linux_old1

History

Tim Chen 2554db9165 sched/wait: Break up long wake list walk We encountered workloads that have very long wake up list on large systems. A waker takes a long time to traverse the entire wake list and execute all the wake functions. We saw page wait list that are up to 3700+ entries long in tests of large 4 and 8 socket systems. It took 0.8 sec to traverse such list during wake up. Any other CPU that contends for the list spin lock will spin for a long time. It is a result of the numa balancing migration of hot pages that are shared by many threads. Multiple CPUs waking are queued up behind the lock, and the last one queued has to wait until all CPUs did all the wakeups. The page wait list is traversed with interrupt disabled, which caused various problems. This was the original cause that triggered the NMI watch dog timer in: https://patchwork.kernel.org/patch/9800303/ . Only extending the NMI watch dog timer there helped. This patch bookmarks the waker's scan position in wake list and break the wake up walk, to allow access to the list before the waker resume its walk down the rest of the wait list. It lowers the interrupt and rescheduling latency. This patch also provides a performance boost when combined with the next patch to break up page wakeup list walk. We saw 22% improvement in the will-it-scale file pread2 test on a Xeon Phi system running 256 threads. [ v2: Merged in Linus' changes to remove the bookmark_wake_function, and simply access to flags. ] Reported-by: Kan Liang <kan.liang@intel.com> Tested-by: Kan Liang <kan.liang@intel.com> Signed-off-by: Tim Chen <tim.c.chen@linux.intel.com> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>		2017-09-14 09:56:17 -07:00
..
Makefile	membarrier: Provide expedited private command	2017-08-17 07:28:05 -07:00
autogroup.c	sched/autogroup: Fix error reporting printk text in autogroup_create()	2017-08-10 17:06:03 +02:00
autogroup.h	sched/headers: Prepare for new header dependencies before moving code to <linux/sched/autogroup.h>	2017-03-02 08:42:28 +01:00
clock.c	sched/clock: Fix early boot preempt assumption in __set_sched_clock_stable()	2017-05-24 09:10:00 +02:00
completion.c	Merge branch 'locking-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip	2017-09-04 11:52:29 -07:00
core.c	sched/core: WARN() when migrating to an offline CPU	2017-09-12 17:41:04 +02:00
cpuacct.c	sched/cputime: Convert kcpustat to nsecs	2017-02-01 09:13:47 +01:00
cpuacct.h	sched/cpuacct: Simplify the cpuacct code	2016-03-21 11:00:28 +01:00
cpudeadline.c	sched/deadline: Change return value of cpudl_find()	2017-08-10 12:18:17 +02:00
cpudeadline.h	sched/deadline: Split cpudl_set() into cpudl_set() and cpudl_clear()	2016-09-05 13:29:43 +02:00
cpufreq.c	cpufreq / sched: Pass flags to cpufreq_update_util()	2016-08-16 22:14:55 +02:00
cpufreq_schedutil.c	Merge branch 'pm-cpufreq-sched'	2017-09-04 00:05:22 +02:00
cpupri.c	sched/cpupri: Don't re-initialize 'struct cpupri'	2017-08-10 12:18:14 +02:00
cpupri.h	sched/cpupri: Remove unnecessary definitions in cpupri.h	2014-11-16 10:58:59 +01:00
cputime.c	sched/cputime: Don't use smp_processor_id() in preemptible context	2017-07-14 10:27:15 +02:00
deadline.c	sched/deadline: replace earliest dl and rq leftmost caching	2017-09-08 18:26:49 -07:00
debug.c	Merge branch 'sched-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip	2017-09-13 12:22:32 -07:00
fair.c	Merge branch 'sched-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip	2017-09-13 12:22:32 -07:00
features.h	sched/core: Implement new approach to scale select_idle_cpu()	2017-06-08 10:25:17 +02:00
idle.c	PM / s2idle: Rename ->enter_freeze to ->enter_s2idle	2017-08-11 01:29:56 +02:00
idle_task.c	sched/core: Add wrappers for lockdep_(un)pin_lock()	2017-01-14 11:29:30 +01:00
loadavg.c	sched/loadavg: Generalize "_idle" naming to "_nohz"	2017-06-22 11:30:01 +02:00
membarrier.c	membarrier: Provide expedited private command	2017-08-17 07:28:05 -07:00
rt.c	sched: cpufreq: Allow remote cpufreq callbacks	2017-08-01 14:24:53 +02:00
sched-pelt.h	sched/fair: Move the PELT constants into a generated header	2017-04-14 10:26:37 +02:00
sched.h	Merge branch 'sched-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip	2017-09-13 12:22:32 -07:00
stats.c	sched: use %*pb[l] to print bitmaps including cpumasks and nodemasks	2015-02-13 21:21:37 -08:00
stats.h	sched/headers: Move cputime functionality from <linux/sched.h> and <linux/cputime.h> into <linux/sched/cputime.h>	2017-03-03 01:45:22 +01:00
stop_task.c	sched/core: Add wrappers for lockdep_(un)pin_lock()	2017-01-14 11:29:30 +01:00
swait.c	sched/wait: Remove the lockless swait_active() check in swake_up*()	2017-08-10 12:28:53 +02:00
topology.c	Merge branch 'sched-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip	2017-09-13 12:22:32 -07:00
wait.c	sched/wait: Break up long wake list walk	2017-09-14 09:56:17 -07:00
wait_bit.c	sched/wait: Disambiguate wq_entry->task_list and wq_head->task_list naming	2017-06-20 12:19:14 +02:00