linux

History

Eric Dumazet be852795e1 alloc_percpu() fails to allocate percpu data Some oprofile results obtained while using tbench on a 2x2 cpu machine were very surprising. For example, loopback_xmit() function was using high number of cpu cycles to perform the statistic updates, supposed to be real cheap since they use percpu data pcpu_lstats = netdev_priv(dev); lb_stats = per_cpu_ptr(pcpu_lstats, smp_processor_id()); lb_stats->packets++; /* HERE : serious contention / lb_stats->bytes += skb->len; struct pcpu_lstats is a small structure containing two longs. It appears that on my 32bits platform, alloc_percpu(8) allocates a single cache line, instead of giving to each cpu a separate cache line. Using the following patch gave me impressive boost in various benchmarks ( 6 % in tbench) (all percpu_counters hit this bug too) Long term fix (ie >= 2.6.26) would be to let each CPU allocate their own block of memory, so that we dont need to roudup sizes to L1_CACHE_BYTES, or merging the SGI stuff of course... Note : SLUB vs SLAB is important here to show* the improvement, since they dont have the same minimum allocation sizes (8 bytes vs 32 bytes). This could very well explain regressions some guys reported when they switched to SLUB. Signed-off-by: Eric Dumazet <dada1@cosmosbay.com> Acked-by: Peter Zijlstra <a.p.zijlstra@chello.nl> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>		2008-03-04 16:35:11 -08:00
..
Kconfig	sh: Bump number of quicklists for SH-5.	2008-01-28 13:18:55 +09:00
Makefile	Memory controller: cgroups setup	2008-02-07 08:42:18 -08:00
allocpercpu.c	alloc_percpu() fails to allocate percpu data	2008-03-04 16:35:11 -08:00
backing-dev.c	mm/backing-dev.c: fix percpu_counter_destroy call bug in bdi_init	2007-12-05 09:21:18 -08:00
bootmem.c	Introduce flags for reserve_bootmem()	2008-02-07 08:42:25 -08:00
bounce.c	block: Initial support for data-less (or empty) barrier support	2007-10-16 11:03:56 +02:00
dmapool.c	pool: Improve memory usage for devices which can't cross boundaries	2007-12-04 10:39:58 -05:00
fadvise.c	check ADVICE of fadvise64_64 even if get_xip_page is given	2008-02-05 09:44:19 -08:00
filemap.c	remove final fastcall users	2008-02-13 16:21:18 -08:00
filemap_xip.c	Use pgoff_t instead of unsigned long	2008-02-08 09:22:32 -08:00
fremap.c	sys_remap_file_pages: fix ->vm_file accounting	2008-02-05 09:44:07 -08:00
highmem.c	mm: remove fastcall from mm/	2008-02-05 09:44:18 -08:00
hugetlb.c	hugetlb: ensure we do not reference a surplus page after handing it to buddy	2008-02-23 17:12:13 -08:00
internal.h	Solve section mismatch for free_area_init_core.	2008-02-23 17:13:24 -08:00
madvise.c	speed up madvise_need_mmap_write() usage	2007-07-16 09:05:36 -07:00
memcontrol.c	memcgroup: return negative error code in mem_cgroup_create()	2008-02-23 17:13:25 -08:00
memory.c	Merge git://git.kernel.org/pub/scm/linux/kernel/git/x86/linux-2.6-x86	2008-02-14 21:23:19 -08:00
memory_hotplug.c	Page allocator: clean up pcp draining functions	2008-02-05 09:44:17 -08:00
mempolicy.c	d_path: Make seq_path() use a struct path argument	2008-02-14 21:17:08 -08:00
mempool.c	spelling fixes: mm/	2007-10-20 01:27:18 +02:00
migrate.c	bugfix for memory cgroup controller: migration under memory controller fix	2008-02-07 08:42:19 -08:00
mincore.c	[PATCH] mincore: vma crossing fix	2007-02-15 09:57:03 -08:00
mlock.c	do not limit locked memory when RLIMIT_MEMLOCK is RLIM_INFINITY	2007-07-16 09:05:37 -07:00
mmap.c	mm: special mapping nopage	2008-02-08 18:57:39 -08:00
mmzone.c	[PATCH] remove EXPORT_UNUSED_SYMBOL'ed symbols	2006-12-07 08:39:44 -08:00
mprotect.c	fix mprotect vma_wants_writenotify prot	2007-10-23 08:32:06 -07:00
mremap.c	sparse pointer use of zero as null	2007-10-18 14:37:31 -07:00
msync.c	Detach sched.h from mm.h	2007-05-21 09:18:19 -07:00
nommu.c	nommu: add new vmalloc_user() and remap_vmalloc_range() interfaces.	2008-02-05 09:44:21 -08:00
oom_kill.c	oom: add sysctl to enable task memory dump	2008-02-07 08:42:19 -08:00
page-writeback.c	writeback: speed up writeback of big dirty files	2008-02-05 09:44:19 -08:00
page_alloc.c	zlc_setup(): handle jiffies wraparound	2008-03-04 16:35:10 -08:00
page_io.c	mm: fix PageUptodate data race	2008-02-05 09:44:19 -08:00
page_isolation.c	memory hotremove: unset migrate type "ISOLATE" after removal	2007-11-14 18:45:38 -08:00
pagewalk.c	maps4: introduce a generic page walker	2008-02-05 09:44:16 -08:00
pdflush.c	Freezer: make kernel threads nonfreezable by default	2007-07-17 10:23:02 -07:00
prio_tree.c	spelling fixes: mm/	2007-10-20 01:27:18 +02:00
quicklist.c	quicklists: Only consider memory that can be used with GFP_KERNEL	2008-01-14 08:52:22 -08:00
readahead.c	mm: bdi init hooks	2007-10-17 08:42:45 -07:00
rmap.c	memcontrol: add vm_match_cgroup()	2008-02-09 11:08:33 -08:00
shmem.c	mount-options-fix-tmpfs-fix	2008-02-08 09:22:41 -08:00
shmem_acl.c	[PATCH] Fix typos in mm/shmem_acl.c	2006-10-11 11:14:23 -07:00
slab.c	slab: avoid double initialization & do initialization in 1 place	2008-02-14 15:30:01 -08:00
slob.c	slob: reduce external fragmentation by using three free lists	2008-02-05 09:44:19 -08:00
slub.c	slub: fix possible NULL pointer dereference	2008-03-03 12:22:32 -08:00
sparse-vmemmap.c	memory hotplug fix: fix section mismatch in vmammap_allock_block()	2007-11-29 09:24:54 -08:00
sparse.c	mm: fix section mismatch warning in sparse.c	2008-02-05 09:44:19 -08:00
swap.c	Memory controller: add per cgroup LRU and reclaim	2008-02-07 08:42:18 -08:00
swap_state.c	memcgroup: revert swap_state mods	2008-02-07 08:42:20 -08:00
swapfile.c	d_path: Make seq_path() use a struct path argument	2008-02-14 21:17:08 -08:00
thrash.c	Bug in mm/thrash.c function grab_swap_token()	2007-05-11 08:29:32 -07:00
tiny-shmem.c	Remove unused code from mm/tiny-shmem.c	2008-02-05 09:44:17 -08:00
truncate.c	docbook: fix kernel-api source files	2008-03-03 10:47:14 -08:00
util.c	fix mm/util.c:krealloc()	2007-11-14 18:45:41 -08:00
vmalloc.c	CONFIG_HIGHPTE vs. sub-page page tables.	2008-02-08 09:22:42 -08:00
vmscan.c	per-zone and reclaim enhancements for memory controller: modifies vmscan.c for isolate globa/cgroup lru activity	2008-02-07 08:42:22 -08:00
vmstat.c	vmstat: remove prefetch	2008-02-05 09:44:18 -08:00