linux_old1

History

Alexey Budankov 470530bbb8 perf record: Implement --mmap-flush=<number> option Implement a --mmap-flush option that specifies minimal number of bytes that is extracted from mmaped kernel buffer to store into a trace. The default option value is 1 byte what means every time trace writing thread finds some new data in the mmaped buffer the data is extracted, possibly compressed and written to a trace. $ tools/perf/perf record --mmap-flush 1024 -e cycles -- matrix.gcc $ tools/perf/perf record --aio --mmap-flush 1K -e cycles -- matrix.gcc The option is independent from -z setting, doesn't vary with compression level and can serve two purposes. The first purpose is to increase the compression ratio of a trace data. Larger data chunks are compressed more effectively so the implemented option allows specifying data chunk size to compress. Also at some cases executing more write syscalls with smaller data size can take longer than executing less write syscalls with bigger data size due to syscall overhead so extracting bigger data chunks specified by the option value could additionally decrease runtime overhead. The second purpose is to avoid self monitoring live-lock issue in system wide (-a) profiling mode. Profiling in system wide mode with compression (-a -z) can additionally induce data into the kernel buffers along with the data from monitored processes. If performance data rate and volume from the monitored processes is high then trace streaming and compression activity in the tool is also high. High tool process activity can lead to subtle live-lock effect when compression of single new byte from some of mmaped kernel buffer leads to generation of the next single byte at some mmaped buffer. So perf tool process ends up in endless self monitoring. Implemented synch parameter is the mean to force data move independently from the specified flush threshold value. Despite the provided flush value the tool needs capability to unconditionally drain memory buffers, at least in the end of the collection. Committer testing: Running with the default value, i.e. as soon as there is something to read go on consuming, we first write the synthesized events, small chunks of about 128 bytes: # perf trace -m 2048 --call-graph dwarf -e write -- perf record <SNIP> 101.142 ( 0.004 ms): perf/25821 write(fd: 3</root/perf.data>, buf: 0x210db60, count: 120) = 120 __libc_write (/usr/lib64/libpthread-2.28.so) ion (/home/acme/bin/perf) record__write (inlined) process_synthesized_event (/home/acme/bin/perf) perf_tool__process_synth_event (inlined) perf_event__synthesize_mmap_events (/home/acme/bin/perf) Then we move to reading the mmap buffers consuming the events put there by the kernel perf infrastructure: 107.561 ( 0.005 ms): perf/25821 write(fd: 3</root/perf.data>, buf: 0x7f1befc02000, count: 336) = 336 __libc_write (/usr/lib64/libpthread-2.28.so) ion (/home/acme/bin/perf) record__write (inlined) record__pushfn (/home/acme/bin/perf) perf_mmap__push (/home/acme/bin/perf) record__mmap_read_evlist (inlined) record__mmap_read_all (inlined) __cmd_record (inlined) cmd_record (/home/acme/bin/perf) 12919.953 ( 0.136 ms): perf/25821 write(fd: 3</root/perf.data>, buf: 0x7f1befc83150, count: 184984) = 184984 <SNIP same backtrace as in the 107.561 timestamp> 12920.094 ( 0.155 ms): perf/25821 write(fd: 3</root/perf.data>, buf: 0x7f1befc02150, count: 261816) = 261816 <SNIP same backtrace as in the 107.561 timestamp> 12920.253 ( 0.093 ms): perf/25821 write(fd: 3</root/perf.data>, buf: 0x7f1befb81120, count: 170832) = 170832 <SNIP same backtrace as in the 107.561 timestamp> If we limit it to write only when more than 16MB are available for reading, it throttles that to a quarter of the --mmap-pages set for 'perf record', which by default get to 528384 bytes, found out using 'record -v': mmap flush: 132096 mmap size 528384B With that in place all the writes coming from record__mmap_read_evlist(), i.e. from the mmap buffers setup by the kernel perf infrastructure were at least 132096 bytes long. Trying with a bigger mmap size: perf trace -e write perf record -v -m 2048 --mmap-flush 16M 74982.928 ( 2.471 ms): perf/26500 write(fd: 3</root/perf.data>, buf: 0x7ff94a6cc000, count: 3580888) = 3580888 74985.406 ( 2.353 ms): perf/26500 write(fd: 3</root/perf.data>, buf: 0x7ff949ecb000, count: 3453256) = 3453256 74987.764 ( 2.629 ms): perf/26500 write(fd: 3</root/perf.data>, buf: 0x7ff9496ca000, count: 3859232) = 3859232 74990.399 ( 2.341 ms): perf/26500 write(fd: 3</root/perf.data>, buf: 0x7ff948ec9000, count: 3769032) = 3769032 74992.744 ( 2.064 ms): perf/26500 write(fd: 3</root/perf.data>, buf: 0x7ff9486c8000, count: 3310520) = 3310520 74994.814 ( 2.619 ms): perf/26500 write(fd: 3</root/perf.data>, buf: 0x7ff947ec7000, count: 4194688) = 4194688 74997.439 ( 2.787 ms): perf/26500 write(fd: 3</root/perf.data>, buf: 0x7ff9476c6000, count: 4029760) = 4029760 Was again limited to a quarter of the mmap size: mmap flush: 2098176 mmap size 8392704B A warning about that would be good to have but can be added later, something like: "max flush is a quarter of the mmap size, if wanting to bump the mmap flush further, bump the mmap size as well using -m/--mmap-pages" Also rename the 'sync' parameters to 'synch' to keep tools/perf building with older glibcs: cc1: warnings being treated as errors builtin-record.c: In function 'record__mmap_read_evlist': builtin-record.c:775: warning: declaration of 'sync' shadows a global declaration /usr/include/unistd.h:933: warning: shadowed declaration is here builtin-record.c: In function 'record__mmap_read_all': builtin-record.c:856: warning: declaration of 'sync' shadows a global declaration /usr/include/unistd.h:933: warning: shadowed declaration is here Signed-off-by: Alexey Budankov <alexey.budankov@linux.intel.com> Reviewed-by: Jiri Olsa <jolsa@kernel.org> Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com> Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com> Cc: Andi Kleen <ak@linux.intel.com> Cc: Namhyung Kim <namhyung@kernel.org> Cc: Peter Zijlstra <peterz@infradead.org> Link: http://lkml.kernel.org/r/f6600d72-ecfa-2eb7-7e51-f6954547d500@linux.intel.com Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>		2019-04-01 15:18:10 -03:00
..
Build.txt	perf tools: Add doc about how to build perf with Asan and UBSan	2019-03-19 16:52:04 -03:00
Makefile	perf Documentation: Fix out-of-tree asciidoctor man page generation	2018-09-18 10:17:16 -03:00
android.txt	perf tools: Update android build documentation	2016-07-04 20:27:27 -03:00
asciidoc.conf	…
asciidoctor-extensions.rb	perf Documentation: Support for asciidoctor	2018-04-26 13:47:10 -03:00
build-xed.txt	perf script: Add --insn-trace for instruction decoding	2018-10-24 15:29:50 -03:00
callchain-overhead-calculation.txt	perf tools: Document --children option in more detail	2015-04-29 10:38:06 -03:00
examples.txt	perf record: Remove -f/--force option	2013-07-08 17:37:25 -03:00
intel-bts.txt	perf tools: Add Intel BTS support	2015-08-21 11:34:10 -03:00
intel-pt.txt	perf scripts python: call-graph-from-sql.py: Rename to exported-sql-viewer.py	2018-10-23 14:26:44 -03:00
itrace.txt	perf script: Make itrace script default to all calls	2018-10-24 15:29:54 -03:00
jit-interface.txt	perf symbols: Add description of JIT interface	2012-08-13 14:55:02 -03:00
jitdump-specification.txt	perf jit: Add jitdump format specification document	2016-10-24 11:07:41 -03:00
manpage-1.72.xsl	…
manpage-base.xsl	…
manpage-bold-literal.xsl	…
manpage-normal.xsl	…
manpage-suppress-sp.xsl	…
perf-annotate.txt	perf annotate: Add --percent-type option	2018-08-08 15:55:53 -03:00
perf-archive.txt	perf archive: Remove duplicated 'runs' in man page	2013-12-09 15:21:45 -03:00
perf-bench.txt	perf bench: Add epoll_ctl(2) benchmark	2018-11-21 22:39:55 -03:00
perf-buildid-cache.txt	perf buildid-cache: Support --purge-all option	2018-04-26 09:30:26 -03:00
perf-buildid-list.txt	perf report: Accept fifos as input file	2011-12-23 17:01:03 -02:00
perf-c2c.txt	perf mem/c2c: Fix perf_mem_events to support powerpc	2019-02-04 11:32:14 -03:00
perf-config.txt	perf config: Fix an error in the config template documentation	2019-03-19 16:52:04 -03:00
perf-data.txt	perf tools: Correct title markers for asciidoctor	2018-03-07 10:26:32 -03:00
perf-diff.txt	perf diff: Support --pid/--tid filter options	2019-03-06 18:06:16 -03:00
perf-evlist.txt	perf evlist: Document missing --force option	2017-11-16 14:50:07 -03:00
perf-ftrace.txt	perf tools: Correct title markers for asciidoctor	2018-03-07 10:26:32 -03:00
perf-help.txt	…
perf-inject.txt	perf inject: Document missing options	2017-11-16 14:50:05 -03:00
perf-kallsyms.txt	perf tools: Correct title markers for asciidoctor	2018-03-07 10:26:32 -03:00
perf-kmem.txt	perf kmem: Document a missing option & an argument	2018-02-16 14:55:42 -03:00
perf-kvm.txt	perf tools: Configurable per thread proc map processing time out	2015-06-19 18:27:13 -03:00
perf-list.txt	perf tools Documentation: Fix diverse typos	2018-12-17 14:56:36 -03:00
perf-lock.txt	perf lock: Document missing options	2017-11-16 14:50:04 -03:00
perf-mem.txt	perf mem/c2c: Fix perf_mem_events to support powerpc	2019-02-04 11:32:14 -03:00
perf-probe.txt	perf probe: Support escaped character in parser	2017-12-27 12:15:55 -03:00
perf-record.txt	perf record: Implement --mmap-flush=<number> option	2019-04-01 15:18:10 -03:00
perf-report.txt	perf report: Implement browsing of individual samples	2019-03-11 16:33:19 -03:00
perf-sched.txt	perf sched: Fix documentation for timehist	2018-04-12 10:33:36 -03:00
perf-script-perl.txt	perf tools: Correct title markers for asciidoctor	2018-03-07 10:26:32 -03:00
perf-script-python.txt	perf script python: Add dict fields introduction to Documentation	2018-06-06 15:40:10 -03:00
perf-script.txt	perf script: Support relative time	2019-03-19 16:52:03 -03:00
perf-stat.txt	perf stat: Fix --no-scale	2019-03-19 16:52:03 -03:00
perf-test.txt	perf test: Add -F/--dont-fork option	2016-06-30 18:27:45 -03:00
perf-timechart.txt	perf timechart: Document missing --force option	2017-11-16 14:50:06 -03:00
perf-top.txt	perf top: Allow passing a kallsyms file	2018-12-17 14:54:40 -03:00
perf-trace.txt	perf trace: Allow dumping a BPF map after setting up BPF events	2019-02-19 16:35:45 -03:00
perf-version.txt	perf version: Add man page	2018-04-02 13:52:23 -03:00
perf.data-file-format.txt	perf doc: Fix documentation of the Flags section in perf.data	2019-02-19 13:39:12 -03:00
perf.txt	perf tools: Handle -h and -v options	2015-10-05 16:36:18 -03:00
perfconfig.example	perf config: Show default report configuration in example and docs	2016-09-01 09:44:13 -03:00
tips.txt	perf tools: Add some new tips describing the new options	2019-03-11 16:33:19 -03:00