perf mem/c2c: Document that SPE is used for mem and c2c on ARM

author James Clark <james.clark@arm.com>

Tue, 24 Jan 2023 14:59:29 +0000 (14:59 +0000)

committer Arnaldo Carvalho de Melo <acme@redhat.com>

Fri, 27 Jan 2023 18:00:34 +0000 (15:00 -0300)
author James Clark <james.clark@arm.com>
Tue, 24 Jan 2023 14:59:29 +0000 (14:59 +0000)
committer Arnaldo Carvalho de Melo <acme@redhat.com>
Fri, 27 Jan 2023 18:00:34 +0000 (15:00 -0300)
diff --git a/tools/perf/Documentation/perf-c2c.txt b/tools/perf/Documentation/perf-c2c.txt

index af5c310..4e8c263 100644 (file)
--- a/tools/perf/Documentation/perf-c2c.txt
+++ b/tools/perf/Documentation/perf-c2c.txt
@@ -22,7 +22,11 @@ you to track down the cacheline contentions.
  On Intel, the tool is based on load latency and precise store facility events
  provided by Intel CPUs. On PowerPC, the tool uses random instruction sampling
  with thresholding feature. On AMD, the tool uses IBS op pmu (due to hardware
-limitations, perf c2c is not supported on Zen3 cpus).
+limitations, perf c2c is not supported on Zen3 cpus). On Arm64 it uses SPE to
+sample load and store operations, therefore hardware and kernel support is
+required. See linkperf:perf-arm-spe[1] for a setup guide. Due to the
+statistical nature of Arm SPE sampling, not every memory operation will be
+sampled.
  
  These events provide:
    - memory address of the access
@@ -333,4 +337,4 @@ Check Joe's blog on c2c tool for detailed use case explanation:
  
  SEE ALSO
  --------
-linkperf:perf-record[1], linkperf:perf-mem[1]
+linkperf:perf-record[1], linkperf:perf-mem[1], linkperf:perf-arm-spe[1]
diff --git a/tools/perf/Documentation/perf-mem.txt b/tools/perf/Documentation/perf-mem.txt

index 005c955..1986257 100644 (file)
--- a/tools/perf/Documentation/perf-mem.txt
+++ b/tools/perf/Documentation/perf-mem.txt
@@ -23,6 +23,11 @@ Note that on Intel systems the memory latency reported is the use-latency,
  not the pure load (or store latency). Use latency includes any pipeline
  queueing delays in addition to the memory subsystem latency.
  
+On Arm64 this uses SPE to sample load and store operations, therefore hardware
+and kernel support is required. See linkperf:perf-arm-spe[1] for a setup guide.
+Due to the statistical nature of SPE sampling, not every memory operation will
+be sampled.
+
  OPTIONS
  -------
  <command>...::
@@ -93,4 +98,4 @@ all perf record options.
  
  SEE ALSO
  --------
-linkperf:perf-record[1], linkperf:perf-report[1]
+linkperf:perf-record[1], linkperf:perf-report[1], linkperf:perf-arm-spe[1]
author	James Clark <james.clark@arm.com>
	Tue, 24 Jan 2023 14:59:29 +0000 (14:59 +0000)
committer	Arnaldo Carvalho de Melo <acme@redhat.com>
	Fri, 27 Jan 2023 18:00:34 +0000 (15:00 -0300)
tools/perf/Documentation/perf-c2c.txt		patch \| blob \| history
tools/perf/Documentation/perf-mem.txt		patch \| blob \| history