Merge tag 'loongarch-6.3' of git://git.kernel.org/pub/scm/linux/kernel/git/chenhuacai...
[platform/kernel/linux-starfive.git] / Documentation / trace / histogram-design.rst
1 .. SPDX-License-Identifier: GPL-2.0
2
3 ======================
4 Histogram Design Notes
5 ======================
6
7 :Author: Tom Zanussi <zanussi@kernel.org>
8
9 This document attempts to provide a description of how the ftrace
10 histograms work and how the individual pieces map to the data
11 structures used to implement them in trace_events_hist.c and
12 tracing_map.c.
13
14 Note: All the ftrace histogram command examples assume the working
15       directory is the ftrace /tracing directory. For example::
16
17         # cd /sys/kernel/tracing
18
19 Also, the histogram output displayed for those commands will be
20 generally be truncated - only enough to make the point is displayed.
21
22 'hist_debug' trace event files
23 ==============================
24
25 If the kernel is compiled with CONFIG_HIST_TRIGGERS_DEBUG set, an
26 event file named 'hist_debug' will appear in each event's
27 subdirectory.  This file can be read at any time and will display some
28 of the hist trigger internals described in this document. Specific
29 examples and output will be described in test cases below.
30
31 Basic histograms
32 ================
33
34 First, basic histograms.  Below is pretty much the simplest thing you
35 can do with histograms - create one with a single key on a single
36 event and cat the output::
37
38   # echo 'hist:keys=pid' >> events/sched/sched_waking/trigger
39
40   # cat events/sched/sched_waking/hist
41
42   { pid:      18249 } hitcount:          1
43   { pid:      13399 } hitcount:          1
44   { pid:      17973 } hitcount:          1
45   { pid:      12572 } hitcount:          1
46   ...
47   { pid:         10 } hitcount:        921
48   { pid:      18255 } hitcount:       1444
49   { pid:      25526 } hitcount:       2055
50   { pid:       5257 } hitcount:       2055
51   { pid:      27367 } hitcount:       2055
52   { pid:       1728 } hitcount:       2161
53
54   Totals:
55     Hits: 21305
56     Entries: 183
57     Dropped: 0
58
59 What this does is create a histogram on the sched_waking event using
60 pid as a key and with a single value, hitcount, which even if not
61 explicitly specified, exists for every histogram regardless.
62
63 The hitcount value is a per-bucket value that's automatically
64 incremented on every hit for the given key, which in this case is the
65 pid.
66
67 So in this histogram, there's a separate bucket for each pid, and each
68 bucket contains a value for that bucket, counting the number of times
69 sched_waking was called for that pid.
70
71 Each histogram is represented by a hist_data struct.
72
73 To keep track of each key and value field in the histogram, hist_data
74 keeps an array of these fields named fields[].  The fields[] array is
75 an array containing struct hist_field representations of each
76 histogram val and key in the histogram (variables are also included
77 here, but are discussed later). So for the above histogram we have one
78 key and one value; in this case the one value is the hitcount value,
79 which all histograms have, regardless of whether they define that
80 value or not, which the above histogram does not.
81
82 Each struct hist_field contains a pointer to the ftrace_event_field
83 from the event's trace_event_file along with various bits related to
84 that such as the size, offset, type, and a hist_field_fn_t function,
85 which is used to grab the field's data from the ftrace event buffer
86 (in most cases - some hist_fields such as hitcount don't directly map
87 to an event field in the trace buffer - in these cases the function
88 implementation gets its value from somewhere else).  The flags field
89 indicates which type of field it is - key, value, variable, variable
90 reference, etc., with value being the default.
91
92 The other important hist_data data structure in addition to the
93 fields[] array is the tracing_map instance created for the histogram,
94 which is held in the .map member.  The tracing_map implements the
95 lock-free hash table used to implement histograms (see
96 kernel/trace/tracing_map.h for much more discussion about the
97 low-level data structures implementing the tracing_map).  For the
98 purposes of this discussion, the tracing_map contains a number of
99 buckets, each bucket corresponding to a particular tracing_map_elt
100 object hashed by a given histogram key.
101
102 Below is a diagram the first part of which describes the hist_data and
103 associated key and value fields for the histogram described above.  As
104 you can see, there are two fields in the fields array, one val field
105 for the hitcount and one key field for the pid key.
106
107 Below that is a diagram of a run-time snapshot of what the tracing_map
108 might look like for a given run.  It attempts to show the
109 relationships between the hist_data fields and the tracing_map
110 elements for a couple hypothetical keys and values.::
111
112   +------------------+
113   | hist_data        |
114   +------------------+     +----------------+
115     | .fields[]      |---->| val = hitcount |----------------------------+
116     +----------------+     +----------------+                            |
117     | .map           |       | .size        |                            |
118     +----------------+       +--------------+                            |
119                              | .offset      |                            |
120                              +--------------+                            |
121                              | .fn()        |                            |
122                              +--------------+                            |
123                                    .                                     |
124                                    .                                     |
125                                    .                                     |
126                            +----------------+ <--- n_vals                |
127                            | key = pid      |----------------------------|--+
128                            +----------------+                            |  |
129                              | .size        |                            |  |
130                              +--------------+                            |  |
131                              | .offset      |                            |  |
132                              +--------------+                            |  |
133                              | .fn()        |                            |  |
134                            +----------------+ <--- n_fields              |  |
135                            | unused         |                            |  |
136                            +----------------+                            |  |
137                              |              |                            |  |
138                              +--------------+                            |  |
139                              |              |                            |  |
140                              +--------------+                            |  |
141                              |              |                            |  |
142                              +--------------+                            |  |
143                                             n_keys = n_fields - n_vals   |  |
144
145 The hist_data n_vals and n_fields delineate the extent of the fields[]   |  |
146 array and separate keys from values for the rest of the code.            |  |
147
148 Below is a run-time representation of the tracing_map part of the        |  |
149 histogram, with pointers from various parts of the fields[] array        |  |
150 to corresponding parts of the tracing_map.                               |  |
151
152 The tracing_map consists of an array of tracing_map_entrys and a set     |  |
153 of preallocated tracing_map_elts (abbreviated below as map_entry and     |  |
154 map_elt).  The total number of map_entrys in the hist_data.map array =   |  |
155 map->max_elts (actually map->map_size but only max_elts of those are     |  |
156 used.  This is a property required by the map_insert() algorithm).       |  |
157
158 If a map_entry is unused, meaning no key has yet hashed into it, its     |  |
159 .key value is 0 and its .val pointer is NULL.  Once a map_entry has      |  |
160 been claimed, the .key value contains the key's hash value and the       |  |
161 .val member points to a map_elt containing the full key and an entry     |  |
162 for each key or value in the map_elt.fields[] array.  There is an        |  |
163 entry in the map_elt.fields[] array corresponding to each hist_field     |  |
164 in the histogram, and this is where the continually aggregated sums      |  |
165 corresponding to each histogram value are kept.                          |  |
166
167 The diagram attempts to show the relationship between the                |  |
168 hist_data.fields[] and the map_elt.fields[] with the links drawn         |  |
169 between diagrams::
170
171   +-----------+                                                          |  |
172   | hist_data |                                                          |  |
173   +-----------+                                                          |  |
174     | .fields |                                                          |  |
175     +---------+     +-----------+                                        |  |
176     | .map    |---->| map_entry |                                        |  |
177     +---------+     +-----------+                                        |  |
178                       | .key    |---> 0                                  |  |
179                       +---------+                                        |  |
180                       | .val    |---> NULL                               |  |
181                     +-----------+                                        |  |
182                     | map_entry |                                        |  |
183                     +-----------+                                        |  |
184                       | .key    |---> pid = 999                          |  |
185                       +---------+    +-----------+                       |  |
186                       | .val    |--->| map_elt   |                       |  |
187                       +---------+    +-----------+                       |  |
188                            .           | .key    |---> full key *        |  |
189                            .           +---------+    +---------------+  |  |
190                            .           | .fields |--->| .sum (val)    |<-+  |
191                     +-----------+      +---------+    | 2345          |  |  |
192                     | map_entry |                     +---------------+  |  |
193                     +-----------+                     | .offset (key) |<----+
194                       | .key    |---> 0               | 0             |  |  |
195                       +---------+                     +---------------+  |  |
196                       | .val    |---> NULL                    .          |  |
197                     +-----------+                             .          |  |
198                     | map_entry |                             .          |  |
199                     +-----------+                     +---------------+  |  |
200                       | .key    |                     | .sum (val) or |  |  |
201                       +---------+    +---------+      | .offset (key) |  |  |
202                       | .val    |--->| map_elt |      +---------------+  |  |
203                     +-----------+    +---------+      | .sum (val) or |  |  |
204                     | map_entry |                     | .offset (key) |  |  |
205                     +-----------+                     +---------------+  |  |
206                       | .key    |---> pid = 4444                         |  |
207                       +---------+    +-----------+                       |  |
208                       | .val    |    | map_elt   |                       |  |
209                       +---------+    +-----------+                       |  |
210                                        | .key    |---> full key *        |  |
211                                        +---------+    +---------------+  |  |
212                                        | .fields |--->| .sum (val)    |<-+  |
213                                        +---------+    | 65523         |     |
214                                                       +---------------+     |
215                                                       | .offset (key) |<----+
216                                                       | 0             |
217                                                       +---------------+
218                                                               .
219                                                               .
220                                                               .
221                                                       +---------------+
222                                                       | .sum (val) or |
223                                                       | .offset (key) |
224                                                       +---------------+
225                                                       | .sum (val) or |
226                                                       | .offset (key) |
227                                                       +---------------+
228
229 Abbreviations used in the diagrams::
230
231   hist_data = struct hist_trigger_data
232   hist_data.fields = struct hist_field
233   fn = hist_field_fn_t
234   map_entry = struct tracing_map_entry
235   map_elt = struct tracing_map_elt
236   map_elt.fields = struct tracing_map_field
237
238 Whenever a new event occurs and it has a hist trigger associated with
239 it, event_hist_trigger() is called.  event_hist_trigger() first deals
240 with the key: for each subkey in the key (in the above example, there
241 is just one subkey corresponding to pid), the hist_field that
242 represents that subkey is retrieved from hist_data.fields[] and the
243 hist_field_fn_t fn() associated with that field, along with the
244 field's size and offset, is used to grab that subkey's data from the
245 current trace record.
246
247 Once the complete key has been retrieved, it's used to look that key
248 up in the tracing_map.  If there's no tracing_map_elt associated with
249 that key, an empty one is claimed and inserted in the map for the new
250 key.  In either case, the tracing_map_elt associated with that key is
251 returned.
252
253 Once a tracing_map_elt available, hist_trigger_elt_update() is called.
254 As the name implies, this updates the element, which basically means
255 updating the element's fields.  There's a tracing_map_field associated
256 with each key and value in the histogram, and each of these correspond
257 to the key and value hist_fields created when the histogram was
258 created.  hist_trigger_elt_update() goes through each value hist_field
259 and, as for the keys, uses the hist_field's fn() and size and offset
260 to grab the field's value from the current trace record.  Once it has
261 that value, it simply adds that value to that field's
262 continually-updated tracing_map_field.sum member.  Some hist_field
263 fn()s, such as for the hitcount, don't actually grab anything from the
264 trace record (the hitcount fn() just increments the counter sum by 1),
265 but the idea is the same.
266
267 Once all the values have been updated, hist_trigger_elt_update() is
268 done and returns.  Note that there are also tracing_map_fields for
269 each subkey in the key, but hist_trigger_elt_update() doesn't look at
270 them or update anything - those exist only for sorting, which can
271 happen later.
272
273 Basic histogram test
274 --------------------
275
276 This is a good example to try.  It produces 3 value fields and 2 key
277 fields in the output::
278
279   # echo 'hist:keys=common_pid,call_site.sym:values=bytes_req,bytes_alloc,hitcount' >> events/kmem/kmalloc/trigger
280
281 To see the debug data, cat the kmem/kmalloc's 'hist_debug' file. It
282 will show the trigger info of the histogram it corresponds to, along
283 with the address of the hist_data associated with the histogram, which
284 will become useful in later examples.  It then displays the number of
285 total hist_fields associated with the histogram along with a count of
286 how many of those correspond to keys and how many correspond to values.
287
288 It then goes on to display details for each field, including the
289 field's flags and the position of each field in the hist_data's
290 fields[] array, which is useful information for verifying that things
291 internally appear correct or not, and which again will become even
292 more useful in further examples::
293
294   # cat events/kmem/kmalloc/hist_debug
295
296   # event histogram
297   #
298   # trigger info: hist:keys=common_pid,call_site.sym:vals=hitcount,bytes_req,bytes_alloc:sort=hitcount:size=2048 [active]
299   #
300
301   hist_data: 000000005e48c9a5
302
303   n_vals: 3
304   n_keys: 2
305   n_fields: 5
306
307   val fields:
308
309     hist_data->fields[0]:
310       flags:
311         VAL: HIST_FIELD_FL_HITCOUNT
312       type: u64
313       size: 8
314       is_signed: 0
315
316     hist_data->fields[1]:
317       flags:
318         VAL: normal u64 value
319       ftrace_event_field name: bytes_req
320       type: size_t
321       size: 8
322       is_signed: 0
323
324     hist_data->fields[2]:
325       flags:
326         VAL: normal u64 value
327       ftrace_event_field name: bytes_alloc
328       type: size_t
329       size: 8
330       is_signed: 0
331
332   key fields:
333
334     hist_data->fields[3]:
335       flags:
336         HIST_FIELD_FL_KEY
337       ftrace_event_field name: common_pid
338       type: int
339       size: 8
340       is_signed: 1
341
342     hist_data->fields[4]:
343       flags:
344         HIST_FIELD_FL_KEY
345       ftrace_event_field name: call_site
346       type: unsigned long
347       size: 8
348       is_signed: 0
349
350 The commands below can be used to clean things up for the next test::
351
352   # echo '!hist:keys=common_pid,call_site.sym:values=bytes_req,bytes_alloc,hitcount' >> events/kmem/kmalloc/trigger
353
354 Variables
355 =========
356
357 Variables allow data from one hist trigger to be saved by one hist
358 trigger and retrieved by another hist trigger.  For example, a trigger
359 on the sched_waking event can capture a timestamp for a particular
360 pid, and later a sched_switch event that switches to that pid event
361 can grab the timestamp and use it to calculate a time delta between
362 the two events::
363
364   # echo 'hist:keys=pid:ts0=common_timestamp.usecs' >>
365           events/sched/sched_waking/trigger
366
367   # echo 'hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0' >>
368           events/sched/sched_switch/trigger
369
370 In terms of the histogram data structures, variables are implemented
371 as another type of hist_field and for a given hist trigger are added
372 to the hist_data.fields[] array just after all the val fields.  To
373 distinguish them from the existing key and val fields, they're given a
374 new flag type, HIST_FIELD_FL_VAR (abbreviated FL_VAR) and they also
375 make use of a new .var.idx field member in struct hist_field, which
376 maps them to an index in a new map_elt.vars[] array added to the
377 map_elt specifically designed to store and retrieve variable values.
378 The diagram below shows those new elements and adds a new variable
379 entry, ts0, corresponding to the ts0 variable in the sched_waking
380 trigger above.
381
382 sched_waking histogram
383 ----------------------::
384
385   +------------------+
386   | hist_data        |<-------------------------------------------------------+
387   +------------------+   +-------------------+                                |
388     | .fields[]      |-->| val = hitcount    |                                |
389     +----------------+   +-------------------+                                |
390     | .map           |     | .size           |                                |
391     +----------------+     +-----------------+                                |
392                            | .offset         |                                |
393                            +-----------------+                                |
394                            | .fn()           |                                |
395                            +-----------------+                                |
396                            | .flags          |                                |
397                            +-----------------+                                |
398                            | .var.idx        |                                |
399                          +-------------------+                                |
400                          | var = ts0         |                                |
401                          +-------------------+                                |
402                            | .size           |                                |
403                            +-----------------+                                |
404                            | .offset         |                                |
405                            +-----------------+                                |
406                            | .fn()           |                                |
407                            +-----------------+                                |
408                            | .flags & FL_VAR |                                |
409                            +-----------------+                                |
410                            | .var.idx        |----------------------------+-+ |
411                            +-----------------+                            | | |
412                                     .                                     | | |
413                                     .                                     | | |
414                                     .                                     | | |
415                          +-------------------+ <--- n_vals                | | |
416                          | key = pid         |                            | | |
417                          +-------------------+                            | | |
418                            | .size           |                            | | |
419                            +-----------------+                            | | |
420                            | .offset         |                            | | |
421                            +-----------------+                            | | |
422                            | .fn()           |                            | | |
423                            +-----------------+                            | | |
424                            | .flags & FL_KEY |                            | | |
425                            +-----------------+                            | | |
426                            | .var.idx        |                            | | |
427                          +-------------------+ <--- n_fields              | | |
428                          | unused            |                            | | |
429                          +-------------------+                            | | |
430                            |                 |                            | | |
431                            +-----------------+                            | | |
432                            |                 |                            | | |
433                            +-----------------+                            | | |
434                            |                 |                            | | |
435                            +-----------------+                            | | |
436                            |                 |                            | | |
437                            +-----------------+                            | | |
438                            |                 |                            | | |
439                            +-----------------+                            | | |
440                                              n_keys = n_fields - n_vals   | | |
441                                                                           | | |
442
443 This is very similar to the basic case.  In the above diagram, we can     | | |
444 see a new .flags member has been added to the struct hist_field           | | |
445 struct, and a new entry added to hist_data.fields representing the ts0    | | |
446 variable.  For a normal val hist_field, .flags is just 0 (modulo          | | |
447 modifier flags), but if the value is defined as a variable, the .flags    | | |
448 contains a set FL_VAR bit.                                                | | |
449
450 As you can see, the ts0 entry's .var.idx member contains the index        | | |
451 into the tracing_map_elts' .vars[] array containing variable values.      | | |
452 This idx is used whenever the value of the variable is set or read.       | | |
453 The map_elt.vars idx assigned to the given variable is assigned and       | | |
454 saved in .var.idx by create_tracing_map_fields() after it calls           | | |
455 tracing_map_add_var().                                                    | | |
456
457 Below is a representation of the histogram at run-time, which             | | |
458 populates the map, along with correspondence to the above hist_data and   | | |
459 hist_field data structures.                                               | | |
460
461 The diagram attempts to show the relationship between the                 | | |
462 hist_data.fields[] and the map_elt.fields[] and map_elt.vars[] with       | | |
463 the links drawn between diagrams.  For each of the map_elts, you can      | | |
464 see that the .fields[] members point to the .sum or .offset of a key      | | |
465 or val and the .vars[] members point to the value of a variable.  The     | | |
466 arrows between the two diagrams show the linkages between those           | | |
467 tracing_map members and the field definitions in the corresponding        | | |
468 hist_data fields[] members.::
469
470   +-----------+                                                           | | |
471   | hist_data |                                                           | | |
472   +-----------+                                                           | | |
473     | .fields |                                                           | | |
474     +---------+     +-----------+                                         | | |
475     | .map    |---->| map_entry |                                         | | |
476     +---------+     +-----------+                                         | | |
477                       | .key    |---> 0                                   | | |
478                       +---------+                                         | | |
479                       | .val    |---> NULL                                | | |
480                     +-----------+                                         | | |
481                     | map_entry |                                         | | |
482                     +-----------+                                         | | |
483                       | .key    |---> pid = 999                           | | |
484                       +---------+    +-----------+                        | | |
485                       | .val    |--->| map_elt   |                        | | |
486                       +---------+    +-----------+                        | | |
487                            .           | .key    |---> full key *         | | |
488                            .           +---------+    +---------------+   | | |
489                            .           | .fields |--->| .sum (val)    |   | | |
490                            .           +---------+    | 2345          |   | | |
491                            .        +--| .vars   |    +---------------+   | | |
492                            .        |  +---------+    | .offset (key) |   | | |
493                            .        |                 | 0             |   | | |
494                            .        |                 +---------------+   | | |
495                            .        |                         .           | | |
496                            .        |                         .           | | |
497                            .        |                         .           | | |
498                            .        |                 +---------------+   | | |
499                            .        |                 | .sum (val) or |   | | |
500                            .        |                 | .offset (key) |   | | |
501                            .        |                 +---------------+   | | |
502                            .        |                 | .sum (val) or |   | | |
503                            .        |                 | .offset (key) |   | | |
504                            .        |                 +---------------+   | | |
505                            .        |                                     | | |
506                            .        +---------------->+---------------+   | | |
507                            .                          | ts0           |<--+ | |
508                            .                          | 113345679876  |   | | |
509                            .                          +---------------+   | | |
510                            .                          | unused        |   | | |
511                            .                          |               |   | | |
512                            .                          +---------------+   | | |
513                            .                                  .           | | |
514                            .                                  .           | | |
515                            .                                  .           | | |
516                            .                          +---------------+   | | |
517                            .                          | unused        |   | | |
518                            .                          |               |   | | |
519                            .                          +---------------+   | | |
520                            .                          | unused        |   | | |
521                            .                          |               |   | | |
522                            .                          +---------------+   | | |
523                            .                                              | | |
524                     +-----------+                                         | | |
525                     | map_entry |                                         | | |
526                     +-----------+                                         | | |
527                       | .key    |---> pid = 4444                          | | |
528                       +---------+    +-----------+                        | | |
529                       | .val    |--->| map_elt   |                        | | |
530                       +---------+    +-----------+                        | | |
531                            .           | .key    |---> full key *         | | |
532                            .           +---------+    +---------------+   | | |
533                            .           | .fields |--->| .sum (val)    |   | | |
534                                        +---------+    | 2345          |   | | |
535                                     +--| .vars   |    +---------------+   | | |
536                                     |  +---------+    | .offset (key) |   | | |
537                                     |                 | 0             |   | | |
538                                     |                 +---------------+   | | |
539                                     |                         .           | | |
540                                     |                         .           | | |
541                                     |                         .           | | |
542                                     |                 +---------------+   | | |
543                                     |                 | .sum (val) or |   | | |
544                                     |                 | .offset (key) |   | | |
545                                     |                 +---------------+   | | |
546                                     |                 | .sum (val) or |   | | |
547                                     |                 | .offset (key) |   | | |
548                                     |                 +---------------+   | | |
549                                     |                                     | | |
550                                     |                 +---------------+   | | |
551                                     +---------------->| ts0           |<--+ | |
552                                                       | 213499240729  |     | |
553                                                       +---------------+     | |
554                                                       | unused        |     | |
555                                                       |               |     | |
556                                                       +---------------+     | |
557                                                               .             | |
558                                                               .             | |
559                                                               .             | |
560                                                       +---------------+     | |
561                                                       | unused        |     | |
562                                                       |               |     | |
563                                                       +---------------+     | |
564                                                       | unused        |     | |
565                                                       |               |     | |
566                                                       +---------------+     | |
567
568 For each used map entry, there's a map_elt pointing to an array of          | |
569 .vars containing the current value of the variables associated with         | |
570 that histogram entry.  So in the above, the timestamp associated with       | |
571 pid 999 is 113345679876, and the timestamp variable in the same             | |
572 .var.idx for pid 4444 is 213499240729.                                      | |
573
574 sched_switch histogram                                                      | |
575 ----------------------                                                      | |
576
577 The sched_switch histogram paired with the above sched_waking               | |
578 histogram is shown below.  The most important aspect of the                 | |
579 sched_switch histogram is that it references a variable on the              | |
580 sched_waking histogram above.                                               | |
581
582 The histogram diagram is very similar to the others so far displayed,       | |
583 but it adds variable references.  You can see the normal hitcount and       | |
584 key fields along with a new wakeup_lat variable implemented in the          | |
585 same way as the sched_waking ts0 variable, but in addition there's an       | |
586 entry with the new FL_VAR_REF (short for HIST_FIELD_FL_VAR_REF) flag.       | |
587
588 Associated with the new var ref field are a couple of new hist_field        | |
589 members, var.hist_data and var_ref_idx.  For a variable reference, the      | |
590 var.hist_data goes with the var.idx, which together uniquely identify       | |
591 a particular variable on a particular histogram.  The var_ref_idx is        | |
592 just the index into the var_ref_vals[] array that caches the values of      | |
593 each variable whenever a hist trigger is updated.  Those resulting          | |
594 values are then finally accessed by other code such as trace action         | |
595 code that uses the var_ref_idx values to assign param values.               | |
596
597 The diagram below describes the situation for the sched_switch              | |
598 histogram referred to before::
599
600   # echo 'hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0' >>     | |
601           events/sched/sched_switch/trigger                                 | |
602                                                                             | |
603   +------------------+                                                      | |
604   | hist_data        |                                                      | |
605   +------------------+   +-----------------------+                          | |
606     | .fields[]      |-->| val = hitcount        |                          | |
607     +----------------+   +-----------------------+                          | |
608     | .map           |     | .size               |                          | |
609     +----------------+     +---------------------+                          | |
610  +--| .var_refs[]    |     | .offset             |                          | |
611  |  +----------------+     +---------------------+                          | |
612  |                         | .fn()               |                          | |
613  |   var_ref_vals[]        +---------------------+                          | |
614  |  +-------------+        | .flags              |                          | |
615  |  | $ts0        |<---+   +---------------------+                          | |
616  |  +-------------+    |   | .var.idx            |                          | |
617  |  |             |    |   +---------------------+                          | |
618  |  +-------------+    |   | .var.hist_data      |                          | |
619  |  |             |    |   +---------------------+                          | |
620  |  +-------------+    |   | .var_ref_idx        |                          | |
621  |  |             |    | +-----------------------+                          | |
622  |  +-------------+    | | var = wakeup_lat      |                          | |
623  |         .           | +-----------------------+                          | |
624  |         .           |   | .size               |                          | |
625  |         .           |   +---------------------+                          | |
626  |  +-------------+    |   | .offset             |                          | |
627  |  |             |    |   +---------------------+                          | |
628  |  +-------------+    |   | .fn()               |                          | |
629  |  |             |    |   +---------------------+                          | |
630  |  +-------------+    |   | .flags & FL_VAR     |                          | |
631  |                     |   +---------------------+                          | |
632  |                     |   | .var.idx            |                          | |
633  |                     |   +---------------------+                          | |
634  |                     |   | .var.hist_data      |                          | |
635  |                     |   +---------------------+                          | |
636  |                     |   | .var_ref_idx        |                          | |
637  |                     |   +---------------------+                          | |
638  |                     |             .                                      | |
639  |                     |             .                                      | |
640  |                     |             .                                      | |
641  |                     | +-----------------------+ <--- n_vals              | |
642  |                     | | key = pid             |                          | |
643  |                     | +-----------------------+                          | |
644  |                     |   | .size               |                          | |
645  |                     |   +---------------------+                          | |
646  |                     |   | .offset             |                          | |
647  |                     |   +---------------------+                          | |
648  |                     |   | .fn()               |                          | |
649  |                     |   +---------------------+                          | |
650  |                     |   | .flags              |                          | |
651  |                     |   +---------------------+                          | |
652  |                     |   | .var.idx            |                          | |
653  |                     | +-----------------------+ <--- n_fields            | |
654  |                     | | unused                |                          | |
655  |                     | +-----------------------+                          | |
656  |                     |   |                     |                          | |
657  |                     |   +---------------------+                          | |
658  |                     |   |                     |                          | |
659  |                     |   +---------------------+                          | |
660  |                     |   |                     |                          | |
661  |                     |   +---------------------+                          | |
662  |                     |   |                     |                          | |
663  |                     |   +---------------------+                          | |
664  |                     |   |                     |                          | |
665  |                     |   +---------------------+                          | |
666  |                     |                         n_keys = n_fields - n_vals | |
667  |                     |                                                    | |
668  |                     |                                                    | |
669  |                     | +-----------------------+                          | |
670  +---------------------->| var_ref = $ts0        |                          | |
671                        | +-----------------------+                          | |
672                        |   | .size               |                          | |
673                        |   +---------------------+                          | |
674                        |   | .offset             |                          | |
675                        |   +---------------------+                          | |
676                        |   | .fn()               |                          | |
677                        |   +---------------------+                          | |
678                        |   | .flags & FL_VAR_REF |                          | |
679                        |   +---------------------+                          | |
680                        |   | .var.idx            |--------------------------+ |
681                        |   +---------------------+                            |
682                        |   | .var.hist_data      |----------------------------+
683                        |   +---------------------+
684                        +---| .var_ref_idx        |
685                            +---------------------+
686
687 Abbreviations used in the diagrams::
688
689   hist_data = struct hist_trigger_data
690   hist_data.fields = struct hist_field
691   fn = hist_field_fn_t
692   FL_KEY = HIST_FIELD_FL_KEY
693   FL_VAR = HIST_FIELD_FL_VAR
694   FL_VAR_REF = HIST_FIELD_FL_VAR_REF
695
696 When a hist trigger makes use of a variable, a new hist_field is
697 created with flag HIST_FIELD_FL_VAR_REF.  For a VAR_REF field, the
698 var.idx and var.hist_data take the same values as the referenced
699 variable, as well as the referenced variable's size, type, and
700 is_signed values.  The VAR_REF field's .name is set to the name of the
701 variable it references.  If a variable reference was created using the
702 explicit system.event.$var_ref notation, the hist_field's system and
703 event_name variables are also set.
704
705 So, in order to handle an event for the sched_switch histogram,
706 because we have a reference to a variable on another histogram, we
707 need to resolve all variable references first.  This is done via the
708 resolve_var_refs() calls made from event_hist_trigger().  What this
709 does is grabs the var_refs[] array from the hist_data representing the
710 sched_switch histogram.  For each one of those, the referenced
711 variable's var.hist_data along with the current key is used to look up
712 the corresponding tracing_map_elt in that histogram.  Once found, the
713 referenced variable's var.idx is used to look up the variable's value
714 using tracing_map_read_var(elt, var.idx), which yields the value of
715 the variable for that element, ts0 in the case above.  Note that both
716 the hist_fields representing both the variable and the variable
717 reference have the same var.idx, so this is straightforward.
718
719 Variable and variable reference test
720 ------------------------------------
721
722 This example creates a variable on the sched_waking event, ts0, and
723 uses it in the sched_switch trigger.  The sched_switch trigger also
724 creates its own variable, wakeup_lat, but nothing yet uses it::
725
726   # echo 'hist:keys=pid:ts0=common_timestamp.usecs' >> events/sched/sched_waking/trigger
727
728   # echo 'hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0' >> events/sched/sched_switch/trigger
729
730 Looking at the sched_waking 'hist_debug' output, in addition to the
731 normal key and value hist_fields, in the val fields section we see a
732 field with the HIST_FIELD_FL_VAR flag, which indicates that that field
733 represents a variable.  Note that in addition to the variable name,
734 contained in the var.name field, it includes the var.idx, which is the
735 index into the tracing_map_elt.vars[] array of the actual variable
736 location.  Note also that the output shows that variables live in the
737 same part of the hist_data->fields[] array as normal values::
738
739   # cat events/sched/sched_waking/hist_debug
740
741   # event histogram
742   #
743   # trigger info: hist:keys=pid:vals=hitcount:ts0=common_timestamp.usecs:sort=hitcount:size=2048:clock=global [active]
744   #
745
746   hist_data: 000000009536f554
747
748   n_vals: 2
749   n_keys: 1
750   n_fields: 3
751
752   val fields:
753
754     hist_data->fields[0]:
755       flags:
756         VAL: HIST_FIELD_FL_HITCOUNT
757       type: u64
758       size: 8
759       is_signed: 0
760
761     hist_data->fields[1]:
762       flags:
763         HIST_FIELD_FL_VAR
764       var.name: ts0
765       var.idx (into tracing_map_elt.vars[]): 0
766       type: u64
767       size: 8
768       is_signed: 0
769
770   key fields:
771
772     hist_data->fields[2]:
773       flags:
774         HIST_FIELD_FL_KEY
775       ftrace_event_field name: pid
776       type: pid_t
777       size: 8
778       is_signed: 1
779
780 Moving on to the sched_switch trigger hist_debug output, in addition
781 to the unused wakeup_lat variable, we see a new section displaying
782 variable references.  Variable references are displayed in a separate
783 section because in addition to being logically separate from
784 variables and values, they actually live in a separate hist_data
785 array, var_refs[].
786
787 In this example, the sched_switch trigger has a reference to a
788 variable on the sched_waking trigger, $ts0.  Looking at the details,
789 we can see that the var.hist_data value of the referenced variable
790 matches the previously displayed sched_waking trigger, and the var.idx
791 value matches the previously displayed var.idx value for that
792 variable.  Also displayed is the var_ref_idx value for that variable
793 reference, which is where the value for that variable is cached for
794 use when the trigger is invoked::
795
796   # cat events/sched/sched_switch/hist_debug
797
798   # event histogram
799   #
800   # trigger info: hist:keys=next_pid:vals=hitcount:wakeup_lat=common_timestamp.usecs-$ts0:sort=hitcount:size=2048:clock=global [active]
801   #
802
803   hist_data: 00000000f4ee8006
804
805   n_vals: 2
806   n_keys: 1
807   n_fields: 3
808
809   val fields:
810
811     hist_data->fields[0]:
812       flags:
813         VAL: HIST_FIELD_FL_HITCOUNT
814       type: u64
815       size: 8
816       is_signed: 0
817
818     hist_data->fields[1]:
819       flags:
820         HIST_FIELD_FL_VAR
821       var.name: wakeup_lat
822       var.idx (into tracing_map_elt.vars[]): 0
823       type: u64
824       size: 0
825       is_signed: 0
826
827   key fields:
828
829     hist_data->fields[2]:
830       flags:
831         HIST_FIELD_FL_KEY
832       ftrace_event_field name: next_pid
833       type: pid_t
834       size: 8
835       is_signed: 1
836
837   variable reference fields:
838
839     hist_data->var_refs[0]:
840       flags:
841         HIST_FIELD_FL_VAR_REF
842       name: ts0
843       var.idx (into tracing_map_elt.vars[]): 0
844       var.hist_data: 000000009536f554
845       var_ref_idx (into hist_data->var_refs[]): 0
846       type: u64
847       size: 8
848       is_signed: 0
849
850 The commands below can be used to clean things up for the next test::
851
852   # echo '!hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0' >> events/sched/sched_switch/trigger
853
854   # echo '!hist:keys=pid:ts0=common_timestamp.usecs' >> events/sched/sched_waking/trigger
855
856 Actions and Handlers
857 ====================
858
859 Adding onto the previous example, we will now do something with that
860 wakeup_lat variable, namely send it and another field as a synthetic
861 event.
862
863 The onmatch() action below basically says that whenever we have a
864 sched_switch event, if we have a matching sched_waking event, in this
865 case if we have a pid in the sched_waking histogram that matches the
866 next_pid field on this sched_switch event, we retrieve the
867 variables specified in the wakeup_latency() trace action, and use
868 them to generate a new wakeup_latency event into the trace stream.
869
870 Note that the way the trace handlers such as wakeup_latency() (which
871 could equivalently be written trace(wakeup_latency,$wakeup_lat,next_pid)
872 are implemented, the parameters specified to the trace handler must be
873 variables.  In this case, $wakeup_lat is obviously a variable, but
874 next_pid isn't, since it's just naming a field in the sched_switch
875 trace event.  Since this is something that almost every trace() and
876 save() action does, a special shortcut is implemented to allow field
877 names to be used directly in those cases.  How it works is that under
878 the covers, a temporary variable is created for the named field, and
879 this variable is what is actually passed to the trace handler.  In the
880 code and documentation, this type of variable is called a 'field
881 variable'.
882
883 Fields on other trace event's histograms can be used as well.  In that
884 case we have to generate a new histogram and an unfortunately named
885 'synthetic_field' (the use of synthetic here has nothing to do with
886 synthetic events) and use that special histogram field as a variable.
887
888 The diagram below illustrates the new elements described above in the
889 context of the sched_switch histogram using the onmatch() handler and
890 the trace() action.
891
892 First, we define the wakeup_latency synthetic event::
893
894   # echo 'wakeup_latency u64 lat; pid_t pid' >> synthetic_events
895
896 Next, the sched_waking hist trigger as before::
897
898   # echo 'hist:keys=pid:ts0=common_timestamp.usecs' >>
899           events/sched/sched_waking/trigger
900
901 Finally, we create a hist trigger on the sched_switch event that
902 generates a wakeup_latency() trace event.  In this case we pass
903 next_pid into the wakeup_latency synthetic event invocation, which
904 means it will be automatically converted into a field variable::
905
906   # echo 'hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0: \
907           onmatch(sched.sched_waking).wakeup_latency($wakeup_lat,next_pid)' >>
908           /sys/kernel/tracing/events/sched/sched_switch/trigger
909
910 The diagram for the sched_switch event is similar to previous examples
911 but shows the additional field_vars[] array for hist_data and shows
912 the linkages between the field_vars and the variables and references
913 created to implement the field variables.  The details are discussed
914 below::
915
916     +------------------+
917     | hist_data        |
918     +------------------+   +-----------------------+
919       | .fields[]      |-->| val = hitcount        |
920       +----------------+   +-----------------------+
921       | .map           |     | .size               |
922       +----------------+     +---------------------+
923   +---| .field_vars[]  |     | .offset             |
924   |   +----------------+     +---------------------+
925   |+--| .var_refs[]    |     | .offset             |
926   ||  +----------------+     +---------------------+
927   ||                         | .fn()               |
928   ||   var_ref_vals[]        +---------------------+
929   ||  +-------------+        | .flags              |
930   ||  | $ts0        |<---+   +---------------------+
931   ||  +-------------+    |   | .var.idx            |
932   ||  | $next_pid   |<-+ |   +---------------------+
933   ||  +-------------+  | |   | .var.hist_data      |
934   ||+>| $wakeup_lat |  | |   +---------------------+
935   ||| +-------------+  | |   | .var_ref_idx        |
936   ||| |             |  | | +-----------------------+
937   ||| +-------------+  | | | var = wakeup_lat      |
938   |||        .         | | +-----------------------+
939   |||        .         | |   | .size               |
940   |||        .         | |   +---------------------+
941   ||| +-------------+  | |   | .offset             |
942   ||| |             |  | |   +---------------------+
943   ||| +-------------+  | |   | .fn()               |
944   ||| |             |  | |   +---------------------+
945   ||| +-------------+  | |   | .flags & FL_VAR     |
946   |||                  | |   +---------------------+
947   |||                  | |   | .var.idx            |
948   |||                  | |   +---------------------+
949   |||                  | |   | .var.hist_data      |
950   |||                  | |   +---------------------+
951   |||                  | |   | .var_ref_idx        |
952   |||                  | |   +---------------------+
953   |||                  | |              .
954   |||                  | |              .
955   |||                  | |              .
956   |||                  | |              .
957   ||| +--------------+ | |              .
958   +-->| field_var    | | |              .
959    || +--------------+ | |              .
960    ||   | var        | | |              .
961    ||   +------------+ | |              .
962    ||   | val        | | |              .
963    || +--------------+ | |              .
964    || | field_var    | | |              .
965    || +--------------+ | |              .
966    ||   | var        | | |              .
967    ||   +------------+ | |              .
968    ||   | val        | | |              .
969    ||   +------------+ | |              .
970    ||         .        | |              .
971    ||         .        | |              .
972    ||         .        | | +-----------------------+ <--- n_vals
973    || +--------------+ | | | key = pid             |
974    || | field_var    | | | +-----------------------+
975    || +--------------+ | |   | .size               |
976    ||   | var        |--+|   +---------------------+
977    ||   +------------+ |||   | .offset             |
978    ||   | val        |-+||   +---------------------+
979    ||   +------------+ |||   | .fn()               |
980    ||                  |||   +---------------------+
981    ||                  |||   | .flags              |
982    ||                  |||   +---------------------+
983    ||                  |||   | .var.idx            |
984    ||                  |||   +---------------------+ <--- n_fields
985    ||                  |||
986    ||                  |||                           n_keys = n_fields - n_vals
987    ||                  ||| +-----------------------+
988    ||                  |+->| var = next_pid        |
989    ||                  | | +-----------------------+
990    ||                  | |   | .size               |
991    ||                  | |   +---------------------+
992    ||                  | |   | .offset             |
993    ||                  | |   +---------------------+
994    ||                  | |   | .flags & FL_VAR     |
995    ||                  | |   +---------------------+
996    ||                  | |   | .var.idx            |
997    ||                  | |   +---------------------+
998    ||                  | |   | .var.hist_data      |
999    ||                  | | +-----------------------+
1000    ||                  +-->| val for next_pid      |
1001    ||                  | | +-----------------------+
1002    ||                  | |   | .size               |
1003    ||                  | |   +---------------------+
1004    ||                  | |   | .offset             |
1005    ||                  | |   +---------------------+
1006    ||                  | |   | .fn()               |
1007    ||                  | |   +---------------------+
1008    ||                  | |   | .flags              |
1009    ||                  | |   +---------------------+
1010    ||                  | |   |                     |
1011    ||                  | |   +---------------------+
1012    ||                  | |
1013    ||                  | |
1014    ||                  | | +-----------------------+
1015    +|------------------|-|>| var_ref = $ts0        |
1016     |                  | | +-----------------------+
1017     |                  | |   | .size               |
1018     |                  | |   +---------------------+
1019     |                  | |   | .offset             |
1020     |                  | |   +---------------------+
1021     |                  | |   | .fn()               |
1022     |                  | |   +---------------------+
1023     |                  | |   | .flags & FL_VAR_REF |
1024     |                  | |   +---------------------+
1025     |                  | +---| .var_ref_idx        |
1026     |                  |   +-----------------------+
1027     |                  |   | var_ref = $next_pid   |
1028     |                  |   +-----------------------+
1029     |                  |     | .size               |
1030     |                  |     +---------------------+
1031     |                  |     | .offset             |
1032     |                  |     +---------------------+
1033     |                  |     | .fn()               |
1034     |                  |     +---------------------+
1035     |                  |     | .flags & FL_VAR_REF |
1036     |                  |     +---------------------+
1037     |                  +-----| .var_ref_idx        |
1038     |                      +-----------------------+
1039     |                      | var_ref = $wakeup_lat |
1040     |                      +-----------------------+
1041     |                        | .size               |
1042     |                        +---------------------+
1043     |                        | .offset             |
1044     |                        +---------------------+
1045     |                        | .fn()               |
1046     |                        +---------------------+
1047     |                        | .flags & FL_VAR_REF |
1048     |                        +---------------------+
1049     +------------------------| .var_ref_idx        |
1050                              +---------------------+
1051
1052 As you can see, for a field variable, two hist_fields are created: one
1053 representing the variable, in this case next_pid, and one to actually
1054 get the value of the field from the trace stream, like a normal val
1055 field does.  These are created separately from normal variable
1056 creation and are saved in the hist_data->field_vars[] array.  See
1057 below for how these are used.  In addition, a reference hist_field is
1058 also created, which is needed to reference the field variables such as
1059 $next_pid variable in the trace() action.
1060
1061 Note that $wakeup_lat is also a variable reference, referencing the
1062 value of the expression common_timestamp-$ts0, and so also needs to
1063 have a hist field entry representing that reference created.
1064
1065 When hist_trigger_elt_update() is called to get the normal key and
1066 value fields, it also calls update_field_vars(), which goes through
1067 each field_var created for the histogram, and available from
1068 hist_data->field_vars and calls val->fn() to get the data from the
1069 current trace record, and then uses the var's var.idx to set the
1070 variable at the var.idx offset in the appropriate tracing_map_elt's
1071 variable at elt->vars[var.idx].
1072
1073 Once all the variables have been updated, resolve_var_refs() can be
1074 called from event_hist_trigger(), and not only can our $ts0 and
1075 $next_pid references be resolved but the $wakeup_lat reference as
1076 well.  At this point, the trace() action can simply access the values
1077 assembled in the var_ref_vals[] array and generate the trace event.
1078
1079 The same process occurs for the field variables associated with the
1080 save() action.
1081
1082 Abbreviations used in the diagram::
1083
1084   hist_data = struct hist_trigger_data
1085   hist_data.fields = struct hist_field
1086   field_var = struct field_var
1087   fn = hist_field_fn_t
1088   FL_KEY = HIST_FIELD_FL_KEY
1089   FL_VAR = HIST_FIELD_FL_VAR
1090   FL_VAR_REF = HIST_FIELD_FL_VAR_REF
1091
1092 trace() action field variable test
1093 ----------------------------------
1094
1095 This example adds to the previous test example by finally making use
1096 of the wakeup_lat variable, but in addition also creates a couple of
1097 field variables that then are all passed to the wakeup_latency() trace
1098 action via the onmatch() handler.
1099
1100 First, we create the wakeup_latency synthetic event::
1101
1102   # echo 'wakeup_latency u64 lat; pid_t pid; char comm[16]' >> synthetic_events
1103
1104 Next, the sched_waking trigger from previous examples::
1105
1106   # echo 'hist:keys=pid:ts0=common_timestamp.usecs' >> events/sched/sched_waking/trigger
1107
1108 Finally, as in the previous test example, we calculate and assign the
1109 wakeup latency using the $ts0 reference from the sched_waking trigger
1110 to the wakeup_lat variable, and finally use it along with a couple
1111 sched_switch event fields, next_pid and next_comm, to generate a
1112 wakeup_latency trace event.  The next_pid and next_comm event fields
1113 are automatically converted into field variables for this purpose::
1114
1115   # echo 'hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0:onmatch(sched.sched_waking).wakeup_latency($wakeup_lat,next_pid,next_comm)' >> /sys/kernel/tracing/events/sched/sched_switch/trigger
1116
1117 The sched_waking hist_debug output shows the same data as in the
1118 previous test example::
1119
1120   # cat events/sched/sched_waking/hist_debug
1121
1122   # event histogram
1123   #
1124   # trigger info: hist:keys=pid:vals=hitcount:ts0=common_timestamp.usecs:sort=hitcount:size=2048:clock=global [active]
1125   #
1126
1127   hist_data: 00000000d60ff61f
1128
1129   n_vals: 2
1130   n_keys: 1
1131   n_fields: 3
1132
1133   val fields:
1134
1135     hist_data->fields[0]:
1136       flags:
1137         VAL: HIST_FIELD_FL_HITCOUNT
1138       type: u64
1139       size: 8
1140       is_signed: 0
1141
1142     hist_data->fields[1]:
1143       flags:
1144         HIST_FIELD_FL_VAR
1145       var.name: ts0
1146       var.idx (into tracing_map_elt.vars[]): 0
1147       type: u64
1148       size: 8
1149       is_signed: 0
1150
1151   key fields:
1152
1153     hist_data->fields[2]:
1154       flags:
1155         HIST_FIELD_FL_KEY
1156       ftrace_event_field name: pid
1157       type: pid_t
1158       size: 8
1159       is_signed: 1
1160
1161 The sched_switch hist_debug output shows the same key and value fields
1162 as in the previous test example - note that wakeup_lat is still in the
1163 val fields section, but that the new field variables are not there -
1164 although the field variables are variables, they're held separately in
1165 the hist_data's field_vars[] array.  Although the field variables and
1166 the normal variables are located in separate places, you can see that
1167 the actual variable locations for those variables in the
1168 tracing_map_elt.vars[] do have increasing indices as expected:
1169 wakeup_lat takes the var.idx = 0 slot, while the field variables for
1170 next_pid and next_comm have values var.idx = 1, and var.idx = 2.  Note
1171 also that those are the same values displayed for the variable
1172 references corresponding to those variables in the variable reference
1173 fields section.  Since there are two triggers and thus two hist_data
1174 addresses, those addresses also need to be accounted for when doing
1175 the matching - you can see that the first variable refers to the 0
1176 var.idx on the previous hist trigger (see the hist_data address
1177 associated with that trigger), while the second variable refers to the
1178 0 var.idx on the sched_switch hist trigger, as do all the remaining
1179 variable references.
1180
1181 Finally, the action tracking variables section just shows the system
1182 and event name for the onmatch() handler::
1183
1184   # cat events/sched/sched_switch/hist_debug
1185
1186   # event histogram
1187   #
1188   # trigger info: hist:keys=next_pid:vals=hitcount:wakeup_lat=common_timestamp.usecs-$ts0:sort=hitcount:size=2048:clock=global:onmatch(sched.sched_waking).wakeup_latency($wakeup_lat,next_pid,next_comm) [active]
1189   #
1190
1191   hist_data: 0000000008f551b7
1192
1193   n_vals: 2
1194   n_keys: 1
1195   n_fields: 3
1196
1197   val fields:
1198
1199     hist_data->fields[0]:
1200       flags:
1201         VAL: HIST_FIELD_FL_HITCOUNT
1202       type: u64
1203       size: 8
1204       is_signed: 0
1205
1206     hist_data->fields[1]:
1207       flags:
1208         HIST_FIELD_FL_VAR
1209       var.name: wakeup_lat
1210       var.idx (into tracing_map_elt.vars[]): 0
1211       type: u64
1212       size: 0
1213       is_signed: 0
1214
1215   key fields:
1216
1217     hist_data->fields[2]:
1218       flags:
1219         HIST_FIELD_FL_KEY
1220       ftrace_event_field name: next_pid
1221       type: pid_t
1222       size: 8
1223       is_signed: 1
1224
1225   variable reference fields:
1226
1227     hist_data->var_refs[0]:
1228       flags:
1229         HIST_FIELD_FL_VAR_REF
1230       name: ts0
1231       var.idx (into tracing_map_elt.vars[]): 0
1232       var.hist_data: 00000000d60ff61f
1233       var_ref_idx (into hist_data->var_refs[]): 0
1234       type: u64
1235       size: 8
1236       is_signed: 0
1237
1238     hist_data->var_refs[1]:
1239       flags:
1240         HIST_FIELD_FL_VAR_REF
1241       name: wakeup_lat
1242       var.idx (into tracing_map_elt.vars[]): 0
1243       var.hist_data: 0000000008f551b7
1244       var_ref_idx (into hist_data->var_refs[]): 1
1245       type: u64
1246       size: 0
1247       is_signed: 0
1248
1249     hist_data->var_refs[2]:
1250       flags:
1251         HIST_FIELD_FL_VAR_REF
1252       name: next_pid
1253       var.idx (into tracing_map_elt.vars[]): 1
1254       var.hist_data: 0000000008f551b7
1255       var_ref_idx (into hist_data->var_refs[]): 2
1256       type: pid_t
1257       size: 4
1258       is_signed: 0
1259
1260     hist_data->var_refs[3]:
1261       flags:
1262         HIST_FIELD_FL_VAR_REF
1263       name: next_comm
1264       var.idx (into tracing_map_elt.vars[]): 2
1265       var.hist_data: 0000000008f551b7
1266       var_ref_idx (into hist_data->var_refs[]): 3
1267       type: char[16]
1268       size: 256
1269       is_signed: 0
1270
1271   field variables:
1272
1273     hist_data->field_vars[0]:
1274
1275       field_vars[0].var:
1276       flags:
1277         HIST_FIELD_FL_VAR
1278       var.name: next_pid
1279       var.idx (into tracing_map_elt.vars[]): 1
1280
1281       field_vars[0].val:
1282       ftrace_event_field name: next_pid
1283       type: pid_t
1284       size: 4
1285       is_signed: 1
1286
1287     hist_data->field_vars[1]:
1288
1289       field_vars[1].var:
1290       flags:
1291         HIST_FIELD_FL_VAR
1292       var.name: next_comm
1293       var.idx (into tracing_map_elt.vars[]): 2
1294
1295       field_vars[1].val:
1296       ftrace_event_field name: next_comm
1297       type: char[16]
1298       size: 256
1299       is_signed: 0
1300
1301   action tracking variables (for onmax()/onchange()/onmatch()):
1302
1303     hist_data->actions[0].match_data.event_system: sched
1304     hist_data->actions[0].match_data.event: sched_waking
1305
1306 The commands below can be used to clean things up for the next test::
1307
1308   # echo '!hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0:onmatch(sched.sched_waking).wakeup_latency($wakeup_lat,next_pid,next_comm)' >> /sys/kernel/tracing/events/sched/sched_switch/trigger
1309
1310   # echo '!hist:keys=pid:ts0=common_timestamp.usecs' >> events/sched/sched_waking/trigger
1311
1312   # echo '!wakeup_latency u64 lat; pid_t pid; char comm[16]' >> synthetic_events
1313
1314 action_data and the trace() action
1315 ----------------------------------
1316
1317 As mentioned above, when the trace() action generates a synthetic
1318 event, all the parameters to the synthetic event either already are
1319 variables or are converted into variables (via field variables), and
1320 finally all those variable values are collected via references to them
1321 into a var_ref_vals[] array.
1322
1323 The values in the var_ref_vals[] array, however, don't necessarily
1324 follow the same ordering as the synthetic event params.  To address
1325 that, struct action_data contains another array, var_ref_idx[] that
1326 maps the trace action params to the var_ref_vals[] values.  Below is a
1327 diagram illustrating that for the wakeup_latency() synthetic event::
1328
1329   +------------------+     wakeup_latency()
1330   | action_data      |       event params               var_ref_vals[]
1331   +------------------+    +-----------------+        +-----------------+
1332     | .var_ref_idx[] |--->| $wakeup_lat idx |---+    |                 |
1333     +----------------+    +-----------------+   |    +-----------------+
1334     | .synth_event   |    | $next_pid idx   |---|-+  | $wakeup_lat val |
1335     +----------------+    +-----------------+   | |  +-----------------+
1336                                    .            | +->| $next_pid val   |
1337                                    .            |    +-----------------+
1338                                    .            |           .
1339                           +-----------------+   |           .
1340                           |                 |   |           .
1341                           +-----------------+   |    +-----------------+
1342                                                 +--->| $wakeup_lat val |
1343                                                      +-----------------+
1344
1345 Basically, how this ends up getting used in the synthetic event probe
1346 function, trace_event_raw_event_synth(), is as follows::
1347
1348   for each field i in .synth_event
1349     val_idx = .var_ref_idx[i]
1350     val = var_ref_vals[val_idx]
1351
1352 action_data and the onXXX() handlers
1353 ------------------------------------
1354
1355 The hist trigger onXXX() actions other than onmatch(), such as onmax()
1356 and onchange(), also make use of and internally create hidden
1357 variables.  This information is contained in the
1358 action_data.track_data struct, and is also visible in the hist_debug
1359 output as will be described in the example below.
1360
1361 Typically, the onmax() or onchange() handlers are used in conjunction
1362 with the save() and snapshot() actions.  For example::
1363
1364   # echo 'hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0: \
1365           onmax($wakeup_lat).save(next_comm,prev_pid,prev_prio,prev_comm)' >>
1366           /sys/kernel/tracing/events/sched/sched_switch/trigger
1367
1368 or::
1369
1370   # echo 'hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0: \
1371           onmax($wakeup_lat).snapshot()' >>
1372           /sys/kernel/tracing/events/sched/sched_switch/trigger
1373
1374 save() action field variable test
1375 ---------------------------------
1376
1377 For this example, instead of generating a synthetic event, the save()
1378 action is used to save field values whenever an onmax() handler
1379 detects that a new max latency has been hit.  As in the previous
1380 example, the values being saved are also field values, but in this
1381 case, are kept in a separate hist_data array named save_vars[].
1382
1383 As in previous test examples, we set up the sched_waking trigger::
1384
1385   # echo 'hist:keys=pid:ts0=common_timestamp.usecs' >> events/sched/sched_waking/trigger
1386
1387 In this case, however, we set up the sched_switch trigger to save some
1388 sched_switch field values whenever we hit a new maximum latency.  For
1389 both the onmax() handler and save() action, variables will be created,
1390 which we can use the hist_debug files to examine::
1391
1392   # echo 'hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0:onmax($wakeup_lat).save(next_comm,prev_pid,prev_prio,prev_comm)' >> events/sched/sched_switch/trigger
1393
1394 The sched_waking hist_debug output shows the same data as in the
1395 previous test examples::
1396
1397   # cat events/sched/sched_waking/hist_debug
1398
1399   #
1400   # trigger info: hist:keys=pid:vals=hitcount:ts0=common_timestamp.usecs:sort=hitcount:size=2048:clock=global [active]
1401   #
1402
1403   hist_data: 00000000e6290f48
1404
1405   n_vals: 2
1406   n_keys: 1
1407   n_fields: 3
1408
1409   val fields:
1410
1411     hist_data->fields[0]:
1412       flags:
1413         VAL: HIST_FIELD_FL_HITCOUNT
1414       type: u64
1415       size: 8
1416       is_signed: 0
1417
1418     hist_data->fields[1]:
1419       flags:
1420         HIST_FIELD_FL_VAR
1421       var.name: ts0
1422       var.idx (into tracing_map_elt.vars[]): 0
1423       type: u64
1424       size: 8
1425       is_signed: 0
1426
1427   key fields:
1428
1429     hist_data->fields[2]:
1430       flags:
1431         HIST_FIELD_FL_KEY
1432       ftrace_event_field name: pid
1433       type: pid_t
1434       size: 8
1435       is_signed: 1
1436
1437 The output of the sched_switch trigger shows the same val and key
1438 values as before, but also shows a couple new sections.
1439
1440 First, the action tracking variables section now shows the
1441 actions[].track_data information describing the special tracking
1442 variables and references used to track, in this case, the running
1443 maximum value.  The actions[].track_data.var_ref member contains the
1444 reference to the variable being tracked, in this case the $wakeup_lat
1445 variable.  In order to perform the onmax() handler function, there
1446 also needs to be a variable that tracks the current maximum by getting
1447 updated whenever a new maximum is hit.  In this case, we can see that
1448 an auto-generated variable named ' __max' has been created and is
1449 visible in the actions[].track_data.track_var variable.
1450
1451 Finally, in the new 'save action variables' section, we can see that
1452 the 4 params to the save() function have resulted in 4 field variables
1453 being created for the purposes of saving the values of the named
1454 fields when the max is hit.  These variables are kept in a separate
1455 save_vars[] array off of hist_data, so are displayed in a separate
1456 section::
1457
1458   # cat events/sched/sched_switch/hist_debug
1459
1460   # event histogram
1461   #
1462   # trigger info: hist:keys=next_pid:vals=hitcount:wakeup_lat=common_timestamp.usecs-$ts0:sort=hitcount:size=2048:clock=global:onmax($wakeup_lat).save(next_comm,prev_pid,prev_prio,prev_comm) [active]
1463   #
1464
1465   hist_data: 0000000057bcd28d
1466
1467   n_vals: 2
1468   n_keys: 1
1469   n_fields: 3
1470
1471   val fields:
1472
1473     hist_data->fields[0]:
1474       flags:
1475         VAL: HIST_FIELD_FL_HITCOUNT
1476       type: u64
1477       size: 8
1478       is_signed: 0
1479
1480     hist_data->fields[1]:
1481       flags:
1482         HIST_FIELD_FL_VAR
1483       var.name: wakeup_lat
1484       var.idx (into tracing_map_elt.vars[]): 0
1485       type: u64
1486       size: 0
1487       is_signed: 0
1488
1489   key fields:
1490
1491     hist_data->fields[2]:
1492       flags:
1493         HIST_FIELD_FL_KEY
1494       ftrace_event_field name: next_pid
1495       type: pid_t
1496       size: 8
1497       is_signed: 1
1498
1499   variable reference fields:
1500
1501     hist_data->var_refs[0]:
1502       flags:
1503         HIST_FIELD_FL_VAR_REF
1504       name: ts0
1505       var.idx (into tracing_map_elt.vars[]): 0
1506       var.hist_data: 00000000e6290f48
1507       var_ref_idx (into hist_data->var_refs[]): 0
1508       type: u64
1509       size: 8
1510       is_signed: 0
1511
1512     hist_data->var_refs[1]:
1513       flags:
1514         HIST_FIELD_FL_VAR_REF
1515       name: wakeup_lat
1516       var.idx (into tracing_map_elt.vars[]): 0
1517       var.hist_data: 0000000057bcd28d
1518       var_ref_idx (into hist_data->var_refs[]): 1
1519       type: u64
1520       size: 0
1521       is_signed: 0
1522
1523   action tracking variables (for onmax()/onchange()/onmatch()):
1524
1525     hist_data->actions[0].track_data.var_ref:
1526       flags:
1527         HIST_FIELD_FL_VAR_REF
1528       name: wakeup_lat
1529       var.idx (into tracing_map_elt.vars[]): 0
1530       var.hist_data: 0000000057bcd28d
1531       var_ref_idx (into hist_data->var_refs[]): 1
1532       type: u64
1533       size: 0
1534       is_signed: 0
1535
1536     hist_data->actions[0].track_data.track_var:
1537       flags:
1538         HIST_FIELD_FL_VAR
1539       var.name: __max
1540       var.idx (into tracing_map_elt.vars[]): 1
1541       type: u64
1542       size: 8
1543       is_signed: 0
1544
1545   save action variables (save() params):
1546
1547     hist_data->save_vars[0]:
1548
1549       save_vars[0].var:
1550       flags:
1551         HIST_FIELD_FL_VAR
1552       var.name: next_comm
1553       var.idx (into tracing_map_elt.vars[]): 2
1554
1555       save_vars[0].val:
1556       ftrace_event_field name: next_comm
1557       type: char[16]
1558       size: 256
1559       is_signed: 0
1560
1561     hist_data->save_vars[1]:
1562
1563       save_vars[1].var:
1564       flags:
1565         HIST_FIELD_FL_VAR
1566       var.name: prev_pid
1567       var.idx (into tracing_map_elt.vars[]): 3
1568
1569       save_vars[1].val:
1570       ftrace_event_field name: prev_pid
1571       type: pid_t
1572       size: 4
1573       is_signed: 1
1574
1575     hist_data->save_vars[2]:
1576
1577       save_vars[2].var:
1578       flags:
1579         HIST_FIELD_FL_VAR
1580       var.name: prev_prio
1581       var.idx (into tracing_map_elt.vars[]): 4
1582
1583       save_vars[2].val:
1584       ftrace_event_field name: prev_prio
1585       type: int
1586       size: 4
1587       is_signed: 1
1588
1589     hist_data->save_vars[3]:
1590
1591       save_vars[3].var:
1592       flags:
1593         HIST_FIELD_FL_VAR
1594       var.name: prev_comm
1595       var.idx (into tracing_map_elt.vars[]): 5
1596
1597       save_vars[3].val:
1598       ftrace_event_field name: prev_comm
1599       type: char[16]
1600       size: 256
1601       is_signed: 0
1602
1603 The commands below can be used to clean things up for the next test::
1604
1605   # echo '!hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0:onmax($wakeup_lat).save(next_comm,prev_pid,prev_prio,prev_comm)' >> events/sched/sched_switch/trigger
1606
1607   # echo '!hist:keys=pid:ts0=common_timestamp.usecs' >> events/sched/sched_waking/trigger
1608
1609 A couple special cases
1610 ======================
1611
1612 While the above covers the basics of the histogram internals, there
1613 are a couple of special cases that should be discussed, since they
1614 tend to create even more confusion.  Those are field variables on other
1615 histograms, and aliases, both described below through example tests
1616 using the hist_debug files.
1617
1618 Test of field variables on other histograms
1619 -------------------------------------------
1620
1621 This example is similar to the previous examples, but in this case,
1622 the sched_switch trigger references a hist trigger field on another
1623 event, namely the sched_waking event.  In order to accomplish this, a
1624 field variable is created for the other event, but since an existing
1625 histogram can't be used, as existing histograms are immutable, a new
1626 histogram with a matching variable is created and used, and we'll see
1627 that reflected in the hist_debug output shown below.
1628
1629 First, we create the wakeup_latency synthetic event.  Note the
1630 addition of the prio field::
1631
1632   # echo 'wakeup_latency u64 lat; pid_t pid; int prio' >> synthetic_events
1633
1634 As in previous test examples, we set up the sched_waking trigger::
1635
1636   # echo 'hist:keys=pid:ts0=common_timestamp.usecs' >> events/sched/sched_waking/trigger
1637
1638 Here we set up a hist trigger on sched_switch to send a wakeup_latency
1639 event using an onmatch handler naming the sched_waking event.  Note
1640 that the third param being passed to the wakeup_latency() is prio,
1641 which is a field name that needs to have a field variable created for
1642 it.  There isn't however any prio field on the sched_switch event so
1643 it would seem that it wouldn't be possible to create a field variable
1644 for it.  The matching sched_waking event does have a prio field, so it
1645 should be possible to make use of it for this purpose.  The problem
1646 with that is that it's not currently possible to define a new variable
1647 on an existing histogram, so it's not possible to add a new prio field
1648 variable to the existing sched_waking histogram.  It is however
1649 possible to create an additional new 'matching' sched_waking histogram
1650 for the same event, meaning that it uses the same key and filters, and
1651 define the new prio field variable on that.
1652
1653 Here's the sched_switch trigger::
1654
1655   # echo 'hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0:onmatch(sched.sched_waking).wakeup_latency($wakeup_lat,next_pid,prio)' >> events/sched/sched_switch/trigger
1656
1657 And here's the output of the hist_debug information for the
1658 sched_waking hist trigger.  Note that there are two histograms
1659 displayed in the output: the first is the normal sched_waking
1660 histogram we've seen in the previous examples, and the second is the
1661 special histogram we created to provide the prio field variable.
1662
1663 Looking at the second histogram below, we see a variable with the name
1664 synthetic_prio.  This is the field variable created for the prio field
1665 on that sched_waking histogram::
1666
1667   # cat events/sched/sched_waking/hist_debug
1668
1669   # event histogram
1670   #
1671   # trigger info: hist:keys=pid:vals=hitcount:ts0=common_timestamp.usecs:sort=hitcount:size=2048:clock=global [active]
1672   #
1673
1674   hist_data: 00000000349570e4
1675
1676   n_vals: 2
1677   n_keys: 1
1678   n_fields: 3
1679
1680   val fields:
1681
1682     hist_data->fields[0]:
1683       flags:
1684         VAL: HIST_FIELD_FL_HITCOUNT
1685       type: u64
1686       size: 8
1687       is_signed: 0
1688
1689     hist_data->fields[1]:
1690       flags:
1691         HIST_FIELD_FL_VAR
1692       var.name: ts0
1693       var.idx (into tracing_map_elt.vars[]): 0
1694       type: u64
1695       size: 8
1696       is_signed: 0
1697
1698   key fields:
1699
1700     hist_data->fields[2]:
1701       flags:
1702         HIST_FIELD_FL_KEY
1703       ftrace_event_field name: pid
1704       type: pid_t
1705       size: 8
1706       is_signed: 1
1707
1708
1709   # event histogram
1710   #
1711   # trigger info: hist:keys=pid:vals=hitcount:synthetic_prio=prio:sort=hitcount:size=2048 [active]
1712   #
1713
1714   hist_data: 000000006920cf38
1715
1716   n_vals: 2
1717   n_keys: 1
1718   n_fields: 3
1719
1720   val fields:
1721
1722     hist_data->fields[0]:
1723       flags:
1724         VAL: HIST_FIELD_FL_HITCOUNT
1725       type: u64
1726       size: 8
1727       is_signed: 0
1728
1729     hist_data->fields[1]:
1730       flags:
1731         HIST_FIELD_FL_VAR
1732       ftrace_event_field name: prio
1733       var.name: synthetic_prio
1734       var.idx (into tracing_map_elt.vars[]): 0
1735       type: int
1736       size: 4
1737       is_signed: 1
1738
1739   key fields:
1740
1741     hist_data->fields[2]:
1742       flags:
1743         HIST_FIELD_FL_KEY
1744       ftrace_event_field name: pid
1745       type: pid_t
1746       size: 8
1747       is_signed: 1
1748
1749 Looking at the sched_switch histogram below, we can see a reference to
1750 the synthetic_prio variable on sched_waking, and looking at the
1751 associated hist_data address we see that it is indeed associated with
1752 the new histogram.  Note also that the other references are to a
1753 normal variable, wakeup_lat, and to a normal field variable, next_pid,
1754 the details of which are in the field variables section::
1755
1756   # cat events/sched/sched_switch/hist_debug
1757
1758   # event histogram
1759   #
1760   # trigger info: hist:keys=next_pid:vals=hitcount:wakeup_lat=common_timestamp.usecs-$ts0:sort=hitcount:size=2048:clock=global:onmatch(sched.sched_waking).wakeup_latency($wakeup_lat,next_pid,prio) [active]
1761   #
1762
1763   hist_data: 00000000a73b67df
1764
1765   n_vals: 2
1766   n_keys: 1
1767   n_fields: 3
1768
1769   val fields:
1770
1771     hist_data->fields[0]:
1772       flags:
1773         VAL: HIST_FIELD_FL_HITCOUNT
1774       type: u64
1775       size: 8
1776       is_signed: 0
1777
1778     hist_data->fields[1]:
1779       flags:
1780         HIST_FIELD_FL_VAR
1781       var.name: wakeup_lat
1782       var.idx (into tracing_map_elt.vars[]): 0
1783       type: u64
1784       size: 0
1785       is_signed: 0
1786
1787   key fields:
1788
1789     hist_data->fields[2]:
1790       flags:
1791         HIST_FIELD_FL_KEY
1792       ftrace_event_field name: next_pid
1793       type: pid_t
1794       size: 8
1795       is_signed: 1
1796
1797   variable reference fields:
1798
1799     hist_data->var_refs[0]:
1800       flags:
1801         HIST_FIELD_FL_VAR_REF
1802       name: ts0
1803       var.idx (into tracing_map_elt.vars[]): 0
1804       var.hist_data: 00000000349570e4
1805       var_ref_idx (into hist_data->var_refs[]): 0
1806       type: u64
1807       size: 8
1808       is_signed: 0
1809
1810     hist_data->var_refs[1]:
1811       flags:
1812         HIST_FIELD_FL_VAR_REF
1813       name: wakeup_lat
1814       var.idx (into tracing_map_elt.vars[]): 0
1815       var.hist_data: 00000000a73b67df
1816       var_ref_idx (into hist_data->var_refs[]): 1
1817       type: u64
1818       size: 0
1819       is_signed: 0
1820
1821     hist_data->var_refs[2]:
1822       flags:
1823         HIST_FIELD_FL_VAR_REF
1824       name: next_pid
1825       var.idx (into tracing_map_elt.vars[]): 1
1826       var.hist_data: 00000000a73b67df
1827       var_ref_idx (into hist_data->var_refs[]): 2
1828       type: pid_t
1829       size: 4
1830       is_signed: 0
1831
1832     hist_data->var_refs[3]:
1833       flags:
1834         HIST_FIELD_FL_VAR_REF
1835       name: synthetic_prio
1836       var.idx (into tracing_map_elt.vars[]): 0
1837       var.hist_data: 000000006920cf38
1838       var_ref_idx (into hist_data->var_refs[]): 3
1839       type: int
1840       size: 4
1841       is_signed: 1
1842
1843   field variables:
1844
1845     hist_data->field_vars[0]:
1846
1847       field_vars[0].var:
1848       flags:
1849         HIST_FIELD_FL_VAR
1850       var.name: next_pid
1851       var.idx (into tracing_map_elt.vars[]): 1
1852
1853       field_vars[0].val:
1854       ftrace_event_field name: next_pid
1855       type: pid_t
1856       size: 4
1857       is_signed: 1
1858
1859   action tracking variables (for onmax()/onchange()/onmatch()):
1860
1861     hist_data->actions[0].match_data.event_system: sched
1862     hist_data->actions[0].match_data.event: sched_waking
1863
1864 The commands below can be used to clean things up for the next test::
1865
1866   # echo '!hist:keys=next_pid:wakeup_lat=common_timestamp.usecs-$ts0:onmatch(sched.sched_waking).wakeup_latency($wakeup_lat,next_pid,prio)' >> events/sched/sched_switch/trigger
1867
1868   # echo '!hist:keys=pid:ts0=common_timestamp.usecs' >> events/sched/sched_waking/trigger
1869
1870   # echo '!wakeup_latency u64 lat; pid_t pid; int prio' >> synthetic_events
1871
1872 Alias test
1873 ----------
1874
1875 This example is very similar to previous examples, but demonstrates
1876 the alias flag.
1877
1878 First, we create the wakeup_latency synthetic event::
1879
1880   # echo 'wakeup_latency u64 lat; pid_t pid; char comm[16]' >> synthetic_events
1881
1882 Next, we create a sched_waking trigger similar to previous examples,
1883 but in this case we save the pid in the waking_pid variable::
1884
1885   # echo 'hist:keys=pid:waking_pid=pid:ts0=common_timestamp.usecs' >> events/sched/sched_waking/trigger
1886
1887 For the sched_switch trigger, instead of using $waking_pid directly in
1888 the wakeup_latency synthetic event invocation, we create an alias of
1889 $waking_pid named $woken_pid, and use that in the synthetic event
1890 invocation instead::
1891
1892   # echo 'hist:keys=next_pid:woken_pid=$waking_pid:wakeup_lat=common_timestamp.usecs-$ts0:onmatch(sched.sched_waking).wakeup_latency($wakeup_lat,$woken_pid,next_comm)' >> events/sched/sched_switch/trigger
1893
1894 Looking at the sched_waking hist_debug output, in addition to the
1895 normal fields, we can see the waking_pid variable::
1896
1897   # cat events/sched/sched_waking/hist_debug
1898
1899   # event histogram
1900   #
1901   # trigger info: hist:keys=pid:vals=hitcount:waking_pid=pid,ts0=common_timestamp.usecs:sort=hitcount:size=2048:clock=global [active]
1902   #
1903
1904   hist_data: 00000000a250528c
1905
1906   n_vals: 3
1907   n_keys: 1
1908   n_fields: 4
1909
1910   val fields:
1911
1912     hist_data->fields[0]:
1913       flags:
1914         VAL: HIST_FIELD_FL_HITCOUNT
1915       type: u64
1916       size: 8
1917       is_signed: 0
1918
1919     hist_data->fields[1]:
1920       flags:
1921         HIST_FIELD_FL_VAR
1922       ftrace_event_field name: pid
1923       var.name: waking_pid
1924       var.idx (into tracing_map_elt.vars[]): 0
1925       type: pid_t
1926       size: 4
1927       is_signed: 1
1928
1929     hist_data->fields[2]:
1930       flags:
1931         HIST_FIELD_FL_VAR
1932       var.name: ts0
1933       var.idx (into tracing_map_elt.vars[]): 1
1934       type: u64
1935       size: 8
1936       is_signed: 0
1937
1938   key fields:
1939
1940     hist_data->fields[3]:
1941       flags:
1942         HIST_FIELD_FL_KEY
1943       ftrace_event_field name: pid
1944       type: pid_t
1945       size: 8
1946       is_signed: 1
1947
1948 The sched_switch hist_debug output shows that a variable named
1949 woken_pid has been created but that it also has the
1950 HIST_FIELD_FL_ALIAS flag set.  It also has the HIST_FIELD_FL_VAR flag
1951 set, which is why it appears in the val field section.
1952
1953 Despite that implementation detail, an alias variable is actually more
1954 like a variable reference; in fact it can be thought of as a reference
1955 to a reference.  The implementation copies the var_ref->fn() from the
1956 variable reference being referenced, in this case, the waking_pid
1957 fn(), which is hist_field_var_ref() and makes that the fn() of the
1958 alias.  The hist_field_var_ref() fn() requires the var_ref_idx of the
1959 variable reference it's using, so waking_pid's var_ref_idx is also
1960 copied to the alias.  The end result is that when the value of alias
1961 is retrieved, in the end it just does the same thing the original
1962 reference would have done and retrieves the same value from the
1963 var_ref_vals[] array.  You can verify this in the output by noting
1964 that the var_ref_idx of the alias, in this case woken_pid, is the same
1965 as the var_ref_idx of the reference, waking_pid, in the variable
1966 reference fields section.
1967
1968 Additionally, once it gets that value, since it is also a variable, it
1969 then saves that value into its var.idx.  So the var.idx of the
1970 woken_pid alias is 0, which it fills with the value from var_ref_idx 0
1971 when its fn() is called to update itself.  You'll also notice that
1972 there's a woken_pid var_ref in the variable refs section.  That is the
1973 reference to the woken_pid alias variable, and you can see that it
1974 retrieves the value from the same var.idx as the woken_pid alias, 0,
1975 and then in turn saves that value in its own var_ref_idx slot, 3, and
1976 the value at this position is finally what gets assigned to the
1977 $woken_pid slot in the trace event invocation::
1978
1979   # cat events/sched/sched_switch/hist_debug
1980
1981   # event histogram
1982   #
1983   # trigger info: hist:keys=next_pid:vals=hitcount:woken_pid=$waking_pid,wakeup_lat=common_timestamp.usecs-$ts0:sort=hitcount:size=2048:clock=global:onmatch(sched.sched_waking).wakeup_latency($wakeup_lat,$woken_pid,next_comm) [active]
1984   #
1985
1986   hist_data: 0000000055d65ed0
1987
1988   n_vals: 3
1989   n_keys: 1
1990   n_fields: 4
1991
1992   val fields:
1993
1994     hist_data->fields[0]:
1995       flags:
1996         VAL: HIST_FIELD_FL_HITCOUNT
1997       type: u64
1998       size: 8
1999       is_signed: 0
2000
2001     hist_data->fields[1]:
2002       flags:
2003         HIST_FIELD_FL_VAR
2004         HIST_FIELD_FL_ALIAS
2005       var.name: woken_pid
2006       var.idx (into tracing_map_elt.vars[]): 0
2007       var_ref_idx (into hist_data->var_refs[]): 0
2008       type: pid_t
2009       size: 4
2010       is_signed: 1
2011
2012     hist_data->fields[2]:
2013       flags:
2014         HIST_FIELD_FL_VAR
2015       var.name: wakeup_lat
2016       var.idx (into tracing_map_elt.vars[]): 1
2017       type: u64
2018       size: 0
2019       is_signed: 0
2020
2021   key fields:
2022
2023     hist_data->fields[3]:
2024       flags:
2025         HIST_FIELD_FL_KEY
2026       ftrace_event_field name: next_pid
2027       type: pid_t
2028       size: 8
2029       is_signed: 1
2030
2031   variable reference fields:
2032
2033     hist_data->var_refs[0]:
2034       flags:
2035         HIST_FIELD_FL_VAR_REF
2036       name: waking_pid
2037       var.idx (into tracing_map_elt.vars[]): 0
2038       var.hist_data: 00000000a250528c
2039       var_ref_idx (into hist_data->var_refs[]): 0
2040       type: pid_t
2041       size: 4
2042       is_signed: 1
2043
2044     hist_data->var_refs[1]:
2045       flags:
2046         HIST_FIELD_FL_VAR_REF
2047       name: ts0
2048       var.idx (into tracing_map_elt.vars[]): 1
2049       var.hist_data: 00000000a250528c
2050       var_ref_idx (into hist_data->var_refs[]): 1
2051       type: u64
2052       size: 8
2053       is_signed: 0
2054
2055     hist_data->var_refs[2]:
2056       flags:
2057         HIST_FIELD_FL_VAR_REF
2058       name: wakeup_lat
2059       var.idx (into tracing_map_elt.vars[]): 1
2060       var.hist_data: 0000000055d65ed0
2061       var_ref_idx (into hist_data->var_refs[]): 2
2062       type: u64
2063       size: 0
2064       is_signed: 0
2065
2066     hist_data->var_refs[3]:
2067       flags:
2068         HIST_FIELD_FL_VAR_REF
2069       name: woken_pid
2070       var.idx (into tracing_map_elt.vars[]): 0
2071       var.hist_data: 0000000055d65ed0
2072       var_ref_idx (into hist_data->var_refs[]): 3
2073       type: pid_t
2074       size: 4
2075       is_signed: 1
2076
2077     hist_data->var_refs[4]:
2078       flags:
2079         HIST_FIELD_FL_VAR_REF
2080       name: next_comm
2081       var.idx (into tracing_map_elt.vars[]): 2
2082       var.hist_data: 0000000055d65ed0
2083       var_ref_idx (into hist_data->var_refs[]): 4
2084       type: char[16]
2085       size: 256
2086       is_signed: 0
2087
2088   field variables:
2089
2090     hist_data->field_vars[0]:
2091
2092       field_vars[0].var:
2093       flags:
2094         HIST_FIELD_FL_VAR
2095       var.name: next_comm
2096       var.idx (into tracing_map_elt.vars[]): 2
2097
2098       field_vars[0].val:
2099       ftrace_event_field name: next_comm
2100       type: char[16]
2101       size: 256
2102       is_signed: 0
2103
2104   action tracking variables (for onmax()/onchange()/onmatch()):
2105
2106     hist_data->actions[0].match_data.event_system: sched
2107     hist_data->actions[0].match_data.event: sched_waking
2108
2109 The commands below can be used to clean things up for the next test::
2110
2111   # echo '!hist:keys=next_pid:woken_pid=$waking_pid:wakeup_lat=common_timestamp.usecs-$ts0:onmatch(sched.sched_waking).wakeup_latency($wakeup_lat,$woken_pid,next_comm)' >> events/sched/sched_switch/trigger
2112
2113   # echo '!hist:keys=pid:ts0=common_timestamp.usecs' >> events/sched/sched_waking/trigger
2114
2115   # echo '!wakeup_latency u64 lat; pid_t pid; char comm[16]' >> synthetic_events