Instantly share code, notes, and snippets.

Embed
What would you like to do?
perf output vectorized
[obelavina@bbetty spo600_20173_inline_assembler_lab]$ perf stat -r 10 -d ./vol_simd
Generating sample data.
Scaling samples.
Summing samples.
Result: 700
Generating sample data.
Scaling samples.
Summing samples.
Result: -411
Generating sample data.
Scaling samples.
Summing samples.
Result: 240
Generating sample data.
Scaling samples.
Summing samples.
Result: -906
Generating sample data.
Scaling samples.
Summing samples.
Result: -906
Generating sample data.
Scaling samples.
Summing samples.
Result: 917
Generating sample data.
Scaling samples.
Summing samples.
Result: -651
Generating sample data.
Scaling samples.
Summing samples.
Result: -236
Generating sample data.
Scaling samples.
Summing samples.
Result: -411
Generating sample data.
Scaling samples.
Summing samples.
Result: 525
Performance counter stats for './vol_simd' (10 runs):
31818.842270 task-clock:u (msec) # 1.000 CPUs utilized ( +- 0.10% )
0 context-switches:u # 0.000 K/sec
0 cpu-migrations:u # 0.000 K/sec
488,324 page-faults:u # 0.015 M/sec ( +- 0.00% )
74,046,473,628 cycles:u # 2.327 GHz ( +- 0.11% )
44,326,772,181 instructions:u # 0.60 insn per cycle ( +- 0.00% )
<not supported> branches:u
61,723 branch-misses:u ( +- 1.18% )
17,767,681,424 L1-dcache-loads:u # 558.401 M/sec ( +- 0.00% )
1,031,222,429 L1-dcache-load-misses:u # 5.80% of all L1-dcache hits ( +- 0.00% )
<not supported> LLC-loads:u
<not supported> LLC-load-misses:u
31.818169200 seconds time elapsed ( +- 0.10% )
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment