Last active
October 13, 2019 12:56
-
-
Save mj41/e6ba5ef48d2de61b0c72 to your computer and use it in GitHub Desktop.
Output of https://github.com/mj41/threading
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
~/gd/dalsi/nanoparrot/threading [master|✔]$ cat /proc/cpuinfo | head -n26 | |
processor : 0 | |
vendor_id : GenuineIntel | |
cpu family : 6 | |
model : 69 | |
model name : Intel(R) Core(TM) i7-4600U CPU @ 2.10GHz | |
stepping : 1 | |
microcode : 0x1c | |
cpu MHz : 2483.894 | |
cache size : 4096 KB | |
physical id : 0 | |
siblings : 4 | |
core id : 0 | |
cpu cores : 2 | |
apicid : 0 | |
initial apicid : 0 | |
fpu : yes | |
fpu_exception : yes | |
cpuid level : 13 | |
wp : yes | |
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc aperfmperf eagerfpu pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 fma cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm ida arat epb pln pts dtherm tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid xsaveopt | |
bugs : | |
bogomips : 5387.43 | |
clflush size : 64 | |
cache_alignment : 64 | |
address sizes : 39 bits physical, 48 bits virtual | |
power management: |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
~/gd/dalsi/nanoparrot/threading [master|✔]$ make bench-perf | |
for i in subroutine direct indirect switch call repl-switch; do perf stat -B ./$i 2>&1; done | |
failed to read counter stalled-cycles-frontend | |
failed to read counter stalled-cycles-backend | |
Performance counter stats for './subroutine': | |
184.224555 task-clock (msec) # 0.998 CPUs utilized | |
0 context-switches # 0.000 K/sec | |
0 cpu-migrations # 0.000 K/sec | |
44 page-faults # 0.239 K/sec | |
602,399,539 cycles # 3.270 GHz | |
<not supported> stalled-cycles-frontend | |
<not supported> stalled-cycles-backend | |
901,130,018 instructions # 1.50 insns per cycle | |
400,210,002 branches # 2172.403 M/sec | |
6,894 branch-misses # 0.00% of all branches | |
0.184512295 seconds time elapsed | |
failed to read counter stalled-cycles-frontend | |
failed to read counter stalled-cycles-backend | |
Performance counter stats for './direct': | |
974.359432 task-clock (msec) # 0.999 CPUs utilized | |
1 context-switches # 0.001 K/sec | |
0 cpu-migrations # 0.000 K/sec | |
45 page-faults # 0.046 K/sec | |
3,156,374,204 cycles # 3.239 GHz | |
<not supported> stalled-cycles-frontend | |
<not supported> stalled-cycles-backend | |
3,804,006,962 instructions # 1.21 insns per cycle | |
1,600,728,014 branches # 1642.852 M/sec | |
19,705 branch-misses # 0.00% of all branches | |
0.974880355 seconds time elapsed | |
failed to read counter stalled-cycles-frontend | |
failed to read counter stalled-cycles-backend | |
Performance counter stats for './indirect': | |
1023.301660 task-clock (msec) # 0.999 CPUs utilized | |
1 context-switches # 0.001 K/sec | |
0 cpu-migrations # 0.000 K/sec | |
45 page-faults # 0.044 K/sec | |
3,313,678,659 cycles # 3.238 GHz | |
<not supported> stalled-cycles-frontend | |
<not supported> stalled-cycles-backend | |
4,704,236,354 instructions # 1.42 insns per cycle | |
1,600,769,255 branches # 1564.318 M/sec | |
28,639 branch-misses # 0.00% of all branches | |
1.023872110 seconds time elapsed | |
failed to read counter stalled-cycles-frontend | |
failed to read counter stalled-cycles-backend | |
Performance counter stats for './switch': | |
924.091463 task-clock (msec) # 0.999 CPUs utilized | |
2 context-switches # 0.002 K/sec | |
0 cpu-migrations # 0.000 K/sec | |
45 page-faults # 0.049 K/sec | |
3,014,079,050 cycles # 3.262 GHz | |
<not supported> stalled-cycles-frontend | |
<not supported> stalled-cycles-backend | |
7,403,821,842 instructions # 2.46 insns per cycle | |
3,100,694,179 branches # 3355.397 M/sec | |
19,968 branch-misses # 0.00% of all branches | |
0.924619920 seconds time elapsed | |
failed to read counter stalled-cycles-frontend | |
failed to read counter stalled-cycles-backend | |
Performance counter stats for './call': | |
1965.726030 task-clock (msec) # 0.999 CPUs utilized | |
66 context-switches # 0.034 K/sec | |
0 cpu-migrations # 0.000 K/sec | |
44 page-faults # 0.022 K/sec | |
6,353,837,418 cycles # 3.232 GHz | |
<not supported> stalled-cycles-frontend | |
<not supported> stalled-cycles-backend | |
7,607,903,295 instructions # 1.20 insns per cycle | |
3,101,434,298 branches # 1577.755 M/sec | |
47,353 branch-misses # 0.00% of all branches | |
1.968477874 seconds time elapsed | |
failed to read counter stalled-cycles-frontend | |
failed to read counter stalled-cycles-backend | |
Performance counter stats for './repl-switch': | |
834.074390 task-clock (msec) # 0.999 CPUs utilized | |
1 context-switches # 0.001 K/sec | |
0 cpu-migrations # 0.000 K/sec | |
43 page-faults # 0.052 K/sec | |
2,707,107,648 cycles # 3.246 GHz | |
<not supported> stalled-cycles-frontend | |
<not supported> stalled-cycles-backend | |
5,403,498,514 instructions # 2.00 insns per cycle | |
2,100,635,344 branches # 2518.523 M/sec | |
90,195 branch-misses # 0.00% of all branches | |
0.834544443 seconds time elapsed | |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
~/gd/dalsi/nanoparrot/threading [master|✔]$ cat /proc/cpuinfo | head -n26 | |
processor : 0 | |
vendor_id : GenuineIntel | |
cpu family : 6 | |
model : 142 | |
model name : Intel(R) Core(TM) i7-7600U CPU @ 2.80GHz | |
stepping : 9 | |
microcode : 0x80 | |
cpu MHz : 2897.433 | |
cache size : 4096 KB | |
physical id : 0 | |
siblings : 4 | |
core id : 0 | |
cpu cores : 2 | |
apicid : 0 | |
initial apicid : 0 | |
fpu : yes | |
fpu_exception : yes | |
cpuid level : 22 | |
wp : yes | |
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb invpcid_single pti retpoline intel_pt tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm mpx rdseed adx smap clflushopt xsaveopt xsavec xgetbv1 xsaves dtherm ida arat pln pts hwp hwp_notify hwp_act_window hwp_epp | |
bugs : cpu_meltdown spectre_v1 spectre_v2 | |
bogomips : 5808.00 | |
clflush size : 64 | |
cache_alignment : 64 | |
address sizes : 39 bits physical, 48 bits virtual | |
power management: |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
~/gd/dalsi/nanoparrot/threading [master|✔]$ make bench-perf | |
for i in subroutine direct indirect switch call repl-switch; do perf stat -B ./$i 2>&1; done | |
Performance counter stats for './subroutine': | |
210.003092 task-clock:u (msec) # 0.996 CPUs utilized | |
0 context-switches:u # 0.000 K/sec | |
0 cpu-migrations:u # 0.000 K/sec | |
41 page-faults:u # 0.195 K/sec | |
732,761,679 cycles:u # 3.489 GHz | |
900,115,400 instructions:u # 1.23 insn per cycle | |
400,025,421 branches:u # 1904.855 M/sec | |
1,594 branch-misses:u # 0.00% of all branches | |
0.210930225 seconds time elapsed | |
Performance counter stats for './direct': | |
824.330951 task-clock:u (msec) # 1.000 CPUs utilized | |
0 context-switches:u # 0.000 K/sec | |
0 cpu-migrations:u # 0.000 K/sec | |
43 page-faults:u # 0.052 K/sec | |
3,200,368,233 cycles:u # 3.882 GHz | |
3,800,116,069 instructions:u # 1.19 insn per cycle | |
1,600,026,084 branches:u # 1941.000 M/sec | |
4,163 branch-misses:u # 0.00% of all branches | |
0.824570804 seconds time elapsed | |
Performance counter stats for './indirect': | |
812.115058 task-clock:u (msec) # 1.000 CPUs utilized | |
0 context-switches:u # 0.000 K/sec | |
0 cpu-migrations:u # 0.000 K/sec | |
40 page-faults:u # 0.049 K/sec | |
3,148,865,245 cycles:u # 3.877 GHz | |
4,700,116,059 instructions:u # 1.49 insn per cycle | |
1,600,026,051 branches:u # 1970.196 M/sec | |
4,762 branch-misses:u # 0.00% of all branches | |
0.812410844 seconds time elapsed | |
Performance counter stats for './switch': | |
1068.688555 task-clock:u (msec) # 1.000 CPUs utilized | |
0 context-switches:u # 0.000 K/sec | |
0 cpu-migrations:u # 0.000 K/sec | |
41 page-faults:u # 0.038 K/sec | |
3,891,814,296 cycles:u # 3.642 GHz | |
7,400,116,374 instructions:u # 1.90 insn per cycle | |
3,100,026,370 branches:u # 2900.776 M/sec | |
9,599 branch-misses:u # 0.00% of all branches | |
1.068976510 seconds time elapsed | |
Performance counter stats for './call': | |
1685.838029 task-clock:u (msec) # 0.999 CPUs utilized | |
0 context-switches:u # 0.000 K/sec | |
0 cpu-migrations:u # 0.000 K/sec | |
39 page-faults:u # 0.023 K/sec | |
6,329,243,109 cycles:u # 3.754 GHz | |
7,600,117,030 instructions:u # 1.20 insn per cycle | |
3,100,027,024 branches:u # 1838.864 M/sec | |
9,065 branch-misses:u # 0.00% of all branches | |
1.686720270 seconds time elapsed | |
Performance counter stats for './repl-switch': | |
591.962332 task-clock:u (msec) # 1.000 CPUs utilized | |
0 context-switches:u # 0.000 K/sec | |
0 cpu-migrations:u # 0.000 K/sec | |
43 page-faults:u # 0.073 K/sec | |
2,108,003,967 cycles:u # 3.561 GHz | |
5,400,115,856 instructions:u # 2.56 insn per cycle | |
2,100,025,848 branches:u # 3547.567 M/sec | |
5,098 branch-misses:u # 0.00% of all branches | |
0.592229400 seconds time elapsed | |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment