Skip to content

Instantly share code, notes, and snippets.

@mj41
Last active October 13, 2019 12:56
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save mj41/e6ba5ef48d2de61b0c72 to your computer and use it in GitHub Desktop.
Save mj41/e6ba5ef48d2de61b0c72 to your computer and use it in GitHub Desktop.
~/gd/dalsi/nanoparrot/threading [master|✔]$ cat /proc/cpuinfo | head -n26
processor : 0
vendor_id : GenuineIntel
cpu family : 6
model : 69
model name : Intel(R) Core(TM) i7-4600U CPU @ 2.10GHz
stepping : 1
microcode : 0x1c
cpu MHz : 2483.894
cache size : 4096 KB
physical id : 0
siblings : 4
core id : 0
cpu cores : 2
apicid : 0
initial apicid : 0
fpu : yes
fpu_exception : yes
cpuid level : 13
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc aperfmperf eagerfpu pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 fma cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm ida arat epb pln pts dtherm tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid xsaveopt
bugs :
bogomips : 5387.43
clflush size : 64
cache_alignment : 64
address sizes : 39 bits physical, 48 bits virtual
power management:
~/gd/dalsi/nanoparrot/threading [master|✔]$ make bench-perf
for i in subroutine direct indirect switch call repl-switch; do perf stat -B ./$i 2>&1; done
failed to read counter stalled-cycles-frontend
failed to read counter stalled-cycles-backend
Performance counter stats for './subroutine':
184.224555 task-clock (msec) # 0.998 CPUs utilized
0 context-switches # 0.000 K/sec
0 cpu-migrations # 0.000 K/sec
44 page-faults # 0.239 K/sec
602,399,539 cycles # 3.270 GHz
<not supported> stalled-cycles-frontend
<not supported> stalled-cycles-backend
901,130,018 instructions # 1.50 insns per cycle
400,210,002 branches # 2172.403 M/sec
6,894 branch-misses # 0.00% of all branches
0.184512295 seconds time elapsed
failed to read counter stalled-cycles-frontend
failed to read counter stalled-cycles-backend
Performance counter stats for './direct':
974.359432 task-clock (msec) # 0.999 CPUs utilized
1 context-switches # 0.001 K/sec
0 cpu-migrations # 0.000 K/sec
45 page-faults # 0.046 K/sec
3,156,374,204 cycles # 3.239 GHz
<not supported> stalled-cycles-frontend
<not supported> stalled-cycles-backend
3,804,006,962 instructions # 1.21 insns per cycle
1,600,728,014 branches # 1642.852 M/sec
19,705 branch-misses # 0.00% of all branches
0.974880355 seconds time elapsed
failed to read counter stalled-cycles-frontend
failed to read counter stalled-cycles-backend
Performance counter stats for './indirect':
1023.301660 task-clock (msec) # 0.999 CPUs utilized
1 context-switches # 0.001 K/sec
0 cpu-migrations # 0.000 K/sec
45 page-faults # 0.044 K/sec
3,313,678,659 cycles # 3.238 GHz
<not supported> stalled-cycles-frontend
<not supported> stalled-cycles-backend
4,704,236,354 instructions # 1.42 insns per cycle
1,600,769,255 branches # 1564.318 M/sec
28,639 branch-misses # 0.00% of all branches
1.023872110 seconds time elapsed
failed to read counter stalled-cycles-frontend
failed to read counter stalled-cycles-backend
Performance counter stats for './switch':
924.091463 task-clock (msec) # 0.999 CPUs utilized
2 context-switches # 0.002 K/sec
0 cpu-migrations # 0.000 K/sec
45 page-faults # 0.049 K/sec
3,014,079,050 cycles # 3.262 GHz
<not supported> stalled-cycles-frontend
<not supported> stalled-cycles-backend
7,403,821,842 instructions # 2.46 insns per cycle
3,100,694,179 branches # 3355.397 M/sec
19,968 branch-misses # 0.00% of all branches
0.924619920 seconds time elapsed
failed to read counter stalled-cycles-frontend
failed to read counter stalled-cycles-backend
Performance counter stats for './call':
1965.726030 task-clock (msec) # 0.999 CPUs utilized
66 context-switches # 0.034 K/sec
0 cpu-migrations # 0.000 K/sec
44 page-faults # 0.022 K/sec
6,353,837,418 cycles # 3.232 GHz
<not supported> stalled-cycles-frontend
<not supported> stalled-cycles-backend
7,607,903,295 instructions # 1.20 insns per cycle
3,101,434,298 branches # 1577.755 M/sec
47,353 branch-misses # 0.00% of all branches
1.968477874 seconds time elapsed
failed to read counter stalled-cycles-frontend
failed to read counter stalled-cycles-backend
Performance counter stats for './repl-switch':
834.074390 task-clock (msec) # 0.999 CPUs utilized
1 context-switches # 0.001 K/sec
0 cpu-migrations # 0.000 K/sec
43 page-faults # 0.052 K/sec
2,707,107,648 cycles # 3.246 GHz
<not supported> stalled-cycles-frontend
<not supported> stalled-cycles-backend
5,403,498,514 instructions # 2.00 insns per cycle
2,100,635,344 branches # 2518.523 M/sec
90,195 branch-misses # 0.00% of all branches
0.834544443 seconds time elapsed
~/gd/dalsi/nanoparrot/threading [master|✔]$ cat /proc/cpuinfo | head -n26
processor : 0
vendor_id : GenuineIntel
cpu family : 6
model : 142
model name : Intel(R) Core(TM) i7-7600U CPU @ 2.80GHz
stepping : 9
microcode : 0x80
cpu MHz : 2897.433
cache size : 4096 KB
physical id : 0
siblings : 4
core id : 0
cpu cores : 2
apicid : 0
initial apicid : 0
fpu : yes
fpu_exception : yes
cpuid level : 22
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb invpcid_single pti retpoline intel_pt tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm mpx rdseed adx smap clflushopt xsaveopt xsavec xgetbv1 xsaves dtherm ida arat pln pts hwp hwp_notify hwp_act_window hwp_epp
bugs : cpu_meltdown spectre_v1 spectre_v2
bogomips : 5808.00
clflush size : 64
cache_alignment : 64
address sizes : 39 bits physical, 48 bits virtual
power management:
~/gd/dalsi/nanoparrot/threading [master|✔]$ make bench-perf
for i in subroutine direct indirect switch call repl-switch; do perf stat -B ./$i 2>&1; done
Performance counter stats for './subroutine':
210.003092 task-clock:u (msec) # 0.996 CPUs utilized
0 context-switches:u # 0.000 K/sec
0 cpu-migrations:u # 0.000 K/sec
41 page-faults:u # 0.195 K/sec
732,761,679 cycles:u # 3.489 GHz
900,115,400 instructions:u # 1.23 insn per cycle
400,025,421 branches:u # 1904.855 M/sec
1,594 branch-misses:u # 0.00% of all branches
0.210930225 seconds time elapsed
Performance counter stats for './direct':
824.330951 task-clock:u (msec) # 1.000 CPUs utilized
0 context-switches:u # 0.000 K/sec
0 cpu-migrations:u # 0.000 K/sec
43 page-faults:u # 0.052 K/sec
3,200,368,233 cycles:u # 3.882 GHz
3,800,116,069 instructions:u # 1.19 insn per cycle
1,600,026,084 branches:u # 1941.000 M/sec
4,163 branch-misses:u # 0.00% of all branches
0.824570804 seconds time elapsed
Performance counter stats for './indirect':
812.115058 task-clock:u (msec) # 1.000 CPUs utilized
0 context-switches:u # 0.000 K/sec
0 cpu-migrations:u # 0.000 K/sec
40 page-faults:u # 0.049 K/sec
3,148,865,245 cycles:u # 3.877 GHz
4,700,116,059 instructions:u # 1.49 insn per cycle
1,600,026,051 branches:u # 1970.196 M/sec
4,762 branch-misses:u # 0.00% of all branches
0.812410844 seconds time elapsed
Performance counter stats for './switch':
1068.688555 task-clock:u (msec) # 1.000 CPUs utilized
0 context-switches:u # 0.000 K/sec
0 cpu-migrations:u # 0.000 K/sec
41 page-faults:u # 0.038 K/sec
3,891,814,296 cycles:u # 3.642 GHz
7,400,116,374 instructions:u # 1.90 insn per cycle
3,100,026,370 branches:u # 2900.776 M/sec
9,599 branch-misses:u # 0.00% of all branches
1.068976510 seconds time elapsed
Performance counter stats for './call':
1685.838029 task-clock:u (msec) # 0.999 CPUs utilized
0 context-switches:u # 0.000 K/sec
0 cpu-migrations:u # 0.000 K/sec
39 page-faults:u # 0.023 K/sec
6,329,243,109 cycles:u # 3.754 GHz
7,600,117,030 instructions:u # 1.20 insn per cycle
3,100,027,024 branches:u # 1838.864 M/sec
9,065 branch-misses:u # 0.00% of all branches
1.686720270 seconds time elapsed
Performance counter stats for './repl-switch':
591.962332 task-clock:u (msec) # 1.000 CPUs utilized
0 context-switches:u # 0.000 K/sec
0 cpu-migrations:u # 0.000 K/sec
43 page-faults:u # 0.073 K/sec
2,108,003,967 cycles:u # 3.561 GHz
5,400,115,856 instructions:u # 2.56 insn per cycle
2,100,025,848 branches:u # 3547.567 M/sec
5,098 branch-misses:u # 0.00% of all branches
0.592229400 seconds time elapsed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment