Skip to content

Instantly share code, notes, and snippets.

@tanakamura
Created December 19, 2020 07:55
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save tanakamura/7c96b797b7fe940af7f43bf6a45011e1 to your computer and use it in GitHub Desktop.
Save tanakamura/7c96b797b7fe940af7f43bf6a45011e1 to your computer and use it in GitHub Desktop.
ostimer: clock_gettime
userland_timer: rdtscp
perf_counter: yes
Intel(R) Core(TM) i5-8250U CPU @ 1.60GHz
==== actual-freq ====
-> : num_thread
| | 1| 2| 4| 7
================================================
| iadd64 (0.01 [sec]) |3.38|3.38|3.38|2.78
------------------------------------------------
| fadd64 (0.01 [sec]) |3.38|3.38|3.38|2.79
------------------------------------------------
| iadd32x4 (0.01 [sec]) |3.38|3.38|3.38|2.78
------------------------------------------------
| fadd64x2 (0.01 [sec]) |3.38|3.38|3.38|2.78
------------------------------------------------
| fmul64x2 (0.01 [sec]) |3.38|3.38|3.38|2.79
------------------------------------------------
|busy_iadd32x8 (0.01 [sec]) |3.38|3.38|3.38|2.78
------------------------------------------------
|busy_imul32x8 (0.01 [sec]) |3.38|3.39|3.38|2.78
------------------------------------------------
|busy_fadd64x4 (0.01 [sec]) |3.38|3.38|3.38|2.78
------------------------------------------------
|busy_fmul64x4 (0.01 [sec]) |3.38|3.38|3.38|2.78
------------------------------------------------
| busy_fma64x4 (0.01 [sec]) |3.38|3.38|3.38|2.78
------------------------------------------------
| iadd64 (0.10 [sec]) |3.38|3.38|3.38|2.78
------------------------------------------------
| fadd64 (0.10 [sec]) |3.38|3.38|3.38|2.78
------------------------------------------------
| iadd32x4 (0.10 [sec]) |3.38|3.38|3.38|2.71
------------------------------------------------
| fadd64x2 (0.10 [sec]) |3.38|3.38|3.38|2.68
------------------------------------------------
| fmul64x2 (0.10 [sec]) |3.38|3.38|3.38|2.71
------------------------------------------------
|busy_iadd32x8 (0.10 [sec]) |3.38|3.38|3.38|2.71
------------------------------------------------
|busy_imul32x8 (0.10 [sec]) |3.38|3.38|3.38|2.68
------------------------------------------------
|busy_fadd64x4 (0.10 [sec]) |3.38|3.38|3.38|2.68
------------------------------------------------
|busy_fmul64x4 (0.10 [sec]) |3.38|3.38|3.38|2.68
------------------------------------------------
| busy_fma64x4 (0.10 [sec]) |3.38|3.38|3.38|2.67
------------------------------------------------
| iadd64 (0.20 [sec]) |3.38|3.38|3.38|2.57
------------------------------------------------
| fadd64 (0.20 [sec]) |3.38|3.38|3.38|2.68
------------------------------------------------
| iadd32x4 (0.20 [sec]) |3.38|3.38|3.38|2.68
------------------------------------------------
| fadd64x2 (0.20 [sec]) |3.38|3.38|3.38|2.68
------------------------------------------------
| fmul64x2 (0.20 [sec]) |3.38|3.38|3.38|2.69
------------------------------------------------
|busy_iadd32x8 (0.20 [sec]) |3.38|3.38|3.38|2.65
------------------------------------------------
|busy_imul32x8 (0.20 [sec]) |3.38|3.38|3.38|2.67
------------------------------------------------
|busy_fadd64x4 (0.20 [sec]) |3.38|3.38|3.38|2.68
------------------------------------------------
|busy_fmul64x4 (0.20 [sec]) |3.38|3.38|3.38|2.64
------------------------------------------------
| busy_fma64x4 (0.20 [sec]) |3.38|3.38|3.38|2.58
------------------------------------------------
| iadd64 (0.40 [sec]) |3.38|3.38|3.38|2.54
------------------------------------------------
| fadd64 (0.40 [sec]) |3.38|3.38|3.38|2.61
------------------------------------------------
| iadd32x4 (0.40 [sec]) |3.38|3.38|3.36|2.61
------------------------------------------------
| fadd64x2 (0.40 [sec]) |3.38|3.38|3.05|2.61
------------------------------------------------
| fmul64x2 (0.40 [sec]) |3.38|3.38|2.96|2.68
------------------------------------------------
|busy_iadd32x8 (0.40 [sec]) |3.38|3.38|2.84|2.60
------------------------------------------------
|busy_imul32x8 (0.40 [sec]) |3.38|3.38|2.87|2.64
------------------------------------------------
|busy_fadd64x4 (0.40 [sec]) |3.38|3.38|2.90|2.63
------------------------------------------------
|busy_fmul64x4 (0.40 [sec]) |3.38|3.38|2.87|2.52
------------------------------------------------
| busy_fma64x4 (0.40 [sec]) |3.38|3.38|2.78|2.53
------------------------------------------------
v : instruction
ostimer: clock_gettime
userland_timer: rdtscp
perf_counter: yes
11th Gen Intel(R) Core(TM) i5-1135G7 @ 2.40GHz
==== actual-freq ====
-> : num_thread
| | 1| 2| 4| 7
=================================================
| iadd64 (0.01 [sec]) |4.19|4.18|4.18|3.79
-------------------------------------------------
| fadd64 (0.01 [sec]) |4.19|4.18|4.18|3.79
-------------------------------------------------
| iadd32x4 (0.01 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| fadd64x2 (0.01 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| fmul64x2 (0.01 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| busy_iadd32x8 (0.01 [sec]) |4.19|4.19|4.17|3.79
-------------------------------------------------
| busy_imul32x8 (0.01 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| busy_fadd64x4 (0.01 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| busy_fmul64x4 (0.01 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| busy_fma64x4 (0.01 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
|busy_iadd32x16 (0.01 [sec]) |4.08|4.09|4.08|3.79
-------------------------------------------------
|busy_imul32x16 (0.01 [sec]) |4.09|4.09|4.08|3.79
-------------------------------------------------
| busy_fadd64x8 (0.01 [sec]) |4.09|4.09|4.08|3.79
-------------------------------------------------
| busy_fmul64x8 (0.01 [sec]) |4.08|4.09|4.08|3.79
-------------------------------------------------
| busy_fma64x8 (0.01 [sec]) |4.08|4.09|4.08|3.79
-------------------------------------------------
| iadd64 (0.10 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| fadd64 (0.10 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| iadd32x4 (0.10 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| fadd64x2 (0.10 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| fmul64x2 (0.10 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| busy_iadd32x8 (0.10 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| busy_imul32x8 (0.10 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| busy_fadd64x4 (0.10 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| busy_fmul64x4 (0.10 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| busy_fma64x4 (0.10 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
|busy_iadd32x16 (0.10 [sec]) |4.09|4.09|4.08|3.79
-------------------------------------------------
|busy_imul32x16 (0.10 [sec]) |4.09|4.09|4.08|3.79
-------------------------------------------------
| busy_fadd64x8 (0.10 [sec]) |4.09|4.09|4.08|3.79
-------------------------------------------------
| busy_fmul64x8 (0.10 [sec]) |4.09|4.09|4.08|3.79
-------------------------------------------------
| busy_fma64x8 (0.10 [sec]) |4.09|4.09|4.08|3.79
-------------------------------------------------
| iadd64 (0.20 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| fadd64 (0.20 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| iadd32x4 (0.20 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| fadd64x2 (0.20 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| fmul64x2 (0.20 [sec]) |4.19|4.19|4.17|3.79
-------------------------------------------------
| busy_iadd32x8 (0.20 [sec]) |4.19|4.19|4.17|3.79
-------------------------------------------------
| busy_imul32x8 (0.20 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| busy_fadd64x4 (0.20 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| busy_fmul64x4 (0.20 [sec]) |4.19|4.19|4.17|3.79
-------------------------------------------------
| busy_fma64x4 (0.20 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
|busy_iadd32x16 (0.20 [sec]) |4.09|4.09|4.08|3.79
-------------------------------------------------
|busy_imul32x16 (0.20 [sec]) |4.09|4.09|4.08|3.79
-------------------------------------------------
| busy_fadd64x8 (0.20 [sec]) |4.09|4.09|4.08|3.79
-------------------------------------------------
| busy_fmul64x8 (0.20 [sec]) |4.09|4.09|4.08|3.79
-------------------------------------------------
| busy_fma64x8 (0.20 [sec]) |4.09|4.09|4.07|3.79
-------------------------------------------------
| iadd64 (0.40 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| fadd64 (0.40 [sec]) |4.19|4.19|4.17|3.79
-------------------------------------------------
| iadd32x4 (0.40 [sec]) |4.19|4.19|4.17|3.79
-------------------------------------------------
| fadd64x2 (0.40 [sec]) |4.19|4.19|4.17|3.79
-------------------------------------------------
| fmul64x2 (0.40 [sec]) |4.19|4.19|4.17|3.79
-------------------------------------------------
| busy_iadd32x8 (0.40 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
| busy_imul32x8 (0.40 [sec]) |4.19|4.19|4.17|3.79
-------------------------------------------------
| busy_fadd64x4 (0.40 [sec]) |4.19|4.19|4.17|3.79
-------------------------------------------------
| busy_fmul64x4 (0.40 [sec]) |4.19|4.19|4.17|3.79
-------------------------------------------------
| busy_fma64x4 (0.40 [sec]) |4.19|4.19|4.18|3.79
-------------------------------------------------
|busy_iadd32x16 (0.40 [sec]) |4.09|4.09|4.08|3.79
-------------------------------------------------
|busy_imul32x16 (0.40 [sec]) |4.09|4.09|4.08|3.79
-------------------------------------------------
| busy_fadd64x8 (0.40 [sec]) |4.09|4.09|4.08|3.79
-------------------------------------------------
| busy_fmul64x8 (0.40 [sec]) |4.09|4.09|4.08|3.79
-------------------------------------------------
| busy_fma64x8 (0.40 [sec]) |4.09|4.09|4.08|3.79
-------------------------------------------------
v : instruction
ostimer: clock_gettime
userland_timer: rdtscp
perf_counter: yes
AMD Ryzen 7 3700X 8-Core Processor
==== actual-freq ====
-> : num_thread
| | 1| 2| 4| 8| 15
=====================================================
| iadd64 (0.01 [sec]) |4.32|4.33|4.21|4.27|4.12
-----------------------------------------------------
| fadd64 (0.01 [sec]) |4.32|4.34|4.33|4.29|4.16
-----------------------------------------------------
| iadd32x4 (0.01 [sec]) |4.27|4.27|4.30|4.25|4.12
-----------------------------------------------------
| fadd64x2 (0.01 [sec]) |4.32|4.33|4.34|4.27|4.15
-----------------------------------------------------
| fmul64x2 (0.01 [sec]) |4.25|4.35|4.34|4.28|4.16
-----------------------------------------------------
|busy_iadd32x8 (0.01 [sec]) |4.34|4.31|4.32|4.24|4.10
-----------------------------------------------------
|busy_imul32x8 (0.01 [sec]) |4.32|4.35|4.32|4.29|4.18
-----------------------------------------------------
|busy_fadd64x4 (0.01 [sec]) |4.35|4.35|4.34|4.27|4.15
-----------------------------------------------------
|busy_fmul64x4 (0.01 [sec]) |4.35|4.35|4.34|4.28|4.16
-----------------------------------------------------
| busy_fma64x4 (0.01 [sec]) |4.34|4.34|4.33|4.27|4.18
-----------------------------------------------------
| iadd64 (0.10 [sec]) |4.33|4.29|4.28|4.19|4.10
-----------------------------------------------------
| fadd64 (0.10 [sec]) |4.36|4.34|4.31|4.18|4.13
-----------------------------------------------------
| iadd32x4 (0.10 [sec]) |4.34|4.27|4.26|4.12|4.07
-----------------------------------------------------
| fadd64x2 (0.10 [sec]) |4.36|4.34|4.29|4.19|4.11
-----------------------------------------------------
| fmul64x2 (0.10 [sec]) |4.36|4.34|4.27|4.18|4.11
-----------------------------------------------------
|busy_iadd32x8 (0.10 [sec]) |4.33|4.26|4.23|4.11|4.07
-----------------------------------------------------
|busy_imul32x8 (0.10 [sec]) |4.33|4.36|4.33|4.21|4.14
-----------------------------------------------------
|busy_fadd64x4 (0.10 [sec]) |4.36|4.33|4.31|4.19|4.08
-----------------------------------------------------
|busy_fmul64x4 (0.10 [sec]) |4.35|4.33|4.31|4.17|4.08
-----------------------------------------------------
| busy_fma64x4 (0.10 [sec]) |4.35|4.33|4.28|4.16|4.09
-----------------------------------------------------
| iadd64 (0.20 [sec]) |4.34|4.28|4.27|4.13|4.06
-----------------------------------------------------
| fadd64 (0.20 [sec]) |4.36|4.33|4.29|4.18|4.09
-----------------------------------------------------
| iadd32x4 (0.20 [sec]) |4.34|4.26|4.24|4.13|4.06
-----------------------------------------------------
| fadd64x2 (0.20 [sec]) |4.36|4.33|4.30|4.19|4.08
-----------------------------------------------------
| fmul64x2 (0.20 [sec]) |4.36|4.34|4.31|4.16|4.09
-----------------------------------------------------
|busy_iadd32x8 (0.20 [sec]) |4.34|4.24|4.23|4.12|4.02
-----------------------------------------------------
|busy_imul32x8 (0.20 [sec]) |4.37|4.35|4.32|4.21|4.10
-----------------------------------------------------
|busy_fadd64x4 (0.20 [sec]) |4.36|4.33|4.30|4.17|4.08
-----------------------------------------------------
|busy_fmul64x4 (0.20 [sec]) |4.36|4.31|4.30|4.16|4.08
-----------------------------------------------------
| busy_fma64x4 (0.20 [sec]) |4.36|4.31|4.30|4.16|4.09
-----------------------------------------------------
| iadd64 (0.40 [sec]) |4.34|4.27|4.24|4.14|4.05
-----------------------------------------------------
| fadd64 (0.40 [sec]) |4.35|4.34|4.30|4.17|4.08
-----------------------------------------------------
| iadd32x4 (0.40 [sec]) |4.33|4.26|4.23|4.12|4.04
-----------------------------------------------------
| fadd64x2 (0.40 [sec]) |4.35|4.34|4.31|4.16|4.06
-----------------------------------------------------
| fmul64x2 (0.40 [sec]) |4.36|4.33|4.29|4.16|4.08
-----------------------------------------------------
|busy_iadd32x8 (0.40 [sec]) |4.33|4.25|4.23|4.11|4.03
-----------------------------------------------------
|busy_imul32x8 (0.40 [sec]) |4.38|4.36|4.33|4.17|4.09
-----------------------------------------------------
|busy_fadd64x4 (0.40 [sec]) |4.36|4.33|4.29|4.16|4.08
-----------------------------------------------------
|busy_fmul64x4 (0.40 [sec]) |4.36|4.32|4.29|4.15|4.07
-----------------------------------------------------
| busy_fma64x4 (0.40 [sec]) |4.35|4.33|4.28|4.15|4.05
-----------------------------------------------------
v : instruction
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment