Created
December 19, 2020 07:55
-
-
Save tanakamura/7c96b797b7fe940af7f43bf6a45011e1 to your computer and use it in GitHub Desktop.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
ostimer: clock_gettime | |
userland_timer: rdtscp | |
perf_counter: yes | |
Intel(R) Core(TM) i5-8250U CPU @ 1.60GHz | |
==== actual-freq ==== | |
-> : num_thread | |
| | 1| 2| 4| 7 | |
================================================ | |
| iadd64 (0.01 [sec]) |3.38|3.38|3.38|2.78 | |
------------------------------------------------ | |
| fadd64 (0.01 [sec]) |3.38|3.38|3.38|2.79 | |
------------------------------------------------ | |
| iadd32x4 (0.01 [sec]) |3.38|3.38|3.38|2.78 | |
------------------------------------------------ | |
| fadd64x2 (0.01 [sec]) |3.38|3.38|3.38|2.78 | |
------------------------------------------------ | |
| fmul64x2 (0.01 [sec]) |3.38|3.38|3.38|2.79 | |
------------------------------------------------ | |
|busy_iadd32x8 (0.01 [sec]) |3.38|3.38|3.38|2.78 | |
------------------------------------------------ | |
|busy_imul32x8 (0.01 [sec]) |3.38|3.39|3.38|2.78 | |
------------------------------------------------ | |
|busy_fadd64x4 (0.01 [sec]) |3.38|3.38|3.38|2.78 | |
------------------------------------------------ | |
|busy_fmul64x4 (0.01 [sec]) |3.38|3.38|3.38|2.78 | |
------------------------------------------------ | |
| busy_fma64x4 (0.01 [sec]) |3.38|3.38|3.38|2.78 | |
------------------------------------------------ | |
| iadd64 (0.10 [sec]) |3.38|3.38|3.38|2.78 | |
------------------------------------------------ | |
| fadd64 (0.10 [sec]) |3.38|3.38|3.38|2.78 | |
------------------------------------------------ | |
| iadd32x4 (0.10 [sec]) |3.38|3.38|3.38|2.71 | |
------------------------------------------------ | |
| fadd64x2 (0.10 [sec]) |3.38|3.38|3.38|2.68 | |
------------------------------------------------ | |
| fmul64x2 (0.10 [sec]) |3.38|3.38|3.38|2.71 | |
------------------------------------------------ | |
|busy_iadd32x8 (0.10 [sec]) |3.38|3.38|3.38|2.71 | |
------------------------------------------------ | |
|busy_imul32x8 (0.10 [sec]) |3.38|3.38|3.38|2.68 | |
------------------------------------------------ | |
|busy_fadd64x4 (0.10 [sec]) |3.38|3.38|3.38|2.68 | |
------------------------------------------------ | |
|busy_fmul64x4 (0.10 [sec]) |3.38|3.38|3.38|2.68 | |
------------------------------------------------ | |
| busy_fma64x4 (0.10 [sec]) |3.38|3.38|3.38|2.67 | |
------------------------------------------------ | |
| iadd64 (0.20 [sec]) |3.38|3.38|3.38|2.57 | |
------------------------------------------------ | |
| fadd64 (0.20 [sec]) |3.38|3.38|3.38|2.68 | |
------------------------------------------------ | |
| iadd32x4 (0.20 [sec]) |3.38|3.38|3.38|2.68 | |
------------------------------------------------ | |
| fadd64x2 (0.20 [sec]) |3.38|3.38|3.38|2.68 | |
------------------------------------------------ | |
| fmul64x2 (0.20 [sec]) |3.38|3.38|3.38|2.69 | |
------------------------------------------------ | |
|busy_iadd32x8 (0.20 [sec]) |3.38|3.38|3.38|2.65 | |
------------------------------------------------ | |
|busy_imul32x8 (0.20 [sec]) |3.38|3.38|3.38|2.67 | |
------------------------------------------------ | |
|busy_fadd64x4 (0.20 [sec]) |3.38|3.38|3.38|2.68 | |
------------------------------------------------ | |
|busy_fmul64x4 (0.20 [sec]) |3.38|3.38|3.38|2.64 | |
------------------------------------------------ | |
| busy_fma64x4 (0.20 [sec]) |3.38|3.38|3.38|2.58 | |
------------------------------------------------ | |
| iadd64 (0.40 [sec]) |3.38|3.38|3.38|2.54 | |
------------------------------------------------ | |
| fadd64 (0.40 [sec]) |3.38|3.38|3.38|2.61 | |
------------------------------------------------ | |
| iadd32x4 (0.40 [sec]) |3.38|3.38|3.36|2.61 | |
------------------------------------------------ | |
| fadd64x2 (0.40 [sec]) |3.38|3.38|3.05|2.61 | |
------------------------------------------------ | |
| fmul64x2 (0.40 [sec]) |3.38|3.38|2.96|2.68 | |
------------------------------------------------ | |
|busy_iadd32x8 (0.40 [sec]) |3.38|3.38|2.84|2.60 | |
------------------------------------------------ | |
|busy_imul32x8 (0.40 [sec]) |3.38|3.38|2.87|2.64 | |
------------------------------------------------ | |
|busy_fadd64x4 (0.40 [sec]) |3.38|3.38|2.90|2.63 | |
------------------------------------------------ | |
|busy_fmul64x4 (0.40 [sec]) |3.38|3.38|2.87|2.52 | |
------------------------------------------------ | |
| busy_fma64x4 (0.40 [sec]) |3.38|3.38|2.78|2.53 | |
------------------------------------------------ | |
v : instruction | |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
ostimer: clock_gettime | |
userland_timer: rdtscp | |
perf_counter: yes | |
11th Gen Intel(R) Core(TM) i5-1135G7 @ 2.40GHz | |
==== actual-freq ==== | |
-> : num_thread | |
| | 1| 2| 4| 7 | |
================================================= | |
| iadd64 (0.01 [sec]) |4.19|4.18|4.18|3.79 | |
------------------------------------------------- | |
| fadd64 (0.01 [sec]) |4.19|4.18|4.18|3.79 | |
------------------------------------------------- | |
| iadd32x4 (0.01 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| fadd64x2 (0.01 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| fmul64x2 (0.01 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| busy_iadd32x8 (0.01 [sec]) |4.19|4.19|4.17|3.79 | |
------------------------------------------------- | |
| busy_imul32x8 (0.01 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| busy_fadd64x4 (0.01 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| busy_fmul64x4 (0.01 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| busy_fma64x4 (0.01 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
|busy_iadd32x16 (0.01 [sec]) |4.08|4.09|4.08|3.79 | |
------------------------------------------------- | |
|busy_imul32x16 (0.01 [sec]) |4.09|4.09|4.08|3.79 | |
------------------------------------------------- | |
| busy_fadd64x8 (0.01 [sec]) |4.09|4.09|4.08|3.79 | |
------------------------------------------------- | |
| busy_fmul64x8 (0.01 [sec]) |4.08|4.09|4.08|3.79 | |
------------------------------------------------- | |
| busy_fma64x8 (0.01 [sec]) |4.08|4.09|4.08|3.79 | |
------------------------------------------------- | |
| iadd64 (0.10 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| fadd64 (0.10 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| iadd32x4 (0.10 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| fadd64x2 (0.10 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| fmul64x2 (0.10 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| busy_iadd32x8 (0.10 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| busy_imul32x8 (0.10 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| busy_fadd64x4 (0.10 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| busy_fmul64x4 (0.10 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| busy_fma64x4 (0.10 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
|busy_iadd32x16 (0.10 [sec]) |4.09|4.09|4.08|3.79 | |
------------------------------------------------- | |
|busy_imul32x16 (0.10 [sec]) |4.09|4.09|4.08|3.79 | |
------------------------------------------------- | |
| busy_fadd64x8 (0.10 [sec]) |4.09|4.09|4.08|3.79 | |
------------------------------------------------- | |
| busy_fmul64x8 (0.10 [sec]) |4.09|4.09|4.08|3.79 | |
------------------------------------------------- | |
| busy_fma64x8 (0.10 [sec]) |4.09|4.09|4.08|3.79 | |
------------------------------------------------- | |
| iadd64 (0.20 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| fadd64 (0.20 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| iadd32x4 (0.20 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| fadd64x2 (0.20 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| fmul64x2 (0.20 [sec]) |4.19|4.19|4.17|3.79 | |
------------------------------------------------- | |
| busy_iadd32x8 (0.20 [sec]) |4.19|4.19|4.17|3.79 | |
------------------------------------------------- | |
| busy_imul32x8 (0.20 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| busy_fadd64x4 (0.20 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| busy_fmul64x4 (0.20 [sec]) |4.19|4.19|4.17|3.79 | |
------------------------------------------------- | |
| busy_fma64x4 (0.20 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
|busy_iadd32x16 (0.20 [sec]) |4.09|4.09|4.08|3.79 | |
------------------------------------------------- | |
|busy_imul32x16 (0.20 [sec]) |4.09|4.09|4.08|3.79 | |
------------------------------------------------- | |
| busy_fadd64x8 (0.20 [sec]) |4.09|4.09|4.08|3.79 | |
------------------------------------------------- | |
| busy_fmul64x8 (0.20 [sec]) |4.09|4.09|4.08|3.79 | |
------------------------------------------------- | |
| busy_fma64x8 (0.20 [sec]) |4.09|4.09|4.07|3.79 | |
------------------------------------------------- | |
| iadd64 (0.40 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| fadd64 (0.40 [sec]) |4.19|4.19|4.17|3.79 | |
------------------------------------------------- | |
| iadd32x4 (0.40 [sec]) |4.19|4.19|4.17|3.79 | |
------------------------------------------------- | |
| fadd64x2 (0.40 [sec]) |4.19|4.19|4.17|3.79 | |
------------------------------------------------- | |
| fmul64x2 (0.40 [sec]) |4.19|4.19|4.17|3.79 | |
------------------------------------------------- | |
| busy_iadd32x8 (0.40 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
| busy_imul32x8 (0.40 [sec]) |4.19|4.19|4.17|3.79 | |
------------------------------------------------- | |
| busy_fadd64x4 (0.40 [sec]) |4.19|4.19|4.17|3.79 | |
------------------------------------------------- | |
| busy_fmul64x4 (0.40 [sec]) |4.19|4.19|4.17|3.79 | |
------------------------------------------------- | |
| busy_fma64x4 (0.40 [sec]) |4.19|4.19|4.18|3.79 | |
------------------------------------------------- | |
|busy_iadd32x16 (0.40 [sec]) |4.09|4.09|4.08|3.79 | |
------------------------------------------------- | |
|busy_imul32x16 (0.40 [sec]) |4.09|4.09|4.08|3.79 | |
------------------------------------------------- | |
| busy_fadd64x8 (0.40 [sec]) |4.09|4.09|4.08|3.79 | |
------------------------------------------------- | |
| busy_fmul64x8 (0.40 [sec]) |4.09|4.09|4.08|3.79 | |
------------------------------------------------- | |
| busy_fma64x8 (0.40 [sec]) |4.09|4.09|4.08|3.79 | |
------------------------------------------------- | |
v : instruction | |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
ostimer: clock_gettime | |
userland_timer: rdtscp | |
perf_counter: yes | |
AMD Ryzen 7 3700X 8-Core Processor | |
==== actual-freq ==== | |
-> : num_thread | |
| | 1| 2| 4| 8| 15 | |
===================================================== | |
| iadd64 (0.01 [sec]) |4.32|4.33|4.21|4.27|4.12 | |
----------------------------------------------------- | |
| fadd64 (0.01 [sec]) |4.32|4.34|4.33|4.29|4.16 | |
----------------------------------------------------- | |
| iadd32x4 (0.01 [sec]) |4.27|4.27|4.30|4.25|4.12 | |
----------------------------------------------------- | |
| fadd64x2 (0.01 [sec]) |4.32|4.33|4.34|4.27|4.15 | |
----------------------------------------------------- | |
| fmul64x2 (0.01 [sec]) |4.25|4.35|4.34|4.28|4.16 | |
----------------------------------------------------- | |
|busy_iadd32x8 (0.01 [sec]) |4.34|4.31|4.32|4.24|4.10 | |
----------------------------------------------------- | |
|busy_imul32x8 (0.01 [sec]) |4.32|4.35|4.32|4.29|4.18 | |
----------------------------------------------------- | |
|busy_fadd64x4 (0.01 [sec]) |4.35|4.35|4.34|4.27|4.15 | |
----------------------------------------------------- | |
|busy_fmul64x4 (0.01 [sec]) |4.35|4.35|4.34|4.28|4.16 | |
----------------------------------------------------- | |
| busy_fma64x4 (0.01 [sec]) |4.34|4.34|4.33|4.27|4.18 | |
----------------------------------------------------- | |
| iadd64 (0.10 [sec]) |4.33|4.29|4.28|4.19|4.10 | |
----------------------------------------------------- | |
| fadd64 (0.10 [sec]) |4.36|4.34|4.31|4.18|4.13 | |
----------------------------------------------------- | |
| iadd32x4 (0.10 [sec]) |4.34|4.27|4.26|4.12|4.07 | |
----------------------------------------------------- | |
| fadd64x2 (0.10 [sec]) |4.36|4.34|4.29|4.19|4.11 | |
----------------------------------------------------- | |
| fmul64x2 (0.10 [sec]) |4.36|4.34|4.27|4.18|4.11 | |
----------------------------------------------------- | |
|busy_iadd32x8 (0.10 [sec]) |4.33|4.26|4.23|4.11|4.07 | |
----------------------------------------------------- | |
|busy_imul32x8 (0.10 [sec]) |4.33|4.36|4.33|4.21|4.14 | |
----------------------------------------------------- | |
|busy_fadd64x4 (0.10 [sec]) |4.36|4.33|4.31|4.19|4.08 | |
----------------------------------------------------- | |
|busy_fmul64x4 (0.10 [sec]) |4.35|4.33|4.31|4.17|4.08 | |
----------------------------------------------------- | |
| busy_fma64x4 (0.10 [sec]) |4.35|4.33|4.28|4.16|4.09 | |
----------------------------------------------------- | |
| iadd64 (0.20 [sec]) |4.34|4.28|4.27|4.13|4.06 | |
----------------------------------------------------- | |
| fadd64 (0.20 [sec]) |4.36|4.33|4.29|4.18|4.09 | |
----------------------------------------------------- | |
| iadd32x4 (0.20 [sec]) |4.34|4.26|4.24|4.13|4.06 | |
----------------------------------------------------- | |
| fadd64x2 (0.20 [sec]) |4.36|4.33|4.30|4.19|4.08 | |
----------------------------------------------------- | |
| fmul64x2 (0.20 [sec]) |4.36|4.34|4.31|4.16|4.09 | |
----------------------------------------------------- | |
|busy_iadd32x8 (0.20 [sec]) |4.34|4.24|4.23|4.12|4.02 | |
----------------------------------------------------- | |
|busy_imul32x8 (0.20 [sec]) |4.37|4.35|4.32|4.21|4.10 | |
----------------------------------------------------- | |
|busy_fadd64x4 (0.20 [sec]) |4.36|4.33|4.30|4.17|4.08 | |
----------------------------------------------------- | |
|busy_fmul64x4 (0.20 [sec]) |4.36|4.31|4.30|4.16|4.08 | |
----------------------------------------------------- | |
| busy_fma64x4 (0.20 [sec]) |4.36|4.31|4.30|4.16|4.09 | |
----------------------------------------------------- | |
| iadd64 (0.40 [sec]) |4.34|4.27|4.24|4.14|4.05 | |
----------------------------------------------------- | |
| fadd64 (0.40 [sec]) |4.35|4.34|4.30|4.17|4.08 | |
----------------------------------------------------- | |
| iadd32x4 (0.40 [sec]) |4.33|4.26|4.23|4.12|4.04 | |
----------------------------------------------------- | |
| fadd64x2 (0.40 [sec]) |4.35|4.34|4.31|4.16|4.06 | |
----------------------------------------------------- | |
| fmul64x2 (0.40 [sec]) |4.36|4.33|4.29|4.16|4.08 | |
----------------------------------------------------- | |
|busy_iadd32x8 (0.40 [sec]) |4.33|4.25|4.23|4.11|4.03 | |
----------------------------------------------------- | |
|busy_imul32x8 (0.40 [sec]) |4.38|4.36|4.33|4.17|4.09 | |
----------------------------------------------------- | |
|busy_fadd64x4 (0.40 [sec]) |4.36|4.33|4.29|4.16|4.08 | |
----------------------------------------------------- | |
|busy_fmul64x4 (0.40 [sec]) |4.36|4.32|4.29|4.15|4.07 | |
----------------------------------------------------- | |
| busy_fma64x4 (0.40 [sec]) |4.35|4.33|4.28|4.15|4.05 | |
----------------------------------------------------- | |
v : instruction | |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment