Created
July 15, 2024 09:44
-
-
Save cyyself/470eab44ab97425882bddb88d43c41b4 to your computer and use it in GitHub Desktop.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
baseline commit 6882f8c55e9ab55f298e5203c193d0ccd94e2a57: | |
➜ XiangShan git:(master) ✗ ~/.homebrew/bin/numactl --physcpubind=0-7,16-23 perf stat -e L1-dcache-load-misses -e L1-icache-load-misses -e branch-misses -e branch-load-misses -e cpu-cycles -e instructions -e stalled-cycles-frontend -e stalled-cycles-backend -e branch-instructions sh -c "./build/emu --no-diff -i /nfs/home/chenyangyu/xs-env/XiangShan/ready-to-run/coremark-2-iteration.bin 2>/dev/null" | |
emu compiled at Jul 15 2024, 17:21:46 | |
Using simulated 32768B flash | |
Using simulated 8192MB RAM | |
The image is /nfs/home/chenyangyu/xs-env/XiangShan/ready-to-run/coremark-2-iteration.bin | |
Running CoreMark for 2 iterations | |
2K performance run parameters for coremark. | |
CoreMark Size : 666 | |
Total time (ms) : 3161 | |
Iterations : 2 | |
Compiler version : GCC10.2.0 | |
seedcrc : 0xe9f5 | |
[0]crclist : 0xe714 | |
[0]crcmatrix : 0x1fd7 | |
[0]crcstate : 0x8e3a | |
[0]crcfinal : 0x72be | |
Finised in 3161 ms. | |
================================================== | |
CoreMark Iterations/Sec 632 | |
Core 0: HIT GOOD TRAP at pc = 0x800026ba | |
Core-0 instrCnt = 626,872, cycleCnt = 367,843, IPC = 1.704184 | |
Seed=0 Guest cycle spent: 367,847 (this will be different from cycleCnt if emu loads a snapshot) | |
Host time spent: 36,665ms | |
Performance counter stats for 'sh -c ./build/emu --no-diff -i /nfs/home/chenyangyu/xs-env/XiangShan/ready-to-run/coremark-2-iteration.bin 2>/dev/null': | |
29,688,446,591 L1-dcache-load-misses (55.55%) | |
2,282,973,604 L1-icache-load-misses (55.55%) | |
1,386,393,206 branch-misses # 1.03% of all branches (55.55%) | |
1,383,956,833 branch-load-misses (55.55%) | |
1,460,411,727,859 cpu-cycles (55.55%) | |
1,661,233,695,046 instructions # 1.14 insn per cycle | |
# 0.13 stalled cycles per insn (55.56%) | |
12,163,534,465 stalled-cycles-frontend # 0.83% frontend cycles idle (55.57%) | |
220,732,222,891 stalled-cycles-backend # 15.11% backend cycles idle (55.56%) | |
134,405,866,602 branch-instructions (55.56%) | |
36.696658039 seconds time elapsed | |
292.938192000 seconds user | |
0.031999000 seconds sys | |
optimized 775d1684d840ef433a17766e48ab3968a26a0607: | |
➜ XiangShan git:(master) ✗ ~/.homebrew/bin/numactl --physcpubind=0-7,16-23 perf stat -e L1-dcache-load-misses -e L1-icache-load-misses -e branch-misses -e branch-load-misses -e cpu-cycles -e instructions -e stalled-cycles-frontend -e stalled-cycles-backend -e branch-instructions sh -c "./build/emu --no-diff -i /nfs/home/chenyangyu/xs-env/XiangShan/ready-to-run/coremark-2-iteration.bin 2>/dev/null" | |
emu compiled at Jul 15 2024, 16:52:39 | |
Using simulated 32768B flash | |
Using simulated 8192MB RAM | |
The image is /nfs/home/chenyangyu/xs-env/XiangShan/ready-to-run/coremark-2-iteration.bin | |
Running CoreMark for 2 iterations | |
2K performance run parameters for coremark. | |
CoreMark Size : 666 | |
Total time (ms) : 3161 | |
Iterations : 2 | |
Compiler version : GCC10.2.0 | |
seedcrc : 0xe9f5 | |
[0]crclist : 0xe714 | |
[0]crcmatrix : 0x1fd7 | |
[0]crcstate : 0x8e3a | |
[0]crcfinal : 0x72be | |
Finised in 3161 ms. | |
================================================== | |
CoreMark Iterations/Sec 632 | |
Core 0: HIT GOOD TRAP at pc = 0x800026ba | |
Core-0 instrCnt = 626,872, cycleCnt = 367,843, IPC = 1.704184 | |
Seed=0 Guest cycle spent: 367,847 (this will be different from cycleCnt if emu loads a snapshot) | |
Host time spent: 36,345ms | |
Performance counter stats for 'sh -c ./build/emu --no-diff -i /nfs/home/chenyangyu/xs-env/XiangShan/ready-to-run/coremark-2-iteration.bin 2>/dev/null': | |
29,874,084,281 L1-dcache-load-misses (55.56%) | |
2,305,157,516 L1-icache-load-misses (55.56%) | |
1,406,497,447 branch-misses # 1.07% of all branches (55.56%) | |
1,406,122,854 branch-load-misses (55.56%) | |
1,444,627,064,092 cpu-cycles (55.56%) | |
1,664,024,873,662 instructions # 1.15 insn per cycle | |
# 0.13 stalled cycles per insn (55.56%) | |
11,406,108,646 stalled-cycles-frontend # 0.79% frontend cycles idle (55.56%) | |
223,995,817,060 stalled-cycles-backend # 15.51% backend cycles idle (55.56%) | |
131,632,170,101 branch-instructions (55.56%) | |
36.443674697 seconds time elapsed | |
288.636205000 seconds user | |
0.089624000 seconds sys |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment