Skip to content

Instantly share code, notes, and snippets.

@cyyself
Created July 15, 2024 09:44
Show Gist options
  • Save cyyself/470eab44ab97425882bddb88d43c41b4 to your computer and use it in GitHub Desktop.
Save cyyself/470eab44ab97425882bddb88d43c41b4 to your computer and use it in GitHub Desktop.
baseline commit 6882f8c55e9ab55f298e5203c193d0ccd94e2a57:
➜ XiangShan git:(master) ✗ ~/.homebrew/bin/numactl --physcpubind=0-7,16-23 perf stat -e L1-dcache-load-misses -e L1-icache-load-misses -e branch-misses -e branch-load-misses -e cpu-cycles -e instructions -e stalled-cycles-frontend -e stalled-cycles-backend -e branch-instructions sh -c "./build/emu --no-diff -i /nfs/home/chenyangyu/xs-env/XiangShan/ready-to-run/coremark-2-iteration.bin 2>/dev/null"
emu compiled at Jul 15 2024, 17:21:46
Using simulated 32768B flash
Using simulated 8192MB RAM
The image is /nfs/home/chenyangyu/xs-env/XiangShan/ready-to-run/coremark-2-iteration.bin
Running CoreMark for 2 iterations
2K performance run parameters for coremark.
CoreMark Size : 666
Total time (ms) : 3161
Iterations : 2
Compiler version : GCC10.2.0
seedcrc : 0xe9f5
[0]crclist : 0xe714
[0]crcmatrix : 0x1fd7
[0]crcstate : 0x8e3a
[0]crcfinal : 0x72be
Finised in 3161 ms.
==================================================
CoreMark Iterations/Sec 632
Core 0: HIT GOOD TRAP at pc = 0x800026ba
Core-0 instrCnt = 626,872, cycleCnt = 367,843, IPC = 1.704184
Seed=0 Guest cycle spent: 367,847 (this will be different from cycleCnt if emu loads a snapshot)
Host time spent: 36,665ms
Performance counter stats for 'sh -c ./build/emu --no-diff -i /nfs/home/chenyangyu/xs-env/XiangShan/ready-to-run/coremark-2-iteration.bin 2>/dev/null':
29,688,446,591 L1-dcache-load-misses (55.55%)
2,282,973,604 L1-icache-load-misses (55.55%)
1,386,393,206 branch-misses # 1.03% of all branches (55.55%)
1,383,956,833 branch-load-misses (55.55%)
1,460,411,727,859 cpu-cycles (55.55%)
1,661,233,695,046 instructions # 1.14 insn per cycle
# 0.13 stalled cycles per insn (55.56%)
12,163,534,465 stalled-cycles-frontend # 0.83% frontend cycles idle (55.57%)
220,732,222,891 stalled-cycles-backend # 15.11% backend cycles idle (55.56%)
134,405,866,602 branch-instructions (55.56%)
36.696658039 seconds time elapsed
292.938192000 seconds user
0.031999000 seconds sys
optimized 775d1684d840ef433a17766e48ab3968a26a0607:
➜ XiangShan git:(master) ✗ ~/.homebrew/bin/numactl --physcpubind=0-7,16-23 perf stat -e L1-dcache-load-misses -e L1-icache-load-misses -e branch-misses -e branch-load-misses -e cpu-cycles -e instructions -e stalled-cycles-frontend -e stalled-cycles-backend -e branch-instructions sh -c "./build/emu --no-diff -i /nfs/home/chenyangyu/xs-env/XiangShan/ready-to-run/coremark-2-iteration.bin 2>/dev/null"
emu compiled at Jul 15 2024, 16:52:39
Using simulated 32768B flash
Using simulated 8192MB RAM
The image is /nfs/home/chenyangyu/xs-env/XiangShan/ready-to-run/coremark-2-iteration.bin
Running CoreMark for 2 iterations
2K performance run parameters for coremark.
CoreMark Size : 666
Total time (ms) : 3161
Iterations : 2
Compiler version : GCC10.2.0
seedcrc : 0xe9f5
[0]crclist : 0xe714
[0]crcmatrix : 0x1fd7
[0]crcstate : 0x8e3a
[0]crcfinal : 0x72be
Finised in 3161 ms.
==================================================
CoreMark Iterations/Sec 632
Core 0: HIT GOOD TRAP at pc = 0x800026ba
Core-0 instrCnt = 626,872, cycleCnt = 367,843, IPC = 1.704184
Seed=0 Guest cycle spent: 367,847 (this will be different from cycleCnt if emu loads a snapshot)
Host time spent: 36,345ms
Performance counter stats for 'sh -c ./build/emu --no-diff -i /nfs/home/chenyangyu/xs-env/XiangShan/ready-to-run/coremark-2-iteration.bin 2>/dev/null':
29,874,084,281 L1-dcache-load-misses (55.56%)
2,305,157,516 L1-icache-load-misses (55.56%)
1,406,497,447 branch-misses # 1.07% of all branches (55.56%)
1,406,122,854 branch-load-misses (55.56%)
1,444,627,064,092 cpu-cycles (55.56%)
1,664,024,873,662 instructions # 1.15 insn per cycle
# 0.13 stalled cycles per insn (55.56%)
11,406,108,646 stalled-cycles-frontend # 0.79% frontend cycles idle (55.56%)
223,995,817,060 stalled-cycles-backend # 15.51% backend cycles idle (55.56%)
131,632,170,101 branch-instructions (55.56%)
36.443674697 seconds time elapsed
288.636205000 seconds user
0.089624000 seconds sys
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment