Created
April 26, 2021 09:36
-
-
Save XanClic/526aaedec0c60feb689188bb584ac06c to your computer and use it in GitHub Desktop.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
0000000000000000 <matrix_mul>: | |
matrix_mul(): | |
0: c4 e2 7d 19 1e vbroadcastsd ymm3,QWORD PTR [rsi] | |
5: c4 e2 7d 19 56 18 vbroadcastsd ymm2,QWORD PTR [rsi+0x18] | |
b: c5 e5 59 1f vmulpd ymm3,ymm3,YMMWORD PTR [rdi] | |
f: c4 e2 7d 19 4e 08 vbroadcastsd ymm1,QWORD PTR [rsi+0x8] | |
15: c4 e2 7d 19 46 10 vbroadcastsd ymm0,QWORD PTR [rsi+0x10] | |
1b: c4 e2 7d 19 66 38 vbroadcastsd ymm4,QWORD PTR [rsi+0x38] | |
21: c4 e2 7d 19 6e 40 vbroadcastsd ymm5,QWORD PTR [rsi+0x40] | |
27: c4 e2 7d 19 76 68 vbroadcastsd ymm6,QWORD PTR [rsi+0x68] | |
2d: c4 e2 f5 b8 5f 20 vfmadd231pd ymm3,ymm1,YMMWORD PTR [rdi+0x20] | |
33: c4 e2 7d 19 4e 28 vbroadcastsd ymm1,QWORD PTR [rsi+0x28] | |
39: c5 d5 59 2f vmulpd ymm5,ymm5,YMMWORD PTR [rdi] | |
3d: c4 e2 e5 98 47 40 vfmadd132pd ymm0,ymm3,YMMWORD PTR [rdi+0x40] | |
43: c5 fd 28 da vmovapd ymm3,ymm2 | |
47: c4 e2 7d 19 56 20 vbroadcastsd ymm2,QWORD PTR [rsi+0x20] | |
4d: c5 ed 59 17 vmulpd ymm2,ymm2,YMMWORD PTR [rdi] | |
51: c4 e2 f5 b8 57 20 vfmadd231pd ymm2,ymm1,YMMWORD PTR [rdi+0x20] | |
57: c4 e2 7d 19 4e 58 vbroadcastsd ymm1,QWORD PTR [rsi+0x58] | |
5d: c4 e2 fd 98 5f 60 vfmadd132pd ymm3,ymm0,YMMWORD PTR [rdi+0x60] | |
63: c4 e2 7d 19 46 30 vbroadcastsd ymm0,QWORD PTR [rsi+0x30] | |
69: c4 e2 ed 98 47 40 vfmadd132pd ymm0,ymm2,YMMWORD PTR [rdi+0x40] | |
6f: c4 e2 fd 98 67 60 vfmadd132pd ymm4,ymm0,YMMWORD PTR [rdi+0x60] | |
75: c4 e2 7d 19 46 50 vbroadcastsd ymm0,QWORD PTR [rsi+0x50] | |
7b: c5 fd 28 d4 vmovapd ymm2,ymm4 | |
7f: c4 e2 7d 19 66 48 vbroadcastsd ymm4,QWORD PTR [rsi+0x48] | |
85: c4 e2 d5 98 67 20 vfmadd132pd ymm4,ymm5,YMMWORD PTR [rdi+0x20] | |
8b: c4 e2 7d 19 6e 70 vbroadcastsd ymm5,QWORD PTR [rsi+0x70] | |
91: c4 e2 dd 98 47 40 vfmadd132pd ymm0,ymm4,YMMWORD PTR [rdi+0x40] | |
97: c4 e2 7d 19 66 78 vbroadcastsd ymm4,QWORD PTR [rsi+0x78] | |
9d: c4 e2 fd 98 4f 60 vfmadd132pd ymm1,ymm0,YMMWORD PTR [rdi+0x60] | |
a3: c4 e2 7d 19 46 60 vbroadcastsd ymm0,QWORD PTR [rsi+0x60] | |
a9: c5 fd 59 07 vmulpd ymm0,ymm0,YMMWORD PTR [rdi] | |
ad: c4 e2 cd b8 47 20 vfmadd231pd ymm0,ymm6,YMMWORD PTR [rdi+0x20] | |
b3: c4 e2 d5 b8 47 40 vfmadd231pd ymm0,ymm5,YMMWORD PTR [rdi+0x40] | |
b9: c4 e2 dd b8 47 60 vfmadd231pd ymm0,ymm4,YMMWORD PTR [rdi+0x60] | |
bf: c5 fd 29 1f vmovapd YMMWORD PTR [rdi],ymm3 | |
c3: c5 fd 29 57 20 vmovapd YMMWORD PTR [rdi+0x20],ymm2 | |
c8: c5 fd 29 4f 40 vmovapd YMMWORD PTR [rdi+0x40],ymm1 | |
cd: c5 fd 29 47 60 vmovapd YMMWORD PTR [rdi+0x60],ymm0 | |
d2: c5 f8 77 vzeroupper | |
d5: c3 ret |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment