Skip to content

Instantly share code, notes, and snippets.

@XanClic
Created April 26, 2021 09:22
Show Gist options
  • Save XanClic/a3a113b8f673438579f8c0213164939e to your computer and use it in GitHub Desktop.
Save XanClic/a3a113b8f673438579f8c0213164939e to your computer and use it in GitHub Desktop.
0000000000000000 <matrix_mul>:
matrix_mul():
0: c5 f8 28 1f vmovaps xmm3,XMMWORD PTR [rdi]
4: c4 e2 79 18 36 vbroadcastss xmm6,DWORD PTR [rsi]
9: c4 e2 79 18 6e 04 vbroadcastss xmm5,DWORD PTR [rsi+0x4]
f: c5 f8 28 57 10 vmovaps xmm2,XMMWORD PTR [rdi+0x10]
14: c4 e2 79 18 7e 0c vbroadcastss xmm7,DWORD PTR [rsi+0xc]
1a: c5 f8 28 4f 20 vmovaps xmm1,XMMWORD PTR [rdi+0x20]
1f: c4 e2 79 18 66 08 vbroadcastss xmm4,DWORD PTR [rsi+0x8]
25: c5 f8 28 47 30 vmovaps xmm0,XMMWORD PTR [rdi+0x30]
2a: c4 62 79 18 46 1c vbroadcastss xmm8,DWORD PTR [rsi+0x1c]
30: c4 62 79 18 4e 20 vbroadcastss xmm9,DWORD PTR [rsi+0x20]
36: c4 62 79 18 56 34 vbroadcastss xmm10,DWORD PTR [rsi+0x34]
3c: c5 c8 59 f3 vmulps xmm6,xmm6,xmm3
40: c4 e2 69 b8 f5 vfmadd231ps xmm6,xmm2,xmm5
45: c4 e2 79 18 6e 10 vbroadcastss xmm5,DWORD PTR [rsi+0x10]
4b: c5 30 59 cb vmulps xmm9,xmm9,xmm3
4f: c4 e2 49 98 e1 vfmadd132ps xmm4,xmm6,xmm1
54: c5 f8 28 f7 vmovaps xmm6,xmm7
58: c4 e2 79 18 7e 14 vbroadcastss xmm7,DWORD PTR [rsi+0x14]
5e: c5 d0 59 eb vmulps xmm5,xmm5,xmm3
62: c4 e2 59 98 f0 vfmadd132ps xmm6,xmm4,xmm0
67: c4 e2 79 18 66 18 vbroadcastss xmm4,DWORD PTR [rsi+0x18]
6d: c4 e2 69 b8 ef vfmadd231ps xmm5,xmm2,xmm7
72: c4 e2 79 18 7e 28 vbroadcastss xmm7,DWORD PTR [rsi+0x28]
78: c4 e2 51 98 e1 vfmadd132ps xmm4,xmm5,xmm1
7d: c5 78 29 c5 vmovaps xmm5,xmm8
81: c4 62 79 18 46 24 vbroadcastss xmm8,DWORD PTR [rsi+0x24]
87: c4 e2 59 98 e8 vfmadd132ps xmm5,xmm4,xmm0
8c: c4 e2 79 18 66 2c vbroadcastss xmm4,DWORD PTR [rsi+0x2c]
92: c4 62 31 98 c2 vfmadd132ps xmm8,xmm9,xmm2
97: c4 62 79 18 4e 38 vbroadcastss xmm9,DWORD PTR [rsi+0x38]
9d: c4 e2 39 98 f9 vfmadd132ps xmm7,xmm8,xmm1
a2: c4 62 79 18 46 3c vbroadcastss xmm8,DWORD PTR [rsi+0x3c]
a8: c4 e2 41 98 e0 vfmadd132ps xmm4,xmm7,xmm0
ad: c4 e2 79 18 7e 30 vbroadcastss xmm7,DWORD PTR [rsi+0x30]
b3: c5 f8 29 37 vmovaps XMMWORD PTR [rdi],xmm6
b7: c5 f8 29 6f 10 vmovaps XMMWORD PTR [rdi+0x10],xmm5
bc: c5 f8 29 67 20 vmovaps XMMWORD PTR [rdi+0x20],xmm4
c1: c5 c0 59 db vmulps xmm3,xmm7,xmm3
c5: c4 c2 61 98 d2 vfmadd132ps xmm2,xmm3,xmm10
ca: c4 c2 69 98 c9 vfmadd132ps xmm1,xmm2,xmm9
cf: c4 c2 71 98 c0 vfmadd132ps xmm0,xmm1,xmm8
d4: c5 f8 29 47 30 vmovaps XMMWORD PTR [rdi+0x30],xmm0
d9: c3 ret
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment