Skip to content

Instantly share code, notes, and snippets.

@ErwanAliasr1
Created January 13, 2023 13:20
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save ErwanAliasr1/f50cc41d6c08ba6da7ee88980960ea0f to your computer and use it in GitHub Desktop.
Save ErwanAliasr1/f50cc41d6c08ba6da7ee88980960ea0f to your computer and use it in GitHub Desktop.
AMD Zen4 7950X avx-turbo
[erwan@7950 avx-turbo]$ ./avx-turbo
CPUID highest leaf : [10h]
Running as root : [NO ]
MSR reads supported : [NO ]
CPU pinning enabled : [YES]
CPU supports zeroupper: [YES]
CPU supports AVX2 : [YES]
CPU supports AVX-512F : [YES]
CPU supports AVX-512VL: [YES]
CPU supports AVX-512BW: [YES]
CPU supports AVX-512CD: [YES]
CPUID doesn't support leaf 0x15, falling back to manual TSC calibration.
tsc_freq = 4500.0 MHz (from calibration loop)
CPU brand string: AMD Ryzen 9 7950X 16-Core Processor
32 available CPUs: [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31]
16 physical cores: [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15]
Will test up to 16 CPUs
Cores | ID | Description | OVRLP3 | Mops
1 | pause_only | pause instruction | 1.000 | 8396
1 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1387
1 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1387
1 | scalar_iadd | Scalar integer adds | 1.000 | 5747
1 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5549
1 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5549
1 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5549
1 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5549
1 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5549
1 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5549
1 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 22222
1 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22222
1 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32680
1 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32573
1 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32680
1 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5747
1 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5747
1 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5750
1 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5750
1 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5747
1 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5747
1 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5549
1 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5549
1 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5549
1 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11099
1 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11099
1 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5549
1 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2775
1 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2775
1 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2775
1 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11099
1 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11099
1 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5549
1 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1850
1 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1850
1 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1850
1 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5747
1 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5750
1 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5750
1 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1387
1 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1387
1 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1388
1 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11099
1 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11099
1 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5552
1 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1110
1 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5552
1 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1110
1 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5552
Cores | ID | Description | OVRLP3 | Mops
2 | pause_only | pause instruction | 1.000 | 8410, 8410
2 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1387, 1387
2 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1387, 1387
2 | scalar_iadd | Scalar integer adds | 1.000 | 5747, 5747
2 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5549, 5549
2 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5549, 5549
2 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5549, 5549
2 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5549, 5549
2 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5549, 5549
2 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5549, 5549
2 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 22222, 22222
2 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22222, 22222
2 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32680, 32680
2 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32680, 32680
2 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32680, 32680
2 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5747, 5747
2 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5750, 5501
2 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5747, 5504
2 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5750, 5747
2 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5750, 5747
2 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5747, 4602
2 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549
2 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549
2 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549
2 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11013, 11013
2 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11013, 11013
2 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549
2 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2775, 2775
2 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2775, 2775
2 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2775, 2775
2 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11099, 11099
2 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11099, 11099
2 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5549, 5549
2 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1850, 1850
2 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1850, 1850
2 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1850, 1850
2 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5750, 5747
2 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5747, 5501
2 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5556, 5747
2 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1387, 1387
2 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1387, 1387
2 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1387, 1387
2 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11013
2 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11013, 11013
2 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5546, 5546
2 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1110, 1110
2 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5549, 5549
2 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1110, 1110
2 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5549, 5549
Cores | ID | Description | OVRLP3 | Mops
3 | pause_only | pause instruction | 1.000 | 8340, 8340, 8340
3 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1387, 1387, 1387
3 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1387, 1387, 1388
3 | scalar_iadd | Scalar integer adds | 1.000 | 5747, 5747, 5747
3 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5549, 5549, 5549
3 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5549, 5549, 5549
3 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5549, 5549, 5549
3 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5549, 5549, 5549
3 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5549, 5549, 5549
3 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5549, 5549, 5549
3 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 21978, 21978, 22026
3 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22222, 22222, 22222
3 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362
3 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32680, 32680, 32680
3 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 33784, 27100, 33784
3 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5747, 5747, 5747
3 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5747, 5747, 5747
3 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5747, 5747, 5747
3 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5747, 5747, 5747
3 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5750, 5747, 5747
3 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5750, 5750, 5747
3 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549, 5549
3 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549, 5549
3 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549, 5549
3 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11099, 11099, 11099
3 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11099, 11099, 11099
3 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549, 5549
3 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2775, 2775, 2775
3 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2775, 2775, 2775
3 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2775, 2775, 2775
3 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11099, 11099, 11099
3 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11099, 11099, 11099
3 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5549, 5549, 5549
3 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1850, 1850, 1850
3 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1850, 1850, 1850
3 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1850, 1850, 1850
3 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5747, 5750, 5747
3 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5750, 5747, 5750
3 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5747, 5747, 5747
3 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1387, 1387, 1387
3 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1387, 1387, 1387
3 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1387, 1387, 1387
3 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11099, 11099, 11099
3 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11099, 11099, 11099
3 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5549, 5552, 5552
3 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1110, 1110, 1110
3 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5546, 5549, 5549
3 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1110, 1110, 1110
3 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5546, 5546, 5546
Cores | ID | Description | OVRLP3 | Mops
4 | pause_only | pause instruction | 1.000 | 8410, 8410, 8410, 8410
4 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1387, 1387, 1387, 1387
4 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1387, 1387, 1387, 1387
4 | scalar_iadd | Scalar integer adds | 1.000 | 5747, 5750, 5747, 5747
4 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5549, 5549, 5549, 5549
4 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5549, 5549, 5549, 5549
4 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5549, 5549, 5549, 5549
4 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5546, 5546, 5549, 5546
4 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5549, 5549, 5549, 5549
4 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5549, 5549, 5549, 5549
4 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 22222, 22222, 22222, 22222
4 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22222, 22222, 22124, 22222
4 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 33784, 27027, 33784, 33784
4 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32468, 31153, 32362, 32362
4 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 33784, 27100, 33784, 33784
4 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5750, 5750, 5750, 5750
4 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5747, 5750, 5747, 5747
4 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5747, 5747, 5747, 5747
4 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5747, 5747, 5747, 5747
4 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5747, 5747, 5747, 5747
4 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5750, 5750, 5747, 5750
4 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549, 5549, 5549
4 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549, 5549, 5549
4 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549, 5549, 5549
4 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11099, 11099, 11099, 11099
4 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11099, 11099, 11099, 11099
4 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549, 5549, 5549
4 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2775, 2775, 2775, 2775
4 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2775, 2775, 2775, 2775
4 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2775, 2775, 2775, 2775
4 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11099, 11099, 11099, 11099
4 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11099, 11099, 11099, 11099
4 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5549, 5549, 5549, 5549
4 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1850, 1850, 1850, 1850
4 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1850, 1850, 1850, 1850
4 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1850, 1850, 1850, 1850
4 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5747, 5747, 5747, 5750
4 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5747, 5750, 5747, 5747
4 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5747, 5747, 5747, 5747
4 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1387, 1387, 1387, 1387
4 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1387, 1387, 1387, 1387
4 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1388, 1388, 1387, 1388
4 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11099, 11099, 11099, 11099
4 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11099, 11099, 11099, 11099
4 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5549, 5549, 5549, 5549
4 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1110, 1110, 1110, 1110
4 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5549, 5549, 5549, 5546
4 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1110, 1110, 1110, 1110
4 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5549, 5549, 5549, 5549
Cores | ID | Description | OVRLP3 | Mops
5 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333
5 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375
5 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375
5 | scalar_iadd | Scalar integer adds | 1.000 | 5501, 5501, 5501, 5501, 5501
5 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501
5 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501
5 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501
5 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501
5 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5498, 5501, 5501, 5501
5 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501
5 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 21978, 22026, 22026, 22026, 22026
5 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22026, 21978, 21978, 21978, 22026
5 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362
5 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362
5 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362
5 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501
5 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5498, 5501, 5501, 5501, 5501
5 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5501, 5501, 5498, 5501, 5501
5 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501
5 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501
5 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5498, 5498, 5498, 5501, 5498
5 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501
5 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501
5 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501
5 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 10989, 11001
5 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001
5 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501
5 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750
5 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750
5 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750
5 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001
5 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 10989, 11001
5 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5501
5 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1834, 1833, 1833
5 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1833, 1833, 1833
5 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1833, 1833, 1833, 1833, 1834
5 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501
5 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501
5 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501
5 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375
5 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375
5 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375
5 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001
5 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001
5 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501
5 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100
5 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501
5 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100
5 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501
Cores | ID | Description | OVRLP3 | Mops
6 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333, 8333
6 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375
6 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375
6 | scalar_iadd | Scalar integer adds | 1.000 | 5501, 5501, 5501, 5501, 5498, 5501
6 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501
6 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501
6 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501
6 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501
6 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501
6 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501
6 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 21978, 22026, 21978, 22026, 22026, 22026
6 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22026, 21978, 22026, 22026, 22026, 21978
6 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362
6 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362
6 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362
6 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5501, 5501, 5498, 5501, 5501, 5501
6 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501
6 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501
6 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501
6 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5501, 5498, 5501, 5501, 5501, 5501
6 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5501, 5501, 5498, 5501, 5501, 5501
6 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501
6 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5498, 5501, 5501, 5501
6 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501
6 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001
6 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001
6 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501
6 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750
6 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750
6 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750
6 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001
6 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001
6 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501
6 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1834, 1834, 1834, 1834
6 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1833, 1834, 1834, 1834, 1833, 1834
6 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1834, 1833, 1833, 1833, 1834, 1833
6 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501
6 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501
6 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501
6 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375
6 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375
6 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375
6 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001
6 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001
6 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501
6 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100
6 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501
6 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100
6 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501
Cores | ID | Description | OVRLP3 | Mops
7 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333, 8333, 8333
7 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375
7 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375
7 | scalar_iadd | Scalar integer adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501
7 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5498, 5501, 5501, 5501, 5501, 5501, 5501
7 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501
7 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501
7 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501
7 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501
7 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501
7 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 22026, 21978, 21978, 22026, 22026, 22026, 21978
7 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22026, 21978, 21978, 22026, 21978, 22026, 21978
7 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362
7 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362
7 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362
7 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5498, 5501, 5498, 5501, 5501, 5501, 5501
7 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501
7 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501
7 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5498
7 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501
7 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501
7 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5498, 5501, 5501, 5501, 5501, 5501, 5498
7 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501
7 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501
7 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 10989, 10989, 11001, 11001, 11001
7 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001
7 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501
7 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750
7 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750
7 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750
7 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 10989, 11001, 11001, 11001, 11001, 11001, 11001
7 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 10989, 11001, 11001, 11001, 11001
7 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501
7 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1833, 1834, 1833, 1833, 1833, 1834, 1833
7 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1833, 1834, 1834, 1834, 1834, 1834, 1834
7 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1833, 1834, 1833, 1834, 1833, 1834, 1833
7 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501
7 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501
7 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5498, 5501, 5501, 5498
7 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375
7 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375
7 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375
7 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001
7 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001
7 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501
7 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100
7 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501
7 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100
7 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501
Cores | ID | Description | OVRLP3 | Mops
8 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8333
8 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375
8 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375
8 | scalar_iadd | Scalar integer adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501
8 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501
8 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501
8 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501
8 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501
8 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501
8 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501
8 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 21978, 21978, 21978, 21978, 21978, 22026, 22026, 22026
8 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 21978, 22026, 21978, 21978, 22026, 21978, 21978, 22026
8 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32258, 32362, 32362, 32362, 32362
8 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362
8 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362
8 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5501, 5501, 5498, 5501, 5501, 5501, 5501, 5501
8 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5498, 5498
8 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5498
8 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501
8 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501
8 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5501
8 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501
8 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501
8 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501
8 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 10989, 11001, 11001, 11001, 10989, 11001
8 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 10989, 11001, 11001, 11001, 11001, 11001
8 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501
8 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750
8 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750
8 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750
8 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 10989, 11001, 11001
8 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 10989, 11001, 11001, 11001
8 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501
8 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1834, 1834, 1833, 1834, 1833, 1833
8 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1834, 1833, 1834, 1833, 1834, 1833, 1834, 1833
8 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1833, 1833, 1834, 1833, 1834, 1833, 1834, 1833
8 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5501
8 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501
8 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5498, 5501
8 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375
8 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375
8 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375
8 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001
8 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001
8 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501
8 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100
8 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501
8 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100
8 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501
Cores | ID | Description | OVRLP3 | Mops
9 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8333
9 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1363
9 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1363
9 | scalar_iadd | Scalar integer adds | 1.000 | 5501, 5501, 5501, 5501, 5498, 5498, 5501, 5501, 5450
9 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5476
9 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5476
9 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5476
9 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5476
9 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5476
9 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5476
9 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 22026, 21978, 22026, 21978, 21978, 22026, 22026, 22026, 21882
9 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22026, 22026, 22026, 21978, 21978, 22026, 21978, 22026, 21882
9 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32258
9 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32154
9 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32154
9 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5453
9 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5498, 5498, 5453
9 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5501, 5498, 5501, 5498, 5501, 5501, 5501, 5501, 5450
9 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5501, 5498, 5498, 5498, 5501, 5501, 5498, 5450
9 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450
9 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450
9 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5498, 5476
9 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5476
9 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5476
9 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 10989, 11001, 10953
9 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 10989, 11001, 11001, 11001, 10953
9 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5498, 5501, 5501, 5501, 5501, 5498, 5501, 5476
9 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2737
9 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2737
9 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2737
9 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 10989, 10989, 11001, 11001, 11001, 11001, 11001, 11001, 10953
9 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10953
9 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5476
9 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1834, 1833, 1834, 1834, 1834, 1834, 1825
9 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1833, 1834, 1834, 1833, 1834, 1833, 1834, 1834, 1825
9 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1834, 1833, 1834, 1833, 1833, 1834, 1834, 1833, 1825
9 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450
9 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5498, 5501, 5498, 5501, 5450
9 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450
9 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1369
9 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1369
9 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1369
9 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10953
9 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10953
9 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5473
9 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1090
9 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5453
9 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1090
9 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450
Cores | ID | Description | OVRLP3 | Mops
10 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8292, 8292
10 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1363, 1363
10 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1363, 1363
10 | scalar_iadd | Scalar integer adds | 1.000 | 5498, 5501, 5498, 5501, 5498, 5501, 5501, 5501, 5450, 5450
10 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450
10 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5473, 5473
10 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450
10 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450
10 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450
10 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5453, 5453
10 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 22026, 21978, 22026, 22026, 21978, 21978, 22026, 21978, 21786, 21786
10 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22026, 22026, 22026, 22026, 21978, 21978, 21978, 22026, 21786, 21786
10 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32051, 32051
10 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32051, 32051
10 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32154, 32154
10 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5501, 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450
10 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450
10 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450
10 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5501, 5498, 5501, 5498, 5501, 5501, 5501, 5450, 5450
10 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5501, 5501, 5498, 5501, 5498, 5501, 5501, 5498, 5450, 5450
10 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5450, 5450
10 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5470, 5470
10 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5453, 5453
10 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5473, 5473
10 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10905, 10905
10 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10941, 10941
10 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450
10 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2737, 2737
10 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2737, 2737
10 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2725, 2725
10 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 10989, 11001, 11001, 11001, 11001, 10989, 10905, 10905
10 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10989, 10905, 10905
10 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450
10 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1834, 1834, 1834, 1834, 1833, 1834, 1825, 1825
10 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1834, 1833, 1834, 1834, 1834, 1834, 1817, 1817
10 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1833, 1833, 1834, 1834, 1834, 1834, 1817, 1817
10 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450
10 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5501, 5450, 5450
10 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450
10 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1369, 1369
10 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1369, 1369
10 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1363, 1363
10 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10905, 10905
10 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10893, 10905
10 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450
10 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1090, 1090
10 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450
10 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1090, 1090
10 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5453, 5450
Cores | ID | Description | OVRLP3 | Mops
11 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8258, 8258, 8258
11 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1350, 1350, 1350
11 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1350, 1350, 1350
11 | scalar_iadd | Scalar integer adds | 1.000 | 5501, 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5402, 5402, 5400
11 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426
11 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426
11 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426
11 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426
11 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426
11 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426
11 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 22026, 21978, 22026, 21978, 21978, 21978, 21978, 22026, 21692, 21692, 21692
11 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22026, 21978, 22026, 22026, 21978, 22026, 22026, 22026, 21692, 21692, 21692
11 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31949, 31949, 31949
11 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31949, 31949, 31949
11 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31949, 31949, 31847
11 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5423, 5423, 5423
11 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5501, 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426
11 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5498, 5501, 5501, 5501, 5501, 5498, 5498, 5498, 5423, 5423, 5423
11 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5405, 5405, 5402
11 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5423, 5423, 5423
11 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5423, 5423, 5423
11 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426
11 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5426, 5426, 5426
11 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426
11 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10846, 10846, 10846
11 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10989, 10846, 10846, 10846
11 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426
11 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2712, 2712, 2712
11 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2712, 2713, 2712
11 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2712, 2712, 2712
11 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11001, 10989, 10989, 11001, 11001, 11001, 11001, 11001, 10846, 10846, 10846
11 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 10989, 11001, 11001, 11001, 11001, 11001, 10846, 10846, 10846
11 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5426, 5426, 5426
11 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1834, 1833, 1833, 1833, 1833, 1834, 1834, 1834, 1809, 1809, 1808
11 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1833, 1834, 1834, 1833, 1833, 1833, 1834, 1834, 1809, 1809, 1809
11 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1834, 1833, 1834, 1833, 1833, 1834, 1834, 1834, 1808, 1808, 1809
11 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400
11 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5426, 5426, 5426
11 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5405, 5405, 5405
11 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1356, 1356, 1356
11 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1356, 1356, 1356
11 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1356, 1356, 1356
11 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10846, 10846, 10846
11 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10846, 10846, 10846
11 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426
11 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1080, 1080, 1080
11 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426
11 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1080, 1080, 1080
11 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426
Cores | ID | Description | OVRLP3 | Mops
12 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8217, 8224, 8217, 8224
12 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1350, 1350, 1350, 1350
12 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1350, 1350, 1350, 1350
12 | scalar_iadd | Scalar integer adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400
12 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426, 5426
12 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426, 5426
12 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426, 5426
12 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426, 5426
12 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5426, 5426, 5426, 5426
12 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426, 5426
12 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 22026, 22026, 21978, 22026, 22026, 22026, 22026, 21978, 21692, 21692, 21692, 21692
12 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22026, 22026, 21978, 22026, 21978, 22026, 22026, 22026, 21692, 21692, 21692, 21692
12 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31949, 31949, 31949, 31949
12 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31949, 31949, 31949, 31949
12 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31949, 31949, 31949, 31949
12 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5400, 5400, 5400, 5400
12 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5501, 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400
12 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5501, 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400
12 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400
12 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5400, 5400, 5400, 5400
12 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400
12 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426, 5426
12 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5426, 5426, 5426, 5426
12 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426, 5426
12 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10846, 10846, 10846, 10846
12 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10846, 10846, 10846, 10846
12 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426, 5426
12 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2712, 2712, 2712, 2712
12 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2712, 2712, 2712, 2712
12 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2712, 2712, 2712, 2712
12 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10846, 10846, 10846, 10846
12 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 10989, 10989, 11001, 11001, 10846, 10846, 10846, 10846
12 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5402, 5402, 5402, 5402
12 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1834, 1833, 1834, 1834, 1833, 1833, 1833, 1834, 1808, 1808, 1808, 1808
12 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1834, 1834, 1834, 1834, 1834, 1834, 1808, 1808, 1808, 1808
12 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1833, 1833, 1833, 1833, 1833, 1833, 1833, 1833, 1808, 1808, 1808, 1808
12 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400
12 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400
12 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400
12 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1356, 1356, 1356, 1356
12 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1356, 1356, 1356, 1356
12 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1356, 1356, 1356, 1356
12 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10846, 10846, 10846, 10846
12 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10846, 10846, 10846, 10846
12 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5423, 5423, 5423, 5423
12 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1080, 1080, 1080, 1080
12 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5423, 5426, 5426, 5423
12 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1080, 1080, 1080, 1080
12 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400
Cores | ID | Description | OVRLP3 | Mops
13 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8217, 8217, 8217, 8217, 8217
13 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1344, 1344, 1344, 1344, 1344
13 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1344, 1344, 1344, 1344, 1344
13 | scalar_iadd | Scalar integer adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376
13 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400
13 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5498, 5400, 5400, 5400, 5400, 5400
13 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400
13 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400
13 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400
13 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400
13 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 21978, 22026, 21978, 21978, 21978, 21978, 21978, 21978, 21598, 21598, 21598, 21598, 21598
13 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 21978, 21978, 22026, 21978, 22026, 22026, 22026, 22026, 21598, 21598, 21598, 21598, 21598
13 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31746, 31746, 31746, 31746, 31746
13 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31746, 31746, 31746, 31746, 31746
13 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31847, 31746, 31847, 31847, 31847
13 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400
13 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400
13 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400
13 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400
13 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5498, 5498, 5400, 5400, 5400, 5400, 5400
13 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5501, 5498, 5501, 5501, 5501, 5498, 5501, 5501, 5400, 5400, 5400, 5400, 5400
13 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400
13 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400
13 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400
13 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10799, 10799, 10799, 10799, 10799
13 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10799, 10799, 10799, 10799, 10799
13 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400
13 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2701, 2701, 2701, 2701, 2701
13 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2700, 2700, 2700, 2700, 2700
13 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2700, 2700, 2700, 2700, 2700
13 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 10989, 11001, 11001, 10799, 10799, 10799, 10799, 10799
13 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10799, 10799, 10799, 10799, 10799
13 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5397, 5400, 5400
13 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1833, 1834, 1834, 1833, 1834, 1834, 1800, 1800, 1800, 1800, 1800
13 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1833, 1834, 1833, 1834, 1834, 1834, 1833, 1834, 1800, 1800, 1800, 1800, 1800
13 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1833, 1834, 1833, 1834, 1834, 1834, 1800, 1800, 1800, 1800, 1800
13 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400
13 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5498, 5498, 5498, 5501, 5501, 5498, 5376, 5376, 5376, 5376, 5376
13 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5498, 5501, 5501, 5498, 5501, 5501, 5400, 5400, 5400, 5400, 5400
13 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1350, 1350, 1350, 1350, 1350
13 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1350, 1350, 1350, 1350, 1350
13 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1350, 1350, 1350, 1350, 1350
13 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10799, 10799, 10799, 10811, 10799
13 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10811, 10811, 10799, 10799, 10799
13 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5402, 5400, 5400, 5400, 5400
13 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1075, 1075, 1075, 1075, 1075
13 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400
13 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1075, 1075, 1075, 1075, 1075
13 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5397, 5397, 5397, 5397, 5397
Cores | ID | Description | OVRLP3 | Mops
14 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8183, 8183, 8183, 8183, 8183, 8183
14 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1344, 1344, 1344, 1344, 1344, 1344
14 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1344, 1344, 1344, 1344, 1344, 1344
14 | scalar_iadd | Scalar integer adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376
14 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400, 5400
14 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400, 5400
14 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400, 5400
14 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400, 5400
14 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400, 5400
14 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400, 5400
14 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 21978, 22026, 22026, 21978, 22026, 21978, 21978, 22026, 21598, 21598, 21598, 21598, 21598, 21598
14 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22026, 22026, 22026, 22026, 21978, 21978, 22026, 22026, 21598, 21598, 21598, 21598, 21598, 21598
14 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31746, 31746, 31746, 31746, 31746, 31746
14 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31746, 31746, 31746, 31746, 31746, 31746
14 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31746, 31847, 31847, 31847, 31746, 31847
14 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5501, 5501, 5498, 5498, 5501, 5498, 5501, 5498, 5376, 5376, 5376, 5376, 5376, 5376
14 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5498, 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376
14 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376
14 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376
14 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5501, 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376
14 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5498, 5501, 5498, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376
14 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5498, 5400, 5400, 5400, 5400, 5400, 5400
14 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400, 5400
14 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400, 5400
14 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10799, 10799, 10799, 10799, 10799, 10799
14 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10799, 10799, 10799, 10799, 10799, 10799
14 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400, 5400
14 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2700, 2700, 2700, 2700, 2700, 2700
14 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2700, 2700, 2700, 2700, 2700, 2700
14 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2699, 2699, 2698, 2699, 2699, 2699
14 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11001, 10989, 11001, 11001, 11001, 11001, 11001, 11001, 10799, 10799, 10799, 10799, 10799, 10799
14 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10799, 10799, 10799, 10799, 10799, 10799
14 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376
14 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1834, 1833, 1834, 1834, 1834, 1834, 1800, 1800, 1800, 1800, 1800, 1800
14 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1833, 1833, 1833, 1834, 1834, 1833, 1834, 1834, 1800, 1800, 1800, 1800, 1800, 1800
14 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1834, 1834, 1833, 1834, 1833, 1834, 1800, 1800, 1800, 1800, 1800, 1800
14 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376
14 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5498, 5501, 5376, 5376, 5376, 5376, 5376, 5376
14 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5498, 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376
14 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1350, 1350, 1350, 1350, 1350, 1350
14 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1350, 1350, 1350, 1350, 1350, 1350
14 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1350, 1350, 1350, 1350, 1350, 1350
14 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10799, 10799, 10799, 10799, 10799, 10799
14 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10799, 10799, 10811, 10799, 10799, 10799
14 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5373, 5373, 5373, 5373, 5373, 5373
14 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1075, 1075, 1075, 1075, 1075, 1075
14 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5373, 5373, 5373, 5373, 5373, 5373
14 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1075, 1075, 1075, 1075, 1075, 1075
14 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5373, 5373, 5373, 5373, 5373, 5373
Cores | ID | Description | OVRLP3 | Mops
15 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8183, 8183, 8183, 8183, 8183, 8183, 8183
15 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1344, 1344, 1344, 1344, 1344, 1344, 1344
15 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1338, 1338, 1338, 1338, 1338, 1338, 1338
15 | scalar_iadd | Scalar integer adds | 1.000 | 5501, 5501, 5498, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376
15 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376
15 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376
15 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376
15 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376
15 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376
15 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376
15 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 21978, 22026, 22026, 22026, 21978, 21978, 21978, 21978, 21505, 21505, 21505, 21505, 21505, 21505, 21505
15 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22026, 22026, 21978, 21978, 21978, 21978, 22026, 22026, 21505, 21505, 21505, 21505, 21505, 21505, 21505
15 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31746, 31746, 31746, 31746, 31746, 31746, 31746
15 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31746, 31746, 31746, 31746, 31746, 31746, 31746
15 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31646, 31646, 31646, 31646, 31646, 31646, 31646
15 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5501, 5501, 5498, 5501, 5498, 5498, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376
15 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376
15 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376
15 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376
15 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376
15 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376
15 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400, 5400, 5400
15 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5397, 5400, 5400, 5400, 5400, 5400, 5400
15 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376
15 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 10989, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10753, 10753, 10753, 10753, 10753, 10753, 10741
15 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10787, 10787, 10787, 10787, 10799, 10787, 10787
15 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376
15 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2688, 2687, 2687, 2687, 2687, 2688, 2688
15 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2688, 2688, 2688, 2688, 2688, 2688, 2688
15 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2688, 2687, 2687, 2688, 2687, 2688, 2687
15 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10753, 10741, 10753, 10753, 10753, 10753, 10753
15 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 10989, 11001, 11001, 11001, 11001, 10753, 10753, 10753, 10753, 10753, 10741, 10753
15 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376
15 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1834, 1834, 1834, 1834, 1834, 1834, 1800, 1800, 1800, 1800, 1800, 1800, 1800
15 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1834, 1833, 1834, 1834, 1833, 1833, 1833, 1834, 1800, 1800, 1800, 1800, 1800, 1800, 1800
15 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1834, 1833, 1834, 1834, 1834, 1834, 1792, 1792, 1792, 1792, 1792, 1792, 1792
15 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376
15 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376
15 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5501, 5498, 5501, 5498, 5501, 5501, 5498, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376
15 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1350, 1350, 1350, 1350, 1350, 1350, 1350
15 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1344, 1344, 1344, 1344, 1344, 1344, 1344
15 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1344, 1344, 1344, 1344, 1344, 1344, 1344
15 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10753, 10753, 10753, 10753, 10753, 10753, 10753
15 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10695, 10695, 10695, 10695, 10695, 10695, 10695
15 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5373, 5373, 5373, 5373, 5373, 5373, 5373
15 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1070, 1070, 1070, 1070, 1070, 1070, 1070
15 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5373, 5373, 5373, 5373, 5373, 5373, 5373
15 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1070, 1070, 1070, 1070, 1070, 1070, 1070
15 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5373, 5373, 5373, 5373, 5373, 5373, 5373
Cores | ID | Description | OVRLP3 | Mops
16 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8183, 8183, 8183, 8183, 8183, 8183, 8183, 8183
16 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1338, 1338, 1338, 1338, 1338, 1338, 1338, 1338
16 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1338, 1337, 1338, 1338, 1338, 1338, 1338, 1338
16 | scalar_iadd | Scalar integer adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5350, 5350, 5350, 5350, 5350, 5350, 5350, 5350
16 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376
16 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376
16 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376
16 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376
16 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376
16 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376
16 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 21978, 21978, 21978, 21978, 22026, 21978, 22026, 22026, 21505, 21505, 21505, 21505, 21505, 21505, 21505, 21505
16 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22026, 22026, 21978, 21978, 22026, 21978, 22026, 22026, 21505, 21505, 21505, 21459, 21505, 21505, 21505, 21505
16 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31646, 31646, 31646, 31646, 31646, 31646, 31646, 31646
16 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31646, 31646, 31646, 31646, 31646, 31646, 31646, 31646
16 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31646, 31646, 31646, 31646, 31646, 31646, 31646, 31646
16 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376
16 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376
16 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5498, 5501, 5501, 5501, 5498, 5501, 5501, 5498, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376
16 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5498, 5501, 5498, 5501, 5501, 5501, 5501, 5373, 5373, 5373, 5373, 5373, 5373, 5373, 5373
16 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5498, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376
16 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5501, 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5373, 5376, 5376, 5376, 5376
16 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376
16 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376
16 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376
16 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11001, 10989, 11001, 11001, 11001, 11001, 11001, 11001, 10753, 10753, 10753, 10753, 10753, 10753, 10753, 10753
16 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 10989, 11001, 11001, 10753, 10753, 10753, 10753, 10753, 10753, 10753, 10753
16 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376
16 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2688, 2687, 2687, 2687, 2687, 2687, 2687, 2687
16 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2687, 2687, 2687, 2687, 2687, 2687, 2687, 2687
16 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2687, 2687, 2687, 2687, 2687, 2687, 2687, 2687
16 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 10989, 11001, 11001, 11001, 10753, 10753, 10753, 10753, 10741, 10753, 10753, 10753
16 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10753, 10741, 10753, 10753, 10753, 10753, 10753, 10753
16 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376
16 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1834, 1833, 1833, 1834, 1834, 1834, 1833, 1833, 1792, 1792, 1792, 1792, 1792, 1792, 1792, 1792
16 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1833, 1833, 1834, 1833, 1834, 1834, 1834, 1833, 1792, 1792, 1792, 1792, 1792, 1792, 1792, 1792
16 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1834, 1833, 1833, 1834, 1834, 1834, 1834, 1834, 1792, 1792, 1792, 1792, 1792, 1792, 1792, 1792
16 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5373, 5376, 5376, 5376, 5376, 5376, 5376
16 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376
16 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5498, 5501, 5501, 5498, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376
16 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1344, 1344, 1344, 1344, 1344, 1344, 1344, 1344
16 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1344, 1344, 1344, 1344, 1344, 1344, 1344, 1344
16 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1344, 1344, 1344, 1344, 1344, 1344, 1344, 1344
16 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10753, 10753, 10753, 10753, 10753, 10753, 10753, 10753
16 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10753, 10753, 10753, 10753, 10753, 10753, 10753, 10753
16 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5373, 5356, 5356, 5356, 5373, 5353, 5373, 5373
16 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1070, 1070, 1070, 1070, 1070, 1070, 1070, 1070
16 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5350, 5350, 5350, 5350, 5350, 5350, 5350, 5350
16 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1070, 1070, 1070, 1070, 1070, 1070, 1070, 1070
16 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5373, 5373, 5373, 5373, 5373, 5373, 5373, 5373
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment