Created
September 23, 2018 17:20
-
-
Save richardstartin/7dba694273177db49ce4076eb19cd316 to your computer and use it in GitHub Desktop.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
# JMH version: 1.20 | |
# VM version: JDK 12-internal, VM 12-internal+0-adhoc.root.dev | |
# VM invoker: /home/richard/workspace/dev/build/linux-x86_64-normal-server-release/images/jdk/bin/java | |
# VM options: --add-modules=jdk.incubator.vector -Djdk.incubator.vector.VECTOR_ACCESS_OOB_CHECK=0 | |
# Warmup: 20 iterations, 1 s each | |
# Measurement: 20 iterations, 1 s each | |
# Timeout: 10 min per iteration | |
# Threads: 1 thread, will synchronize iterations | |
# Benchmark mode: Throughput, ops/time | |
# Benchmark: com.openkappa.panama.vectorbenchmarks.IntersectionCardinality.popcnt | |
# Parameters: (size = 1024) | |
# Run progress: 0.00% complete, ETA 00:04:00 | |
# Fork: 1 of 1 | |
# Preparing profilers: LinuxPerfAsmProfiler | |
# Profilers consume stdout and stderr from target VM, use -v EXTRA to copy to console | |
# Warmup Iteration 1: 1928.739 ops/ms | |
# Warmup Iteration 2: 1888.794 ops/ms | |
# Warmup Iteration 3: 1889.059 ops/ms | |
# Warmup Iteration 4: 1937.593 ops/ms | |
# Warmup Iteration 5: 1701.530 ops/ms | |
# Warmup Iteration 6: 1711.947 ops/ms | |
# Warmup Iteration 7: 1719.288 ops/ms | |
# Warmup Iteration 8: 1613.266 ops/ms | |
# Warmup Iteration 9: 1719.744 ops/ms | |
# Warmup Iteration 10: 1778.811 ops/ms | |
# Warmup Iteration 11: 1732.071 ops/ms | |
# Warmup Iteration 12: 1781.464 ops/ms | |
# Warmup Iteration 13: 1886.609 ops/ms | |
# Warmup Iteration 14: 1847.048 ops/ms | |
# Warmup Iteration 15: 2062.539 ops/ms | |
# Warmup Iteration 16: 2115.832 ops/ms | |
# Warmup Iteration 17: 1998.139 ops/ms | |
# Warmup Iteration 18: 2027.166 ops/ms | |
# Warmup Iteration 19: 1885.706 ops/ms | |
# Warmup Iteration 20: 1818.164 ops/ms | |
Iteration 1: 1933.133 ops/ms | |
Iteration 2: 1884.957 ops/ms | |
Iteration 3: 1931.504 ops/ms | |
Iteration 4: 2004.028 ops/ms | |
Iteration 5: 1943.589 ops/ms | |
Iteration 6: 1860.841 ops/ms | |
Iteration 7: 1938.185 ops/ms | |
Iteration 8: 1666.135 ops/ms | |
Iteration 9: 1870.358 ops/ms | |
Iteration 10: 2032.499 ops/ms | |
Iteration 11: 1953.443 ops/ms | |
Iteration 12: 2089.342 ops/ms | |
Iteration 13: 2078.010 ops/ms | |
Iteration 14: 1831.790 ops/ms | |
Iteration 15: 1986.962 ops/ms | |
Iteration 16: 1719.909 ops/ms | |
Iteration 17: 1788.452 ops/ms | |
Iteration 18: 1909.800 ops/ms | |
Iteration 19: 1900.186 ops/ms | |
Iteration 20: 1767.354 ops/ms | |
# Processing profiler results: LinuxPerfAsmProfiler | |
Result "com.openkappa.panama.vectorbenchmarks.IntersectionCardinality.popcnt": | |
1904.524 ±(99.9%) 96.763 ops/ms [Average] | |
(min, avg, max) = (1666.135, 1904.524, 2089.342), stdev = 111.432 | |
CI (99.9%): [1807.761, 2001.286] (assumes normal distribution) | |
Secondary result "com.openkappa.panama.vectorbenchmarks.IntersectionCardinality.popcnt:·asm": | |
PrintAssembly processed: 165704 total address lines. | |
Perf output processed (skipped 22.710 seconds): | |
Column 1: cycles (20513 events) | |
Hottest code regions (>10.00% "cycles" events): | |
....[Hottest Region 1].............................................................................. | |
c2, level 4, com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub, version 565 (423 bytes) | |
0x00007fe418240c31: movzbl 0x94(%r8),%r11d ;*getfield isDone {reexecute=0 rethrow=0 return_oop=0} | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@30 (line 160) | |
; implicit exception: dispatches to 0x00007fe418240f8e | |
0x00007fe418240c39: test %r11d,%r11d | |
╭ 0x00007fe418240c3c: jne 0x00007fe418240e09 ;*ifeq {reexecute=0 rethrow=0 return_oop=0} | |
│ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@33 (line 160) | |
│ 0x00007fe418240c42: mov $0x1,%ebp | |
│╭ 0x00007fe418240c47: jmpq 0x00007fe418240d34 | |
││ ↗ 0x00007fe418240c4c: vmovq %xmm0,%r9 | |
││ │ 0x00007fe418240c51: vmovq %xmm1,%r8 | |
0.00% ││ │ ↗ 0x00007fe418240c56: vmovq %r9,%xmm0 | |
0.04% ││ │ │ 0x00007fe418240c5b: vmovq %r8,%xmm1 ;*iload_2 {reexecute=0 rethrow=0 return_oop=0} | |
││ │ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@12 (line 35) | |
││ │ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
1.71% ││↗│ │ 0x00007fe418240c60: movslq %ecx,%r9 | |
1.42% ││││ │ 0x00007fe418240c63: mov 0x10(%rbx,%rcx,8),%r10 | |
8.98% ││││ │ 0x00007fe418240c68: and 0x10(%rdi,%rcx,8),%r10 | |
3.51% ││││ │ 0x00007fe418240c6d: popcnt %r10,%r8 | |
3.21% ││││ │ 0x00007fe418240c72: add %r8d,%edx | |
2.48% ││││ │ 0x00007fe418240c75: mov 0x28(%rbx,%r9,8),%r10 | |
8.19% ││││ │ 0x00007fe418240c7a: and 0x28(%rdi,%r9,8),%r10 | |
3.59% ││││ │ 0x00007fe418240c7f: popcnt %r10,%r10 | |
3.73% ││││ │ 0x00007fe418240c84: mov 0x20(%rbx,%r9,8),%r8 | |
2.16% ││││ │ 0x00007fe418240c89: and 0x20(%rdi,%r9,8),%r8 | |
7.53% ││││ │ 0x00007fe418240c8e: popcnt %r8,%rsi | |
6.21% ││││ │ 0x00007fe418240c93: mov 0x18(%rbx,%r9,8),%r8 ;*laload {reexecute=0 rethrow=0 return_oop=0} | |
││││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@42 (line 36) | |
││││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
2.30% ││││ │ 0x00007fe418240c98: and 0x18(%rdi,%r9,8),%r8 | |
2.07% ││││ │ 0x00007fe418240c9d: popcnt %r8,%r9 | |
12.75% ││││ │ 0x00007fe418240ca2: add %r9d,%edx | |
6.01% ││││ │ 0x00007fe418240ca5: add %esi,%edx | |
5.70% ││││ │ 0x00007fe418240ca7: add %r10d,%edx ;*iadd {reexecute=0 rethrow=0 return_oop=0} | |
││││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@47 (line 36) | |
││││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
8.60% ││││ │ 0x00007fe418240caa: add $0x4,%ecx ;*iinc {reexecute=0 rethrow=0 return_oop=0} | |
││││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@49 (line 35) | |
││││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
3.58% ││││ │ 0x00007fe418240cad: cmp %r11d,%ecx | |
││╰│ │ 0x00007fe418240cb0: jl 0x00007fe418240c60 ;*goto {reexecute=0 rethrow=0 return_oop=0} | |
││ │ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@52 (line 35) | |
││ │ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
0.04% ││ │ │ 0x00007fe418240cb2: mov 0x108(%r15),%r10 ; ImmutableOopMap{rbx=Oop rdi=Oop xmm0=Oop xmm1=Oop [96]=Oop [16]=Oop } | |
││ │ │ ;*goto {reexecute=1 rethrow=0 return_oop=0} | |
││ │ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@52 (line 35) | |
││ │ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
0.05% ││ │ │ 0x00007fe418240cb9: test %eax,(%r10) ;*goto {reexecute=0 rethrow=0 return_oop=0} | |
││ │ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@52 (line 35) | |
││ │ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
││ │ │ ; {poll} | |
0.29% ││ │ │ 0x00007fe418240cbc: cmp %r11d,%ecx | |
││ ╰ │ 0x00007fe418240cbf: jl 0x00007fe418240c4c | |
││ │ 0x00007fe418240cc1: vmovq %xmm0,%r9 | |
││ │ 0x00007fe418240cc6: vmovq %xmm1,%r8 | |
0.00% ││ ↗│ 0x00007fe418240ccb: cmp %r14d,%ecx | |
││ ╭ ││ 0x00007fe418240cce: jge 0x00007fe418240cfa ;*iload_2 {reexecute=0 rethrow=0 return_oop=0} | |
││ │ ││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@12 (line 35) | |
││ │ ││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
0.08% ││ │↗ ││ 0x00007fe418240cd0: cmp %r13d,%ecx | |
││ ││ ││ 0x00007fe418240cd3: jge 0x00007fe418240e78 ;*if_icmpge {reexecute=0 rethrow=0 return_oop=0} | |
││ ││ ││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@18 (line 35) | |
││ ││ ││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
0.00% ││ ││ ││ 0x00007fe418240cd9: cmp %eax,%ecx | |
││ ││ ││ 0x00007fe418240cdb: jge 0x00007fe418240ea4 ;*if_icmpge {reexecute=0 rethrow=0 return_oop=0} | |
││ ││ ││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@27 (line 35) | |
││ ││ ││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
0.20% ││ ││ ││ 0x00007fe418240ce1: mov 0x10(%rbx,%rcx,8),%r10 ;*laload {reexecute=0 rethrow=0 return_oop=0} | |
││ ││ ││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@42 (line 36) | |
││ ││ ││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
0.01% ││ ││ ││ 0x00007fe418240ce6: and 0x10(%rdi,%rcx,8),%r10 | |
0.06% ││ ││ ││ 0x00007fe418240ceb: popcnt %r10,%r11 | |
0.02% ││ ││ ││ 0x00007fe418240cf0: add %r11d,%edx ;*iadd {reexecute=0 rethrow=0 return_oop=0} | |
││ ││ ││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@47 (line 36) | |
││ ││ ││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
0.19% ││ ││ ││ 0x00007fe418240cf3: inc %ecx ;*iinc {reexecute=0 rethrow=0 return_oop=0} | |
││ ││ ││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@49 (line 35) | |
││ ││ ││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
││ ││ ││ 0x00007fe418240cf5: cmp %r14d,%ecx | |
││ │╰ ││ 0x00007fe418240cf8: jl 0x00007fe418240cd0 ;*if_icmpge {reexecute=0 rethrow=0 return_oop=0} | |
││ │ ││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@9 (line 35) | |
││ │ ││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
0.00% ││ ↘ ││↗ 0x00007fe418240cfa: mov %r8,0x58(%rsp) | |
││ │││ 0x00007fe418240cff: mov %r9,0x50(%rsp) | |
0.04% ││ │││ 0x00007fe418240d04: mov 0x10(%rsp),%rsi | |
││ │││ 0x00007fe418240d09: xchg %ax,%ax | |
││ │││ 0x00007fe418240d0b: callq 0x00007fe410765f00 ; ImmutableOopMap{[80]=Oop [88]=Oop [96]=Oop [16]=Oop } | |
││ │││ ;*invokevirtual consume {reexecute=0 rethrow=0 return_oop=0} | |
││ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@20 (line 158) | |
││ │││ ; {optimized virtual_call} | |
0.46% ││ │││ 0x00007fe418240d10: mov 0x58(%rsp),%r8 | |
││ │││ 0x00007fe418240d15: movzbl 0x94(%r8),%r11d ;*goto {reexecute=0 rethrow=0 return_oop=0} | |
││ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@52 (line 35) | |
││ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
0.01% ││ │││ 0x00007fe418240d1d: mov 0x108(%r15),%r10 | |
0.06% ││ │││ 0x00007fe418240d24: add $0x1,%rbp ; ImmutableOopMap{r8=Oop [80]=Oop [96]=Oop [16]=Oop } | |
││ │││ ;*ifeq {reexecute=1 rethrow=0 return_oop=0} | |
││ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@33 (line 160) | |
││ │││ 0x00007fe418240d28: test %eax,(%r10) ; {poll} | |
││ │││ 0x00007fe418240d2b: test %r11d,%r11d | |
││ ╭ │││ 0x00007fe418240d2e: jne 0x00007fe418240e0e ;*aload_1 {reexecute=0 rethrow=0 return_oop=0} | |
││ │ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@36 (line 161) | |
│↘ │ │││ 0x00007fe418240d34: mov 0x50(%rsp),%r9 | |
0.06% │ │ │││ 0x00007fe418240d39: mov 0xc(%r9),%r14d ;*getfield size {reexecute=0 rethrow=0 return_oop=0} | |
│ │ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@6 (line 35) | |
│ │ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
0.00% │ │ │││ 0x00007fe418240d3d: test %r14d,%r14d | |
0.00% │ │╭ │││ 0x00007fe418240d40: jle 0x00007fe418240e02 ;*if_icmpge {reexecute=0 rethrow=0 return_oop=0} | |
│ ││ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@9 (line 35) | |
│ ││ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
0.02% │ ││ │││ 0x00007fe418240d46: mov 0x10(%r9),%r10d ;*getfield left {reexecute=0 rethrow=0 return_oop=0} | |
│ ││ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@14 (line 35) | |
│ ││ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
0.03% │ ││ │││ 0x00007fe418240d4a: mov 0xc(%r12,%r10,8),%r13d ;*arraylength {reexecute=0 rethrow=0 return_oop=0} | |
│ ││ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@17 (line 35) | |
│ ││ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
│ ││ │││ ; implicit exception: dispatches to 0x00007fe418240e40 | |
0.07% │ ││ │││ 0x00007fe418240d4f: mov 0x14(%r9),%ebx ;*getfield right {reexecute=0 rethrow=0 return_oop=0} | |
│ ││ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@23 (line 35) | |
│ ││ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
│ ││ │││ 0x00007fe418240d53: mov 0xc(%r12,%rbx,8),%eax ;*arraylength {reexecute=0 rethrow=0 return_oop=0} | |
│ ││ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@26 (line 35) | |
│ ││ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
│ ││ │││ ; implicit exception: dispatches to 0x00007fe418240e40 | |
0.00% │ ││ │││ 0x00007fe418240d58: test %r13d,%r13d | |
│ ││ │││ 0x00007fe418240d5b: jbe 0x00007fe418240e40 | |
0.04% │ ││ │││ 0x00007fe418240d61: movslq %r13d,%r11 | |
0.01% │ ││ │││ 0x00007fe418240d64: movslq %r14d,%rcx | |
│ ││ │││ 0x00007fe418240d67: dec %rcx | |
0.01% │ ││ │││ 0x00007fe418240d6a: cmp %r11,%rcx | |
│ ││ │││ 0x00007fe418240d6d: jae 0x00007fe418240e40 | |
0.06% │ ││ │││ 0x00007fe418240d73: test %eax,%eax | |
│ ││ │││ 0x00007fe418240d75: jbe 0x00007fe418240e40 | |
0.00% │ ││ │││ 0x00007fe418240d7b: movslq %eax,%r11 | |
│ ││ │││ 0x00007fe418240d7e: cmp %r11,%rcx | |
│ ││ │││ 0x00007fe418240d81: jae 0x00007fe418240e40 | |
│ ││ │││ 0x00007fe418240d87: mov $0x1,%ecx | |
0.05% │ ││ │││ 0x00007fe418240d8c: cmp %r14d,%ecx | |
0.00% │ ││ │││ 0x00007fe418240d8f: mov $0x1,%r11d | |
│ ││ │││ 0x00007fe418240d95: cmovg %r14d,%r11d | |
0.01% │ ││ │││ 0x00007fe418240d99: shl $0x3,%rbx | |
0.05% │ ││ │││ 0x00007fe418240d9d: lea (%r12,%r10,8),%rdi | |
│ ││ │││ 0x00007fe418240da1: xor %ecx,%ecx | |
│ ││ │││ 0x00007fe418240da3: xor %edx,%edx ;*iload_2 {reexecute=0 rethrow=0 return_oop=0} | |
│ ││ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@12 (line 35) | |
│ ││ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
0.01% │ ││↗│││ 0x00007fe418240da5: cmp %r13d,%ecx | |
│ ││││││ 0x00007fe418240da8: jge 0x00007fe418240e72 ;*if_icmpge {reexecute=0 rethrow=0 return_oop=0} | |
│ ││││││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@18 (line 35) | |
│ ││││││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
0.07% │ ││││││ 0x00007fe418240dae: cmp %eax,%ecx | |
│ ││││││ 0x00007fe418240db0: jge 0x00007fe418240e9e ;*if_icmpge {reexecute=0 rethrow=0 return_oop=0} | |
│ ││││││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@27 (line 35) | |
│ ││││││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
0.00% │ ││││││ 0x00007fe418240db6: mov 0x10(%rbx,%rcx,8),%r10 ;*laload {reexecute=0 rethrow=0 return_oop=0} | |
│ ││││││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@42 (line 36) | |
│ ││││││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
│ ││││││ 0x00007fe418240dbb: and 0x10(%rdi,%rcx,8),%r10 | |
0.00% │ ││││││ 0x00007fe418240dc0: popcnt %r10,%r10 | |
0.02% │ ││││││ 0x00007fe418240dc5: add %r10d,%edx ;*iadd {reexecute=0 rethrow=0 return_oop=0} | |
│ ││││││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@47 (line 36) | |
│ ││││││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
0.01% │ ││││││ 0x00007fe418240dc8: inc %ecx ;*iinc {reexecute=0 rethrow=0 return_oop=0} | |
│ ││││││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@49 (line 35) | |
│ ││││││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
│ ││││││ 0x00007fe418240dca: cmp %r11d,%ecx | |
│ ││╰│││ 0x00007fe418240dcd: jl 0x00007fe418240da5 ;*if_icmpge {reexecute=0 rethrow=0 return_oop=0} | |
│ ││ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::popcnt@9 (line 35) | |
│ ││ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@17 (line 158) | |
│ ││ │││ 0x00007fe418240dcf: cmp %eax,%r13d | |
0.05% │ ││ │││ 0x00007fe418240dd2: mov %r13d,%r10d | |
0.01% │ ││ │││ 0x00007fe418240dd5: cmovg %eax,%r10d | |
│ ││ │││ 0x00007fe418240dd9: cmp %r14d,%r10d | |
│ ││ │││ 0x00007fe418240ddc: cmovg %r14d,%r10d | |
0.04% │ ││ │││ 0x00007fe418240de0: mov %r10d,%r11d | |
0.02% │ ││ │││ 0x00007fe418240de3: add $0xfffffffd,%r11d | |
│ ││ │││ 0x00007fe418240de7: cmp %r11d,%r10d | |
0.01% │ ││ │││ 0x00007fe418240dea: mov $0x80000000,%r10d | |
0.03% │ ││ │││ 0x00007fe418240df0: cmovl %r10d,%r11d | |
0.01% │ ││ │││ 0x00007fe418240df4: cmp %r11d,%ecx | |
│ ││ ╰││ 0x00007fe418240df7: jge 0x00007fe418240ccb | |
0.01% │ ││ ╰│ 0x00007fe418240dfd: jmpq 0x00007fe418240c56 | |
│ │↘ │ 0x00007fe418240e02: xor %edx,%edx | |
│ │ ╰ 0x00007fe418240e04: jmpq 0x00007fe418240cfa | |
↘ │ 0x00007fe418240e09: mov $0x1,%ebp ;*aload_1 {reexecute=0 rethrow=0 return_oop=0} | |
│ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@36 (line 161) | |
↘ 0x00007fe418240e0e: movabs $0x7fe42e776390,%r10 | |
0x00007fe418240e18: callq *%r10 ;*invokestatic nanoTime {reexecute=0 rethrow=0 return_oop=0} | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@37 (line 161) | |
0x00007fe418240e1b: mov 0x60(%rsp),%r10 | |
0x00007fe418240e20: mov %rbp,0x18(%r10) ;*putfield measuredOps {reexecute=0 rethrow=0 return_oop=0} | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub@52 (line 163) | |
.................................................................................................... | |
96.10% <total for region 1> | |
....[Hottest Regions]............................................................................... | |
96.10% c2, level 4 com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub, version 565 (423 bytes) | |
1.87% [kernel.kallsyms] native_write_msr (5 bytes) | |
0.34% c2, level 4 org.openjdk.jmh.infra.Blackhole::consume, version 538 (55 bytes) | |
0.09% [kernel.kallsyms] syscall_return_via_sysret (3 bytes) | |
0.07% kernel [unknown] (36 bytes) | |
0.07% [kernel.kallsyms] __intel_pmu_enable_all (0 bytes) | |
0.05% kernel [unknown] (39 bytes) | |
0.05% libjvm.so _ZN14ElfSymbolTable6lookupEPhPiS1_S1_P16ElfFuncDescTable (35 bytes) | |
0.04% libjvm.so _ZN10fileStream5writeEPKcm (74 bytes) | |
0.03% [kernel.kallsyms] mutex_lock (0 bytes) | |
0.03% [kernel.kallsyms] __intel_pmu_enable_all (26 bytes) | |
0.02% [kernel.kallsyms] __fget (23 bytes) | |
0.02% [kernel.kallsyms] mutex_unlock (0 bytes) | |
0.02% [kernel.kallsyms] entry_SYSCALL_64_after_hwframe (43 bytes) | |
0.02% libjvm.so _ZN9xmlStream10write_textEPKcm (47 bytes) | |
0.02% kernel [unknown] (0 bytes) | |
0.02% kernel [unknown] (15 bytes) | |
0.02% [kernel.kallsyms] __fget_light (2 bytes) | |
0.02% interpreter method entry point (kind = zerolocals) (7 bytes) | |
0.01% libc-2.23.so _IO_default_xsputn (30 bytes) | |
1.06% <...other 190 warm regions...> | |
.................................................................................................... | |
100.00% <totals> | |
....[Hottest Methods (after inlining)].............................................................. | |
96.10% c2, level 4 com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_popcnt_jmhTest::popcnt_thrpt_jmhStub, version 565 | |
1.87% [kernel.kallsyms] native_write_msr | |
0.34% c2, level 4 org.openjdk.jmh.infra.Blackhole::consume, version 538 | |
0.18% kernel [unknown] | |
0.10% [kernel.kallsyms] syscall_return_via_sysret | |
0.10% [kernel.kallsyms] __intel_pmu_enable_all | |
0.05% libjvm.so _ZN14ElfSymbolTable6lookupEPhPiS1_S1_P16ElfFuncDescTable | |
0.04% libjvm.so _ZN10fileStream5writeEPKcm | |
0.04% libjvm.so _ZN9xmlStream10write_textEPKcm | |
0.03% [kernel.kallsyms] mutex_lock | |
0.03% interpreter method entry point (kind = zerolocals) | |
0.03% [kernel.kallsyms] entry_SYSCALL_64_after_hwframe | |
0.02% [kernel.kallsyms] mutex_unlock | |
0.02% [kernel.kallsyms] __fget | |
0.02% libc-2.23.so vfprintf | |
0.02% libc-2.23.so _IO_fwrite | |
0.02% [kernel.kallsyms] __task_pid_nr_ns | |
0.02% [kernel.kallsyms] copy_page_from_iter | |
0.02% [kernel.kallsyms] __fget_light | |
0.01% [kernel.kallsyms] do_syscall_64 | |
0.92% <...other 138 warm methods...> | |
.................................................................................................... | |
100.00% <totals> | |
....[Distribution by Source]........................................................................ | |
96.44% c2, level 4 | |
2.73% [kernel.kallsyms] | |
0.37% libjvm.so | |
0.18% kernel | |
0.18% libc-2.23.so | |
0.05% interpreter | |
0.02% hsdis-amd64.so | |
0.01% ld-2.23.so | |
0.01% perf-1904.map | |
0.00% c1, level 3 | |
.................................................................................................... | |
100.00% <totals> | |
# JMH version: 1.20 | |
# VM version: JDK 12-internal, VM 12-internal+0-adhoc.root.dev | |
# VM invoker: /home/richard/workspace/dev/build/linux-x86_64-normal-server-release/images/jdk/bin/java | |
# VM options: --add-modules=jdk.incubator.vector -Djdk.incubator.vector.VECTOR_ACCESS_OOB_CHECK=0 | |
# Warmup: 20 iterations, 1 s each | |
# Measurement: 20 iterations, 1 s each | |
# Timeout: 10 min per iteration | |
# Threads: 1 thread, will synchronize iterations | |
# Benchmark mode: Throughput, ops/time | |
# Benchmark: com.openkappa.panama.vectorbenchmarks.IntersectionCardinality.unrolledPopcnt | |
# Parameters: (size = 1024) | |
# Run progress: 16.67% complete, ETA 00:03:48 | |
# Fork: 1 of 1 | |
# Preparing profilers: LinuxPerfAsmProfiler | |
# Profilers consume stdout and stderr from target VM, use -v EXTRA to copy to console | |
# Warmup Iteration 1: 1997.094 ops/ms | |
# Warmup Iteration 2: 2039.923 ops/ms | |
# Warmup Iteration 3: 2072.548 ops/ms | |
# Warmup Iteration 4: 2075.326 ops/ms | |
# Warmup Iteration 5: 2015.018 ops/ms | |
# Warmup Iteration 6: 1964.247 ops/ms | |
# Warmup Iteration 7: 2088.821 ops/ms | |
# Warmup Iteration 8: 2051.734 ops/ms | |
# Warmup Iteration 9: 2067.739 ops/ms | |
# Warmup Iteration 10: 2093.326 ops/ms | |
# Warmup Iteration 11: 2020.883 ops/ms | |
# Warmup Iteration 12: 2006.645 ops/ms | |
# Warmup Iteration 13: 2065.898 ops/ms | |
# Warmup Iteration 14: 2047.837 ops/ms | |
# Warmup Iteration 15: 2056.739 ops/ms | |
# Warmup Iteration 16: 2056.714 ops/ms | |
# Warmup Iteration 17: 2070.151 ops/ms | |
# Warmup Iteration 18: 2084.163 ops/ms | |
# Warmup Iteration 19: 2097.975 ops/ms | |
# Warmup Iteration 20: 2013.559 ops/ms | |
Iteration 1: 1986.134 ops/ms | |
Iteration 2: 2033.465 ops/ms | |
Iteration 3: 2016.875 ops/ms | |
Iteration 4: 1979.354 ops/ms | |
Iteration 5: 2032.891 ops/ms | |
Iteration 6: 1994.260 ops/ms | |
Iteration 7: 2018.565 ops/ms | |
Iteration 8: 2017.612 ops/ms | |
Iteration 9: 2030.119 ops/ms | |
Iteration 10: 2005.845 ops/ms | |
Iteration 11: 1991.231 ops/ms | |
Iteration 12: 1981.082 ops/ms | |
Iteration 13: 2016.942 ops/ms | |
Iteration 14: 2017.264 ops/ms | |
Iteration 15: 2022.235 ops/ms | |
Iteration 16: 2052.385 ops/ms | |
Iteration 17: 2094.790 ops/ms | |
Iteration 18: 1876.700 ops/ms | |
Iteration 19: 1931.889 ops/ms | |
Iteration 20: 1932.585 ops/ms | |
# Processing profiler results: LinuxPerfAsmProfiler | |
Result "com.openkappa.panama.vectorbenchmarks.IntersectionCardinality.unrolledPopcnt": | |
2001.611 ±(99.9%) 40.978 ops/ms [Average] | |
(min, avg, max) = (1876.700, 2001.611, 2094.790), stdev = 47.191 | |
CI (99.9%): [1960.633, 2042.590] (assumes normal distribution) | |
Secondary result "com.openkappa.panama.vectorbenchmarks.IntersectionCardinality.unrolledPopcnt:·asm": | |
PrintAssembly processed: 163538 total address lines. | |
Perf output processed (skipped 22.643 seconds): | |
Column 1: cycles (20522 events) | |
Hottest code regions (>10.00% "cycles" events): | |
....[Hottest Region 1].............................................................................. | |
c2, level 4, com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub, version 553 (540 bytes) | |
0x00007fe9cc027f95: movzbl 0x94(%rdi),%r10d ;*getfield isDone {reexecute=0 rethrow=0 return_oop=0} | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@30 (line 160) | |
; implicit exception: dispatches to 0x00007fe9cc028472 | |
0x00007fe9cc027f9d: test %r10d,%r10d | |
0x00007fe9cc027fa0: jne 0x00007fe9cc02824d ;*ifeq {reexecute=0 rethrow=0 return_oop=0} | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@33 (line 160) | |
0x00007fe9cc027fa6: mov $0x1,%ebp | |
╭ 0x00007fe9cc027fab: jmp 0x00007fe9cc027ff4 | |
│ ↗ 0x00007fe9cc027fad: add %r11d,%ebx | |
│ │ 0x00007fe9cc027fb0: add %ecx,%ebx | |
0.05% │ │ 0x00007fe9cc027fb2: add %ebx,%edx | |
│ │ 0x00007fe9cc027fb4: vmovq %xmm0,%rdi | |
│ │ 0x00007fe9cc027fb9: vmovq %xmm1,%rbp ;*iadd {reexecute=0 rethrow=0 return_oop=0} | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@156 (line 53) | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
│ │ 0x00007fe9cc027fbe: mov %rdi,0x60(%rsp) | |
0.04% │ │ 0x00007fe9cc027fc3: mov 0x10(%rsp),%rsi | |
│ │ 0x00007fe9cc027fc8: data16 xchg %ax,%ax | |
│ │ 0x00007fe9cc027fcb: callq 0x00007fe9c4548f00 ; ImmutableOopMap{[96]=Oop [112]=Oop [0]=Oop [16]=Oop } | |
│ │ ;*invokevirtual consume {reexecute=0 rethrow=0 return_oop=0} | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@20 (line 158) | |
│ │ ; {optimized virtual_call} | |
0.40% │ │ 0x00007fe9cc027fd0: mov 0x60(%rsp),%rdi | |
│ │ 0x00007fe9cc027fd5: movzbl 0x94(%rdi),%r10d ;*goto {reexecute=0 rethrow=0 return_oop=0} | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@146 (line 47) | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
│ │ 0x00007fe9cc027fdd: mov 0x108(%r15),%r11 | |
0.07% │ │ 0x00007fe9cc027fe4: add $0x1,%rbp ; ImmutableOopMap{rdi=Oop [112]=Oop [0]=Oop [16]=Oop } | |
│ │ ;*ifeq {reexecute=1 rethrow=0 return_oop=0} | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@33 (line 160) | |
│ │ 0x00007fe9cc027fe8: test %eax,(%r11) ; {poll} | |
│ │ 0x00007fe9cc027feb: test %r10d,%r10d | |
│ │ 0x00007fe9cc027fee: jne 0x00007fe9cc028252 ;*aload_1 {reexecute=0 rethrow=0 return_oop=0} | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@36 (line 161) | |
↘ │ 0x00007fe9cc027ff4: mov 0x70(%rsp),%r10 | |
0.05% │ 0x00007fe9cc027ff9: mov 0xc(%r10),%r14d ;*getfield size {reexecute=0 rethrow=0 return_oop=0} | |
│ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@15 (line 47) | |
│ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
│ 0x00007fe9cc027ffd: test %r14d,%r14d | |
│ 0x00007fe9cc028000: jle 0x00007fe9cc028246 ;*if_icmpge {reexecute=0 rethrow=0 return_oop=0} | |
│ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@18 (line 47) | |
│ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
0.01% │ 0x00007fe9cc028006: mov 0x10(%r10),%r9d ;*getfield left {reexecute=0 rethrow=0 return_oop=0} | |
│ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@24 (line 47) | |
│ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
0.05% │ 0x00007fe9cc02800a: mov 0xc(%r12,%r9,8),%r13d ;*arraylength {reexecute=0 rethrow=0 return_oop=0} | |
│ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@27 (line 47) | |
│ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
│ ; implicit exception: dispatches to 0x00007fe9cc028283 | |
0.05% │ 0x00007fe9cc02800f: mov 0x14(%r10),%r11d ;*getfield right {reexecute=0 rethrow=0 return_oop=0} | |
│ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@34 (line 47) | |
│ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
│ 0x00007fe9cc028013: mov 0xc(%r12,%r11,8),%r10d ;*laload {reexecute=0 rethrow=0 return_oop=0} | |
│ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@135 (line 51) | |
│ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
│ ; implicit exception: dispatches to 0x00007fe9cc028283 | |
0.02% │ 0x00007fe9cc028018: test %r13d,%r13d | |
│ 0x00007fe9cc02801b: jbe 0x00007fe9cc028283 | |
0.06% │ 0x00007fe9cc028021: movslq %r14d,%r8 | |
│ 0x00007fe9cc028024: movslq %r13d,%rbx | |
│ 0x00007fe9cc028027: add $0x3,%r8 | |
│ 0x00007fe9cc02802b: and $0xfffffffffffffffc,%r8 | |
0.06% │ 0x00007fe9cc02802f: mov %r8d,%r8d | |
│ 0x00007fe9cc028032: movslq %r8d,%r8 | |
│ 0x00007fe9cc028035: mov %r8,%rcx | |
0.04% │ 0x00007fe9cc028038: add $0xfffffffffffffffc,%rcx | |
0.03% │ 0x00007fe9cc02803c: cmp %rbx,%rcx | |
│ 0x00007fe9cc02803f: jae 0x00007fe9cc028283 | |
│ 0x00007fe9cc028045: test %r10d,%r10d | |
│ 0x00007fe9cc028048: jbe 0x00007fe9cc028283 | |
│ 0x00007fe9cc02804e: vmovq %rbp,%xmm1 | |
0.01% │ 0x00007fe9cc028053: movslq %r10d,%rdx | |
0.01% │ 0x00007fe9cc028056: cmp %rdx,%rcx | |
│ 0x00007fe9cc028059: jae 0x00007fe9cc028283 | |
│ 0x00007fe9cc02805f: cmp $0x2,%r13d | |
│ 0x00007fe9cc028063: jbe 0x00007fe9cc028283 | |
│ 0x00007fe9cc028069: mov %r8,%rcx | |
0.02% │ 0x00007fe9cc02806c: add $0xfffffffffffffffe,%rcx | |
0.10% │ 0x00007fe9cc028070: cmp %rbx,%rcx | |
│ 0x00007fe9cc028073: jae 0x00007fe9cc028283 | |
0.01% │ 0x00007fe9cc028079: cmp $0x2,%r10d | |
│ 0x00007fe9cc02807d: jbe 0x00007fe9cc028283 | |
│ 0x00007fe9cc028083: cmp %rdx,%rcx | |
│ 0x00007fe9cc028086: jae 0x00007fe9cc028283 | |
0.04% │ 0x00007fe9cc02808c: cmp $0x3,%r13d | |
│ 0x00007fe9cc028090: jbe 0x00007fe9cc028283 | |
0.08% │ 0x00007fe9cc028096: dec %r8 | |
0.01% │ 0x00007fe9cc028099: cmp %rbx,%r8 | |
│ 0x00007fe9cc02809c: jae 0x00007fe9cc028283 | |
0.00% │ 0x00007fe9cc0280a2: cmp $0x3,%r10d | |
│ 0x00007fe9cc0280a6: jbe 0x00007fe9cc028283 | |
0.00% │ 0x00007fe9cc0280ac: cmp %rdx,%r8 | |
│ 0x00007fe9cc0280af: jae 0x00007fe9cc028283 | |
0.07% │ 0x00007fe9cc0280b5: cmp $0x7ffffffc,%r14d | |
│ 0x00007fe9cc0280bc: jg 0x00007fe9cc0282ae ;*if_icmpge {reexecute=0 rethrow=0 return_oop=0} | |
│ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@18 (line 47) | |
│ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
│ 0x00007fe9cc0280c2: mov $0x4,%r8d | |
│ 0x00007fe9cc0280c8: cmp %r14d,%r8d | |
│ 0x00007fe9cc0280cb: cmovg %r14d,%r8d | |
0.06% │ 0x00007fe9cc0280cf: lea (%r12,%r9,8),%rsi | |
│ 0x00007fe9cc0280d3: lea (%r12,%r11,8),%rax | |
│ 0x00007fe9cc0280d7: xor %r9d,%r9d | |
0.00% │ 0x00007fe9cc0280da: xor %r11d,%r11d | |
0.06% │ 0x00007fe9cc0280dd: xor %ebx,%ebx | |
0.00% │ 0x00007fe9cc0280df: xor %ecx,%ecx | |
│ 0x00007fe9cc0280e1: xor %edx,%edx ;*iload {reexecute=0 rethrow=0 return_oop=0} | |
│ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@21 (line 47) | |
│ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
↗ │ 0x00007fe9cc0280e3: cmp %r13d,%r9d | |
│ │ 0x00007fe9cc0280e6: jge 0x00007fe9cc0282da ;*if_icmpge {reexecute=0 rethrow=0 return_oop=0} | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@28 (line 47) | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
0.07% │ │ 0x00007fe9cc0280ec: cmp %r10d,%r9d | |
│ │ 0x00007fe9cc0280ef: jge 0x00007fe9cc02831e ;*if_icmpge {reexecute=0 rethrow=0 return_oop=0} | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@38 (line 47) | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
│ │ 0x00007fe9cc0280f5: vmovq %rdi,%xmm0 | |
│ │ 0x00007fe9cc0280fa: mov 0x28(%rax,%r9,8),%rdi ;*laload {reexecute=0 rethrow=0 return_oop=0} | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@135 (line 51) | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
│ │ 0x00007fe9cc0280ff: and 0x28(%rsi,%r9,8),%rdi | |
0.06% │ │ 0x00007fe9cc028104: popcnt %rdi,%rbp | |
0.00% │ │ 0x00007fe9cc028109: add %ebp,%edx ;*iadd {reexecute=0 rethrow=0 return_oop=0} | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@140 (line 51) | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
│ │ 0x00007fe9cc02810b: mov 0x20(%rax,%r9,8),%rdi ;*laload {reexecute=0 rethrow=0 return_oop=0} | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@109 (line 50) | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
│ │ 0x00007fe9cc028110: and 0x20(%rsi,%r9,8),%rdi | |
0.08% │ │ 0x00007fe9cc028115: popcnt %rdi,%rbp | |
0.00% │ │ 0x00007fe9cc02811a: add %ebp,%ecx ;*iadd {reexecute=0 rethrow=0 return_oop=0} | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@114 (line 50) | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
│ │ 0x00007fe9cc02811c: mov 0x10(%rax,%r9,8),%rdi ;*laload {reexecute=0 rethrow=0 return_oop=0} | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@59 (line 48) | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
0.00% │ │ 0x00007fe9cc028121: and 0x10(%rsi,%r9,8),%rdi | |
0.05% │ │ 0x00007fe9cc028126: popcnt %rdi,%rbp | |
│ │ 0x00007fe9cc02812b: add %ebp,%r11d ;*iadd {reexecute=0 rethrow=0 return_oop=0} | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@64 (line 48) | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
│ │ 0x00007fe9cc02812e: mov 0x18(%rax,%r9,8),%rdi ;*laload {reexecute=0 rethrow=0 return_oop=0} | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@84 (line 49) | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
│ │ 0x00007fe9cc028133: and 0x18(%rsi,%r9,8),%rdi | |
0.09% │ │ 0x00007fe9cc028138: popcnt %rdi,%rbp | |
0.00% │ │ 0x00007fe9cc02813d: add %ebp,%ebx ;*iadd {reexecute=0 rethrow=0 return_oop=0} | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@89 (line 49) | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
│ │ 0x00007fe9cc02813f: add $0x4,%r9d ;*iinc {reexecute=0 rethrow=0 return_oop=0} | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@143 (line 47) | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
│ │ 0x00007fe9cc028143: cmp %r8d,%r9d | |
╭│ │ 0x00007fe9cc028146: jge 0x00007fe9cc02814f ;*if_icmpge {reexecute=0 rethrow=0 return_oop=0} | |
││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@18 (line 47) | |
││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
││ │ 0x00007fe9cc028148: vmovq %xmm0,%rdi | |
│╰ │ 0x00007fe9cc02814d: jmp 0x00007fe9cc0280e3 | |
0.04% ↘ │ 0x00007fe9cc02814f: cmp %r10d,%r13d | |
│ 0x00007fe9cc028152: mov %r13d,%ebp | |
0.01% │ 0x00007fe9cc028155: cmovg %r10d,%ebp | |
│ 0x00007fe9cc028159: cmp %r14d,%ebp | |
0.03% │ 0x00007fe9cc02815c: cmovg %r14d,%ebp | |
0.01% │ 0x00007fe9cc028160: cmp %ebp,%r9d | |
╭ │ 0x00007fe9cc028163: jge 0x00007fe9cc0281d3 | |
0.01% │ │ 0x00007fe9cc028165: data16 data16 nopw 0x0(%rax,%rax,1) ;*iload {reexecute=0 rethrow=0 return_oop=0} | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@21 (line 47) | |
│ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
1.35% │↗↗│ 0x00007fe9cc028170: movslq %r9d,%rdi | |
0.57% ││││ 0x00007fe9cc028173: mov 0x10(%rax,%r9,8),%r8 ;*laload {reexecute=0 rethrow=0 return_oop=0} | |
││││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@59 (line 48) | |
││││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
10.26% ││││ 0x00007fe9cc028178: and 0x10(%rsi,%r9,8),%r8 | |
4.35% ││││ 0x00007fe9cc02817d: popcnt %r8,%r8 | |
3.80% ││││ 0x00007fe9cc028182: add %r8d,%r11d ;*iadd {reexecute=0 rethrow=0 return_oop=0} | |
││││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@64 (line 48) | |
││││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
2.58% ││││ 0x00007fe9cc028185: mov 0x28(%rax,%rdi,8),%r8 ;*laload {reexecute=0 rethrow=0 return_oop=0} | |
││││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@135 (line 51) | |
││││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
8.42% ││││ 0x00007fe9cc02818a: and 0x28(%rsi,%rdi,8),%r8 | |
4.04% ││││ 0x00007fe9cc02818f: popcnt %r8,%r8 | |
4.62% ││││ 0x00007fe9cc028194: add %r8d,%edx ;*iadd {reexecute=0 rethrow=0 return_oop=0} | |
││││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@140 (line 51) | |
││││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
5.25% ││││ 0x00007fe9cc028197: mov 0x20(%rax,%rdi,8),%r8 ;*laload {reexecute=0 rethrow=0 return_oop=0} | |
││││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@109 (line 50) | |
││││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
6.57% ││││ 0x00007fe9cc02819c: and 0x20(%rsi,%rdi,8),%r8 | |
3.06% ││││ 0x00007fe9cc0281a1: popcnt %r8,%r8 | |
5.06% ││││ 0x00007fe9cc0281a6: add %r8d,%ecx ;*iadd {reexecute=0 rethrow=0 return_oop=0} | |
││││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@114 (line 50) | |
││││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
7.18% ││││ 0x00007fe9cc0281a9: mov 0x18(%rax,%rdi,8),%r8 ;*laload {reexecute=0 rethrow=0 return_oop=0} | |
││││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@84 (line 49) | |
││││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
5.44% ││││ 0x00007fe9cc0281ae: and 0x18(%rsi,%rdi,8),%r8 | |
2.63% ││││ 0x00007fe9cc0281b3: popcnt %r8,%r8 | |
5.07% ││││ 0x00007fe9cc0281b8: add %r8d,%ebx ;*iadd {reexecute=0 rethrow=0 return_oop=0} | |
││││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@89 (line 49) | |
││││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
9.33% ││││ 0x00007fe9cc0281bb: add $0x4,%r9d ;*iinc {reexecute=0 rethrow=0 return_oop=0} | |
││││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@143 (line 47) | |
││││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
3.93% ││││ 0x00007fe9cc0281bf: cmp %ebp,%r9d | |
0.00% │╰││ 0x00007fe9cc0281c2: jl 0x00007fe9cc028170 ;*goto {reexecute=0 rethrow=0 return_oop=0} | |
│ ││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@146 (line 47) | |
│ ││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
0.03% │ ││ 0x00007fe9cc0281c4: mov 0x108(%r15),%r8 ; ImmutableOopMap{rsi=Oop rax=Oop xmm0=Oop [112]=Oop [0]=Oop [16]=Oop } | |
│ ││ ;*goto {reexecute=1 rethrow=0 return_oop=0} | |
│ ││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@146 (line 47) | |
│ ││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
0.05% │ ││ 0x00007fe9cc0281cb: test %eax,(%r8) ;*goto {reexecute=0 rethrow=0 return_oop=0} | |
│ ││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@146 (line 47) | |
│ ││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
│ ││ ; {poll} | |
0.31% │ ││ 0x00007fe9cc0281ce: cmp %ebp,%r9d | |
│ ╰│ 0x00007fe9cc0281d1: jl 0x00007fe9cc028170 | |
↘ │ 0x00007fe9cc0281d3: cmp %r14d,%r9d | |
╰ 0x00007fe9cc0281d6: jge 0x00007fe9cc027fad ;*iload {reexecute=0 rethrow=0 return_oop=0} | |
; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::unrolledPopcnt@21 (line 47) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub@17 (line 158) | |
0x00007fe9cc0281dc: cmp %r13d,%r9d | |
0x00007fe9cc0281df: jge 0x00007fe9cc0282de ;*if_icmpge {reexecute=0 rethrow=0 return_oop=0} | |
.................................................................................................... | |
95.83% <total for region 1> | |
....[Hottest Regions]............................................................................... | |
95.83% c2, level 4 com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub, version 553 (540 bytes) | |
2.20% [kernel.kallsyms] native_write_msr (5 bytes) | |
0.50% c2, level 4 org.openjdk.jmh.infra.Blackhole::consume, version 526 (55 bytes) | |
0.13% [kernel.kallsyms] syscall_return_via_sysret (6 bytes) | |
0.07% kernel [unknown] (37 bytes) | |
0.05% libjvm.so _ZN14ElfSymbolTable6lookupEPhPiS1_S1_P16ElfFuncDescTable (35 bytes) | |
0.04% kernel [unknown] (12 bytes) | |
0.03% [kernel.kallsyms] __fget (33 bytes) | |
0.03% [kernel.kallsyms] __intel_pmu_enable_all (0 bytes) | |
0.02% [kernel.kallsyms] mutex_unlock (0 bytes) | |
0.02% [kernel.kallsyms] mutex_lock (0 bytes) | |
0.02% [kernel.kallsyms] sys_write (13 bytes) | |
0.02% [kernel.kallsyms] copy_user_generic_unrolled (25 bytes) | |
0.02% libjvm.so _ZN14ElfSymbolTable6lookupEPhPiS1_S1_P16ElfFuncDescTable (11 bytes) | |
0.02% libjvm.so _ZN12outputStream15update_positionEPKcm (43 bytes) | |
0.01% [kernel.kallsyms] do_syscall_64 (23 bytes) | |
0.01% [kernel.kallsyms] sys_gettid (27 bytes) | |
0.01% [kernel.kallsyms] __schedule (0 bytes) | |
0.01% [kernel.kallsyms] _raw_spin_lock (16 bytes) | |
0.01% [kernel.kallsyms] __indirect_thunk_start (12 bytes) | |
0.91% <...other 160 warm regions...> | |
.................................................................................................... | |
100.00% <totals> | |
....[Hottest Methods (after inlining)].............................................................. | |
95.83% c2, level 4 com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_unrolledPopcnt_jmhTest::unrolledPopcnt_thrpt_jmhStub, version 553 | |
2.20% [kernel.kallsyms] native_write_msr | |
0.50% c2, level 4 org.openjdk.jmh.infra.Blackhole::consume, version 526 | |
0.13% [kernel.kallsyms] syscall_return_via_sysret | |
0.12% kernel [unknown] | |
0.07% libjvm.so _ZN14ElfSymbolTable6lookupEPhPiS1_S1_P16ElfFuncDescTable | |
0.04% libc-2.23.so vfprintf | |
0.03% [kernel.kallsyms] __fget | |
0.03% libc-2.23.so _IO_fwrite | |
0.03% [kernel.kallsyms] __intel_pmu_enable_all | |
0.02% [kernel.kallsyms] do_syscall_64 | |
0.02% libjvm.so _ZN9xmlStream10write_textEPKcm | |
0.02% [kernel.kallsyms] mutex_lock | |
0.02% [kernel.kallsyms] mutex_unlock | |
0.02% libjvm.so _ZN12outputStream15update_positionEPKcm | |
0.02% [kernel.kallsyms] sys_write | |
0.02% [kernel.kallsyms] copy_user_generic_unrolled | |
0.01% [kernel.kallsyms] __indirect_thunk_start | |
0.01% [kernel.kallsyms] __mutex_lock.isra.2 | |
0.01% [kernel.kallsyms] __schedule | |
0.81% <...other 130 warm methods...> | |
.................................................................................................... | |
100.00% <totals> | |
....[Distribution by Source]........................................................................ | |
96.33% c2, level 4 | |
2.95% [kernel.kallsyms] | |
0.30% libjvm.so | |
0.23% libc-2.23.so | |
0.12% kernel | |
0.03% interpreter | |
0.01% c1, level 3 | |
0.01% hsdis-amd64.so | |
0.00% | |
0.00% libjava.so | |
.................................................................................................... | |
100.00% <totals> | |
# JMH version: 1.20 | |
# VM version: JDK 12-internal, VM 12-internal+0-adhoc.root.dev | |
# VM invoker: /home/richard/workspace/dev/build/linux-x86_64-normal-server-release/images/jdk/bin/java | |
# VM options: --add-modules=jdk.incubator.vector -Djdk.incubator.vector.VECTOR_ACCESS_OOB_CHECK=0 | |
# Warmup: 20 iterations, 1 s each | |
# Measurement: 20 iterations, 1 s each | |
# Timeout: 10 min per iteration | |
# Threads: 1 thread, will synchronize iterations | |
# Benchmark mode: Throughput, ops/time | |
# Benchmark: com.openkappa.panama.vectorbenchmarks.IntersectionCardinality.vpandExtractPopcnt | |
# Parameters: (size = 1024) | |
# Run progress: 33.33% complete, ETA 00:02:59 | |
# Fork: 1 of 1 | |
# Preparing profilers: LinuxPerfAsmProfiler | |
# Profilers consume stdout and stderr from target VM, use -v EXTRA to copy to console | |
# Warmup Iteration 1: 1224.200 ops/ms | |
# Warmup Iteration 2: 1878.081 ops/ms | |
# Warmup Iteration 3: 1798.377 ops/ms | |
# Warmup Iteration 4: 1807.368 ops/ms | |
# Warmup Iteration 5: 1775.194 ops/ms | |
# Warmup Iteration 6: 1796.109 ops/ms | |
# Warmup Iteration 7: 1806.375 ops/ms | |
# Warmup Iteration 8: 1750.640 ops/ms | |
# Warmup Iteration 9: 1778.083 ops/ms | |
# Warmup Iteration 10: 1760.837 ops/ms | |
# Warmup Iteration 11: 1686.809 ops/ms | |
# Warmup Iteration 12: 1758.661 ops/ms | |
# Warmup Iteration 13: 1773.591 ops/ms | |
# Warmup Iteration 14: 1751.584 ops/ms | |
# Warmup Iteration 15: 1765.513 ops/ms | |
# Warmup Iteration 16: 1821.195 ops/ms | |
# Warmup Iteration 17: 1822.448 ops/ms | |
# Warmup Iteration 18: 1749.696 ops/ms | |
# Warmup Iteration 19: 1799.353 ops/ms | |
# Warmup Iteration 20: 1742.873 ops/ms | |
Iteration 1: 1795.475 ops/ms | |
Iteration 2: 1801.106 ops/ms | |
Iteration 3: 1763.309 ops/ms | |
Iteration 4: 1808.688 ops/ms | |
Iteration 5: 1825.122 ops/ms | |
Iteration 6: 1808.330 ops/ms | |
Iteration 7: 1819.812 ops/ms | |
Iteration 8: 1801.034 ops/ms | |
Iteration 9: 1833.117 ops/ms | |
Iteration 10: 1767.858 ops/ms | |
Iteration 11: 1772.872 ops/ms | |
Iteration 12: 1779.114 ops/ms | |
Iteration 13: 1787.272 ops/ms | |
Iteration 14: 1728.688 ops/ms | |
Iteration 15: 1763.232 ops/ms | |
Iteration 16: 1749.218 ops/ms | |
Iteration 17: 1676.605 ops/ms | |
Iteration 18: 1647.729 ops/ms | |
Iteration 19: 1686.287 ops/ms | |
Iteration 20: 1689.041 ops/ms | |
# Processing profiler results: LinuxPerfAsmProfiler | |
Result "com.openkappa.panama.vectorbenchmarks.IntersectionCardinality.vpandExtractPopcnt": | |
1765.195 ±(99.9%) 46.484 ops/ms [Average] | |
(min, avg, max) = (1647.729, 1765.195, 1833.117), stdev = 53.531 | |
CI (99.9%): [1718.711, 1811.679] (assumes normal distribution) | |
Secondary result "com.openkappa.panama.vectorbenchmarks.IntersectionCardinality.vpandExtractPopcnt:·asm": | |
PrintAssembly processed: 172434 total address lines. | |
Perf output processed (skipped 22.657 seconds): | |
Column 1: cycles (20493 events) | |
Hottest code regions (>10.00% "cycles" events): | |
....[Hottest Region 1].............................................................................. | |
c2, level 4, com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub, version 637 (818 bytes) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
0x00007fbfe024bb31: movzbl 0x94(%r9),%r10d ;*arraylength {reexecute=0 rethrow=0 return_oop=0} | |
; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@17 (line 94) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
; implicit exception: dispatches to 0x00007fbfe024c06a | |
0x00007fbfe024bb39: test %r10d,%r10d | |
0x00007fbfe024bb3c: jne 0x00007fbfe024be98 | |
0x00007fbfe024bb42: mov $0x1,%ebp | |
╭ 0x00007fbfe024bb47: jmpq 0x00007fbfe024bd78 | |
│ 0x00007fbfe024bb4c: nopl 0x0(%rax) | |
0.01% │↗ 0x00007fbfe024bb50: vmovq %xmm2,%rax ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@92 (line 99) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
0.43% ││ ↗ 0x00007fbfe024bb55: vmovdqu 0x10(%rax,%rcx,8),%ymm2 | |
2.58% ││ │ 0x00007fbfe024bb5b: vpand 0x10(%r13,%rcx,8),%ymm2,%ymm8 ;*ifeq {reexecute=0 rethrow=0 return_oop=0} | |
││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@33 (line 160) | |
0.32% ││ │ 0x00007fbfe024bb62: movslq %ecx,%r10 ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
││ │ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@92 (line 99) | |
││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
0.43% ││ │ 0x00007fbfe024bb65: vmovdqu 0x70(%rax,%r10,8),%ymm2 ;*invokevirtual fromArray {reexecute=0 rethrow=0 return_oop=0} | |
││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@38 (line 95) | |
││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
2.65% ││ │ 0x00007fbfe024bb6c: vpand 0x70(%r13,%r10,8),%ymm2,%ymm9 ;*putfield arg$1 {reexecute=0 rethrow=0 return_oop=0} | |
││ │ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::<init>@6 | |
││ │ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::get$Lambda@5 | |
││ │ ; - java.lang.invoke.DirectMethodHandle$Holder::invokeStatic@10 | |
││ │ ; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@5 | |
││ │ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@38 (line 1297) | |
││ │ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@38 (line 95) | |
││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
0.84% ││ │ 0x00007fbfe024bb73: vmovdqu 0x30(%rax,%r10,8),%ymm2 | |
0.46% ││ │ 0x00007fbfe024bb7a: vpand 0x30(%r13,%r10,8),%ymm2,%ymm10 ;*ifeq {reexecute=0 rethrow=0 return_oop=0} | |
││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@33 (line 160) | |
3.06% ││ │ 0x00007fbfe024bb81: vmovdqu 0x50(%rax,%r10,8),%ymm6 ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
││ │ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@81 (line 98) | |
││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
0.03% ││ │ 0x00007fbfe024bb88: vmovq %rax,%xmm2 | |
0.42% ││ │ 0x00007fbfe024bb8d: vpand 0x50(%r13,%r10,8),%ymm6,%ymm11 | |
2.60% ││ │ 0x00007fbfe024bb94: vmovq %xmm8,%r10 | |
0.01% ││ │ 0x00007fbfe024bb99: popcnt %r10,%rbp | |
0.51% ││ │ 0x00007fbfe024bb9e: add %edx,%ebp | |
3.86% ││ │ 0x00007fbfe024bba0: vmovq %xmm10,%r10 | |
0.15% ││ │ 0x00007fbfe024bba5: popcnt %r10,%rax | |
0.43% ││ │ 0x00007fbfe024bbaa: vmovq %xmm11,%r10 | |
0.41% ││ │ 0x00007fbfe024bbaf: popcnt %r10,%r14 | |
2.84% ││ │ 0x00007fbfe024bbb4: vmovq %xmm9,%r10 | |
0.18% ││ │ 0x00007fbfe024bbb9: popcnt %r10,%rdx | |
0.35% ││ │ 0x00007fbfe024bbbe: vextracti128 $0x1,%ymm9,%xmm6 | |
0.41% ││ │ 0x00007fbfe024bbc4: vpextrq $0x0,%xmm6,%r10 | |
2.62% ││ │ 0x00007fbfe024bbca: popcnt %r10,%r10 | |
1.45% ││ │ 0x00007fbfe024bbcf: vextracti128 $0x1,%ymm9,%xmm6 | |
0.42% ││ │ 0x00007fbfe024bbd5: vpextrq $0x1,%xmm6,%r11 | |
2.20% ││ │ 0x00007fbfe024bbdb: popcnt %r11,%r8 | |
1.34% ││ │ 0x00007fbfe024bbe0: vpextrq $0x1,%xmm9,%r11 | |
2.44% ││ │ 0x00007fbfe024bbe6: popcnt %r11,%r11 | |
0.21% ││ │ 0x00007fbfe024bbeb: vpextrq $0x1,%xmm10,%r9 | |
0.97% ││ │ 0x00007fbfe024bbf1: popcnt %r9,%r9 | |
2.17% ││ │ 0x00007fbfe024bbf6: vextracti128 $0x1,%ymm8,%xmm6 | |
0.22% ││ │ 0x00007fbfe024bbfc: vpextrq $0x1,%xmm6,%rbx | |
1.10% ││ │ 0x00007fbfe024bc02: popcnt %rbx,%rbx | |
2.46% ││ │ 0x00007fbfe024bc07: vextracti128 $0x1,%ymm8,%xmm6 | |
0.22% ││ │ 0x00007fbfe024bc0d: vpextrq $0x0,%xmm6,%rdi | |
1.00% ││ │ 0x00007fbfe024bc13: popcnt %rdi,%rsi | |
2.64% ││ │ 0x00007fbfe024bc18: vpextrq $0x1,%xmm8,%rdi | |
0.80% ││ │ 0x00007fbfe024bc1e: popcnt %rdi,%rdi | |
0.35% ││ │ 0x00007fbfe024bc23: add %edi,%ebp | |
3.42% ││ │ 0x00007fbfe024bc25: add %esi,%ebp | |
0.38% ││ │ 0x00007fbfe024bc27: add %ebx,%ebp | |
0.70% ││ │ 0x00007fbfe024bc29: add %ebp,%eax | |
0.84% ││ │ 0x00007fbfe024bc2b: add %r9d,%eax | |
2.85% ││ │ 0x00007fbfe024bc2e: vpextrq $0x1,%xmm11,%r9 | |
0.35% ││ │ 0x00007fbfe024bc34: popcnt %r9,%rbx | |
0.21% ││ │ 0x00007fbfe024bc39: vextracti128 $0x1,%ymm10,%xmm6 | |
2.82% ││ │ 0x00007fbfe024bc3f: vpextrq $0x1,%xmm6,%r9 | |
0.34% ││ │ 0x00007fbfe024bc45: popcnt %r9,%r9 | |
0.38% ││ │ 0x00007fbfe024bc4a: vextracti128 $0x1,%ymm10,%xmm6 | |
2.58% ││ │ 0x00007fbfe024bc50: vpextrq $0x0,%xmm6,%rdi | |
0.42% ││ │ 0x00007fbfe024bc56: popcnt %rdi,%rsi | |
0.56% ││ │ 0x00007fbfe024bc5b: add %esi,%eax | |
4.90% ││ │ 0x00007fbfe024bc5d: add %r9d,%eax | |
0.55% ││ │ 0x00007fbfe024bc60: add %eax,%r14d | |
0.87% ││ │ 0x00007fbfe024bc63: add %ebx,%r14d | |
1.46% ││ │ 0x00007fbfe024bc66: vextracti128 $0x1,%ymm11,%xmm6 | |
1.91% ││ │ 0x00007fbfe024bc6c: vpextrq $0x0,%xmm6,%r9 | |
0.12% ││ │ 0x00007fbfe024bc72: popcnt %r9,%r9 | |
1.33% ││ │ 0x00007fbfe024bc77: add %r9d,%r14d | |
2.20% ││ │ 0x00007fbfe024bc7a: vextracti128 $0x1,%ymm11,%xmm6 | |
0.08% ││ │ 0x00007fbfe024bc80: vpextrq $0x1,%xmm6,%r9 | |
2.51% ││ │ 0x00007fbfe024bc86: popcnt %r9,%rbx | |
3.68% ││ │ 0x00007fbfe024bc8b: add %ebx,%r14d | |
0.45% ││ │ 0x00007fbfe024bc8e: add %r14d,%edx | |
1.69% ││ │ 0x00007fbfe024bc91: add %r11d,%edx | |
1.34% ││ │ 0x00007fbfe024bc94: add %r10d,%edx | |
3.71% ││ │ 0x00007fbfe024bc97: add %r8d,%edx | |
4.53% ││ │ 0x00007fbfe024bc9a: add $0x10,%ecx | |
0.20% ││ │ 0x00007fbfe024bc9d: cmp 0x50(%rsp),%ecx | |
│╰ │ 0x00007fbfe024bca1: jl 0x00007fbfe024bb50 | |
│ │ 0x00007fbfe024bca7: vmovq %xmm3,%rbp | |
0.03% │ │ 0x00007fbfe024bcac: vmovd %xmm5,%ebx | |
0.03% │ │ 0x00007fbfe024bcb0: vmovd %xmm4,%r11d | |
0.00% │ │ 0x00007fbfe024bcb5: vmovd %xmm7,%r8d | |
│ │ 0x00007fbfe024bcba: vmovq %xmm2,%rax | |
0.02% │ ↗│ 0x00007fbfe024bcbf: cmp %ebx,%ecx | |
│ ╭ ││ 0x00007fbfe024bcc1: jge 0x00007fbfe024bd33 ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
│ │ ││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
│ │ ││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@92 (line 99) | |
│ │ ││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
0.07% │ │ ││ 0x00007fbfe024bcc3: nop ;*invokestatic bitCount {reexecute=0 rethrow=0 return_oop=0} | |
│ │ ││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@62 (line 96) | |
│ │ ││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
0.04% │ │ ↗ ││ 0x00007fbfe024bcc4: cmp %r11d,%ecx | |
│ │ │ ││ 0x00007fbfe024bcc7: jge 0x00007fbfe024bef2 | |
│ │ │ ││ 0x00007fbfe024bccd: cmp %r8d,%ecx | |
│ │ │ ││ 0x00007fbfe024bcd0: jge 0x00007fbfe024bf26 | |
0.01% │ │ │ ││ 0x00007fbfe024bcd6: vmovdqu 0x10(%rax,%rcx,8),%ymm2 | |
0.06% │ │ │ ││ 0x00007fbfe024bcdc: vpand 0x10(%r13,%rcx,8),%ymm2,%ymm2 | |
0.06% │ │ │ ││ 0x00007fbfe024bce3: vextracti128 $0x1,%ymm2,%xmm3 | |
0.01% │ │ │ ││ 0x00007fbfe024bce9: vpextrq $0x1,%xmm3,%r10 | |
0.27% │ │ │ ││ 0x00007fbfe024bcef: popcnt %r10,%rsi | |
0.18% │ │ │ ││ 0x00007fbfe024bcf4: vextracti128 $0x1,%ymm2,%xmm3 | |
0.01% │ │ │ ││ 0x00007fbfe024bcfa: vpextrq $0x0,%xmm3,%r10 | |
0.11% │ │ │ ││ 0x00007fbfe024bd00: popcnt %r10,%r9 | |
0.06% │ │ │ ││ 0x00007fbfe024bd05: vmovq %xmm2,%r10 | |
│ │ │ ││ 0x00007fbfe024bd0a: popcnt %r10,%rdi | |
0.02% │ │ │ ││ 0x00007fbfe024bd0f: add %edx,%edi | |
0.08% │ │ │ ││ 0x00007fbfe024bd11: vpextrq $0x1,%xmm2,%r10 | |
0.06% │ │ │ ││ 0x00007fbfe024bd17: popcnt %r10,%r10 | |
0.08% │ │ │ ││ 0x00007fbfe024bd1c: add %r10d,%edi | |
0.10% │ │ │ ││ 0x00007fbfe024bd1f: add %r9d,%edi | |
0.14% │ │ │ ││ 0x00007fbfe024bd22: add %esi,%edi | |
0.11% │ │ │ ││ 0x00007fbfe024bd24: add $0x4,%ecx | |
0.01% │ │ │ ││ 0x00007fbfe024bd27: cmp %ebx,%ecx ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
│ │ │ ││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
│ │ │ ││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@92 (line 99) | |
│ │ │ ││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
│ │╭ │ ││ 0x00007fbfe024bd29: jl 0x00007fbfe024bd2f | |
0.00% │ ││ │ ││ 0x00007fbfe024bd2b: mov %edi,%edx | |
│ ││╭│ ││ 0x00007fbfe024bd2d: jmp 0x00007fbfe024bd33 | |
│ │↘││ ││ 0x00007fbfe024bd2f: mov %edi,%edx | |
0.01% │ │ │╰ ││ 0x00007fbfe024bd31: jmp 0x00007fbfe024bcc4 ;*putfield arg$1 {reexecute=0 rethrow=0 return_oop=0} | |
│ │ │ ││ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::<init>@6 | |
│ │ │ ││ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::get$Lambda@5 | |
│ │ │ ││ ; - java.lang.invoke.DirectMethodHandle$Holder::invokeStatic@10 | |
│ │ │ ││ ; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@5 | |
│ │ │ ││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@38 (line 1297) | |
│ │ │ ││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
│ │ │ ││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@38 (line 95) | |
│ │ │ ││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
0.07% │ ↘ ↘ ││ 0x00007fbfe024bd33: vmovq %xmm0,%rdi | |
│ ││ 0x00007fbfe024bd38: vmovq %xmm1,%r9 | |
0.02% │ ││↗ 0x00007fbfe024bd3d: mov %r9,0x50(%rsp) | |
│ │││ 0x00007fbfe024bd42: mov %rdi,0x60(%rsp) | |
0.02% │ │││ 0x00007fbfe024bd47: mov 0x8(%rsp),%rsi | |
│ │││ 0x00007fbfe024bd4c: vzeroupper | |
0.04% │ │││ 0x00007fbfe024bd4f: callq 0x00007fbfd8765f00 ; ImmutableOopMap{[80]=Oop [88]=Oop [96]=Oop [8]=Oop } | |
│ │││ ;*invokevirtual consume {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@20 (line 158) | |
│ │││ ; {optimized virtual_call} | |
0.02% │ │││ 0x00007fbfe024bd54: mov 0x50(%rsp),%r9 ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@92 (line 99) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
0.04% │ │││ 0x00007fbfe024bd59: movzbl 0x94(%r9),%r11d ;*invokestatic load {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@43 (line 1297) | |
│ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@38 (line 95) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
0.02% │ │││ 0x00007fbfe024bd61: mov 0x108(%r15),%r10 ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@92 (line 99) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
│ │││ 0x00007fbfe024bd68: add $0x1,%rbp ; ImmutableOopMap{r9=Oop [88]=Oop [96]=Oop [8]=Oop } | |
│ │││ ;*ifeq {reexecute=1 rethrow=0 return_oop=0} | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@33 (line 160) | |
0.06% │ │││ 0x00007fbfe024bd6c: test %eax,(%r10) ; {poll} | |
│ │││ 0x00007fbfe024bd6f: test %r11d,%r11d | |
│ │││ 0x00007fbfe024bd72: jne 0x00007fbfe024be9d | |
0.01% ↘ │││ 0x00007fbfe024bd78: mov 0x60(%rsp),%rdi ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
│││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
│││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@92 (line 99) | |
│││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
0.00% │││ 0x00007fbfe024bd7d: mov 0xc(%rdi),%ebx ;*invokevirtual fromArray {reexecute=0 rethrow=0 return_oop=0} | |
│││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@38 (line 95) | |
│││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
0.02% │││ 0x00007fbfe024bd80: test %ebx,%ebx | |
╭ │││ 0x00007fbfe024bd82: jle 0x00007fbfe024be91 ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@92 (line 99) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
│ │││ 0x00007fbfe024bd88: mov 0x10(%rdi),%r10d ;*putfield arg$1 {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::<init>@6 | |
│ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::get$Lambda@5 | |
│ │││ ; - java.lang.invoke.DirectMethodHandle$Holder::invokeStatic@10 | |
│ │││ ; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@5 | |
│ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@38 (line 1297) | |
│ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@38 (line 95) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
│ │││ 0x00007fbfe024bd8c: mov 0xc(%r12,%r10,8),%r11d ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@92 (line 99) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
│ │││ ; implicit exception: dispatches to 0x00007fbfe024c006 | |
0.04% │ │││ 0x00007fbfe024bd91: mov 0x14(%rdi),%eax ;*putfield arg$1 {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::<init>@6 | |
│ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::get$Lambda@5 | |
│ │││ ; - java.lang.invoke.DirectMethodHandle$Holder::invokeStatic@10 | |
│ │││ ; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@5 | |
│ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@38 (line 1297) | |
│ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@38 (line 95) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
0.02% │ │││ 0x00007fbfe024bd94: mov 0xc(%r12,%rax,8),%r8d ; implicit exception: dispatches to 0x00007fbfe024c006 | |
0.01% │ │││ 0x00007fbfe024bd99: cmp $0x7ffffffc,%ebx | |
│ │││ 0x00007fbfe024bd9f: jg 0x00007fbfe024bed5 | |
0.01% │ │││ 0x00007fbfe024bda5: mov $0x4,%edx | |
│ │││ 0x00007fbfe024bdaa: cmp %edx,%ebx | |
0.02% │ │││ 0x00007fbfe024bdac: mov %ebx,%esi | |
0.01% │ │││ 0x00007fbfe024bdae: mov $0x4,%ecx | |
0.01% │ │││ 0x00007fbfe024bdb3: cmovg %ecx,%esi | |
│ │││ 0x00007fbfe024bdb6: shl $0x3,%rax | |
0.02% │ │││ 0x00007fbfe024bdba: lea (%r12,%r10,8),%r13 | |
0.00% │ │││ 0x00007fbfe024bdbe: xor %ecx,%ecx | |
0.01% │ │││ 0x00007fbfe024bdc0: xor %r10d,%r10d | |
│ ↗│││ 0x00007fbfe024bdc3: cmp %r11d,%ecx | |
│ ││││ 0x00007fbfe024bdc6: jge 0x00007fbfe024beff | |
0.03% │ ││││ 0x00007fbfe024bdcc: cmp %r8d,%ecx | |
│ ││││ 0x00007fbfe024bdcf: jge 0x00007fbfe024bf33 | |
0.00% │ ││││ 0x00007fbfe024bdd5: vmovq %r9,%xmm1 | |
│ ││││ 0x00007fbfe024bdda: vmovq %rdi,%xmm0 ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
│ ││││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
│ ││││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@92 (line 99) | |
│ ││││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
│ ││││ 0x00007fbfe024bddf: vmovdqu 0x10(%rax,%rcx,8),%ymm2 ;*synchronization entry | |
│ ││││ ; - jdk.incubator.vector.Long256Vector::get@-1 (line 951) | |
│ ││││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@70 (line 97) | |
│ ││││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
0.04% │ ││││ 0x00007fbfe024bde5: vpand 0x10(%r13,%rcx,8),%ymm2,%ymm2 | |
0.11% │ ││││ 0x00007fbfe024bdec: vextracti128 $0x1,%ymm2,%xmm3 | |
0.05% │ ││││ 0x00007fbfe024bdf2: vpextrq $0x1,%xmm3,%r9 | |
0.23% │ ││││ 0x00007fbfe024bdf8: popcnt %r9,%r9 | |
0.12% │ ││││ 0x00007fbfe024bdfd: vextracti128 $0x1,%ymm2,%xmm3 | |
│ ││││ 0x00007fbfe024be03: vpextrq $0x0,%xmm3,%rdi | |
│ ││││ 0x00007fbfe024be09: popcnt %rdi,%rdi | |
0.03% │ ││││ 0x00007fbfe024be0e: vmovq %xmm2,%rdx | |
│ ││││ 0x00007fbfe024be13: popcnt %rdx,%rdx | |
│ ││││ 0x00007fbfe024be18: add %r10d,%edx | |
0.01% │ ││││ 0x00007fbfe024be1b: vpextrq $0x1,%xmm2,%r10 | |
0.10% │ ││││ 0x00007fbfe024be21: popcnt %r10,%r10 | |
0.00% │ ││││ 0x00007fbfe024be26: add %r10d,%edx | |
0.01% │ ││││ 0x00007fbfe024be29: add %edi,%edx | |
0.11% │ ││││ 0x00007fbfe024be2b: add %r9d,%edx | |
0.08% │ ││││ 0x00007fbfe024be2e: add $0x4,%ecx | |
│ ││││ 0x00007fbfe024be31: cmp %esi,%ecx | |
│╭││││ 0x00007fbfe024be33: jge 0x00007fbfe024be47 | |
││││││ 0x00007fbfe024be35: mov %edx,%r10d | |
││││││ 0x00007fbfe024be38: vmovq %xmm0,%rdi | |
││││││ 0x00007fbfe024be3d: vmovq %xmm1,%r9 | |
││╰│││ 0x00007fbfe024be42: jmpq 0x00007fbfe024bdc3 | |
│↘ │││ 0x00007fbfe024be47: cmp %ebx,%r8d | |
0.01% │ │││ 0x00007fbfe024be4a: mov %r8d,%r10d | |
0.06% │ │││ 0x00007fbfe024be4d: cmovg %ebx,%r10d | |
│ │││ 0x00007fbfe024be51: cmp %r11d,%r10d | |
│ │││ 0x00007fbfe024be54: cmovg %r11d,%r10d | |
│ │││ 0x00007fbfe024be58: mov %r10d,%r9d | |
0.03% │ │││ 0x00007fbfe024be5b: add $0xfffffff4,%r9d | |
│ │││ 0x00007fbfe024be5f: cmp %r9d,%r10d | |
0.00% │ │││ 0x00007fbfe024be62: mov $0x80000000,%edi | |
│ │││ 0x00007fbfe024be67: cmovl %edi,%r9d | |
0.04% │ │││ 0x00007fbfe024be6b: cmp %r9d,%ecx | |
│ ╰││ 0x00007fbfe024be6e: jge 0x00007fbfe024bcbf | |
│ ││ 0x00007fbfe024be74: vmovq %rbp,%xmm3 | |
│ ││ 0x00007fbfe024be79: vmovd %ebx,%xmm5 | |
0.00% │ ││ 0x00007fbfe024be7d: vmovd %r11d,%xmm4 | |
0.06% │ ││ 0x00007fbfe024be82: vmovd %r8d,%xmm7 | |
│ ││ 0x00007fbfe024be87: mov %r9d,0x50(%rsp) | |
│ ╰│ 0x00007fbfe024be8c: jmpq 0x00007fbfe024bb55 | |
↘ │ 0x00007fbfe024be91: xor %edx,%edx | |
╰ 0x00007fbfe024be93: jmpq 0x00007fbfe024bd3d ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@92 (line 99) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
0x00007fbfe024be98: mov $0x1,%ebp ;*arraylength {reexecute=0 rethrow=0 return_oop=0} | |
; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcnt@17 (line 94) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub@17 (line 158) | |
.................................................................................................... | |
96.24% <total for region 1> | |
....[Hottest Regions]............................................................................... | |
96.24% c2, level 4 com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub, version 637 (818 bytes) | |
1.79% [kernel.kallsyms] native_write_msr (5 bytes) | |
0.21% c2, level 4 org.openjdk.jmh.infra.Blackhole::consume, version 610 (55 bytes) | |
0.13% [kernel.kallsyms] __intel_pmu_enable_all (0 bytes) | |
0.11% [kernel.kallsyms] syscall_return_via_sysret (6 bytes) | |
0.07% kernel [unknown] (37 bytes) | |
0.07% libjvm.so _ZN14ElfSymbolTable6lookupEPhPiS1_S1_P16ElfFuncDescTable (30 bytes) | |
0.03% kernel [unknown] (39 bytes) | |
0.03% [kernel.kallsyms] common_file_perm (10 bytes) | |
0.03% [kernel.kallsyms] mutex_unlock (0 bytes) | |
0.03% [kernel.kallsyms] mutex_lock (0 bytes) | |
0.03% [kernel.kallsyms] entry_SYSCALL_64_after_hwframe (43 bytes) | |
0.02% [kernel.kallsyms] finish_task_switch (0 bytes) | |
0.02% kernel [unknown] (30 bytes) | |
0.02% [kernel.kallsyms] do_syscall_64 (15 bytes) | |
0.02% libc-2.23.so syscall (0 bytes) | |
0.02% ld-2.23.so __tls_get_addr (21 bytes) | |
0.01% kernel [unknown] (9 bytes) | |
0.01% [kernel.kallsyms] __update_load_avg_se.isra.38 (0 bytes) | |
0.01% [kernel.kallsyms] copy_user_generic_unrolled (4 bytes) | |
1.07% <...other 189 warm regions...> | |
.................................................................................................... | |
100.00% <totals> | |
....[Hottest Methods (after inlining)].............................................................. | |
96.25% c2, level 4 com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcnt_jmhTest::vpandExtractPopcnt_thrpt_jmhStub, version 637 | |
1.79% [kernel.kallsyms] native_write_msr | |
0.21% c2, level 4 org.openjdk.jmh.infra.Blackhole::consume, version 610 | |
0.15% kernel [unknown] | |
0.14% [kernel.kallsyms] __intel_pmu_enable_all | |
0.12% [kernel.kallsyms] syscall_return_via_sysret | |
0.07% libjvm.so _ZN14ElfSymbolTable6lookupEPhPiS1_S1_P16ElfFuncDescTable | |
0.04% [kernel.kallsyms] common_file_perm | |
0.03% [kernel.kallsyms] entry_SYSCALL_64_after_hwframe | |
0.03% [kernel.kallsyms] mutex_lock | |
0.03% [kernel.kallsyms] mutex_unlock | |
0.03% libc-2.23.so vfprintf | |
0.02% [kernel.kallsyms] do_syscall_64 | |
0.02% [kernel.kallsyms] finish_task_switch | |
0.02% hsdis-amd64.so print_insn | |
0.02% interpreter method entry point (kind = zerolocals) | |
0.02% ld-2.23.so __tls_get_addr | |
0.02% libjvm.so _ZN10fileStream5writeEPKcm | |
0.02% libjvm.so _ZN12outputStream44do_vsnprintf_and_write_with_automatic_bufferEPKcP13__va_list_tagb | |
0.02% libc-2.23.so __strchr_sse2 | |
0.94% <...other 146 warm methods...> | |
.................................................................................................... | |
100.00% <totals> | |
....[Distribution by Source]........................................................................ | |
96.46% c2, level 4 | |
2.71% [kernel.kallsyms] | |
0.36% libjvm.so | |
0.17% libc-2.23.so | |
0.15% kernel | |
0.07% interpreter | |
0.04% hsdis-amd64.so | |
0.02% ld-2.23.so | |
0.02% libpthread-2.23.so | |
0.00% perf-2054.map | |
0.00% Unknown, level 0 | |
.................................................................................................... | |
100.00% <totals> | |
# JMH version: 1.20 | |
# VM version: JDK 12-internal, VM 12-internal+0-adhoc.root.dev | |
# VM invoker: /home/richard/workspace/dev/build/linux-x86_64-normal-server-release/images/jdk/bin/java | |
# VM options: --add-modules=jdk.incubator.vector -Djdk.incubator.vector.VECTOR_ACCESS_OOB_CHECK=0 | |
# Warmup: 20 iterations, 1 s each | |
# Measurement: 20 iterations, 1 s each | |
# Timeout: 10 min per iteration | |
# Threads: 1 thread, will synchronize iterations | |
# Benchmark mode: Throughput, ops/time | |
# Benchmark: com.openkappa.panama.vectorbenchmarks.IntersectionCardinality.vpandExtractPopcntUnrolled | |
# Parameters: (size = 1024) | |
# Run progress: 50.00% complete, ETA 00:02:13 | |
# Fork: 1 of 1 | |
# Preparing profilers: LinuxPerfAsmProfiler | |
# Profilers consume stdout and stderr from target VM, use -v EXTRA to copy to console | |
# Warmup Iteration 1: 1059.855 ops/ms | |
# Warmup Iteration 2: 1582.857 ops/ms | |
# Warmup Iteration 3: 1849.448 ops/ms | |
# Warmup Iteration 4: 1854.264 ops/ms | |
# Warmup Iteration 5: 1772.919 ops/ms | |
# Warmup Iteration 6: 1664.252 ops/ms | |
# Warmup Iteration 7: 1674.879 ops/ms | |
# Warmup Iteration 8: 1586.829 ops/ms | |
# Warmup Iteration 9: 1608.915 ops/ms | |
# Warmup Iteration 10: 1594.792 ops/ms | |
# Warmup Iteration 11: 1607.739 ops/ms | |
# Warmup Iteration 12: 1524.458 ops/ms | |
# Warmup Iteration 13: 1560.806 ops/ms | |
# Warmup Iteration 14: 1638.602 ops/ms | |
# Warmup Iteration 15: 1707.510 ops/ms | |
# Warmup Iteration 16: 1762.003 ops/ms | |
# Warmup Iteration 17: 1783.302 ops/ms | |
# Warmup Iteration 18: 1790.042 ops/ms | |
# Warmup Iteration 19: 1788.938 ops/ms | |
# Warmup Iteration 20: 1817.867 ops/ms | |
Iteration 1: 1859.973 ops/ms | |
Iteration 2: 1866.665 ops/ms | |
Iteration 3: 1848.195 ops/ms | |
Iteration 4: 1812.493 ops/ms | |
Iteration 5: 1859.172 ops/ms | |
Iteration 6: 1817.958 ops/ms | |
Iteration 7: 1827.607 ops/ms | |
Iteration 8: 1866.956 ops/ms | |
Iteration 9: 1858.716 ops/ms | |
Iteration 10: 1851.698 ops/ms | |
Iteration 11: 1852.486 ops/ms | |
Iteration 12: 1848.956 ops/ms | |
Iteration 13: 1850.713 ops/ms | |
Iteration 14: 1854.415 ops/ms | |
Iteration 15: 1865.470 ops/ms | |
Iteration 16: 1872.100 ops/ms | |
Iteration 17: 1858.070 ops/ms | |
Iteration 18: 1816.461 ops/ms | |
Iteration 19: 1851.532 ops/ms | |
Iteration 20: 1790.113 ops/ms | |
# Processing profiler results: LinuxPerfAsmProfiler | |
Result "com.openkappa.panama.vectorbenchmarks.IntersectionCardinality.vpandExtractPopcntUnrolled": | |
1846.487 ±(99.9%) 18.955 ops/ms [Average] | |
(min, avg, max) = (1790.113, 1846.487, 1872.100), stdev = 21.828 | |
CI (99.9%): [1827.533, 1865.442] (assumes normal distribution) | |
Secondary result "com.openkappa.panama.vectorbenchmarks.IntersectionCardinality.vpandExtractPopcntUnrolled:·asm": | |
PrintAssembly processed: 179873 total address lines. | |
Perf output processed (skipped 22.614 seconds): | |
Column 1: cycles (20317 events) | |
Hottest code regions (>10.00% "cycles" events): | |
....[Hottest Region 1].............................................................................. | |
c2, level 4, com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub, version 654 (1378 bytes) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
; implicit exception: dispatches to 0x00007f546024dc3e | |
0x00007f546024d3f5: test %r11d,%r11d ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0x00007f546024d3f8: jne 0x00007f546024d9fd | |
0x00007f546024d3fe: mov $0x1,%ebp | |
╭ 0x00007f546024d403: jmpq 0x00007f546024d840 | |
│ 0x00007f546024d408: nopl 0x0(%rax,%rax,1) | |
0.55% │↗ 0x00007f546024d410: mov %edi,%ebx | |
0.05% ││ 0x00007f546024d412: mov %edx,%ebp ;*ldc {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - jdk.incubator.vector.Long256Vector::get@28 (line 954) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@176 (line 118) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
1.13% ││ 0x00007f546024d414: vmovq %xmm13,%r10 ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.07% ││ 0x00007f546024d419: vmovdqu 0x10(%r10,%r13,8),%ymm0 ;*invokevirtual vpandExtractPopcntUnrolled {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.42% ││ 0x00007f546024d420: vmovq %xmm12,%r9 ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.09% ││ 0x00007f546024d425: vpand 0x10(%r9,%r13,8),%ymm0,%ymm1 ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@164 (line 117) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
1.57% ││ 0x00007f546024d42c: mov %r13d,%r8d | |
0.43% ││ 0x00007f546024d42f: add $0x1c,%r8d ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.07% ││ 0x00007f546024d433: movslq %r8d,%r11 ;*invokestatic checkIndex {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@11 (line 1296) | |
││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@59 (line 110) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
1.13% ││ 0x00007f546024d436: vmovdqu 0x10(%r10,%r11,8),%ymm0 ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.06% ││ 0x00007f546024d43d: vpand 0x10(%r9,%r11,8),%ymm0,%ymm8 ;*new {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::get$Lambda@0 | |
││ ; - java.lang.invoke.DirectMethodHandle$Holder::invokeStatic@10 | |
││ ; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@5 | |
││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@38 (line 1297) | |
││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@59 (line 110) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.80% ││ 0x00007f546024d444: mov %r13d,%r11d | |
1.08% ││ 0x00007f546024d447: add $0x14,%r11d ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.06% ││ 0x00007f546024d44b: movslq %r11d,%r11 ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@128 (line 114) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.54% ││ 0x00007f546024d44e: vmovdqu 0x10(%r10,%r11,8),%ymm0 ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.06% ││ 0x00007f546024d455: vpand 0x10(%r9,%r11,8),%ymm0,%ymm2 ;*invokevirtual invokeBasic {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@4 | |
││ ; - jdk.incubator.vector.Long256Vector::get@38 (line 954) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@128 (line 114) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.99% ││ 0x00007f546024d45c: mov %r13d,%r8d | |
0.46% ││ 0x00007f546024d45f: add $0xc,%r8d ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.09% ││ 0x00007f546024d463: movslq %r8d,%r11 ;*iflt {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - jdk.incubator.vector.Long256Vector::get@1 (line 951) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@176 (line 118) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
1.18% ││ 0x00007f546024d466: vmovdqu 0x10(%r10,%r11,8),%ymm0 ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.06% ││ 0x00007f546024d46d: vpand 0x10(%r9,%r11,8),%ymm0,%ymm3 ;*aload_0 {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - java.lang.invoke.LambdaForm$MH/0x0000000800060440::invoke@0 | |
││ ; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@4 | |
││ ; - jdk.incubator.vector.Long256Vector::get@38 (line 954) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@164 (line 117) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.74% ││ 0x00007f546024d474: mov %r13d,%r11d | |
1.05% ││ 0x00007f546024d477: add $0x4,%r11d ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.02% ││ 0x00007f546024d47b: movslq %r11d,%r11 ;*iload_1 {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - jdk.incubator.vector.Long256Vector::get@0 (line 951) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@176 (line 118) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.51% ││ 0x00007f546024d47e: vmovdqu 0x10(%r10,%r11,8),%ymm0 ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.15% ││ 0x00007f546024d485: vpand 0x10(%r9,%r11,8),%ymm0,%ymm4 ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@164 (line 117) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
1.13% ││ 0x00007f546024d48c: movslq %r13d,%r11 ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.58% ││ 0x00007f546024d48f: vmovdqu 0xd0(%r10,%r11,8),%ymm0 ;*new {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::get$Lambda@0 | |
││ ; - java.lang.invoke.DirectMethodHandle$Holder::invokeStatic@10 | |
││ ; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@5 | |
││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@38 (line 1297) | |
││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@47 (line 110) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.06% ││ 0x00007f546024d499: vpand 0xd0(%r9,%r11,8),%ymm0,%ymm5 ;*getfield left {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@42 (line 110) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
1.26% ││ 0x00007f546024d4a3: vmovdqu 0x50(%r10,%r11,8),%ymm0 ;*invokevirtual get {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@176 (line 118) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.48% ││ 0x00007f546024d4aa: vpand 0x50(%r9,%r11,8),%ymm0,%ymm6 ;*invokevirtual invokeBasic {reexecute=0 rethrow=0 return_oop=0} | |
││ ; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@4 | |
││ ; - jdk.incubator.vector.Long256Vector::get@38 (line 954) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@164 (line 117) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
1.44% ││ 0x00007f546024d4b1: vmovdqu 0x90(%r10,%r11,8),%ymm0 ;*synchronization entry | |
││ ; - jdk.incubator.vector.Long256Vector::get@-1 (line 951) | |
││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@128 (line 114) | |
││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.06% ││ 0x00007f546024d4bb: vpand 0x90(%r9,%r11,8),%ymm0,%ymm7 | |
1.08% ││ 0x00007f546024d4c5: vextracti128 $0x1,%ymm8,%xmm0 | |
0.64% ││ 0x00007f546024d4cb: vpextrq $0x0,%xmm0,%r10 | |
0.78% ││ 0x00007f546024d4d1: popcnt %r10,%rax | |
0.23% ││ 0x00007f546024d4d6: vpextrq $0x1,%xmm2,%r10 | |
0.79% ││ 0x00007f546024d4dc: popcnt %r10,%r10 | |
1.06% ││ 0x00007f546024d4e1: vmovd %r10d,%xmm15 | |
0.70% ││ 0x00007f546024d4e6: vextracti128 $0x1,%ymm7,%xmm0 | |
1.84% ││ 0x00007f546024d4ec: vpextrq $0x0,%xmm0,%r10 | |
1.42% ││ 0x00007f546024d4f2: popcnt %r10,%rsi | |
3.16% ││ 0x00007f546024d4f7: vpextrq $0x1,%xmm3,%r10 | |
0.29% ││ 0x00007f546024d4fd: popcnt %r10,%r9 | |
0.12% ││ 0x00007f546024d502: vpextrq $0x1,%xmm4,%r10 | |
1.55% ││ 0x00007f546024d508: popcnt %r10,%r14 | |
0.01% ││ 0x00007f546024d50d: vextracti128 $0x1,%ymm1,%xmm0 | |
0.11% ││ 0x00007f546024d513: vpextrq $0x0,%xmm0,%r10 | |
1.40% ││ 0x00007f546024d519: popcnt %r10,%r10 | |
0.01% ││ 0x00007f546024d51e: add %ecx,%r10d | |
0.09% ││ 0x00007f546024d521: add %r10d,%r14d | |
1.12% ││ 0x00007f546024d524: vpextrq $0x1,%xmm5,%r10 | |
0.29% ││ 0x00007f546024d52a: popcnt %r10,%rdi | |
0.10% ││ 0x00007f546024d52f: vmovq %xmm2,%r10 | |
1.29% ││ 0x00007f546024d534: popcnt %r10,%r8 | |
0.30% ││ 0x00007f546024d539: vpextrq $0x1,%xmm7,%r10 | |
0.10% ││ 0x00007f546024d53f: popcnt %r10,%r11 | |
1.28% ││ 0x00007f546024d544: vmovq %xmm3,%r10 | |
0.29% ││ 0x00007f546024d549: popcnt %r10,%rdx | |
0.02% ││ 0x00007f546024d54e: vmovq %xmm4,%r10 | |
0.08% ││ 0x00007f546024d553: popcnt %r10,%rcx | |
1.24% ││ 0x00007f546024d558: vpextrq $0x1,%xmm1,%r10 | |
0.27% ││ 0x00007f546024d55e: popcnt %r10,%r10 | |
0.09% ││ 0x00007f546024d563: add %ebx,%r10d | |
1.21% ││ 0x00007f546024d566: add %r10d,%ecx | |
0.25% ││ 0x00007f546024d569: vextracti128 $0x1,%ymm4,%xmm0 | |
0.00% ││ 0x00007f546024d56f: vpextrq $0x1,%xmm0,%r10 | |
1.42% ││ 0x00007f546024d575: popcnt %r10,%r10 | |
0.28% ││ 0x00007f546024d57a: add %r10d,%ecx | |
0.02% ││ 0x00007f546024d57d: vpextrq $0x1,%xmm6,%r10 | |
1.35% ││ 0x00007f546024d583: popcnt %r10,%rbx | |
0.33% ││ 0x00007f546024d588: add %ebx,%ecx | |
0.01% ││ 0x00007f546024d58a: add %ecx,%edx | |
0.11% ││ 0x00007f546024d58c: vextracti128 $0x1,%ymm3,%xmm0 | |
1.24% ││ 0x00007f546024d592: vpextrq $0x1,%xmm0,%r10 | |
0.32% ││ 0x00007f546024d598: popcnt %r10,%rcx | |
0.58% ││ 0x00007f546024d59d: add %ecx,%edx | |
1.12% ││ 0x00007f546024d59f: add %edx,%r11d | |
0.55% ││ 0x00007f546024d5a2: add %r11d,%r8d | |
0.66% ││ 0x00007f546024d5a5: vextracti128 $0x1,%ymm2,%xmm0 | |
0.31% ││ 0x00007f546024d5ab: vpextrq $0x1,%xmm0,%r10 | |
1.64% ││ 0x00007f546024d5b1: popcnt %r10,%r11 | |
1.31% ││ 0x00007f546024d5b6: add %r11d,%r8d | |
0.79% ││ 0x00007f546024d5b9: add %r8d,%edi | |
2.73% ││ 0x00007f546024d5bc: vmovq %xmm8,%r10 | |
0.17% ││ 0x00007f546024d5c1: popcnt %r10,%r10 | |
││ 0x00007f546024d5c6: add %r10d,%edi | |
││ 0x00007f546024d5c9: vextracti128 $0x1,%ymm8,%xmm0 | |
1.61% ││ 0x00007f546024d5cf: vpextrq $0x1,%xmm0,%r10 | |
0.22% ││ 0x00007f546024d5d5: popcnt %r10,%r11 | |
0.02% ││ 0x00007f546024d5da: add %r11d,%edi | |
2.50% ││ 0x00007f546024d5dd: vmovq %xmm5,%r10 | |
0.16% ││ 0x00007f546024d5e2: popcnt %r10,%rdx | |
││ 0x00007f546024d5e7: vmovq %xmm7,%r10 | |
0.00% ││ 0x00007f546024d5ec: popcnt %r10,%r10 | |
1.48% ││ 0x00007f546024d5f1: vmovq %xmm6,%r11 | |
0.17% ││ 0x00007f546024d5f6: popcnt %r11,%r11 | |
0.00% ││ 0x00007f546024d5fb: vmovq %xmm1,%r8 | |
0.01% ││ 0x00007f546024d600: popcnt %r8,%r8 | |
1.53% ││ 0x00007f546024d605: add %ebp,%r8d | |
0.18% ││ 0x00007f546024d608: vextracti128 $0x1,%ymm1,%xmm0 | |
0.00% ││ 0x00007f546024d60e: vpextrq $0x1,%xmm0,%rcx | |
1.46% ││ 0x00007f546024d614: popcnt %rcx,%rbx | |
0.15% ││ 0x00007f546024d619: add %ebx,%r8d | |
││ 0x00007f546024d61c: vextracti128 $0x1,%ymm4,%xmm0 | |
0.01% ││ 0x00007f546024d622: vpextrq $0x0,%xmm0,%rcx | |
1.76% ││ 0x00007f546024d628: popcnt %rcx,%rcx | |
0.01% ││ 0x00007f546024d62d: add %ecx,%r8d | |
0.00% ││ 0x00007f546024d630: add %r8d,%r11d | |
1.61% ││ 0x00007f546024d633: vextracti128 $0x1,%ymm6,%xmm0 | |
0.16% ││ 0x00007f546024d639: vpextrq $0x1,%xmm0,%r8 | |
0.00% ││ 0x00007f546024d63f: popcnt %r8,%rcx | |
1.39% ││ 0x00007f546024d644: add %ecx,%r11d | |
0.40% ││ 0x00007f546024d647: vextracti128 $0x1,%ymm3,%xmm0 | |
││ 0x00007f546024d64d: vpextrq $0x0,%xmm0,%r8 | |
1.70% ││ 0x00007f546024d653: popcnt %r8,%r8 | |
0.28% ││ 0x00007f546024d658: add %r8d,%r11d | |
0.04% ││ 0x00007f546024d65b: add %r11d,%r10d | |
0.18% ││ 0x00007f546024d65e: vextracti128 $0x1,%ymm7,%xmm0 | |
1.45% ││ 0x00007f546024d664: vpextrq $0x1,%xmm0,%r11 | |
0.11% ││ 0x00007f546024d66a: popcnt %r11,%r11 | |
0.19% ││ 0x00007f546024d66f: add %r11d,%r10d | |
1.35% ││ 0x00007f546024d672: vextracti128 $0x1,%ymm2,%xmm0 | |
0.18% ││ 0x00007f546024d678: vpextrq $0x0,%xmm0,%r11 | |
0.27% ││ 0x00007f546024d67e: popcnt %r11,%r8 | |
1.42% ││ 0x00007f546024d683: add %r8d,%r10d | |
0.15% ││ 0x00007f546024d686: add %r10d,%edx | |
0.36% ││ 0x00007f546024d689: vextracti128 $0x1,%ymm5,%xmm0 | |
0.14% ││ 0x00007f546024d68f: vpextrq $0x1,%xmm0,%r10 | |
1.35% ││ 0x00007f546024d695: popcnt %r10,%r11 | |
0.39% ││ 0x00007f546024d69a: add %r11d,%edx | |
0.08% ││ 0x00007f546024d69d: add %eax,%edx | |
1.58% ││ 0x00007f546024d69f: vextracti128 $0x1,%ymm5,%xmm0 | |
0.12% ││ 0x00007f546024d6a5: vpextrq $0x0,%xmm0,%r10 | |
0.07% ││ 0x00007f546024d6ab: popcnt %r10,%rcx | |
1.62% ││ 0x00007f546024d6b0: vpextrq $0x1,%xmm8,%r10 | |
0.10% ││ 0x00007f546024d6b6: popcnt %r10,%r10 | |
0.09% ││ 0x00007f546024d6bb: vextracti128 $0x1,%ymm6,%xmm0 | |
1.62% ││ 0x00007f546024d6c1: vpextrq $0x0,%xmm0,%r11 | |
0.16% ││ 0x00007f546024d6c7: popcnt %r11,%r8 | |
0.19% ││ 0x00007f546024d6cc: add %r8d,%r14d | |
1.50% ││ 0x00007f546024d6cf: add %r14d,%r9d | |
0.66% ││ 0x00007f546024d6d2: add %r9d,%esi | |
0.66% ││ 0x00007f546024d6d5: vmovd %xmm15,%r11d | |
0.13% ││ 0x00007f546024d6da: add %esi,%r11d | |
1.39% ││ 0x00007f546024d6dd: add %r11d,%ecx | |
0.63% ││ 0x00007f546024d6e0: add %r10d,%ecx | |
1.07% ││ 0x00007f546024d6e3: add $0x20,%r13d | |
0.08% ││ 0x00007f546024d6e7: cmp 0x60(%rsp),%r13d | |
│╰ 0x00007f546024d6ec: jl 0x00007f546024d410 | |
0.00% │ 0x00007f546024d6f2: vmovd %xmm14,%esi | |
│ 0x00007f546024d6f6: mov 0x50(%rsp),%r10d | |
0.03% │ 0x00007f546024d6fb: mov 0x54(%rsp),%r11d | |
0.01% │ 0x00007f546024d700: cmp %esi,%r13d | |
│ ╭ 0x00007f546024d703: jge 0x00007f546024d7f2 | |
0.01% │ │ 0x00007f546024d709: data16 xchg %ax,%ax | |
0.02% │ │ ↗ 0x00007f546024d70c: cmp %r10d,%r13d | |
│ │ │ 0x00007f546024d70f: jge 0x00007f546024da67 | |
0.09% │ │ │ 0x00007f546024d715: cmp %r11d,%r13d | |
│ │ │ 0x00007f546024d718: jge 0x00007f546024daae | |
0.02% │ │ │ 0x00007f546024d71e: vmovq %xmm13,%rax ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
│ │ │ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.02% │ │ │ 0x00007f546024d723: vmovdqu 0x10(%rax,%r13,8),%ymm0 ;*checkcast {reexecute=0 rethrow=0 return_oop=0} | |
│ │ │ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@46 (line 1297) | |
│ │ │ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@92 (line 111) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.04% │ │ │ 0x00007f546024d72a: vmovq %xmm12,%rbp ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
│ │ │ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.08% │ │ │ 0x00007f546024d72f: vpand 0x10(%rbp,%r13,8),%ymm0,%ymm0 ;*checkcast {reexecute=0 rethrow=0 return_oop=0} | |
│ │ │ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@46 (line 1297) | |
│ │ │ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@92 (line 111) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.08% │ │ │ 0x00007f546024d736: mov %r13d,%r9d | |
0.01% │ │ │ 0x00007f546024d739: add $0x4,%r9d ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
│ │ │ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.08% │ │ │ 0x00007f546024d73d: movslq %r9d,%r8 ;*invokevirtual fromArray {reexecute=0 rethrow=0 return_oop=0} | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@92 (line 111) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.03% │ │ │ 0x00007f546024d740: vmovdqu 0x10(%rax,%r8,8),%ymm1 | |
0.03% │ │ │ 0x00007f546024d747: vpand 0x10(%rbp,%r8,8),%ymm1,%ymm1 | |
0.11% │ │ │ 0x00007f546024d74e: vpextrq $0x1,%xmm1,%r8 | |
0.08% │ │ │ 0x00007f546024d754: popcnt %r8,%r8 | |
0.12% │ │ │ 0x00007f546024d759: vextracti128 $0x1,%ymm0,%xmm2 | |
0.07% │ │ │ 0x00007f546024d75f: vpextrq $0x0,%xmm2,%r9 | |
0.10% │ │ │ 0x00007f546024d765: popcnt %r9,%r9 | |
0.09% │ │ │ 0x00007f546024d76a: add %ecx,%r9d | |
0.14% │ │ │ 0x00007f546024d76d: add %r9d,%r8d | |
0.09% │ │ │ 0x00007f546024d770: vmovq %xmm1,%r9 | |
0.00% │ │ │ 0x00007f546024d775: popcnt %r9,%rbx | |
0.01% │ │ │ 0x00007f546024d77a: vpextrq $0x1,%xmm0,%r9 | |
0.12% │ │ │ 0x00007f546024d780: popcnt %r9,%rcx | |
0.02% │ │ │ 0x00007f546024d785: add %edi,%ecx | |
0.01% │ │ │ 0x00007f546024d787: add %ecx,%ebx | |
0.04% │ │ │ 0x00007f546024d789: vextracti128 $0x1,%ymm1,%xmm2 | |
0.10% │ │ │ 0x00007f546024d78f: vpextrq $0x1,%xmm2,%r9 | |
0.03% │ │ │ 0x00007f546024d795: popcnt %r9,%r9 | |
0.06% │ │ │ 0x00007f546024d79a: add %r9d,%ebx | |
0.07% │ │ │ 0x00007f546024d79d: vmovq %xmm0,%r9 | |
0.02% │ │ │ 0x00007f546024d7a2: popcnt %r9,%r9 | |
│ │ │ 0x00007f546024d7a7: add %edx,%r9d | |
0.03% │ │ │ 0x00007f546024d7aa: vextracti128 $0x1,%ymm0,%xmm2 | |
0.08% │ │ │ 0x00007f546024d7b0: vpextrq $0x1,%xmm2,%rcx | |
0.00% │ │ │ 0x00007f546024d7b6: popcnt %rcx,%rdi | |
0.03% │ │ │ 0x00007f546024d7bb: add %edi,%r9d | |
0.09% │ │ │ 0x00007f546024d7be: vextracti128 $0x1,%ymm1,%xmm0 | |
0.01% │ │ │ 0x00007f546024d7c4: vpextrq $0x0,%xmm0,%rcx | |
0.03% │ │ │ 0x00007f546024d7ca: popcnt %rcx,%rcx | |
0.08% │ │ │ 0x00007f546024d7cf: add %ecx,%r9d | |
0.06% │ │ │ 0x00007f546024d7d2: add $0x8,%r13d | |
0.01% │ │ │ 0x00007f546024d7d6: cmp %esi,%r13d ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
│ │ │ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
│ │╭ │ 0x00007f546024d7d9: jl 0x00007f546024d7e5 | |
│ ││ │ 0x00007f546024d7db: mov %r8d,%ecx | |
0.04% │ ││ │ 0x00007f546024d7de: mov %ebx,%edi | |
│ ││ │ 0x00007f546024d7e0: mov %r9d,%edx ; {no_reloc} | |
0.00% │ ││╭│ 0x00007f546024d7e3: jmp 0x00007f546024d7f2 | |
0.02% │ │↘││ 0x00007f546024d7e5: mov %r8d,%ecx | |
0.08% │ │ ││ 0x00007f546024d7e8: mov %ebx,%edi | |
0.04% │ │ ││ 0x00007f546024d7ea: mov %r9d,%edx | |
│ │ │╰ 0x00007f546024d7ed: jmpq 0x00007f546024d70c ;*invokevirtual fromArray {reexecute=0 rethrow=0 return_oop=0} | |
│ │ │ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@59 (line 110) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.01% │ ↘ ↘ 0x00007f546024d7f2: add %edi,%edx ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
│ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
│ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
│ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.02% │ 0x00007f546024d7f4: add %ecx,%edx | |
0.01% │ 0x00007f546024d7f6: vmovq %xmm9,%rbx | |
│ 0x00007f546024d7fb: vmovq %xmm10,%r8 | |
0.00% │ 0x00007f546024d800: vmovq %xmm11,%rbp | |
0.04% │ 0x00007f546024d805: mov %r8,0x50(%rsp) | |
0.01% │ 0x00007f546024d80a: mov %rbx,0x60(%rsp) | |
│ 0x00007f546024d80f: mov 0x8(%rsp),%rsi | |
0.00% │ 0x00007f546024d814: vzeroupper | |
0.05% │ 0x00007f546024d817: callq 0x00007f5458765f00 ; ImmutableOopMap{[80]=Oop [88]=Oop [96]=Oop [8]=Oop } | |
│ ;*invokevirtual consume {reexecute=0 rethrow=0 return_oop=0} | |
│ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@20 (line 158) | |
│ ; {optimized virtual_call} | |
0.03% │ 0x00007f546024d81c: mov 0x50(%rsp),%r8 ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
│ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
│ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
│ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.00% │ 0x00007f546024d821: movzbl 0x94(%r8),%r10d ;*putfield arg$1 {reexecute=0 rethrow=0 return_oop=0} | |
│ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::<init>@6 | |
│ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::get$Lambda@5 | |
│ ; - java.lang.invoke.DirectMethodHandle$Holder::invokeStatic@10 | |
│ ; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@5 | |
│ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@38 (line 1297) | |
│ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
│ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@59 (line 110) | |
│ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.05% │ 0x00007f546024d829: mov 0x108(%r15),%r11 ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
│ ; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
│ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
│ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.01% │ 0x00007f546024d830: add $0x1,%rbp ; ImmutableOopMap{r8=Oop [88]=Oop [96]=Oop [8]=Oop } | |
│ ;*ifeq {reexecute=1 rethrow=0 return_oop=0} | |
│ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@33 (line 160) | |
│ 0x00007f546024d834: test %eax,(%r11) ; {poll} | |
0.00% │ 0x00007f546024d837: test %r10d,%r10d | |
│ 0x00007f546024d83a: jne 0x00007f546024da02 | |
0.02% ↘ 0x00007f546024d840: mov 0x60(%rsp),%rbx ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.01% 0x00007f546024d845: mov 0xc(%rbx),%ecx ;*arraylength {reexecute=0 rethrow=0 return_oop=0} | |
; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@34 (line 109) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0x00007f546024d848: test %ecx,%ecx | |
0x00007f546024d84a: jle 0x00007f546024d9f6 ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.01% 0x00007f546024d850: mov 0x10(%rbx),%edi ;*new {reexecute=0 rethrow=0 return_oop=0} | |
; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::get$Lambda@0 | |
; - java.lang.invoke.DirectMethodHandle$Holder::invokeStatic@10 | |
; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@5 | |
; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@38 (line 1297) | |
; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@47 (line 110) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.01% 0x00007f546024d853: mov 0xc(%r12,%rdi,8),%r10d ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
; implicit exception: dispatches to 0x00007f546024dbda | |
0.06% 0x00007f546024d858: mov 0x14(%rbx),%r9d ;*putfield arg$1 {reexecute=0 rethrow=0 return_oop=0} | |
; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::<init>@6 | |
; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::get$Lambda@5 | |
; - java.lang.invoke.DirectMethodHandle$Holder::invokeStatic@10 | |
; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@5 | |
; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@38 (line 1297) | |
; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@47 (line 110) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0x00007f546024d85c: mov 0xc(%r12,%r9,8),%r11d ; implicit exception: dispatches to 0x00007f546024dbda | |
0.02% 0x00007f546024d861: cmp $0x7ffffff8,%ecx | |
0x00007f546024d867: jg 0x00007f546024da3a | |
0.00% 0x00007f546024d86d: mov $0x8,%edx | |
0.00% 0x00007f546024d872: cmp %edx,%ecx | |
0x00007f546024d874: mov %ecx,%esi | |
0.03% 0x00007f546024d876: mov $0x8,%eax | |
0.00% 0x00007f546024d87b: cmovg %eax,%esi | |
0.01% 0x00007f546024d87e: lea (%r12,%rdi,8),%rax | |
0x00007f546024d882: lea (%r12,%r9,8),%r14 | |
0.02% 0x00007f546024d886: xor %r13d,%r13d | |
0.02% 0x00007f546024d889: xor %r9d,%r9d | |
0.00% 0x00007f546024d88c: xor %edi,%edi | |
0.00% 0x00007f546024d88e: xor %edx,%edx | |
0.02% 0x00007f546024d890: cmp %r10d,%r13d | |
0x00007f546024d893: jge 0x00007f546024da63 | |
0.00% 0x00007f546024d899: cmp %r11d,%r13d | |
0x00007f546024d89c: jge 0x00007f546024daaa | |
0.02% 0x00007f546024d8a2: mov %r11d,0x54(%rsp) | |
0x00007f546024d8a7: vmovd %r10d,%xmm1 | |
0.02% 0x00007f546024d8ac: mov %ecx,0x50(%rsp) | |
0.04% 0x00007f546024d8b0: vmovq %rbp,%xmm11 | |
0.01% 0x00007f546024d8b5: vmovq %r8,%xmm10 | |
0x00007f546024d8ba: vmovq %rbx,%xmm9 ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.01% 0x00007f546024d8bf: vmovdqu 0x10(%r14,%r13,8),%ymm0 ;*checkcast {reexecute=0 rethrow=0 return_oop=0} | |
; - jdk.incubator.vector.Long256Vector::and@30 (line 401) | |
; - jdk.incubator.vector.Long256Vector::and@2 (line 39) | |
; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@95 (line 111) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.17% 0x00007f546024d8c6: vpand 0x10(%rax,%r13,8),%ymm0,%ymm3 ;*invokevirtual and {reexecute=0 rethrow=0 return_oop=0} | |
; - jdk.incubator.vector.Long256Vector::and@2 (line 39) | |
; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@95 (line 111) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.30% 0x00007f546024d8cd: mov %r13d,%r11d | |
0x00007f546024d8d0: add $0x4,%r11d ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0x00007f546024d8d4: movslq %r11d,%r10 ;*checkcast {reexecute=0 rethrow=0 return_oop=0} | |
; - jdk.incubator.vector.Long256Vector::and@30 (line 401) | |
; - jdk.incubator.vector.Long256Vector::and@2 (line 39) | |
; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@95 (line 111) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.05% 0x00007f546024d8d7: vmovdqu 0x10(%r14,%r10,8),%ymm0 | |
0x00007f546024d8de: vmovq %r14,%xmm13 ;*invokestatic extract {reexecute=0 rethrow=0 return_oop=0} | |
; - jdk.incubator.vector.Long256Vector::get@43 (line 954) | |
; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@188 (line 119) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0x00007f546024d8e3: vpand 0x10(%rax,%r10,8),%ymm0,%ymm0 ;*checkcast {reexecute=0 rethrow=0 return_oop=0} | |
; - jdk.incubator.vector.Long256Vector::and@30 (line 401) | |
; - jdk.incubator.vector.Long256Vector::and@2 (line 39) | |
; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandExtractPopcntUnrolled@95 (line 111) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.03% 0x00007f546024d8ea: vmovq %rax,%xmm12 | |
0x00007f546024d8ef: vextracti128 $0x1,%ymm0,%xmm2 | |
0x00007f546024d8f5: vpextrq $0x0,%xmm2,%r10 | |
0.04% 0x00007f546024d8fb: popcnt %r10,%r8 | |
0x00007f546024d900: vpextrq $0x1,%xmm0,%r10 | |
0x00007f546024d906: popcnt %r10,%rcx | |
0.05% 0x00007f546024d90b: vextracti128 $0x1,%ymm3,%xmm2 | |
0.02% 0x00007f546024d911: vpextrq $0x0,%xmm2,%r10 | |
0x00007f546024d917: popcnt %r10,%r10 | |
0.09% 0x00007f546024d91c: add %edx,%r10d | |
0.06% 0x00007f546024d91f: add %r10d,%ecx | |
0.08% 0x00007f546024d922: vmovq %xmm0,%r10 | |
0x00007f546024d927: popcnt %r10,%rbx | |
0x00007f546024d92c: vpextrq $0x1,%xmm3,%r10 | |
0.05% 0x00007f546024d932: popcnt %r10,%r11 | |
0x00007f546024d937: add %edi,%r11d | |
0x00007f546024d93a: add %r11d,%ebx | |
0x00007f546024d93d: vextracti128 $0x1,%ymm0,%xmm2 | |
0.05% 0x00007f546024d943: vpextrq $0x1,%xmm2,%r10 | |
0x00007f546024d949: popcnt %r10,%r11 | |
0.00% 0x00007f546024d94e: add %r11d,%ebx | |
0.04% 0x00007f546024d951: vmovq %xmm3,%r10 | |
0x00007f546024d956: popcnt %r10,%rbp | |
0x00007f546024d95b: add %r9d,%ebp | |
0x00007f546024d95e: vextracti128 $0x1,%ymm3,%xmm0 | |
0.04% 0x00007f546024d964: vpextrq $0x1,%xmm0,%r10 | |
0x00007f546024d96a: popcnt %r10,%r10 | |
0x00007f546024d96f: add %r10d,%ebp | |
0.04% 0x00007f546024d972: add %r8d,%ebp | |
0x00007f546024d975: add $0x8,%r13d | |
0x00007f546024d979: cmp %esi,%r13d | |
0x00007f546024d97c: jge 0x00007f546024d9a7 | |
0x00007f546024d97e: mov %ecx,%edx | |
0x00007f546024d980: mov %ebx,%edi | |
0x00007f546024d982: mov %ebp,%r9d | |
0x00007f546024d985: vmovq %xmm9,%rbx | |
0x00007f546024d98a: vmovq %xmm10,%r8 | |
0x00007f546024d98f: vmovq %xmm11,%rbp | |
0x00007f546024d994: mov 0x50(%rsp),%ecx | |
.................................................................................................... | |
97.02% <total for region 1> | |
....[Hottest Regions]............................................................................... | |
97.02% c2, level 4 com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub, version 654 (1378 bytes) | |
1.06% [kernel.kallsyms] native_write_msr (0 bytes) | |
0.29% c2, level 4 com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub, version 654 (69 bytes) | |
0.18% c2, level 4 org.openjdk.jmh.infra.Blackhole::consume, version 626 (55 bytes) | |
0.12% kernel [unknown] (39 bytes) | |
0.11% [kernel.kallsyms] syscall_return_via_sysret (27 bytes) | |
0.06% [kernel.kallsyms] x86_pmu_enable (0 bytes) | |
0.04% [kernel.kallsyms] intel_pmu_enable_all (0 bytes) | |
0.04% libjvm.so _ZN14ElfSymbolTable6lookupEPhPiS1_S1_P16ElfFuncDescTable (40 bytes) | |
0.04% [kernel.kallsyms] __intel_pmu_enable_all (0 bytes) | |
0.02% [kernel.kallsyms] __perf_event_task_sched_in (34 bytes) | |
0.02% [kernel.kallsyms] mutex_unlock (0 bytes) | |
0.02% libjvm.so _ZN10fileStream5writeEPKcm (43 bytes) | |
0.01% kernel [unknown] (27 bytes) | |
0.01% [kernel.kallsyms] __perf_event_task_sched_in (0 bytes) | |
0.01% [kernel.kallsyms] __fget (19 bytes) | |
0.01% [kernel.kallsyms] mutex_lock (24 bytes) | |
0.01% [kernel.kallsyms] _raw_spin_lock_irqsave (0 bytes) | |
0.01% [kernel.kallsyms] entry_SYSCALL_64_stage2 (0 bytes) | |
0.01% interpreter method entry point (kind = zerolocals) (4 bytes) | |
0.86% <...other 149 warm regions...> | |
.................................................................................................... | |
100.00% <totals> | |
....[Hottest Methods (after inlining)].............................................................. | |
97.31% c2, level 4 com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandExtractPopcntUnrolled_jmhTest::vpandExtractPopcntUnrolled_thrpt_jmhStub, version 654 | |
1.06% [kernel.kallsyms] native_write_msr | |
0.18% c2, level 4 org.openjdk.jmh.infra.Blackhole::consume, version 626 | |
0.13% kernel [unknown] | |
0.11% [kernel.kallsyms] syscall_return_via_sysret | |
0.06% libjvm.so _ZN14ElfSymbolTable6lookupEPhPiS1_S1_P16ElfFuncDescTable | |
0.06% [kernel.kallsyms] x86_pmu_enable | |
0.04% [kernel.kallsyms] intel_pmu_enable_all | |
0.04% [kernel.kallsyms] __perf_event_task_sched_in | |
0.04% [kernel.kallsyms] __intel_pmu_enable_all | |
0.02% [kernel.kallsyms] mutex_unlock | |
0.02% libjvm.so _Z23resource_allocate_bytesmN17AllocFailStrategy13AllocFailEnumE | |
0.02% libc-2.23.so _IO_fwrite | |
0.02% libjvm.so _ZN10fileStream5writeEPKcm | |
0.02% interpreter method entry point (kind = zerolocals) | |
0.02% [kernel.kallsyms] pipe_write | |
0.02% libc-2.23.so vfprintf | |
0.02% [kernel.kallsyms] __task_pid_nr_ns | |
0.02% [kernel.kallsyms] fsnotify | |
0.02% [kernel.kallsyms] start_this_handle | |
0.74% <...other 116 warm methods...> | |
.................................................................................................... | |
100.00% <totals> | |
....[Distribution by Source]........................................................................ | |
97.49% c2, level 4 | |
1.81% [kernel.kallsyms] | |
0.32% libjvm.so | |
0.15% libc-2.23.so | |
0.13% kernel | |
0.06% interpreter | |
0.02% hsdis-amd64.so | |
0.01% libpthread-2.23.so | |
0.01% c1, level 3 | |
.................................................................................................... | |
100.00% <totals> | |
# JMH version: 1.20 | |
# VM version: JDK 12-internal, VM 12-internal+0-adhoc.root.dev | |
# VM invoker: /home/richard/workspace/dev/build/linux-x86_64-normal-server-release/images/jdk/bin/java | |
# VM options: --add-modules=jdk.incubator.vector -Djdk.incubator.vector.VECTOR_ACCESS_OOB_CHECK=0 | |
# Warmup: 20 iterations, 1 s each | |
# Measurement: 20 iterations, 1 s each | |
# Timeout: 10 min per iteration | |
# Threads: 1 thread, will synchronize iterations | |
# Benchmark mode: Throughput, ops/time | |
# Benchmark: com.openkappa.panama.vectorbenchmarks.IntersectionCardinality.vpandStorePopcnt | |
# Parameters: (size = 1024) | |
# Run progress: 66.67% complete, ETA 00:01:28 | |
# Fork: 1 of 1 | |
# Preparing profilers: LinuxPerfAsmProfiler | |
# Profilers consume stdout and stderr from target VM, use -v EXTRA to copy to console | |
# Warmup Iteration 1: 1261.246 ops/ms | |
# Warmup Iteration 2: 2278.408 ops/ms | |
# Warmup Iteration 3: 2338.440 ops/ms | |
# Warmup Iteration 4: 2306.587 ops/ms | |
# Warmup Iteration 5: 2319.717 ops/ms | |
# Warmup Iteration 6: 2296.683 ops/ms | |
# Warmup Iteration 7: 2259.671 ops/ms | |
# Warmup Iteration 8: 2294.211 ops/ms | |
# Warmup Iteration 9: 2307.339 ops/ms | |
# Warmup Iteration 10: 2266.060 ops/ms | |
# Warmup Iteration 11: 2327.654 ops/ms | |
# Warmup Iteration 12: 2205.767 ops/ms | |
# Warmup Iteration 13: 2102.444 ops/ms | |
# Warmup Iteration 14: 2028.330 ops/ms | |
# Warmup Iteration 15: 2129.482 ops/ms | |
# Warmup Iteration 16: 2162.807 ops/ms | |
# Warmup Iteration 17: 1916.340 ops/ms | |
# Warmup Iteration 18: 2052.144 ops/ms | |
# Warmup Iteration 19: 2084.914 ops/ms | |
# Warmup Iteration 20: 2233.801 ops/ms | |
Iteration 1: 2151.215 ops/ms | |
Iteration 2: 2045.747 ops/ms | |
Iteration 3: 2061.618 ops/ms | |
Iteration 4: 2021.660 ops/ms | |
Iteration 5: 1961.469 ops/ms | |
Iteration 6: 2080.741 ops/ms | |
Iteration 7: 2022.129 ops/ms | |
Iteration 8: 2154.693 ops/ms | |
Iteration 9: 2036.206 ops/ms | |
Iteration 10: 2122.045 ops/ms | |
Iteration 11: 2338.628 ops/ms | |
Iteration 12: 2064.550 ops/ms | |
Iteration 13: 2105.848 ops/ms | |
Iteration 14: 2234.730 ops/ms | |
Iteration 15: 2154.765 ops/ms | |
Iteration 16: 2033.454 ops/ms | |
Iteration 17: 2077.859 ops/ms | |
Iteration 18: 2206.773 ops/ms | |
Iteration 19: 2181.564 ops/ms | |
Iteration 20: 2338.851 ops/ms | |
# Processing profiler results: LinuxPerfAsmProfiler | |
Result "com.openkappa.panama.vectorbenchmarks.IntersectionCardinality.vpandStorePopcnt": | |
2119.727 ±(99.9%) 88.639 ops/ms [Average] | |
(min, avg, max) = (1961.469, 2119.727, 2338.851), stdev = 102.077 | |
CI (99.9%): [2031.088, 2208.366] (assumes normal distribution) | |
Secondary result "com.openkappa.panama.vectorbenchmarks.IntersectionCardinality.vpandStorePopcnt:·asm": | |
PrintAssembly processed: 180362 total address lines. | |
Perf output processed (skipped 22.602 seconds): | |
Column 1: cycles (20383 events) | |
Hottest code regions (>10.00% "cycles" events): | |
....[Hottest Region 1].............................................................................. | |
c2, level 4, com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub, version 662 (712 bytes) | |
; implicit exception: dispatches to 0x00007f8f9824dc16 | |
0x00007f8f9824d6f5: test %r11d,%r11d ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcnt@62 (line 61) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub@17 (line 158) | |
0x00007f8f9824d6f8: jne 0x00007f8f9824d9f2 | |
0x00007f8f9824d6fe: mov $0x1,%ebp | |
╭ 0x00007f8f9824d703: jmpq 0x00007f8f9824d8e4 | |
│ ↗ 0x00007f8f9824d708: vmovq %xmm2,%rbp | |
│ │ 0x00007f8f9824d70d: vmovd %xmm4,%r11d | |
0.06% │ │ ↗ 0x00007f8f9824d712: vmovq %rbp,%xmm2 | |
│ │ │ 0x00007f8f9824d717: vmovd %r11d,%xmm4 | |
│ │ │ 0x00007f8f9824d71c: nopl 0x0(%rax) ;*iload_3 {reexecute=0 rethrow=0 return_oop=0} | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcnt@17 (line 60) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub@17 (line 158) | |
0.00% │↗│ │ 0x00007f8f9824d720: vmovdqu 0x10(%rax,%rbx,8),%ymm3 | |
0.25% │││ │ 0x00007f8f9824d726: vpand 0x10(%r13,%rbx,8),%ymm3,%ymm3 | |
4.35% │││ │ 0x00007f8f9824d72d: vmovdqu %ymm3,0x10(%r12,%r9,8) | |
0.06% │││ │ 0x00007f8f9824d734: vmovdqu 0x30(%rax,%rbx,8),%ymm3 | |
0.35% │││ │ 0x00007f8f9824d73a: vpand 0x30(%r13,%rbx,8),%ymm3,%ymm3 | |
4.82% │││ │ 0x00007f8f9824d741: movslq %ebx,%rbp | |
0.00% │││ │ 0x00007f8f9824d744: popcnt 0x10(%r12,%r9,8),%r10 | |
0.34% │││ │ 0x00007f8f9824d74b: add %ecx,%r10d | |
4.08% │││ │ 0x00007f8f9824d74e: popcnt 0x20(%r12,%r9,8),%rcx | |
0.04% │││ │ 0x00007f8f9824d755: popcnt 0x28(%r12,%r9,8),%r8 | |
0.09% │││ │ 0x00007f8f9824d75c: popcnt 0x18(%r12,%r9,8),%r11 | |
0.10% │││ │ 0x00007f8f9824d763: add %r11d,%r10d | |
4.29% │││ │ 0x00007f8f9824d766: add %r10d,%ecx | |
0.15% │││ │ 0x00007f8f9824d769: add %ecx,%r8d | |
4.18% │││ │ 0x00007f8f9824d76c: vmovdqu %ymm3,0x10(%r12,%r9,8) | |
0.04% │││ │ 0x00007f8f9824d773: vmovdqu 0x50(%rax,%rbp,8),%ymm3 | |
0.02% │││ │ 0x00007f8f9824d779: vpand 0x50(%r13,%rbp,8),%ymm3,%ymm3 | |
4.91% │││ │ 0x00007f8f9824d780: popcnt 0x20(%r12,%r9,8),%rcx | |
4.11% │││ │ 0x00007f8f9824d787: popcnt 0x18(%r12,%r9,8),%r10 | |
0.06% │││ │ 0x00007f8f9824d78e: popcnt 0x10(%r12,%r9,8),%r11 | |
0.26% │││ │ 0x00007f8f9824d795: add %r11d,%r8d | |
0.09% │││ │ 0x00007f8f9824d798: add %r10d,%r8d | |
3.92% │││ │ 0x00007f8f9824d79b: add %ecx,%r8d | |
0.19% │││ │ 0x00007f8f9824d79e: popcnt 0x28(%r12,%r9,8),%rsi | |
0.10% │││ │ 0x00007f8f9824d7a5: add %r8d,%esi | |
4.07% │││ │ 0x00007f8f9824d7a8: vmovdqu %ymm3,0x10(%r12,%r9,8) | |
0.11% │││ │ 0x00007f8f9824d7af: vmovdqu 0x70(%rax,%rbp,8),%ymm3 | |
0.09% │││ │ 0x00007f8f9824d7b5: vpand 0x70(%r13,%rbp,8),%ymm3,%ymm3 | |
5.07% │││ │ 0x00007f8f9824d7bc: popcnt 0x18(%r12,%r9,8),%r11 | |
0.25% │││ │ 0x00007f8f9824d7c3: popcnt 0x10(%r12,%r9,8),%r10 | |
0.11% │││ │ 0x00007f8f9824d7ca: add %esi,%r10d | |
0.13% │││ │ 0x00007f8f9824d7cd: add %r11d,%r10d | |
3.96% │││ │ 0x00007f8f9824d7d0: popcnt 0x20(%r12,%r9,8),%r8 | |
0.08% │││ │ 0x00007f8f9824d7d7: add %r10d,%r8d | |
3.89% │││ │ 0x00007f8f9824d7da: popcnt 0x28(%r12,%r9,8),%r10 | |
8.26% │││ │ 0x00007f8f9824d7e1: add %r8d,%r10d | |
4.08% │││ │ 0x00007f8f9824d7e4: vmovdqu %ymm3,0x10(%r12,%r9,8) | |
0.01% │││ │ 0x00007f8f9824d7eb: popcnt 0x10(%r12,%r9,8),%rcx | |
0.52% │││ │ 0x00007f8f9824d7f2: add %r10d,%ecx | |
4.36% │││ │ 0x00007f8f9824d7f5: popcnt 0x28(%r12,%r9,8),%r8 | |
0.06% │││ │ 0x00007f8f9824d7fc: popcnt 0x20(%r12,%r9,8),%rsi | |
0.10% │││ │ 0x00007f8f9824d803: popcnt 0x18(%r12,%r9,8),%r10 | |
8.00% │││ │ 0x00007f8f9824d80a: add %r10d,%ecx | |
4.38% │││ │ 0x00007f8f9824d80d: add %esi,%ecx | |
4.35% │││ │ 0x00007f8f9824d80f: add %r8d,%ecx ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcnt@62 (line 61) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub@17 (line 158) | |
4.04% │││ │ 0x00007f8f9824d812: add $0x10,%ebx | |
0.00% │││ │ 0x00007f8f9824d815: cmp %edi,%ebx | |
│╰│ │ 0x00007f8f9824d817: jl 0x00007f8f9824d720 ;*putfield arg$1 {reexecute=0 rethrow=0 return_oop=0} | |
│ │ │ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::<init>@6 | |
│ │ │ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::get$Lambda@5 | |
│ │ │ ; - java.lang.invoke.DirectMethodHandle$Holder::invokeStatic@10 | |
│ │ │ ; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@5 | |
│ │ │ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@38 (line 1297) | |
│ │ │ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcnt@43 (line 61) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub@17 (line 158) | |
│ │ │ 0x00007f8f9824d81d: mov 0x108(%r15),%r10 ; ImmutableOopMap{r9=NarrowOop rdx=Oop rax=Oop r13=Oop xmm0=Oop xmm1=Oop [88]=Oop [8]=Oop } | |
│ │ │ ;*goto {reexecute=1 rethrow=0 return_oop=0} | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcnt@104 (line 60) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub@17 (line 158) | |
│ │ │ 0x00007f8f9824d824: test %eax,(%r10) ; {poll} | |
0.10% │ │ │ 0x00007f8f9824d827: cmp %edi,%ebx | |
│ ╰ │ 0x00007f8f9824d829: jl 0x00007f8f9824d708 | |
│ │ 0x00007f8f9824d82f: vmovq %xmm2,%rbp | |
│ │ 0x00007f8f9824d834: vmovd %xmm4,%r11d | |
│ ↗│ 0x00007f8f9824d839: cmp %r14d,%ebx | |
│ ╭ ││ 0x00007f8f9824d83c: jge 0x00007f8f9824d89a | |
0.07% │ │ ││ 0x00007f8f9824d83e: xchg %ax,%ax ;*iload_3 {reexecute=0 rethrow=0 return_oop=0} | |
│ │ ││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcnt@17 (line 60) | |
│ │ ││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub@17 (line 158) | |
0.07% │ │ ↗ ││ 0x00007f8f9824d840: cmp %r11d,%ebx | |
│ │ │ ││ 0x00007f8f9824d843: jge 0x00007f8f9824da7a | |
0.08% │ │ │ ││ 0x00007f8f9824d849: cmp 0x28(%rsp),%ebx | |
│ │ │ ││ 0x00007f8f9824d84d: jge 0x00007f8f9824dab6 | |
│ │ │ ││ 0x00007f8f9824d853: vmovdqu 0x10(%rax,%rbx,8),%ymm2 | |
0.07% │ │ │ ││ 0x00007f8f9824d859: vpand 0x10(%r13,%rbx,8),%ymm2,%ymm2 | |
0.17% │ │ │ ││ 0x00007f8f9824d860: vmovdqu %ymm2,0x10(%r12,%r9,8) | |
0.01% │ │ │ ││ 0x00007f8f9824d867: popcnt 0x28(%r12,%r9,8),%rdx | |
0.14% │ │ │ ││ 0x00007f8f9824d86e: popcnt 0x18(%r12,%r9,8),%r8 | |
0.01% │ │ │ ││ 0x00007f8f9824d875: popcnt 0x10(%r12,%r9,8),%rsi | |
0.07% │ │ │ ││ 0x00007f8f9824d87c: add %ecx,%esi | |
0.01% │ │ │ ││ 0x00007f8f9824d87e: add %r8d,%esi | |
0.14% │ │ │ ││ 0x00007f8f9824d881: popcnt 0x20(%r12,%r9,8),%r8 | |
0.11% │ │ │ ││ 0x00007f8f9824d888: add %esi,%r8d | |
0.15% │ │ │ ││ 0x00007f8f9824d88b: add %r8d,%edx | |
0.11% │ │ │ ││ 0x00007f8f9824d88e: add $0x4,%ebx | |
0.05% │ │ │ ││ 0x00007f8f9824d891: cmp %r14d,%ebx | |
│ │╭│ ││ 0x00007f8f9824d894: jge 0x00007f8f9824d89c | |
│ │││ ││ 0x00007f8f9824d896: mov %edx,%ecx | |
│ ││╰ ││ 0x00007f8f9824d898: jmp 0x00007f8f9824d840 | |
│ ↘│ ││ 0x00007f8f9824d89a: mov %ecx,%edx | |
│ ↘ ││ 0x00007f8f9824d89c: vmovq %xmm0,%rdi | |
│ ││ 0x00007f8f9824d8a1: vmovq %xmm1,%r10 | |
0.07% │ ││↗ 0x00007f8f9824d8a6: mov %r10,0x50(%rsp) | |
│ │││ 0x00007f8f9824d8ab: mov %rdi,0x60(%rsp) | |
│ │││ 0x00007f8f9824d8b0: mov 0x8(%rsp),%rsi ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcnt@62 (line 61) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub@17 (line 158) | |
│ │││ 0x00007f8f9824d8b5: data16 xchg %ax,%ax | |
0.06% │ │││ 0x00007f8f9824d8b8: vzeroupper | |
0.08% │ │││ 0x00007f8f9824d8bb: callq 0x00007f8f90765f00 ; ImmutableOopMap{[80]=Oop [88]=Oop [96]=Oop [8]=Oop } | |
│ │││ ;*invokevirtual consume {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub@20 (line 158) | |
│ │││ ; {optimized virtual_call} | |
0.01% │ │││ 0x00007f8f9824d8c0: mov 0x50(%rsp),%r10 ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcnt@62 (line 61) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub@17 (line 158) | |
0.07% │ │││ 0x00007f8f9824d8c5: movzbl 0x94(%r10),%r11d ;*putfield arg$1 {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::<init>@6 | |
│ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::get$Lambda@5 | |
│ │││ ; - java.lang.invoke.DirectMethodHandle$Holder::invokeStatic@10 | |
│ │││ ; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@5 | |
│ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@38 (line 1297) | |
│ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcnt@43 (line 61) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub@17 (line 158) | |
│ │││ 0x00007f8f9824d8cd: mov 0x108(%r15),%r8 ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcnt@62 (line 61) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub@17 (line 158) | |
│ │││ 0x00007f8f9824d8d4: add $0x1,%rbp ; ImmutableOopMap{r10=Oop [88]=Oop [96]=Oop [8]=Oop } | |
│ │││ ;*ifeq {reexecute=1 rethrow=0 return_oop=0} | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub@33 (line 160) | |
│ │││ 0x00007f8f9824d8d8: test %eax,(%r8) ; {poll} | |
0.05% │ │││ 0x00007f8f9824d8db: test %r11d,%r11d | |
│ │││ 0x00007f8f9824d8de: jne 0x00007f8f9824d9f7 | |
↘ │││ 0x00007f8f9824d8e4: mov 0x60(%rsp),%rdi ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│││ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcnt@62 (line 61) | |
│││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub@17 (line 158) | |
0.00% │││ 0x00007f8f9824d8e9: mov 0xc(%rdi),%r14d ;*getfield size {reexecute=0 rethrow=0 return_oop=0} | |
│││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcnt@11 (line 60) | |
│││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub@17 (line 158) | |
│││ 0x00007f8f9824d8ed: test %r14d,%r14d | |
╭ │││ 0x00007f8f9824d8f0: jle 0x00007f8f9824d9eb ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcnt@62 (line 61) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub@17 (line 158) | |
0.08% │ │││ 0x00007f8f9824d8f6: mov 0x18(%rdi),%r9d ;*getfield buffer {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcnt@1 (line 58) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub@17 (line 158) | |
│ │││ 0x00007f8f9824d8fa: mov 0x10(%rdi),%esi ;*getfield left {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcnt@19 (line 60) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub@17 (line 158) | |
0.00% │ │││ 0x00007f8f9824d8fd: mov 0xc(%r12,%rsi,8),%r11d ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcnt@62 (line 61) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub@17 (line 158) | |
│ │││ ; implicit exception: dispatches to 0x00007f8f9824da52 | |
0.01% │ │││ 0x00007f8f9824d902: mov 0x14(%rdi),%ecx ;*getfield right {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcnt@28 (line 60) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub@17 (line 158) | |
0.06% │ │││ 0x00007f8f9824d905: mov 0xc(%r12,%rcx,8),%r8d ; implicit exception: dispatches to 0x00007f8f9824da52 | |
0.01% │ │││ 0x00007f8f9824d90a: mov %r8d,0x28(%rsp) | |
│ │││ 0x00007f8f9824d90f: mov 0xc(%r12,%r9,8),%ebx ; implicit exception: dispatches to 0x00007f8f9824da52 | |
│ │││ 0x00007f8f9824d914: cmp $0x3,%ebx | |
│ │││ 0x00007f8f9824d917: jbe 0x00007f8f9824da52 | |
0.08% │ │││ 0x00007f8f9824d91d: cmp $0x7ffffffc,%r14d | |
│ │││ 0x00007f8f9824d924: jg 0x00007f8f9824da2f | |
│ │││ 0x00007f8f9824d92a: mov $0x4,%r8d | |
│ │││ 0x00007f8f9824d930: cmp %r14d,%r8d | |
│ │││ 0x00007f8f9824d933: cmovg %r14d,%r8d | |
0.06% │ │││ 0x00007f8f9824d937: lea (%r12,%rsi,8),%r13 | |
│ │││ 0x00007f8f9824d93b: lea (%r12,%rcx,8),%rax | |
│ │││ 0x00007f8f9824d93f: xor %ebx,%ebx | |
0.00% │ │││ 0x00007f8f9824d941: xor %esi,%esi | |
0.05% │ ↗│││ 0x00007f8f9824d943: cmp %r11d,%ebx | |
│ ││││ 0x00007f8f9824d946: jge 0x00007f8f9824da76 | |
│ ││││ 0x00007f8f9824d94c: cmp 0x28(%rsp),%ebx | |
│ ││││ 0x00007f8f9824d950: jge 0x00007f8f9824dab2 | |
│ ││││ 0x00007f8f9824d956: vmovq %r10,%xmm1 | |
│ ││││ 0x00007f8f9824d95b: vmovq %rdi,%xmm0 | |
0.05% │ ││││ 0x00007f8f9824d960: vmovdqu 0x10(%rax,%rbx,8),%ymm2 | |
0.00% │ ││││ 0x00007f8f9824d966: vpand 0x10(%r13,%rbx,8),%ymm2,%ymm2 | |
0.08% │ ││││ 0x00007f8f9824d96d: vmovdqu %ymm2,0x10(%r12,%r9,8) | |
0.06% │ ││││ 0x00007f8f9824d974: popcnt 0x28(%r12,%r9,8),%rcx | |
0.34% │ ││││ 0x00007f8f9824d97b: popcnt 0x18(%r12,%r9,8),%r10 | |
0.06% │ ││││ 0x00007f8f9824d982: popcnt 0x10(%r12,%r9,8),%rdi | |
0.02% │ ││││ 0x00007f8f9824d989: add %esi,%edi | |
0.08% │ ││││ 0x00007f8f9824d98b: add %r10d,%edi | |
0.07% │ ││││ 0x00007f8f9824d98e: popcnt 0x20(%r12,%r9,8),%rsi | |
│ ││││ 0x00007f8f9824d995: add %edi,%esi | |
0.07% │ ││││ 0x00007f8f9824d997: add %esi,%ecx | |
0.08% │ ││││ 0x00007f8f9824d999: add $0x4,%ebx | |
│ ││││ 0x00007f8f9824d99c: cmp %r8d,%ebx | |
│╭││││ 0x00007f8f9824d99f: jge 0x00007f8f9824d9af | |
││││││ 0x00007f8f9824d9a1: mov %ecx,%esi | |
││││││ 0x00007f8f9824d9a3: vmovq %xmm0,%rdi | |
││││││ 0x00007f8f9824d9a8: vmovq %xmm1,%r10 | |
││╰│││ 0x00007f8f9824d9ad: jmp 0x00007f8f9824d943 | |
│↘ │││ 0x00007f8f9824d9af: cmp 0x28(%rsp),%r11d | |
│ │││ 0x00007f8f9824d9b4: mov %r11d,%r8d | |
0.06% │ │││ 0x00007f8f9824d9b7: mov 0x28(%rsp),%r10d | |
0.00% │ │││ 0x00007f8f9824d9bc: cmovg %r10d,%r8d | |
0.00% │ │││ 0x00007f8f9824d9c0: cmp %r14d,%r8d | |
│ │││ 0x00007f8f9824d9c3: cmovg %r14d,%r8d | |
0.07% │ │││ 0x00007f8f9824d9c7: mov %r8d,%edi | |
│ │││ 0x00007f8f9824d9ca: add $0xfffffff4,%edi | |
│ │││ 0x00007f8f9824d9cd: cmp %edi,%r8d | |
│ │││ 0x00007f8f9824d9d0: mov $0x80000000,%r10d | |
0.06% │ │││ 0x00007f8f9824d9d6: cmovl %r10d,%edi | |
0.00% │ │││ 0x00007f8f9824d9da: cmp %edi,%ebx | |
│ ╰││ 0x00007f8f9824d9dc: jge 0x00007f8f9824d839 | |
│ ││ 0x00007f8f9824d9e2: lea (%r12,%r9,8),%rdx | |
│ ╰│ 0x00007f8f9824d9e6: jmpq 0x00007f8f9824d712 | |
↘ │ 0x00007f8f9824d9eb: xor %edx,%edx | |
╰ 0x00007f8f9824d9ed: jmpq 0x00007f8f9824d8a6 ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcnt@62 (line 61) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub@17 (line 158) | |
0x00007f8f9824d9f2: mov $0x1,%ebp ;*aload_1 {reexecute=0 rethrow=0 return_oop=0} | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub@36 (line 161) | |
.................................................................................................... | |
96.05% <total for region 1> | |
....[Hottest Regions]............................................................................... | |
96.05% c2, level 4 com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub, version 662 (712 bytes) | |
1.04% [kernel.kallsyms] native_write_msr (5 bytes) | |
0.68% [kernel.kallsyms] dw_readl (0 bytes) | |
0.30% c2, level 4 org.openjdk.jmh.infra.Blackhole::consume, version 634 (48 bytes) | |
0.12% [kernel.kallsyms] syscall_return_via_sysret (1 bytes) | |
0.11% kernel [unknown] (70 bytes) | |
0.11% [kernel.kallsyms] idma64_irq (13 bytes) | |
0.06% libjvm.so _ZN14ElfSymbolTable6lookupEPhPiS1_S1_P16ElfFuncDescTable (30 bytes) | |
0.04% [kernel.kallsyms] __intel_pmu_enable_all (0 bytes) | |
0.03% [kernel.kallsyms] _raw_spin_lock_irqsave (7 bytes) | |
0.03% [kernel.kallsyms] do_syscall_64 (38 bytes) | |
0.03% [kernel.kallsyms] irq_entries_start (0 bytes) | |
0.03% [kernel.kallsyms] switch_to_thread_stack (3 bytes) | |
0.02% [kernel.kallsyms] mutex_unlock (0 bytes) | |
0.02% [kernel.kallsyms] swapgs_restore_regs_and_return_to_usermode (2 bytes) | |
0.02% libjvm.so _ZN10fileStream5writeEPKcm (25 bytes) | |
0.02% [kernel.kallsyms] __task_pid_nr_ns (8 bytes) | |
0.02% [kernel.kallsyms] __wake_up_common (44 bytes) | |
0.02% [kernel.kallsyms] __fget (23 bytes) | |
0.02% [kernel.kallsyms] mutex_lock (0 bytes) | |
1.20% <...other 192 warm regions...> | |
.................................................................................................... | |
99.99% <totals> | |
....[Hottest Methods (after inlining)].............................................................. | |
96.05% c2, level 4 com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcnt_jmhTest::vpandStorePopcnt_thrpt_jmhStub, version 662 | |
1.04% [kernel.kallsyms] native_write_msr | |
0.68% [kernel.kallsyms] dw_readl | |
0.30% c2, level 4 org.openjdk.jmh.infra.Blackhole::consume, version 634 | |
0.13% [kernel.kallsyms] syscall_return_via_sysret | |
0.12% kernel [unknown] | |
0.11% [kernel.kallsyms] idma64_irq | |
0.07% libjvm.so _ZN14ElfSymbolTable6lookupEPhPiS1_S1_P16ElfFuncDescTable | |
0.04% [kernel.kallsyms] __intel_pmu_enable_all | |
0.04% hsdis-amd64.so print_insn | |
0.03% libc-2.23.so vfprintf | |
0.03% [kernel.kallsyms] _raw_spin_lock_irqsave | |
0.03% [kernel.kallsyms] do_syscall_64 | |
0.03% [kernel.kallsyms] switch_to_thread_stack | |
0.03% [kernel.kallsyms] irq_entries_start | |
0.03% [kernel.kallsyms] swapgs_restore_regs_and_return_to_usermode | |
0.02% [kernel.kallsyms] mutex_unlock | |
0.02% [kernel.kallsyms] __wake_up_common | |
0.02% libc-2.23.so __GI___libc_write | |
0.02% [kernel.kallsyms] __task_pid_nr_ns | |
1.12% <...other 158 warm methods...> | |
.................................................................................................... | |
99.99% <totals> | |
....[Distribution by Source]........................................................................ | |
96.35% c2, level 4 | |
2.82% [kernel.kallsyms] | |
0.40% libjvm.so | |
0.16% libc-2.23.so | |
0.12% kernel | |
0.06% hsdis-amd64.so | |
0.02% interpreter | |
0.01% ld-2.23.so | |
0.01% libpthread-2.23.so | |
0.00% | |
0.00% c1, level 3 | |
0.00% [vdso] | |
0.00% perf-2165.map | |
.................................................................................................... | |
99.99% <totals> | |
# JMH version: 1.20 | |
# VM version: JDK 12-internal, VM 12-internal+0-adhoc.root.dev | |
# VM invoker: /home/richard/workspace/dev/build/linux-x86_64-normal-server-release/images/jdk/bin/java | |
# VM options: --add-modules=jdk.incubator.vector -Djdk.incubator.vector.VECTOR_ACCESS_OOB_CHECK=0 | |
# Warmup: 20 iterations, 1 s each | |
# Measurement: 20 iterations, 1 s each | |
# Timeout: 10 min per iteration | |
# Threads: 1 thread, will synchronize iterations | |
# Benchmark mode: Throughput, ops/time | |
# Benchmark: com.openkappa.panama.vectorbenchmarks.IntersectionCardinality.vpandStorePopcntUnrolled | |
# Parameters: (size = 1024) | |
# Run progress: 83.33% complete, ETA 00:00:44 | |
# Fork: 1 of 1 | |
# Preparing profilers: LinuxPerfAsmProfiler | |
# Profilers consume stdout and stderr from target VM, use -v EXTRA to copy to console | |
# Warmup Iteration 1: 1274.947 ops/ms | |
# Warmup Iteration 2: 2165.198 ops/ms | |
# Warmup Iteration 3: 2243.023 ops/ms | |
# Warmup Iteration 4: 2428.626 ops/ms | |
# Warmup Iteration 5: 2481.234 ops/ms | |
# Warmup Iteration 6: 2283.325 ops/ms | |
# Warmup Iteration 7: 2209.925 ops/ms | |
# Warmup Iteration 8: 2155.251 ops/ms | |
# Warmup Iteration 9: 2125.849 ops/ms | |
# Warmup Iteration 10: 2061.766 ops/ms | |
# Warmup Iteration 11: 2203.729 ops/ms | |
# Warmup Iteration 12: 2281.168 ops/ms | |
# Warmup Iteration 13: 2467.025 ops/ms | |
# Warmup Iteration 14: 2410.949 ops/ms | |
# Warmup Iteration 15: 2312.173 ops/ms | |
# Warmup Iteration 16: 2257.778 ops/ms | |
# Warmup Iteration 17: 2251.022 ops/ms | |
# Warmup Iteration 18: 2224.523 ops/ms | |
# Warmup Iteration 19: 2164.816 ops/ms | |
# Warmup Iteration 20: 2251.604 ops/ms | |
Iteration 1: 2280.876 ops/ms | |
Iteration 2: 2285.586 ops/ms | |
Iteration 3: 2234.384 ops/ms | |
Iteration 4: 2209.777 ops/ms | |
Iteration 5: 2254.041 ops/ms | |
Iteration 6: 2345.371 ops/ms | |
Iteration 7: 2457.299 ops/ms | |
Iteration 8: 2332.266 ops/ms | |
Iteration 9: 2361.203 ops/ms | |
Iteration 10: 2168.786 ops/ms | |
Iteration 11: 2401.188 ops/ms | |
Iteration 12: 2276.684 ops/ms | |
Iteration 13: 2198.189 ops/ms | |
Iteration 14: 2147.710 ops/ms | |
Iteration 15: 2295.649 ops/ms | |
Iteration 16: 2265.977 ops/ms | |
Iteration 17: 2150.995 ops/ms | |
Iteration 18: 2208.336 ops/ms | |
Iteration 19: 2004.620 ops/ms | |
Iteration 20: 2047.058 ops/ms | |
# Processing profiler results: LinuxPerfAsmProfiler | |
Result "com.openkappa.panama.vectorbenchmarks.IntersectionCardinality.vpandStorePopcntUnrolled": | |
2246.300 ±(99.9%) 96.287 ops/ms [Average] | |
(min, avg, max) = (2004.620, 2246.300, 2457.299), stdev = 110.885 | |
CI (99.9%): [2150.012, 2342.587] (assumes normal distribution) | |
Secondary result "com.openkappa.panama.vectorbenchmarks.IntersectionCardinality.vpandStorePopcntUnrolled:·asm": | |
PrintAssembly processed: 184834 total address lines. | |
Perf output processed (skipped 22.610 seconds): | |
Column 1: cycles (20425 events) | |
Hottest code regions (>10.00% "cycles" events): | |
....[Hottest Region 1].............................................................................. | |
c2, level 4, com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub, version 665 (1253 bytes) | |
0x00007f6d50030f20: jne 0x00007f6d5003143e | |
0x00007f6d50030f26: mov $0x1,%ebp | |
╭ 0x00007f6d50030f2b: jmpq 0x00007f6d500312c0 | |
│ ↗ 0x00007f6d50030f30: vmovd %xmm3,%esi | |
│ │ 0x00007f6d50030f34: vmovd %xmm2,%edx | |
│ │ 0x00007f6d50030f38: vmovd %xmm5,%ebx | |
│ │ 0x00007f6d50030f3c: mov 0x60(%rsp),%ebp | |
│ │ 0x00007f6d50030f40: vmovq %xmm4,%r10 | |
│ │ ↗ 0x00007f6d50030f45: vmovd %esi,%xmm3 | |
│ │ │ 0x00007f6d50030f49: vmovd %edx,%xmm2 | |
0.05% │ │ │ 0x00007f6d50030f4d: vmovd %ebx,%xmm5 | |
│ │ │ 0x00007f6d50030f51: mov %ebp,0x60(%rsp) | |
│ │ │ 0x00007f6d50030f55: vmovq %r10,%xmm4 ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│ │ │ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
│ │ │ 0x00007f6d50030f5a: nopw 0x0(%rax,%rax,1) ;*invokevirtual fromArray {reexecute=0 rethrow=0 return_oop=0} | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@53 (line 77) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.09% │↗│ │ 0x00007f6d50030f60: vmovdqu 0x10(%rax,%r9,8),%ymm6 | |
0.01% │││ │ 0x00007f6d50030f67: vpand 0x10(%r13,%r9,8),%ymm6,%ymm6 ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
2.19% │││ │ 0x00007f6d50030f6e: vmovdqu %ymm6,0x10(%r12,%rcx,8) ;*arraylength {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@40 (line 76) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.10% │││ │ 0x00007f6d50030f75: mov %r9d,%r10d | |
0.00% │││ │ 0x00007f6d50030f78: add $0x1c,%r10d ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
2.07% │││ │ 0x00007f6d50030f7c: movslq %r10d,%r10 ;*putfield arg$1 {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::<init>@6 | |
│││ │ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::get$Lambda@5 | |
│││ │ ; - java.lang.invoke.DirectMethodHandle$Holder::invokeStatic@10 | |
│││ │ ; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@5 | |
│││ │ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@38 (line 1297) | |
│││ │ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@65 (line 77) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.04% │││ │ 0x00007f6d50030f7f: vmovq %r10,%xmm6 | |
0.08% │││ │ 0x00007f6d50030f84: mov %r9d,%ebx | |
│││ │ 0x00007f6d50030f87: add $0x4,%ebx ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
2.15% │││ │ 0x00007f6d50030f8a: movslq %ebx,%r10 ;*getfield left {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@48 (line 77) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
│││ │ 0x00007f6d50030f8d: vmovdqu 0x10(%rax,%r10,8),%ymm7 | |
0.14% │││ │ 0x00007f6d50030f94: vpand 0x10(%r13,%r10,8),%ymm7,%ymm7 ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
3.10% │││ │ 0x00007f6d50030f9b: vmovdqu %ymm7,0x30(%r12,%rcx,8) ;*arraylength {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@40 (line 76) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.07% │││ │ 0x00007f6d50030fa2: movslq %r9d,%rbx | |
0.06% │││ │ 0x00007f6d50030fa5: vmovdqu 0x50(%rax,%rbx,8),%ymm7 | |
0.15% │││ │ 0x00007f6d50030fab: vpand 0x50(%r13,%rbx,8),%ymm7,%ymm7 | |
2.66% │││ │ 0x00007f6d50030fb2: popcnt 0x20(%r12,%rcx,8),%rsi | |
0.04% │││ │ 0x00007f6d50030fb9: add %r8d,%esi | |
0.14% │││ │ 0x00007f6d50030fbc: popcnt 0x38(%r12,%rcx,8),%r10 | |
3.25% │││ │ 0x00007f6d50030fc3: add %esi,%r10d | |
0.66% │││ │ 0x00007f6d50030fc6: popcnt 0x30(%r12,%rcx,8),%rsi | |
0.18% │││ │ 0x00007f6d50030fcd: popcnt 0x18(%r12,%rcx,8),%r8 | |
0.00% │││ │ 0x00007f6d50030fd4: add %r11d,%r8d | |
1.47% │││ │ 0x00007f6d50030fd7: add %esi,%r8d | |
0.72% │││ │ 0x00007f6d50030fda: popcnt 0x48(%r12,%rcx,8),%rdx | |
0.03% │││ │ 0x00007f6d50030fe1: add %r8d,%edx | |
0.47% │││ │ 0x00007f6d50030fe4: popcnt 0x40(%r12,%rcx,8),%rsi | |
1.57% │││ │ 0x00007f6d50030feb: popcnt 0x10(%r12,%rcx,8),%r11 | |
0.31% │││ │ 0x00007f6d50030ff2: add %edi,%r11d | |
0.06% │││ │ 0x00007f6d50030ff5: popcnt 0x28(%r12,%rcx,8),%r8 | |
0.53% │││ │ 0x00007f6d50030ffc: add %r11d,%r8d | |
1.97% │││ │ 0x00007f6d50030fff: add %esi,%r8d ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
1.15% │││ │ 0x00007f6d50031002: vmovdqu %ymm7,0x10(%r12,%rcx,8) ;*if_icmpge {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@31 (line 76) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.02% │││ │ 0x00007f6d50031009: mov %r9d,%r11d | |
0.02% │││ │ 0x00007f6d5003100c: add $0xc,%r11d ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.81% │││ │ 0x00007f6d50031010: movslq %r11d,%r11 ;*invokevirtual fromArray {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@53 (line 77) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
1.16% │││ │ 0x00007f6d50031013: vmovdqu 0x10(%rax,%r11,8),%ymm7 | |
0.06% │││ │ 0x00007f6d5003101a: vpand 0x10(%r13,%r11,8),%ymm7,%ymm7 ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
1.00% │││ │ 0x00007f6d50031021: vmovdqu %ymm7,0x30(%r12,%rcx,8) ;*arraylength {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@30 (line 76) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
1.12% │││ │ 0x00007f6d50031028: vmovdqu 0x90(%rax,%rbx,8),%ymm7 | |
0.10% │││ │ 0x00007f6d50031031: vpand 0x90(%r13,%rbx,8),%ymm7,%ymm7 | |
1.11% │││ │ 0x00007f6d5003103b: mov %r9d,%edi | |
1.13% │││ │ 0x00007f6d5003103e: add $0x14,%edi ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.03% │││ │ 0x00007f6d50031041: movslq %edi,%rbp ;*if_icmpge {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@21 (line 76) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.21% │││ │ 0x00007f6d50031044: popcnt 0x38(%r12,%rcx,8),%rdi | |
2.36% │││ │ 0x00007f6d5003104b: popcnt 0x30(%r12,%rcx,8),%r11 | |
1.53% │││ │ 0x00007f6d50031052: popcnt 0x18(%r12,%rcx,8),%rsi | |
0.04% │││ │ 0x00007f6d50031059: add %esi,%edx | |
0.09% │││ │ 0x00007f6d5003105b: add %r11d,%edx | |
1.45% │││ │ 0x00007f6d5003105e: popcnt 0x48(%r12,%rcx,8),%r11 | |
2.73% │││ │ 0x00007f6d50031065: add %edx,%r11d | |
1.30% │││ │ 0x00007f6d50031068: popcnt 0x40(%r12,%rcx,8),%rdx | |
0.58% │││ │ 0x00007f6d5003106f: popcnt 0x10(%r12,%rcx,8),%rsi | |
0.27% │││ │ 0x00007f6d50031076: add %esi,%r8d | |
0.21% │││ │ 0x00007f6d50031079: popcnt 0x28(%r12,%rcx,8),%rsi | |
1.44% │││ │ 0x00007f6d50031080: add %r8d,%esi | |
1.27% │││ │ 0x00007f6d50031083: add %edx,%esi | |
1.25% │││ │ 0x00007f6d50031085: popcnt 0x20(%r12,%rcx,8),%r8 | |
0.02% │││ │ 0x00007f6d5003108c: add %r10d,%r8d | |
1.69% │││ │ 0x00007f6d5003108f: add %r8d,%edi ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
1.18% │││ │ 0x00007f6d50031092: vmovdqu %ymm7,0x10(%r12,%rcx,8) ;*goto {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@193 (line 76) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.35% │││ │ 0x00007f6d50031099: vmovdqu 0x10(%rax,%rbp,8),%ymm7 | |
0.44% │││ │ 0x00007f6d5003109f: vpand 0x10(%r13,%rbp,8),%ymm7,%ymm7 ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
2.28% │││ │ 0x00007f6d500310a6: vmovdqu %ymm7,0x30(%r12,%rcx,8) ;*putfield arg$1 {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::<init>@6 | |
│││ │ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::get$Lambda@5 | |
│││ │ ; - java.lang.invoke.DirectMethodHandle$Holder::invokeStatic@10 | |
│││ │ ; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@5 | |
│││ │ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@38 (line 1297) | |
│││ │ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@53 (line 77) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.42% │││ │ 0x00007f6d500310ad: vmovdqu 0xd0(%rax,%rbx,8),%ymm7 | |
0.02% │││ │ 0x00007f6d500310b6: vpand 0xd0(%r13,%rbx,8),%ymm7,%ymm7 | |
1.79% │││ │ 0x00007f6d500310c0: popcnt 0x20(%r12,%rcx,8),%r10 | |
0.39% │││ │ 0x00007f6d500310c7: add %edi,%r10d | |
0.01% │││ │ 0x00007f6d500310ca: popcnt 0x38(%r12,%rcx,8),%rbx | |
1.65% │││ │ 0x00007f6d500310d1: add %r10d,%ebx | |
1.40% │││ │ 0x00007f6d500310d4: popcnt 0x30(%r12,%rcx,8),%r10 | |
1.50% │││ │ 0x00007f6d500310db: popcnt 0x18(%r12,%rcx,8),%r8 | |
0.00% │││ │ 0x00007f6d500310e2: add %r11d,%r8d | |
0.53% │││ │ 0x00007f6d500310e5: add %r10d,%r8d | |
1.58% │││ │ 0x00007f6d500310e8: popcnt 0x48(%r12,%rcx,8),%rbp | |
0.04% │││ │ 0x00007f6d500310ef: add %r8d,%ebp | |
1.79% │││ │ 0x00007f6d500310f2: popcnt 0x40(%r12,%rcx,8),%r10 | |
1.98% │││ │ 0x00007f6d500310f9: popcnt 0x10(%r12,%rcx,8),%rdi | |
0.07% │││ │ 0x00007f6d50031100: add %esi,%edi | |
0.04% │││ │ 0x00007f6d50031102: popcnt 0x28(%r12,%rcx,8),%r11 | |
0.05% │││ │ 0x00007f6d50031109: add %edi,%r11d | |
2.00% │││ │ 0x00007f6d5003110c: add %r10d,%r11d ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.36% │││ │ 0x00007f6d5003110f: vmovdqu %ymm7,0x10(%r12,%rcx,8) ;*invokestatic linkToStatic {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - java.lang.invoke.DirectMethodHandle$Holder::invokeStatic@10 | |
│││ │ ; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@5 | |
│││ │ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@38 (line 1297) | |
│││ │ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@53 (line 77) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.01% │││ │ 0x00007f6d50031116: vmovq %xmm6,%r10 | |
0.01% │││ │ 0x00007f6d5003111b: vmovdqu 0x10(%rax,%r10,8),%ymm7 | |
1.78% │││ │ 0x00007f6d50031122: vpand 0x10(%r13,%r10,8),%ymm7,%ymm6 ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.53% │││ │ 0x00007f6d50031129: vmovdqu %ymm6,0x30(%r12,%rcx,8) ;*putfield arg$1 {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::<init>@6 | |
│││ │ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::get$Lambda@5 | |
│││ │ ; - java.lang.invoke.DirectMethodHandle$Holder::invokeStatic@10 | |
│││ │ ; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@5 | |
│││ │ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@38 (line 1297) | |
│││ │ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@53 (line 77) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.03% │││ │ 0x00007f6d50031130: popcnt 0x10(%r12,%rcx,8),%rdi | |
1.77% │││ │ 0x00007f6d50031137: add %r11d,%edi | |
0.45% │││ │ 0x00007f6d5003113a: popcnt 0x38(%r12,%rcx,8),%r11 | |
2.70% │││ │ 0x00007f6d50031141: popcnt 0x20(%r12,%rcx,8),%r8 | |
0.01% │││ │ 0x00007f6d50031148: add %ebx,%r8d | |
0.06% │││ │ 0x00007f6d5003114b: add %r11d,%r8d | |
2.14% │││ │ 0x00007f6d5003114e: popcnt 0x48(%r12,%rcx,8),%r10 | |
0.07% │││ │ 0x00007f6d50031155: popcnt 0x30(%r12,%rcx,8),%rdx | |
0.20% │││ │ 0x00007f6d5003115c: popcnt 0x18(%r12,%rcx,8),%r11 | |
4.15% │││ │ 0x00007f6d50031163: add %ebp,%r11d | |
2.32% │││ │ 0x00007f6d50031166: add %edx,%r11d | |
2.08% │││ │ 0x00007f6d50031169: add %r10d,%r11d | |
2.26% │││ │ 0x00007f6d5003116c: popcnt 0x40(%r12,%rcx,8),%r10 | |
0.00% │││ │ 0x00007f6d50031173: popcnt 0x28(%r12,%rcx,8),%rdx | |
0.02% │││ │ 0x00007f6d5003117a: add %edx,%edi | |
0.03% │││ │ 0x00007f6d5003117c: add %r10d,%edi ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│││ │ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
│││ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
2.20% │││ │ 0x00007f6d5003117f: add $0x20,%r9d | |
0.00% │││ │ 0x00007f6d50031183: cmp 0x60(%rsp),%r9d | |
│╰│ │ 0x00007f6d50031188: jl 0x00007f6d50030f60 ;*putfield arg$1 {reexecute=0 rethrow=0 return_oop=0} | |
│ │ │ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::<init>@6 | |
│ │ │ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::get$Lambda@5 | |
│ │ │ ; - java.lang.invoke.DirectMethodHandle$Holder::invokeStatic@10 | |
│ │ │ ; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@5 | |
│ │ │ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@38 (line 1297) | |
│ │ │ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@65 (line 77) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
│ │ │ 0x00007f6d5003118e: mov 0x108(%r15),%r10 ; ImmutableOopMap{rcx=NarrowOop rax=Oop r13=Oop r14=Oop xmm0=Oop xmm4=Oop [104]=Oop [8]=Oop } | |
│ │ │ ;*goto {reexecute=1 rethrow=0 return_oop=0} | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@193 (line 76) | |
│ │ │ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.01% │ │ │ 0x00007f6d50031195: test %eax,(%r10) ; {poll} | |
0.06% │ │ │ 0x00007f6d50031198: cmp 0x60(%rsp),%r9d | |
│ ╰ │ 0x00007f6d5003119d: jl 0x00007f6d50030f30 | |
│ │ 0x00007f6d500311a3: vmovd %xmm3,%esi | |
│ │ 0x00007f6d500311a7: vmovd %xmm2,%edx | |
│ │ 0x00007f6d500311ab: vmovd %xmm5,%ebx | |
0.07% │ ↗│ 0x00007f6d500311af: cmp %esi,%r9d | |
│ ╭ ││ 0x00007f6d500311b2: jge 0x00007f6d50031270 | |
0.03% │ │ ↗ ││ 0x00007f6d500311b8: cmp %edx,%r9d | |
│ │ │ ││ 0x00007f6d500311bb: jge 0x00007f6d500314c6 | |
0.09% │ │ │ ││ 0x00007f6d500311c1: cmp %ebx,%r9d | |
│ │ │ ││ 0x00007f6d500311c4: jge 0x00007f6d50031506 | |
0.02% │ │ │ ││ 0x00007f6d500311ca: vmovd %ebx,%xmm4 | |
0.04% │ │ │ ││ 0x00007f6d500311ce: vmovd %edx,%xmm2 | |
0.02% │ │ │ ││ 0x00007f6d500311d2: mov %esi,%ebp | |
0.08% │ │ │ ││ 0x00007f6d500311d4: vmovdqu 0x10(%rax,%r9,8),%ymm3 | |
0.03% │ │ │ ││ 0x00007f6d500311db: vpand 0x10(%r13,%r9,8),%ymm3,%ymm3 ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│ │ │ ││ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│ │ │ ││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
│ │ │ ││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.10% │ │ │ ││ 0x00007f6d500311e2: vmovdqu %ymm3,0x10(%r12,%rcx,8) ;*ifnonnull {reexecute=0 rethrow=0 return_oop=0} | |
│ │ │ ││ ; - java.util.Objects::requireNonNull@1 (line 220) | |
│ │ │ ││ ; - jdk.incubator.vector.Long256Vector::and@1 (line 399) | |
│ │ │ ││ ; - jdk.incubator.vector.Long256Vector::and@2 (line 39) | |
│ │ │ ││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@104 (line 78) | |
│ │ │ ││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.09% │ │ │ ││ 0x00007f6d500311e9: mov %r9d,%r10d | |
0.03% │ │ │ ││ 0x00007f6d500311ec: add $0x4,%r10d ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│ │ │ ││ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│ │ │ ││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
│ │ │ ││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.05% │ │ │ ││ 0x00007f6d500311f0: movslq %r10d,%r10 ;*invokedynamic {reexecute=0 rethrow=0 return_oop=0} | |
│ │ │ ││ ; - jdk.incubator.vector.Long256Vector::and@22 (line 401) | |
│ │ │ ││ ; - jdk.incubator.vector.Long256Vector::and@2 (line 39) | |
│ │ │ ││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@104 (line 78) | |
│ │ │ ││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
│ │ │ ││ 0x00007f6d500311f3: vmovdqu 0x10(%rax,%r10,8),%ymm3 | |
0.09% │ │ │ ││ 0x00007f6d500311fa: vpand 0x10(%r13,%r10,8),%ymm3,%ymm3 ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│ │ │ ││ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│ │ │ ││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
│ │ │ ││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.21% │ │ │ ││ 0x00007f6d50031201: vmovdqu %ymm3,0x30(%r12,%rcx,8) ;*invokestatic requireNonNull {reexecute=0 rethrow=0 return_oop=0} | |
│ │ │ ││ ; - jdk.incubator.vector.Long256Vector::and@1 (line 399) | |
│ │ │ ││ ; - jdk.incubator.vector.Long256Vector::and@2 (line 39) | |
│ │ │ ││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@104 (line 78) | |
│ │ │ ││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.02% │ │ │ ││ 0x00007f6d50031208: popcnt 0x28(%r12,%rcx,8),%rdx | |
0.10% │ │ │ ││ 0x00007f6d5003120f: popcnt 0x20(%r12,%rcx,8),%r10 | |
0.03% │ │ │ ││ 0x00007f6d50031216: add %r8d,%r10d | |
0.06% │ │ │ ││ 0x00007f6d50031219: popcnt 0x38(%r12,%rcx,8),%r8 | |
0.28% │ │ │ ││ 0x00007f6d50031220: add %r10d,%r8d | |
0.14% │ │ │ ││ 0x00007f6d50031223: popcnt 0x30(%r12,%rcx,8),%r10 | |
0.07% │ │ │ ││ 0x00007f6d5003122a: popcnt 0x18(%r12,%rcx,8),%rsi | |
0.01% │ │ │ ││ 0x00007f6d50031231: add %r11d,%esi | |
0.01% │ │ │ ││ 0x00007f6d50031234: add %r10d,%esi | |
0.16% │ │ │ ││ 0x00007f6d50031237: popcnt 0x48(%r12,%rcx,8),%r11 | |
0.04% │ │ │ ││ 0x00007f6d5003123e: add %esi,%r11d | |
0.12% │ │ │ ││ 0x00007f6d50031241: popcnt 0x40(%r12,%rcx,8),%r10 | |
0.11% │ │ │ ││ 0x00007f6d50031248: popcnt 0x10(%r12,%rcx,8),%rbx | |
0.04% │ │ │ ││ 0x00007f6d5003124f: add %edi,%ebx | |
0.01% │ │ │ ││ 0x00007f6d50031251: add %ebx,%edx | |
0.03% │ │ │ ││ 0x00007f6d50031253: add %r10d,%edx | |
0.11% │ │ │ ││ 0x00007f6d50031256: add $0x8,%r9d | |
0.04% │ │ │ ││ 0x00007f6d5003125a: cmp %ebp,%r9d | |
│ │╭│ ││ 0x00007f6d5003125d: jge 0x00007f6d50031272 | |
0.00% │ │││ ││ 0x00007f6d5003125f: mov %edx,%edi | |
0.03% │ │││ ││ 0x00007f6d50031261: mov %ebp,%esi | |
0.14% │ │││ ││ 0x00007f6d50031263: vmovd %xmm2,%edx | |
0.03% │ │││ ││ 0x00007f6d50031267: vmovd %xmm4,%ebx | |
0.00% │ ││╰ ││ 0x00007f6d5003126b: jmpq 0x00007f6d500311b8 | |
│ ↘│ ││ 0x00007f6d50031270: mov %edi,%edx | |
0.02% │ ↘ ││ 0x00007f6d50031272: add %r11d,%edx ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│ ││ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│ ││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
│ ││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.04% │ ││ 0x00007f6d50031275: add %r8d,%edx | |
0.05% │ ││ 0x00007f6d50031278: mov %r14,%r10 | |
0.01% │ ││ 0x00007f6d5003127b: vmovq %xmm0,%r8 | |
0.00% │ ││ 0x00007f6d50031280: vmovq %xmm1,%rbp | |
0.00% │ ││↗ 0x00007f6d50031285: mov %r8,0x60(%rsp) | |
0.05% │ │││ 0x00007f6d5003128a: mov %r10,0x70(%rsp) | |
0.01% │ │││ 0x00007f6d5003128f: mov 0x8(%rsp),%rsi | |
0.01% │ │││ 0x00007f6d50031294: vzeroupper | |
0.09% │ │││ 0x00007f6d50031297: callq 0x00007f6d48548f00 ; ImmutableOopMap{[96]=Oop [104]=Oop [112]=Oop [8]=Oop } | |
│ │││ ;*invokevirtual consume {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@20 (line 158) | |
│ │││ ; {optimized virtual_call} | |
0.04% │ │││ 0x00007f6d5003129c: mov 0x60(%rsp),%r8 ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.01% │ │││ 0x00007f6d500312a1: movzbl 0x94(%r8),%r10d ;*putfield arg$1 {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::<init>@6 | |
│ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::get$Lambda@5 | |
│ │││ ; - java.lang.invoke.DirectMethodHandle$Holder::invokeStatic@10 | |
│ │││ ; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@5 | |
│ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@38 (line 1297) | |
│ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@65 (line 77) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.01% │ │││ 0x00007f6d500312a9: mov 0x108(%r15),%r11 ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.03% │ │││ 0x00007f6d500312b0: add $0x1,%rbp ; ImmutableOopMap{r8=Oop [104]=Oop [112]=Oop [8]=Oop } | |
│ │││ ;*ifeq {reexecute=1 rethrow=0 return_oop=0} | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@33 (line 160) | |
0.00% │ │││ 0x00007f6d500312b4: test %eax,(%r11) ; {poll} | |
│ │││ 0x00007f6d500312b7: test %r10d,%r10d | |
│ ╭ │││ 0x00007f6d500312ba: jne 0x00007f6d50031443 | |
0.01% ↘ │ │││ 0x00007f6d500312c0: mov 0x70(%rsp),%r10 ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.05% │ │││ 0x00007f6d500312c5: mov 0xc(%r10),%edi ;*invokevirtual vpandStorePopcntUnrolled {reexecute=0 rethrow=0 return_oop=0} | |
│ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.02% │ │││ 0x00007f6d500312c9: test %edi,%edi | |
│╭ │││ 0x00007f6d500312cb: jle 0x00007f6d50031437 ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
││ │││ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
││ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
││ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.00% ││ │││ 0x00007f6d500312d1: mov 0x18(%r10),%ecx ;*new {reexecute=0 rethrow=0 return_oop=0} | |
││ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::get$Lambda@0 | |
││ │││ ; - java.lang.invoke.DirectMethodHandle$Holder::invokeStatic@10 | |
││ │││ ; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@5 | |
││ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@38 (line 1297) | |
││ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
││ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@53 (line 77) | |
││ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.01% ││ │││ 0x00007f6d500312d5: mov 0x10(%r10),%esi ;*putfield arg$1 {reexecute=0 rethrow=0 return_oop=0} | |
││ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::<init>@6 | |
││ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::get$Lambda@5 | |
││ │││ ; - java.lang.invoke.DirectMethodHandle$Holder::invokeStatic@10 | |
││ │││ ; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@5 | |
││ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@38 (line 1297) | |
││ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
││ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@65 (line 77) | |
││ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.04% ││ │││ 0x00007f6d500312d9: mov 0xc(%r12,%rsi,8),%edx ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
││ │││ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
││ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
││ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
││ │││ ; implicit exception: dispatches to 0x00007f6d5003149e | |
0.02% ││ │││ 0x00007f6d500312de: mov 0x14(%r10),%r9d ;*putfield arg$1 {reexecute=0 rethrow=0 return_oop=0} | |
││ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::<init>@6 | |
││ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species$$Lambda$46/0x00000008000dbc40::get$Lambda@5 | |
││ │││ ; - java.lang.invoke.DirectMethodHandle$Holder::invokeStatic@10 | |
││ │││ ; - java.lang.invoke.Invokers$Holder::linkToTargetMethod@5 | |
││ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@38 (line 1297) | |
││ │││ ; - jdk.incubator.vector.Long256Vector$Long256Species::fromArray@3 (line 1150) | |
││ │││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@65 (line 77) | |
││ │││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.01% ││ │││ 0x00007f6d500312e2: mov 0xc(%r12,%r9,8),%ebx ; implicit exception: dispatches to 0x00007f6d5003149e | |
0.00% ││ │││ 0x00007f6d500312e7: mov 0xc(%r12,%rcx,8),%r11d ; implicit exception: dispatches to 0x00007f6d5003149e | |
0.04% ││ │││ 0x00007f6d500312ec: cmp $0x7,%r11d | |
││ │││ 0x00007f6d500312f0: jbe 0x00007f6d5003149e | |
0.01% ││ │││ 0x00007f6d500312f6: cmp $0x7ffffff8,%edi | |
││ │││ 0x00007f6d500312fc: jg 0x00007f6d5003147b | |
0.02% ││ │││ 0x00007f6d50031302: vmovq %r8,%xmm0 | |
││ │││ 0x00007f6d50031307: mov 0x70(%rsp),%r14 | |
0.02% ││ │││ 0x00007f6d5003130c: mov $0x8,%r11d | |
0.01% ││ │││ 0x00007f6d50031312: cmp %r11d,%edi | |
││ │││ 0x00007f6d50031315: mov %edi,%r10d | |
││ │││ 0x00007f6d50031318: mov $0x8,%r8d | |
0.05% ││ │││ 0x00007f6d5003131e: cmovg %r8d,%r10d | |
0.03% ││ │││ 0x00007f6d50031322: lea (%r12,%rsi,8),%r13 | |
││ │││ 0x00007f6d50031326: lea (%r12,%r9,8),%rax | |
││ │││ 0x00007f6d5003132a: xor %r9d,%r9d | |
0.03% ││ │││ 0x00007f6d5003132d: xor %esi,%esi | |
0.01% ││ │││ 0x00007f6d5003132f: xor %r11d,%r11d | |
0.00% ││ │││ 0x00007f6d50031332: xor %r8d,%r8d | |
││ ↗│││ 0x00007f6d50031335: cmp %edx,%r9d | |
││ ││││ 0x00007f6d50031338: jge 0x00007f6d500314c2 | |
0.03% ││ ││││ 0x00007f6d5003133e: cmp %ebx,%r9d | |
││ ││││ 0x00007f6d50031341: jge 0x00007f6d50031502 | |
0.03% ││ ││││ 0x00007f6d50031347: mov %ebx,0x64(%rsp) | |
0.01% ││ ││││ 0x00007f6d5003134b: vmovd %edx,%xmm3 | |
││ ││││ 0x00007f6d5003134f: mov %edi,0x60(%rsp) | |
0.05% ││ ││││ 0x00007f6d50031353: vmovq %rbp,%xmm1 | |
0.03% ││ ││││ 0x00007f6d50031358: vmovdqu 0x10(%rax,%r9,8),%ymm2 | |
0.02% ││ ││││ 0x00007f6d5003135f: vpand 0x10(%r13,%r9,8),%ymm2,%ymm2 ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
││ ││││ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
││ ││││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
││ ││││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.06% ││ ││││ 0x00007f6d50031366: vmovdqu %ymm2,0x10(%r12,%rcx,8) ;*ifnonnull {reexecute=0 rethrow=0 return_oop=0} | |
││ ││││ ; - java.util.Objects::requireNonNull@1 (line 220) | |
││ ││││ ; - jdk.incubator.vector.Long256Vector::intoArray@1 (line 653) | |
││ ││││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
││ ││││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.03% ││ ││││ 0x00007f6d5003136d: mov %r9d,%edi | |
0.00% ││ ││││ 0x00007f6d50031370: add $0x4,%edi ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
││ ││││ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
││ ││││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
││ ││││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.00% ││ ││││ 0x00007f6d50031373: movslq %edi,%rbx ;*getstatic VECTOR_ACCESS_OOB_CHECK {reexecute=0 rethrow=0 return_oop=0} | |
││ ││││ ; - jdk.incubator.vector.VectorIntrinsics::checkIndex@0 (line 366) | |
││ ││││ ; - jdk.incubator.vector.Long256Vector::intoArray@11 (line 654) | |
││ ││││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
││ ││││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.03% ││ ││││ 0x00007f6d50031376: vmovdqu 0x10(%rax,%rbx,8),%ymm2 | |
0.00% ││ ││││ 0x00007f6d5003137c: vpand 0x10(%r13,%rbx,8),%ymm2,%ymm2 ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
││ ││││ ; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
││ ││││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
││ ││││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.10% ││ ││││ 0x00007f6d50031383: vmovdqu %ymm2,0x30(%r12,%rcx,8) ;*invokestatic requireNonNull {reexecute=0 rethrow=0 return_oop=0} | |
││ ││││ ; - jdk.incubator.vector.Long256Vector::intoArray@1 (line 653) | |
││ ││││ ; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
││ ││││ ; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
0.02% ││ ││││ 0x00007f6d5003138a: popcnt 0x28(%r12,%rcx,8),%rdi | |
0.03% ││ ││││ 0x00007f6d50031391: popcnt 0x20(%r12,%rcx,8),%rdx | |
0.04% ││ ││││ 0x00007f6d50031398: add %r8d,%edx | |
0.02% ││ ││││ 0x00007f6d5003139b: popcnt 0x38(%r12,%rcx,8),%r8 | |
0.22% ││ ││││ 0x00007f6d500313a2: add %edx,%r8d | |
0.08% ││ ││││ 0x00007f6d500313a5: popcnt 0x30(%r12,%rcx,8),%rdx | |
0.03% ││ ││││ 0x00007f6d500313ac: popcnt 0x18(%r12,%rcx,8),%rbx | |
││ ││││ 0x00007f6d500313b3: add %r11d,%ebx | |
0.04% ││ ││││ 0x00007f6d500313b6: add %edx,%ebx | |
0.15% ││ ││││ 0x00007f6d500313b8: popcnt 0x48(%r12,%rcx,8),%r11 | |
││ ││││ 0x00007f6d500313bf: add %ebx,%r11d | |
0.04% ││ ││││ 0x00007f6d500313c2: popcnt 0x40(%r12,%rcx,8),%rdx | |
0.03% ││ ││││ 0x00007f6d500313c9: popcnt 0x10(%r12,%rcx,8),%rbx | |
0.10% ││ ││││ 0x00007f6d500313d0: add %esi,%ebx | |
0.06% ││ ││││ 0x00007f6d500313d2: add %ebx,%edi | |
0.03% ││ ││││ 0x00007f6d500313d4: add %edx,%edi | |
0.08% ││ ││││ 0x00007f6d500313d6: add $0x8,%r9d | |
0.00% ││ ││││ 0x00007f6d500313da: cmp %r10d,%r9d | |
││╭││││ 0x00007f6d500313dd: jge 0x00007f6d500313f2 | |
│││││││ 0x00007f6d500313df: mov %edi,%esi | |
│││││││ 0x00007f6d500313e1: mov 0x60(%rsp),%edi | |
│││││││ 0x00007f6d500313e5: vmovd %xmm3,%edx | |
│││││││ 0x00007f6d500313e9: mov 0x64(%rsp),%ebx | |
│││╰│││ 0x00007f6d500313ed: jmpq 0x00007f6d50031335 | |
││↘ │││ 0x00007f6d500313f2: vmovd %xmm3,%edx | |
0.00% ││ │││ 0x00007f6d500313f6: cmp 0x64(%rsp),%edx | |
0.06% ││ │││ 0x00007f6d500313fa: mov %edx,%r10d | |
││ │││ 0x00007f6d500313fd: mov 0x64(%rsp),%ebx | |
││ │││ 0x00007f6d50031401: cmovg %ebx,%r10d | |
││ │││ 0x00007f6d50031405: cmp 0x60(%rsp),%r10d | |
0.07% ││ │││ 0x00007f6d5003140a: mov 0x60(%rsp),%esi | |
0.01% ││ │││ 0x00007f6d5003140e: cmovg %esi,%r10d | |
││ │││ 0x00007f6d50031412: mov %r10d,%ebp | |
0.00% ││ │││ 0x00007f6d50031415: add $0xffffffe8,%ebp | |
0.10% ││ │││ 0x00007f6d50031418: cmp %ebp,%r10d | |
0.16% ││ │││ 0x00007f6d5003141b: mov $0x80000000,%r10d | |
││ │││ 0x00007f6d50031421: cmovl %r10d,%ebp | |
0.00% ││ │││ 0x00007f6d50031425: cmp %ebp,%r9d | |
││ ╰││ 0x00007f6d50031428: jge 0x00007f6d500311af | |
0.07% ││ ││ 0x00007f6d5003142e: lea (%r12,%rcx,8),%r10 | |
0.01% ││ ╰│ 0x00007f6d50031432: jmpq 0x00007f6d50030f45 | |
│↘ │ 0x00007f6d50031437: xor %edx,%edx | |
│ ╰ 0x00007f6d50031439: jmpq 0x00007f6d50031285 | |
│ 0x00007f6d5003143e: mov $0x1,%ebp | |
↘ 0x00007f6d50031443: vzeroupper | |
0x00007f6d50031446: movabs $0x7f6d66344390,%r10 | |
0x00007f6d50031450: callq *%r10 | |
0x00007f6d50031453: mov 0x68(%rsp),%r10 ;*invokestatic store {reexecute=0 rethrow=0 return_oop=0} | |
; - jdk.incubator.vector.Long256Vector::intoArray@43 (line 655) | |
; - com.openkappa.panama.vectorbenchmarks.IntersectionCardinality::vpandStorePopcntUnrolled@109 (line 78) | |
; - com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub@17 (line 158) | |
.................................................................................................... | |
96.64% <total for region 1> | |
....[Hottest Regions]............................................................................... | |
96.64% c2, level 4 com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub, version 665 (1253 bytes) | |
1.22% [kernel.kallsyms] native_write_msr (5 bytes) | |
0.29% c2, level 4 org.openjdk.jmh.infra.Blackhole::consume, version 637 (55 bytes) | |
0.15% [kernel.kallsyms] dw_readl (0 bytes) | |
0.13% [kernel.kallsyms] syscall_return_via_sysret (3 bytes) | |
0.09% [kernel.kallsyms] __intel_pmu_enable_all (0 bytes) | |
0.07% kernel [unknown] (37 bytes) | |
0.05% [kernel.kallsyms] mutex_unlock (0 bytes) | |
0.05% libjvm.so _ZN14ElfSymbolTable6lookupEPhPiS1_S1_P16ElfFuncDescTable (29 bytes) | |
0.04% kernel [unknown] (37 bytes) | |
0.03% kernel [unknown] (0 bytes) | |
0.03% [kernel.kallsyms] mutex_lock (0 bytes) | |
0.02% libjvm.so _ZN9xmlStream10write_textEPKcm (22 bytes) | |
0.02% libc-2.23.so __GI___libc_write (15 bytes) | |
0.02% [kernel.kallsyms] entry_SYSCALL_64_stage2 (1 bytes) | |
0.02% [kernel.kallsyms] idma64_irq (0 bytes) | |
0.02% libc-2.23.so __libc_disable_asynccancel (5 bytes) | |
0.01% kernel [unknown] (30 bytes) | |
0.01% [kernel.kallsyms] io_apic_sync (0 bytes) | |
0.01% [kernel.kallsyms] sys_futex (0 bytes) | |
1.05% <...other 183 warm regions...> | |
.................................................................................................... | |
100.00% <totals> | |
....[Hottest Methods (after inlining)].............................................................. | |
96.64% c2, level 4 com.openkappa.panama.vectorbenchmarks.generated.IntersectionCardinality_vpandStorePopcntUnrolled_jmhTest::vpandStorePopcntUnrolled_thrpt_jmhStub, version 665 | |
1.22% [kernel.kallsyms] native_write_msr | |
0.29% c2, level 4 org.openjdk.jmh.infra.Blackhole::consume, version 637 | |
0.17% kernel [unknown] | |
0.15% [kernel.kallsyms] dw_readl | |
0.14% [kernel.kallsyms] syscall_return_via_sysret | |
0.09% [kernel.kallsyms] __intel_pmu_enable_all | |
0.05% [kernel.kallsyms] mutex_unlock | |
0.05% libjvm.so _ZN14ElfSymbolTable6lookupEPhPiS1_S1_P16ElfFuncDescTable | |
0.03% libjvm.so _ZN9xmlStream10write_textEPKcm | |
0.03% [kernel.kallsyms] mutex_lock | |
0.03% [kernel.kallsyms] pipe_write | |
0.02% libc-2.23.so __GI___libc_write | |
0.02% libc-2.23.so __libc_disable_asynccancel | |
0.02% [kernel.kallsyms] entry_SYSCALL_64_stage2 | |
0.02% libc-2.23.so _IO_fwrite | |
0.02% libjvm.so _ZN10fileStream5writeEPKcm | |
0.02% [kernel.kallsyms] idma64_irq | |
0.02% [kernel.kallsyms] copy_page_from_iter | |
0.02% [kernel.kallsyms] sys_futex | |
0.94% <...other 147 warm methods...> | |
.................................................................................................... | |
100.00% <totals> | |
....[Distribution by Source]........................................................................ | |
96.94% c2, level 4 | |
2.24% [kernel.kallsyms] | |
0.34% libjvm.so | |
0.18% libc-2.23.so | |
0.17% kernel | |
0.05% interpreter | |
0.03% hsdis-amd64.so | |
0.02% libpthread-2.23.so | |
0.01% ld-2.23.so | |
0.01% c1, level 3 | |
0.00% libjimage.so | |
.................................................................................................... | |
100.00% <totals> | |
# Run complete. Total time: 00:04:24 | |
Benchmark (size) Mode Cnt Score Error Units | |
IntersectionCardinality.popcnt 1024 thrpt 20 1904.524 ± 96.763 ops/ms | |
IntersectionCardinality.popcnt:·asm 1024 thrpt NaN --- | |
IntersectionCardinality.unrolledPopcnt 1024 thrpt 20 2001.611 ± 40.978 ops/ms | |
IntersectionCardinality.unrolledPopcnt:·asm 1024 thrpt NaN --- | |
IntersectionCardinality.vpandExtractPopcnt 1024 thrpt 20 1765.195 ± 46.484 ops/ms | |
IntersectionCardinality.vpandExtractPopcnt:·asm 1024 thrpt NaN --- | |
IntersectionCardinality.vpandExtractPopcntUnrolled 1024 thrpt 20 1846.487 ± 18.955 ops/ms | |
IntersectionCardinality.vpandExtractPopcntUnrolled:·asm 1024 thrpt NaN --- | |
IntersectionCardinality.vpandStorePopcnt 1024 thrpt 20 2119.727 ± 88.639 ops/ms | |
IntersectionCardinality.vpandStorePopcnt:·asm 1024 thrpt NaN --- | |
IntersectionCardinality.vpandStorePopcntUnrolled 1024 thrpt 20 2246.300 ± 96.287 ops/ms | |
IntersectionCardinality.vpandStorePopcntUnrolled:·asm 1024 thrpt NaN --- |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment