Skip to content

Instantly share code, notes, and snippets.

@dougallj
Last active November 6, 2023 19:20
Show Gist options
  • Star 2 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save dougallj/2764fe3d70955858c45ab11910b3dd21 to your computer and use it in GitHub Desktop.
Save dougallj/2764fe3d70955858c45ab11910b3dd21 to your computer and use it in GitHub Desktop.
Raw data. These were dumped from iPhones/iPads using wall-timers, not
perf-counters. They contain some likely issues and inconsistencies that
haven't been fully investigated. Mostly correct, but it's worth
double-checking anything odd. (For example, "TBL (two register table)"
can have better throughput than is listed sometimes, as can some other
three-operand SIMD things iirc.)
The goal is to find the fastest rate at which an instruction can run. If
there are multiple rows with the same label, the "correct" value is the
minimum. For example:
ADCS (64-bit) 1->2 1.000200
ADCS (64-bit) 1->3 1.001804
ADCS (64-bit) 1->4 1.000273
ADCS (64-bit) 4->2 0.998992
ADCS (64-bit) 4->3 1.003760
ADCS (64-bit) 4->4 1.000002
ADCS (64-bit) throughput 0.668505
ADCS (64-bit) throughput 0.334375
The first six rows show the latencies from each output operand to each
input operand. In this case, one-cycle (plus some noise), regardless of
operand, so ADCS has 1c latency.
The last two rows have the same label. This is because the script used
two different methods to try to measure the throughput. In that case,
take the minimum. Here, that's 0.334375 (3 operations-per-cycle, plus
some noise).
Details of the experiments typically will match the experiments linked at:
https://dougallj.github.io/applecpu/firestorm-int.html
For some more useful notes on A15/A16, see:
https://twitter.com/dougallj/status/1534002276091629569
https://twitter.com/dougallj/status/1581109455269556224
Some operations added since A14/M1 are listed in a separate file at the top.
(Not for any particular reason.) I recall checking the uop counts using
Instruments.app, but the results weren't surprising, just throughput/0.25.
USDOT (vector, 8B) 1->1 3.011408
USDOT (vector, 8B) 1->2 2.999861
USDOT (vector, 8B) 1->3 3.000007
USDOT (vector, 8B) throughput 0.250289
USDOT (vector, 8B) throughput 0.249937
USDOT (vector, 16B) 1->1 3.011412
USDOT (vector, 16B) 1->2 3.011416
USDOT (vector, 16B) 1->3 3.005572
USDOT (vector, 16B) throughput 0.250307
USDOT (vector, 16B) throughput 0.250705
USDOT (by element, 8B) 1->1 2.999560
USDOT (by element, 8B) 1->2 2.999421
USDOT (by element, 8B) 1->3 2.999712
USDOT (by element, 8B) throughput 0.250379
USDOT (by element, 8B) throughput 0.250677
USDOT (by element, 16B) 1->1 2.999133
USDOT (by element, 16B) 1->2 3.011844
USDOT (by element, 16B) 1->3 3.000281
USDOT (by element, 16B) throughput 0.251697
USDOT (by element, 16B) throughput 0.250786
SUDOT (by element, 8B) 1->1 2.999716
SUDOT (by element, 8B) 1->2 3.011997
SUDOT (by element, 8B) 1->3 3.012864
SUDOT (by element, 8B) throughput 0.250307
SUDOT (by element, 8B) throughput 0.250786
SUDOT (by element, 16B) 1->1 2.999560
SUDOT (by element, 16B) 1->2 2.999851
SUDOT (by element, 16B) 1->3 3.000149
SUDOT (by element, 16B) throughput 0.250362
SUDOT (by element, 16B) throughput 0.250749
SMMLA (vector) 1->1 6.012858
SMMLA (vector) 1->2 6.011124
SMMLA (vector) 1->3 6.013146
SMMLA (vector) throughput 0.501499
SMMLA (vector) throughput 0.526893
UMMLA (vector) 1->1 6.012425
UMMLA (vector) 1->2 6.014019
UMMLA (vector) 1->3 6.011566
UMMLA (vector) throughput 0.501572
UMMLA (vector) throughput 0.533232
USMMLA (vector) 1->1 6.013001
USMMLA (vector) 1->2 6.024128
USMMLA (vector) 1->3 6.013001
USMMLA (vector) throughput 0.501571
USMMLA (vector) throughput 0.524005
BFCVT (scalar) 1->2 3.000572
BFCVT (scalar) throughput 0.250072
BFCVTN (vector, 4H) 1->2 2.998987
BFCVTN (vector, 4H) throughput 0.249982
BFCVTN2 (vector, 8H) 1->1 3.011275
BFCVTN2 (vector, 8H) 1->2 3.011997
BFCVTN2 (vector, 8H) throughput 0.250397
BFCVTN2 (vector, 8H) throughput 0.250750
BFDOT (by element, 2S) 1->1 3.000295
BFDOT (by element, 2S) 1->2 10.022082
BFDOT (by element, 2S) 1->3 10.010851
BFDOT (by element, 2S) throughput 0.751477
BFDOT (by element, 2S) throughput 0.752962
BFDOT (by element, 4S) 1->1 3.011704
BFDOT (by element, 4S) 1->2 10.023098
BFDOT (by element, 4S) 1->3 10.057951
BFDOT (by element, 4S) throughput 1.001733
BFDOT (by element, 4S) throughput 1.002404
BFDOT (vector, 2S) 1->1 3.001285
BFDOT (vector, 2S) 1->2 10.016038
BFDOT (vector, 2S) 1->3 10.024926
BFDOT (vector, 2S) throughput 0.751645
BFDOT (vector, 2S) throughput 0.746704
BFDOT (vector, 4S) 1->1 3.000149
BFDOT (vector, 4S) 1->2 10.015276
BFDOT (vector, 4S) 1->3 10.013981
BFDOT (vector, 4S) throughput 1.002420
BFDOT (vector, 4S) throughput 1.001788
BFMLALB 1->1 4.010544
BFMLALB 1->2 4.003002
BFMLALB 1->3 4.005643
BFMLALB throughput 0.250342
BFMLALB throughput 0.250740
BFMLALT 1->1 4.000135
BFMLALT 1->2 4.011854
BFMLALT 1->3 4.011854
BFMLALT throughput 0.250307
BFMLALT throughput 0.250823
BFMMLA 1->1 6.011416
BFMMLA 1->2 13.012826
BFMMLA 1->3 13.018100
BFMMLA throughput 2.002597
BFMMLA throughput 2.003052
ADC (32-bit) 1->2 1.000328
ADC (32-bit) 1->3 1.065060
ADC (32-bit) 1->4 1.026448
ADC (32-bit) throughput 0.501056
ADC (64-bit) 1->2 1.018105
ADC (64-bit) 1->3 1.000130
ADC (64-bit) 1->4 1.014143
ADC (64-bit) throughput 0.529629
ADCS (32-bit) 1->2 1.000064
ADCS (32-bit) 1->3 1.185754
ADCS (32-bit) 1->4 0.992562
ADCS (32-bit) 4->2 1.010281
ADCS (32-bit) 4->3 1.101900
ADCS (32-bit) 4->4 0.979480
ADCS (32-bit) throughput 1.470419
ADCS (32-bit) throughput 3.098103
ADCS (64-bit) 1->2 1.000459
ADCS (64-bit) 1->3 1.206819
ADCS (64-bit) 1->4 1.137327
ADCS (64-bit) 4->2 1.016239
ADCS (64-bit) 4->3 1.065384
ADCS (64-bit) 4->4 0.993688
ADCS (64-bit) throughput 1.628992
ADCS (64-bit) throughput 1.216955
ADD (sxtb, 32-bit) 1->2 2.305780
ADD (sxtb, 32-bit) 1->3 2.069723
ADD (sxtb, 32-bit) throughput 1.141637
ADD (sxtb, 64-bit) 1->2 2.008446
ADD (sxtb, 64-bit) 1->3 1.996458
ADD (sxtb, 64-bit) throughput 1.026466
ADD (uxtb, 32-bit) 1->2 2.045769
ADD (uxtb, 32-bit) 1->3 2.267341
ADD (uxtb, 32-bit) throughput 1.076846
ADD (uxtb, 64-bit) 1->2 2.248260
ADD (uxtb, 64-bit) 1->3 2.244789
ADD (uxtb, 64-bit) throughput 1.018928
ADD (sxth, 32-bit) 1->2 1.968989
ADD (sxth, 32-bit) 1->3 2.146923
ADD (sxth, 32-bit) throughput 1.025622
ADD (sxth, 64-bit) 1->2 1.638829
ADD (sxth, 64-bit) 1->3 5.096116
ADD (sxth, 64-bit) throughput 1.819292
ADD (uxth, 32-bit) 1->2 2.783994
ADD (uxth, 32-bit) 1->3 2.041513
ADD (uxth, 32-bit) throughput 1.338542
ADD (uxth, 64-bit) 1->2 2.267933
ADD (uxth, 64-bit) 1->3 2.148907
ADD (uxth, 64-bit) throughput 1.123890
ADD (sxtw, 32-bit) 1->2 1.118829
ADD (sxtw, 32-bit) 1->3 1.287855
ADD (sxtw, 32-bit) throughput 0.672615
ADD (sxtw, 64-bit) 1->2 1.981503
ADD (sxtw, 64-bit) 1->3 2.075520
ADD (sxtw, 64-bit) throughput 1.029882
ADD (uxtw, 32-bit) 1->2 1.003474
ADD (uxtw, 32-bit) 1->3 1.005366
ADD (uxtw, 32-bit) throughput 0.530983
ADD (uxtw, 64-bit) 1->2 1.954458
ADD (uxtw, 64-bit) 1->3 2.023042
ADD (uxtw, 64-bit) throughput 0.981864
ADD (uxtx, 64-bit) 1->2 1.011896
ADD (uxtx, 64-bit) 1->3 1.031497
ADD (uxtx, 64-bit) throughput 0.501686
ADD (sxtx, 64-bit) 1->2 1.019828
ADD (sxtx, 64-bit) 1->3 1.002947
ADD (sxtx, 64-bit) throughput 0.506215
ADD (immediate, 32-bit) 1->2 1.005660
ADD (immediate, 32-bit) throughput 0.507280
ADD (immediate, 64-bit) 1->2 1.005872
ADD (immediate, 64-bit) throughput 0.649187
ADD (shifted immediate, 32-bit) 1->2 1.245265
ADD (shifted immediate, 32-bit) throughput 0.454882
ADD (shifted immediate, 64-bit) 1->2 0.862306
ADD (shifted immediate, 64-bit) throughput 0.503656
ADD (register, 32-bit) 1->2 1.003475
ADD (register, 32-bit) 1->3 1.008386
ADD (register, 32-bit) throughput 0.519958
ADD (register, 64-bit) 1->2 0.991370
ADD (register, 64-bit) 1->3 1.000461
ADD (register, 64-bit) throughput 0.497339
ADD (register, lsl, 32-bit) 1->2 2.135982
ADD (register, lsl, 32-bit) 1->3 2.009442
ADD (register, lsl, 32-bit) throughput 1.024767
ADD (register, lsl, 64-bit) 1->2 1.977639
ADD (register, lsl, 64-bit) 1->3 2.055476
ADD (register, lsl, 64-bit) throughput 1.011385
ADD (register, lsr, 32-bit) 1->2 2.035674
ADD (register, lsr, 32-bit) 1->3 1.928535
ADD (register, lsr, 32-bit) throughput 1.010973
ADD (register, lsr, 64-bit) 1->2 1.969817
ADD (register, lsr, 64-bit) 1->3 2.064701
ADD (register, lsr, 64-bit) throughput 0.987366
ADD (register, asr, 32-bit) 1->2 1.972884
ADD (register, asr, 32-bit) 1->3 1.938710
ADD (register, asr, 32-bit) throughput 1.011145
ADD (register, asr, 64-bit) 1->2 1.997084
ADD (register, asr, 64-bit) 1->3 2.192536
ADD (register, asr, 64-bit) throughput 1.002014
ADDS (sxtb, 32-bit) 1->2 2.003509
ADDS (sxtb, 32-bit) 1->3 2.012358
ADDS (sxtb, 32-bit) 4->2 2.369218
ADDS (sxtb, 32-bit) 4->3 2.681993
ADDS (sxtb, 32-bit) throughput 0.972342
ADDS (sxtb, 64-bit) 1->2 1.887766
ADDS (sxtb, 64-bit) 1->3 2.017153
ADDS (sxtb, 64-bit) 4->2 2.006051
ADDS (sxtb, 64-bit) 4->3 2.034557
ADDS (sxtb, 64-bit) throughput 1.013898
ADDS (uxtb, 32-bit) 1->2 2.041362
ADDS (uxtb, 32-bit) 1->3 2.023481
ADDS (uxtb, 32-bit) 4->2 2.047022
ADDS (uxtb, 32-bit) 4->3 2.010306
ADDS (uxtb, 32-bit) throughput 0.968183
ADDS (uxtb, 64-bit) 1->2 1.969487
ADDS (uxtb, 64-bit) 1->3 2.030044
ADDS (uxtb, 64-bit) 4->2 2.020575
ADDS (uxtb, 64-bit) 4->3 2.028597
ADDS (uxtb, 64-bit) throughput 0.991116
ADDS (sxth, 32-bit) 1->2 1.961602
ADDS (sxth, 32-bit) 1->3 2.029689
ADDS (sxth, 32-bit) 4->2 2.060730
ADDS (sxth, 32-bit) 4->3 2.049295
ADDS (sxth, 32-bit) throughput 1.009880
ADDS (sxth, 64-bit) 1->2 2.021403
ADDS (sxth, 64-bit) 1->3 2.005044
ADDS (sxth, 64-bit) 4->2 1.999518
ADDS (sxth, 64-bit) 4->3 1.471938
ADDS (sxth, 64-bit) throughput 0.994147
ADDS (uxth, 32-bit) 1->2 1.715751
ADDS (uxth, 32-bit) 1->3 2.060796
ADDS (uxth, 32-bit) 4->2 1.737027
ADDS (uxth, 32-bit) 4->3 1.717218
ADDS (uxth, 32-bit) throughput 1.365911
ADDS (uxth, 64-bit) 1->2 1.715717
ADDS (uxth, 64-bit) 1->3 2.009413
ADDS (uxth, 64-bit) 4->2 1.996251
ADDS (uxth, 64-bit) 4->3 2.155195
ADDS (uxth, 64-bit) throughput 1.178284
ADDS (sxtw, 32-bit) 1->2 0.840275
ADDS (sxtw, 32-bit) 1->3 0.968779
ADDS (sxtw, 32-bit) 4->2 1.412684
ADDS (sxtw, 32-bit) 4->3 1.101602
ADDS (sxtw, 32-bit) throughput 0.746584
ADDS (sxtw, 64-bit) 1->2 2.239178
ADDS (sxtw, 64-bit) 1->3 2.223924
ADDS (sxtw, 64-bit) 4->2 2.230632
ADDS (sxtw, 64-bit) 4->3 2.093078
ADDS (sxtw, 64-bit) throughput 1.000601
ADDS (uxtw, 32-bit) 1->2 0.977771
ADDS (uxtw, 32-bit) 1->3 0.999934
ADDS (uxtw, 32-bit) 4->2 1.050133
ADDS (uxtw, 32-bit) 4->3 0.993755
ADDS (uxtw, 32-bit) throughput 0.711428
ADDS (uxtw, 64-bit) 1->2 2.139346
ADDS (uxtw, 64-bit) 1->3 2.027802
ADDS (uxtw, 64-bit) 4->2 2.050998
ADDS (uxtw, 64-bit) 4->3 1.964709
ADDS (uxtw, 64-bit) throughput 0.992627
ADDS (uxtx, 64-bit) 1->2 0.989438
ADDS (uxtx, 64-bit) 1->3 1.004255
ADDS (uxtx, 64-bit) 4->2 1.018702
ADDS (uxtx, 64-bit) 4->3 1.008356
ADDS (uxtx, 64-bit) throughput 0.696606
ADDS (sxtx, 64-bit) 1->2 0.972288
ADDS (sxtx, 64-bit) 1->3 1.079045
ADDS (sxtx, 64-bit) 4->2 1.006488
ADDS (sxtx, 64-bit) 4->3 1.019511
ADDS (sxtx, 64-bit) throughput 0.690283
ADDS (immediate, 32-bit) 1->2 1.002903
ADDS (immediate, 32-bit) 3->2 1.006094
ADDS (immediate, 32-bit) throughput 0.706096
ADDS (immediate, 64-bit) 1->2 0.996735
ADDS (immediate, 64-bit) 3->2 1.005236
ADDS (immediate, 64-bit) throughput 0.705760
ADDS (shifted immediate, 32-bit) 1->2 0.990086
ADDS (shifted immediate, 32-bit) 3->2 1.021732
ADDS (shifted immediate, 32-bit) throughput 0.696755
ADDS (shifted immediate, 64-bit) 1->2 0.996352
ADDS (shifted immediate, 64-bit) 3->2 1.041453
ADDS (shifted immediate, 64-bit) throughput 0.698475
ADDS (register, 32-bit) 1->2 1.000044
ADDS (register, 32-bit) 1->3 1.012763
ADDS (register, 32-bit) 4->2 1.064203
ADDS (register, 32-bit) 4->3 1.002127
ADDS (register, 32-bit) throughput 0.697693
ADDS (register, 64-bit) 1->2 0.980761
ADDS (register, 64-bit) 1->3 0.999978
ADDS (register, 64-bit) 4->2 1.041001
ADDS (register, 64-bit) 4->3 1.001272
ADDS (register, 64-bit) throughput 0.695792
ADDS (register, lsl, 32-bit) 1->2 1.996622
ADDS (register, lsl, 32-bit) 1->3 2.006598
ADDS (register, lsl, 32-bit) 4->2 2.009544
ADDS (register, lsl, 32-bit) 4->3 2.097717
ADDS (register, lsl, 32-bit) throughput 1.076571
ADDS (register, lsl, 64-bit) 1->2 2.620855
ADDS (register, lsl, 64-bit) 1->3 1.912899
ADDS (register, lsl, 64-bit) 4->2 2.019897
ADDS (register, lsl, 64-bit) 4->3 2.026723
ADDS (register, lsl, 64-bit) throughput 1.008496
ADDS (register, lsr, 32-bit) 1->2 1.925519
ADDS (register, lsr, 32-bit) 1->3 2.014567
ADDS (register, lsr, 32-bit) 4->2 2.055343
ADDS (register, lsr, 32-bit) 4->3 2.009699
ADDS (register, lsr, 32-bit) throughput 1.019827
ADDS (register, lsr, 64-bit) 1->2 1.994836
ADDS (register, lsr, 64-bit) 1->3 2.012895
ADDS (register, lsr, 64-bit) 4->2 2.039148
ADDS (register, lsr, 64-bit) 4->3 2.122175
ADDS (register, lsr, 64-bit) throughput 1.008376
ADDS (register, asr, 32-bit) 1->2 1.997324
ADDS (register, asr, 32-bit) 1->3 1.978939
ADDS (register, asr, 32-bit) 4->2 2.032429
ADDS (register, asr, 32-bit) 4->3 1.992028
ADDS (register, asr, 32-bit) throughput 1.007937
ADDS (register, asr, 64-bit) 1->2 2.001048
ADDS (register, asr, 64-bit) 1->3 2.017696
ADDS (register, asr, 64-bit) 4->2 2.002177
ADDS (register, asr, 64-bit) 4->3 2.007157
ADDS (register, asr, 64-bit) throughput 0.997310
ADR throughput 0.500687
ADRP throughput 0.507934
AND (immediate, 32-bit) 1->2 1.002492
AND (immediate, 32-bit) throughput 0.503703
AND (immediate, 64-bit) 1->2 1.021403
AND (immediate, 64-bit) throughput 0.501949
AND (register, 32-bit) 1->2 1.025722
AND (register, 32-bit) 1->3 0.994129
AND (register, 32-bit) throughput 0.505942
AND (register, 64-bit) 1->2 1.011855
AND (register, 64-bit) 1->3 1.003968
AND (register, 64-bit) throughput 0.503544
AND (register, lsl, 32-bit) 1->2 2.012762
AND (register, lsl, 32-bit) 1->3 1.985060
AND (register, lsl, 32-bit) throughput 1.006270
AND (register, lsl, 64-bit) 1->2 2.000220
AND (register, lsl, 64-bit) 1->3 1.999609
AND (register, lsl, 64-bit) throughput 0.859892
AND (register, lsr, 32-bit) 1->2 1.993760
AND (register, lsr, 32-bit) 1->3 1.980518
AND (register, lsr, 32-bit) throughput 1.018310
AND (register, lsr, 64-bit) 1->2 1.984590
AND (register, lsr, 64-bit) 1->3 2.000064
AND (register, lsr, 64-bit) throughput 0.985867
AND (register, asr, 32-bit) 1->2 2.073742
AND (register, asr, 32-bit) 1->3 2.001008
AND (register, asr, 32-bit) throughput 1.012320
AND (register, asr, 64-bit) 1->2 2.038101
AND (register, asr, 64-bit) 1->3 1.999871
AND (register, asr, 64-bit) throughput 1.022001
AND (register, ror, 32-bit) 1->2 2.021421
AND (register, ror, 32-bit) 1->3 2.032253
AND (register, ror, 32-bit) throughput 1.005094
AND (register, ror, 64-bit) 1->2 1.946970
AND (register, ror, 64-bit) 1->3 2.026964
AND (register, ror, 64-bit) throughput 0.999297
ADDS (immediate, 32-bit) 1->2 1.004686
ADDS (immediate, 32-bit) 3->2 1.011073
ADDS (immediate, 32-bit) throughput 0.691243
ADDS (immediate, 64-bit) 1->2 0.997127
ADDS (immediate, 64-bit) 3->2 0.998684
ADDS (immediate, 64-bit) throughput 0.701587
ANDS (register, 32-bit) 1->2 1.000260
ANDS (register, 32-bit) 1->3 1.013965
ANDS (register, 32-bit) 4->2 1.025220
ANDS (register, 32-bit) 4->3 1.013095
ANDS (register, 32-bit) throughput 0.693797
ANDS (register, 64-bit) 1->2 1.013034
ANDS (register, 64-bit) 1->3 0.985058
ANDS (register, 64-bit) 4->2 1.052209
ANDS (register, 64-bit) 4->3 0.986524
ANDS (register, 64-bit) throughput 0.700339
ANDS (register, lsl, 32-bit) 1->2 2.031674
ANDS (register, lsl, 32-bit) 1->3 1.996120
ANDS (register, lsl, 32-bit) 4->2 2.033221
ANDS (register, lsl, 32-bit) 4->3 1.998323
ANDS (register, lsl, 32-bit) throughput 1.166100
ANDS (register, lsl, 64-bit) 1->2 2.050727
ANDS (register, lsl, 64-bit) 1->3 1.933125
ANDS (register, lsl, 64-bit) 4->2 1.950764
ANDS (register, lsl, 64-bit) 4->3 2.076381
ANDS (register, lsl, 64-bit) throughput 0.991434
ANDS (register, lsr, 32-bit) 1->2 2.013980
ANDS (register, lsr, 32-bit) 1->3 1.993236
ANDS (register, lsr, 32-bit) 4->2 2.007127
ANDS (register, lsr, 32-bit) 4->3 2.000964
ANDS (register, lsr, 32-bit) throughput 1.009948
ANDS (register, lsr, 64-bit) 1->2 2.001183
ANDS (register, lsr, 64-bit) 1->3 1.998653
ANDS (register, lsr, 64-bit) 4->2 2.101789
ANDS (register, lsr, 64-bit) 4->3 2.041889
ANDS (register, lsr, 64-bit) throughput 1.006352
ANDS (register, asr, 32-bit) 1->2 1.956188
ANDS (register, asr, 32-bit) 1->3 2.007627
ANDS (register, asr, 32-bit) 4->2 2.021969
ANDS (register, asr, 32-bit) 4->3 2.561366
ANDS (register, asr, 32-bit) throughput 0.999221
ANDS (register, asr, 64-bit) 1->2 2.076139
ANDS (register, asr, 64-bit) 1->3 1.985512
ANDS (register, asr, 64-bit) 4->2 2.025977
ANDS (register, asr, 64-bit) 4->3 2.039046
ANDS (register, asr, 64-bit) throughput 1.008617
ANDS (register, ror, 32-bit) 1->2 2.013392
ANDS (register, ror, 32-bit) 1->3 1.955192
ANDS (register, ror, 32-bit) 4->2 2.061305
ANDS (register, ror, 32-bit) 4->3 1.999389
ANDS (register, ror, 32-bit) throughput 1.014483
ANDS (register, ror, 64-bit) 1->2 1.999495
ANDS (register, ror, 64-bit) 1->3 2.006040
ANDS (register, ror, 64-bit) 4->2 1.986745
ANDS (register, ror, 64-bit) 4->3 2.002430
ANDS (register, ror, 64-bit) throughput 1.010648
ASR (immediate, 32-bit) 1->2 1.003795
ASR (immediate, 32-bit) throughput 0.378440
ASR (immediate, 64-bit) 1->2 1.000786
ASR (immediate, 64-bit) throughput 0.514935
ASR (register, 32-bit) 1->2 1.000044
ASR (register, 32-bit) 1->3 1.003242
ASR (register, 32-bit) throughput 0.536185
ASR (register, 64-bit) 1->2 0.999946
ASR (register, 64-bit) 1->3 0.988798
ASR (register, 64-bit) throughput 0.625165
B throughput 2.957976
B.cc (not taken) throughput 0.656999
B.cc (taken) throughput 3.936628
BFC (32-bit) 1->1 1.018429
BFC (32-bit) throughput 1.011443
BFC (64-bit) 1->1 1.011030
BFC (64-bit) throughput 1.010017
BFI (32-bit) 1->1 1.001702
BFI (32-bit) 1->2 1.010644
BFI (32-bit) throughput 1.009519
BFI (64-bit) 1->1 0.978997
BFI (64-bit) 1->2 1.053014
BFI (64-bit) throughput 1.003776
BFXIL (32-bit) 1->1 1.004178
BFXIL (32-bit) 1->2 0.984636
BFXIL (32-bit) throughput 0.991258
BFXIL (64-bit) 1->1 1.003853
BFXIL (64-bit) 1->2 0.994393
BFXIL (64-bit) throughput 1.004087
BIC (register, 32-bit) 1->2 0.986266
BIC (register, 32-bit) 1->3 1.008514
BIC (register, 32-bit) throughput 0.502199
BIC (register, 64-bit) 1->2 0.991586
BIC (register, 64-bit) 1->3 1.001294
BIC (register, 64-bit) throughput 0.503478
BIC (register, lsl, 32-bit) 1->2 2.064351
BIC (register, lsl, 32-bit) 1->3 1.982097
BIC (register, lsl, 32-bit) throughput 1.001350
BIC (register, lsl, 64-bit) 1->2 1.970603
BIC (register, lsl, 64-bit) 1->3 2.041813
BIC (register, lsl, 64-bit) throughput 0.990221
BIC (register, lsr, 32-bit) 1->2 1.998182
BIC (register, lsr, 32-bit) 1->3 2.027896
BIC (register, lsr, 32-bit) throughput 1.008181
BIC (register, lsr, 64-bit) 1->2 1.998270
BIC (register, lsr, 64-bit) 1->3 2.010987
BIC (register, lsr, 64-bit) throughput 1.013852
BIC (register, asr, 32-bit) 1->2 2.046867
BIC (register, asr, 32-bit) 1->3 2.061921
BIC (register, asr, 32-bit) throughput 0.993025
BIC (register, asr, 64-bit) 1->2 2.006994
BIC (register, asr, 64-bit) 1->3 1.998007
BIC (register, asr, 64-bit) throughput 1.007149
BIC (register, ror, 32-bit) 1->2 1.993305
BIC (register, ror, 32-bit) 1->3 2.011490
BIC (register, ror, 32-bit) throughput 0.970330
BIC (register, ror, 64-bit) 1->2 1.969328
BIC (register, ror, 64-bit) 1->3 2.016551
BIC (register, ror, 64-bit) throughput 1.018243
BICS (register, 32-bit) 1->2 0.993881
BICS (register, 32-bit) 1->3 0.997237
BICS (register, 32-bit) 4->2 1.037582
BICS (register, 32-bit) 4->3 1.019124
BICS (register, 32-bit) throughput 0.692911
BICS (register, 64-bit) 1->2 1.019357
BICS (register, 64-bit) 1->3 0.985674
BICS (register, 64-bit) 4->2 1.000168
BICS (register, 64-bit) 4->3 0.959482
BICS (register, 64-bit) throughput 0.700310
BICS (register, lsl, 32-bit) 1->2 2.063021
BICS (register, lsl, 32-bit) 1->3 1.993847
BICS (register, lsl, 32-bit) 4->2 2.009846
BICS (register, lsl, 32-bit) 4->3 1.996666
BICS (register, lsl, 32-bit) throughput 1.003766
BICS (register, lsl, 64-bit) 1->2 2.006840
BICS (register, lsl, 64-bit) 1->3 2.028959
BICS (register, lsl, 64-bit) 4->2 2.028586
BICS (register, lsl, 64-bit) 4->3 1.920076
BICS (register, lsl, 64-bit) throughput 1.007291
BICS (register, lsr, 32-bit) 1->2 1.974773
BICS (register, lsr, 32-bit) 1->3 2.001184
BICS (register, lsr, 32-bit) 4->2 2.056956
BICS (register, lsr, 32-bit) 4->3 1.997960
BICS (register, lsr, 32-bit) throughput 0.993692
BICS (register, lsr, 64-bit) 1->2 1.990053
BICS (register, lsr, 64-bit) 1->3 2.015915
BICS (register, lsr, 64-bit) 4->2 1.993354
BICS (register, lsr, 64-bit) 4->3 1.999038
BICS (register, lsr, 64-bit) throughput 1.010170
BICS (register, asr, 32-bit) 1->2 2.001161
BICS (register, asr, 32-bit) 1->3 2.046813
BICS (register, asr, 32-bit) 4->2 2.010782
BICS (register, asr, 32-bit) 4->3 2.022168
BICS (register, asr, 32-bit) throughput 1.014096
BICS (register, asr, 64-bit) 1->2 2.040863
BICS (register, asr, 64-bit) 1->3 2.232306
BICS (register, asr, 64-bit) 4->2 2.002359
BICS (register, asr, 64-bit) 4->3 1.994684
BICS (register, asr, 64-bit) throughput 1.017351
BICS (register, ror, 32-bit) 1->2 2.022315
BICS (register, ror, 32-bit) 1->3 2.007930
BICS (register, ror, 32-bit) 4->2 2.014116
BICS (register, ror, 32-bit) 4->3 1.919306
BICS (register, ror, 32-bit) throughput 1.004646
BICS (register, ror, 64-bit) 1->2 2.001445
BICS (register, ror, 64-bit) 1->3 2.002983
BICS (register, ror, 64-bit) 4->2 2.072538
BICS (register, ror, 64-bit) 4->3 2.048727
BICS (register, ror, 64-bit) throughput 1.011328
BL throughput 3.213531
CBNZ (not taken) throughput 0.713315
CBNZ (taken) throughput 3.901890
CBZ (not taken) throughput 0.716811
CBZ (taken) throughput 3.927248
CCMN (immediate, 32-bit) 2->1 1.060855
CCMN (immediate, 32-bit) 2->2 0.998708
CCMN (immediate, 32-bit) throughput 1.386021
CCMN (immediate, 32-bit) throughput 1.196692
CCMN (immediate, 64-bit) 2->1 0.981321
CCMN (immediate, 64-bit) 2->2 1.004255
CCMN (immediate, 64-bit) throughput 1.404276
CCMN (immediate, 64-bit) throughput 1.201834
CCMN (register, 32-bit) 3->1 1.017644
CCMN (register, 32-bit) 3->2 0.977802
CCMN (register, 32-bit) 3->3 1.002969
CCMN (register, 32-bit) throughput 1.406677
CCMN (register, 32-bit) throughput 1.188983
CCMN (register, 64-bit) 3->1 0.990562
CCMN (register, 64-bit) 3->2 1.012457
CCMN (register, 64-bit) 3->3 0.998708
CCMN (register, 64-bit) throughput 1.369157
CCMN (register, 64-bit) throughput 1.200490
CCMP (immediate, 32-bit) 2->1 1.015986
CCMP (immediate, 32-bit) 2->2 1.001798
CCMP (immediate, 32-bit) throughput 1.396633
CCMP (immediate, 32-bit) throughput 1.211639
CCMP (immediate, 64-bit) 2->1 1.068745
CCMP (immediate, 64-bit) 2->2 0.986029
CCMP (immediate, 64-bit) throughput 1.354471
CCMP (immediate, 64-bit) throughput 1.209199
CCMP (register, 32-bit) 3->1 1.001644
CCMP (register, 32-bit) 3->2 1.065740
CCMP (register, 32-bit) 3->3 1.001869
CCMP (register, 32-bit) throughput 1.395859
CCMP (register, 32-bit) throughput 1.206721
CCMP (register, 64-bit) 3->1 0.997215
CCMP (register, 64-bit) 3->2 1.015132
CCMP (register, 64-bit) 3->3 1.003267
CCMP (register, 64-bit) throughput 1.403281
CCMP (register, 64-bit) throughput 1.208350
CINC (32-bit) 1->2 1.051189
CINC (32-bit) 1->3 1.073586
CINC (32-bit) throughput 0.529602
CINC (64-bit) 1->2 1.000000
CINC (64-bit) 1->3 1.051228
CINC (64-bit) throughput 0.499346
CINV (32-bit) 1->2 0.996329
CINV (32-bit) 1->3 1.046101
CINV (32-bit) throughput 0.505966
CINV (64-bit) 1->2 0.985391
CINV (64-bit) 1->3 0.989185
CINV (64-bit) throughput 0.495994
CLREX throughput 5.080461
CLREX throughput 5.037555
CLS (32-bit) 1->2 1.002947
CLS (32-bit) throughput 0.508090
CLS (64-bit) 1->2 1.009779
CLS (64-bit) throughput 0.505589
CLZ (32-bit) 1->2 1.012262
CLZ (32-bit) throughput 0.495622
CLZ (64-bit) 1->2 1.003211
CLZ (64-bit) throughput 0.504741
CMN (sxtb, 32-bit) 3->1 2.027740
CMN (sxtb, 32-bit) 3->2 1.940013
CMN (sxtb, 32-bit) throughput 1.008648
CMN (sxtb, 64-bit) 3->1 2.013335
CMN (sxtb, 64-bit) 3->2 2.002066
CMN (sxtb, 64-bit) throughput 0.980217
CMN (uxtb, 32-bit) 3->1 1.962957
CMN (uxtb, 32-bit) 3->2 2.025733
CMN (uxtb, 32-bit) throughput 1.048162
CMN (uxtb, 64-bit) 3->1 2.067263
CMN (uxtb, 64-bit) 3->2 1.961703
CMN (uxtb, 64-bit) throughput 0.976382
CMN (sxth, 32-bit) 3->1 2.009017
CMN (sxth, 32-bit) 3->2 1.909491
CMN (sxth, 32-bit) throughput 0.993616
CMN (sxth, 64-bit) 3->1 2.000088
CMN (sxth, 64-bit) 3->2 2.059811
CMN (sxth, 64-bit) throughput 1.011895
CMN (uxth, 32-bit) 3->1 2.050756
CMN (uxth, 32-bit) 3->2 2.008979
CMN (uxth, 32-bit) throughput 1.001007
CMN (uxth, 64-bit) 3->1 1.975705
CMN (uxth, 64-bit) 3->2 1.993955
CMN (uxth, 64-bit) throughput 1.015369
CMN (sxtw, 32-bit) 3->1 0.971203
CMN (sxtw, 32-bit) 3->2 1.023594
CMN (sxtw, 32-bit) throughput 0.679017
CMN (sxtw, 64-bit) 3->1 2.056970
CMN (sxtw, 64-bit) 3->2 2.008730
CMN (sxtw, 64-bit) throughput 1.005698
CMN (uxtw, 32-bit) 3->1 0.963861
CMN (uxtw, 32-bit) 3->2 1.048408
CMN (uxtw, 32-bit) throughput 0.673679
CMN (uxtw, 64-bit) 3->1 1.978097
CMN (uxtw, 64-bit) 3->2 2.048142
CMN (uxtw, 64-bit) throughput 1.005932
CMN (uxtx, 64-bit) 3->1 0.975829
CMN (uxtx, 64-bit) 3->2 1.013634
CMN (uxtx, 64-bit) throughput 0.679276
CMN (sxtx, 64-bit) 3->1 1.029008
CMN (sxtx, 64-bit) 3->2 1.008693
CMN (sxtx, 64-bit) throughput 0.675731
CMN (immediate, 32-bit) 2->1 0.932579
CMN (immediate, 32-bit) throughput 0.665653
CMN (immediate, 64-bit) 2->1 0.998686
CMN (immediate, 64-bit) throughput 0.674255
CMN (shifted immediate, 32-bit) 2->1 1.003443
CMN (shifted immediate, 32-bit) throughput 0.681622
CMN (shifted immediate, 64-bit) 2->1 0.994525
CMN (shifted immediate, 64-bit) throughput 0.670779
CMN (register, 32-bit) 3->1 1.004304
CMN (register, 32-bit) 3->2 0.961878
CMN (register, 32-bit) throughput 0.665655
CMN (register, 64-bit) 3->1 1.023289
CMN (register, 64-bit) 3->2 1.006833
CMN (register, 64-bit) throughput 0.676722
CMN (register, lsl, 32-bit) 3->1 2.018392
CMN (register, lsl, 32-bit) 3->2 2.009321
CMN (register, lsl, 32-bit) throughput 1.001064
CMN (register, lsl, 64-bit) 3->1 1.999715
CMN (register, lsl, 64-bit) 3->2 2.025291
CMN (register, lsl, 64-bit) throughput 1.009126
CMN (register, lsr, 32-bit) 3->1 2.021829
CMN (register, lsr, 32-bit) 3->2 2.016429
CMN (register, lsr, 32-bit) throughput 1.008137
CMN (register, lsr, 64-bit) 3->1 2.024138
CMN (register, lsr, 64-bit) 3->2 2.007664
CMN (register, lsr, 64-bit) throughput 1.005756
CMN (register, asr, 32-bit) 3->1 2.029997
CMN (register, asr, 32-bit) 3->2 2.077151
CMN (register, asr, 32-bit) throughput 1.007367
CMN (register, asr, 64-bit) 3->1 1.997116
CMN (register, asr, 64-bit) 3->2 1.997284
CMN (register, asr, 64-bit) throughput 1.112375
CMP (sxtb, 32-bit) 3->1 2.002555
CMP (sxtb, 32-bit) 3->2 1.994041
CMP (sxtb, 32-bit) throughput 1.008776
CMP (sxtb, 64-bit) 3->1 2.024136
CMP (sxtb, 64-bit) 3->2 2.018441
CMP (sxtb, 64-bit) throughput 1.010538
CMP (uxtb, 32-bit) 3->1 2.022679
CMP (uxtb, 32-bit) 3->2 2.018249
CMP (uxtb, 32-bit) throughput 1.018622
CMP (uxtb, 64-bit) 3->1 2.000634
CMP (uxtb, 64-bit) 3->2 1.998026
CMP (uxtb, 64-bit) throughput 0.999044
CMP (sxth, 32-bit) 3->1 2.032000
CMP (sxth, 32-bit) 3->2 2.017935
CMP (sxth, 32-bit) throughput 1.007475
CMP (sxth, 64-bit) 3->1 2.250734
CMP (sxth, 64-bit) 3->2 2.034505
CMP (sxth, 64-bit) throughput 1.102485
CMP (uxth, 32-bit) 3->1 1.994480
CMP (uxth, 32-bit) 3->2 2.093771
CMP (uxth, 32-bit) throughput 1.007628
CMP (uxth, 64-bit) 3->1 2.032084
CMP (uxth, 64-bit) 3->2 2.069161
CMP (uxth, 64-bit) throughput 0.987011
CMP (sxtw, 32-bit) 3->1 1.003742
CMP (sxtw, 32-bit) 3->2 1.011842
CMP (sxtw, 32-bit) throughput 0.672947
CMP (sxtw, 64-bit) 3->1 1.999430
CMP (sxtw, 64-bit) 3->2 1.998926
CMP (sxtw, 64-bit) throughput 1.006540
CMP (uxtw, 32-bit) 3->1 1.040665
CMP (uxtw, 32-bit) 3->2 1.006774
CMP (uxtw, 32-bit) throughput 0.667814
CMP (uxtw, 64-bit) 3->1 2.089135
CMP (uxtw, 64-bit) 3->2 2.004452
CMP (uxtw, 64-bit) throughput 1.016573
CMP (uxtx, 64-bit) 3->1 1.000943
CMP (uxtx, 64-bit) 3->2 1.080769
CMP (uxtx, 64-bit) throughput 0.670455
CMP (sxtx, 64-bit) 3->1 1.001009
CMP (sxtx, 64-bit) 3->2 0.989583
CMP (sxtx, 64-bit) throughput 0.675626
CMP (immediate, 32-bit) 2->1 1.019167
CMP (immediate, 32-bit) throughput 0.670765
CMP (immediate, 64-bit) 2->1 0.981843
CMP (immediate, 64-bit) throughput 0.670153
CMP (shifted immediate, 32-bit) 2->1 1.459658
CMP (shifted immediate, 32-bit) throughput 0.673227
CMP (shifted immediate, 64-bit) 2->1 0.944707
CMP (shifted immediate, 64-bit) throughput 0.672941
CMP (register, 32-bit) 3->1 1.002300
CMP (register, 32-bit) 3->2 0.962852
CMP (register, 32-bit) throughput 0.680460
CMP (register, 64-bit) 3->1 0.955695
CMP (register, 64-bit) 3->2 1.003602
CMP (register, 64-bit) throughput 0.675589
CMP (register, lsl, 32-bit) 3->1 2.042338
CMP (register, lsl, 32-bit) 3->2 2.001645
CMP (register, lsl, 32-bit) throughput 1.022590
CMP (register, lsl, 64-bit) 3->1 2.016142
CMP (register, lsl, 64-bit) 3->2 2.038679
CMP (register, lsl, 64-bit) throughput 0.995140
CMP (register, lsr, 32-bit) 3->1 2.026776
CMP (register, lsr, 32-bit) 3->2 2.059450
CMP (register, lsr, 32-bit) throughput 1.004158
CMP (register, lsr, 64-bit) 3->1 2.000154
CMP (register, lsr, 64-bit) 3->2 2.057775
CMP (register, lsr, 64-bit) throughput 1.005062
CMP (register, asr, 32-bit) 3->1 2.013027
CMP (register, asr, 32-bit) 3->2 1.939340
CMP (register, asr, 32-bit) throughput 1.004357
CMP (register, asr, 64-bit) 3->1 2.073163
CMP (register, asr, 64-bit) 3->2 1.988741
CMP (register, asr, 64-bit) throughput 1.001130
CNEG (32-bit) 1->2 1.022786
CNEG (32-bit) 1->3 1.008123
CNEG (32-bit) throughput 0.502571
CNEG (64-bit) 1->2 0.997128
CNEG (64-bit) 1->3 0.998621
CNEG (64-bit) throughput 0.501432
CSDB throughput 0.339313
CSEL (32-bit) 1->2 1.003387
CSEL (32-bit) 1->3 1.003706
CSEL (32-bit) 1->4 1.039943
CSEL (32-bit) throughput 0.496596
CSEL (64-bit) 1->2 1.002434
CSEL (64-bit) 1->3 1.015220
CSEL (64-bit) 1->4 0.960055
CSEL (64-bit) throughput 0.506917
CSET (32-bit) 1->2 1.018177
CSET (32-bit) throughput 0.503708
CSET (64-bit) 1->2 1.006950
CSET (64-bit) throughput 0.528508
CSETM (32-bit) 1->2 1.002684
CSETM (32-bit) throughput 0.500933
CSETM (64-bit) 1->2 1.021662
CSETM (64-bit) throughput 0.495254
CSINC (32-bit) 1->2 1.004266
CSINC (32-bit) 1->3 0.999003
CSINC (32-bit) 1->4 1.036318
CSINC (32-bit) throughput 0.481779
CSINC (64-bit) 1->2 1.019120
CSINC (64-bit) 1->3 1.010315
CSINC (64-bit) 1->4 1.017646
CSINC (64-bit) throughput 0.503925
CSINV (32-bit) 1->2 1.025028
CSINV (32-bit) 1->3 1.009668
CSINV (32-bit) 1->4 1.042550
CSINV (32-bit) throughput 0.500829
CSINV (64-bit) 1->2 1.008961
CSINV (64-bit) 1->3 0.992428
CSINV (64-bit) 1->4 1.095996
CSINV (64-bit) throughput 0.504084
CSNEG (32-bit) 1->2 0.993489
CSNEG (32-bit) 1->3 0.965814
CSNEG (32-bit) 1->4 1.006920
CSNEG (32-bit) throughput 0.503568
CSNEG (64-bit) 1->2 1.005702
CSNEG (64-bit) 1->3 1.005402
CSNEG (64-bit) 1->4 1.039318
CSNEG (64-bit) throughput 0.500810
DMB (SY) throughput 8.443930
DMB (ST) throughput 9.034754
DMB (LD) throughput 8.922060
DMB (ISH) throughput 8.911093
DMB (ISHST) throughput 9.028898
DMB (ISHLD) throughput 9.033158
DMB (NSH) throughput 8.840704
DMB (NSHST) throughput 9.008664
DMB (NSHLD) throughput 9.014649
DMB (OSH) throughput 8.984112
DMB (OSHST) throughput 8.954245
DMB (OSHLD) throughput 9.040394
DSB (SY) throughput 17.760345
DSB (ST) throughput 17.974107
DSB (LD) throughput 17.590773
DSB (ISH) throughput 17.751116
DSB (ISHST) throughput 17.919762
DSB (ISHLD) throughput 17.614600
DSB (NSH) throughput 16.760877
DSB (NSHST) throughput 17.931979
DSB (NSHLD) throughput 17.722172
DSB (OSH) throughput 17.428704
DSB (OSHST) throughput 17.693256
DSB (OSHLD) throughput 17.724142
EON (register, 32-bit) 1->2 0.977097
EON (register, 32-bit) 1->3 1.032401
EON (register, 32-bit) throughput 0.504260
EON (register, 64-bit) 1->2 1.015548
EON (register, 64-bit) 1->3 0.998948
EON (register, 64-bit) throughput 0.507833
EON (register, lsl, 32-bit) 1->2 2.051940
EON (register, lsl, 32-bit) 1->3 2.023947
EON (register, lsl, 32-bit) throughput 1.011491
EON (register, lsl, 64-bit) 1->2 1.986951
EON (register, lsl, 64-bit) 1->3 2.022498
EON (register, lsl, 64-bit) throughput 1.016258
EON (register, lsr, 32-bit) 1->2 1.999673
EON (register, lsr, 32-bit) 1->3 2.033105
EON (register, lsr, 32-bit) throughput 1.001836
EON (register, lsr, 64-bit) 1->2 1.983962
EON (register, lsr, 64-bit) 1->3 2.001118
EON (register, lsr, 64-bit) throughput 0.992152
EON (register, asr, 32-bit) 1->2 1.984804
EON (register, asr, 32-bit) 1->3 2.017043
EON (register, asr, 32-bit) throughput 1.007630
EON (register, asr, 64-bit) 1->2 1.990773
EON (register, asr, 64-bit) 1->3 2.009911
EON (register, asr, 64-bit) throughput 1.011806
EON (register, ror, 32-bit) 1->2 2.068936
EON (register, ror, 32-bit) 1->3 2.048219
EON (register, ror, 32-bit) throughput 0.995500
EON (register, ror, 64-bit) 1->2 1.999430
EON (register, ror, 64-bit) 1->3 2.016683
EON (register, ror, 64-bit) throughput 1.181655
EOR (immediate, 32-bit) 1->2 0.997128
EOR (immediate, 32-bit) throughput 0.504190
EOR (immediate, 64-bit) 1->2 0.998839
EOR (immediate, 64-bit) throughput 0.501308
EOR (register, 32-bit) 1->2 0.988429
EOR (register, 32-bit) 1->3 1.002926
EOR (register, 32-bit) throughput 0.504486
EOR (register, 64-bit) 1->2 0.990176
EOR (register, 64-bit) 1->3 0.998664
EOR (register, 64-bit) throughput 0.507206
EOR (register, lsl, 32-bit) 1->2 2.008577
EOR (register, lsl, 32-bit) 1->3 2.024034
EOR (register, lsl, 32-bit) throughput 1.038339
EOR (register, lsl, 64-bit) 1->2 2.081573
EOR (register, lsl, 64-bit) 1->3 2.012720
EOR (register, lsl, 64-bit) throughput 1.017374
EOR (register, lsr, 32-bit) 1->2 2.030325
EOR (register, lsr, 32-bit) 1->3 2.008929
EOR (register, lsr, 32-bit) throughput 0.998852
EOR (register, lsr, 64-bit) 1->2 2.008022
EOR (register, lsr, 64-bit) 1->3 2.074639
EOR (register, lsr, 64-bit) throughput 1.004551
EOR (register, asr, 32-bit) 1->2 2.027857
EOR (register, asr, 32-bit) 1->3 2.014910
EOR (register, asr, 32-bit) throughput 0.970906
EOR (register, asr, 64-bit) 1->2 2.047870
EOR (register, asr, 64-bit) 1->3 2.002250
EOR (register, asr, 64-bit) throughput 1.004677
EOR (register, ror, 32-bit) 1->2 2.030404
EOR (register, ror, 32-bit) 1->3 2.000920
EOR (register, ror, 32-bit) throughput 1.001243
EOR (register, ror, 64-bit) 1->2 1.996160
EOR (register, ror, 64-bit) 1->3 2.042421
EOR (register, ror, 64-bit) throughput 1.016226
EXTR (register, 32-bit) 1->2 1.010227
EXTR (register, 32-bit) 1->3 2.005497
EXTR (register, 32-bit) throughput 1.035548
EXTR (register, 64-bit) 1->2 0.991818
EXTR (register, 64-bit) 1->3 2.008446
EXTR (register, 64-bit) throughput 1.019914
ISB (SY) throughput 25.581788
LDNP (32-bit) 1->3 (with chain penalty) 4.044393
LDNP (32-bit) 2->3 (with chain penalty) 4.070474
LDNP (32-bit) throughput 1.003932
LDNP (64-bit) 1->3 (with chain penalty) 4.046541
LDNP (64-bit) 2->3 (with chain penalty) 4.026934
LDNP (64-bit) throughput 1.009622
LDP (32-bit) 1->3 (with chain penalty) 4.047958
LDP (32-bit) 2->3 (with chain penalty) 4.878055
LDP (32-bit) throughput 1.004853
LDP (64-bit) 1->3 (with chain penalty) 4.174227
LDP (64-bit) 2->3 (with chain penalty) 4.087767
LDP (64-bit) throughput 1.005140
LDP (post-index, 32-bit) 1->3 (with chain penalty) 3.903878
LDP (post-index, 32-bit) 2->3 (with chain penalty) 4.143860
LDP (post-index, 32-bit) throughput 1.084549
LDP (post-index, 64-bit) 1->3 (with chain penalty) 12.546301
LDP (post-index, 64-bit) 2->3 (with chain penalty) 11.669506
LDP (post-index, 64-bit) throughput 1.074776
LDP (pre-index, 32-bit) 1->3 (with chain penalty) 4.230391
LDP (pre-index, 32-bit) 2->3 (with chain penalty) 4.139245
LDP (pre-index, 32-bit) throughput 1.081937
LDP (pre-index, 64-bit) 1->3 (with chain penalty) 12.194854
LDP (pre-index, 64-bit) 2->3 (with chain penalty) 12.465035
LDP (pre-index, 64-bit) throughput 1.081925
LDP (signed offset, 32-bit) 1->3 (with chain penalty) 3.841953
LDP (signed offset, 32-bit) 2->3 (with chain penalty) 4.108388
LDP (signed offset, 32-bit) throughput 1.006257
LDP (signed offset, 64-bit) 1->3 (with chain penalty) 4.057196
LDP (signed offset, 64-bit) 2->3 (with chain penalty) 4.163538
LDP (signed offset, 64-bit) throughput 1.012624
LDPSW (post-index) 1->3 (with chain penalty) 4.179043
LDPSW (post-index) 2->3 (with chain penalty) 4.079035
LDPSW (post-index) throughput 1.075887
LDPSW (pre-index) 1->3 (with chain penalty) 4.177090
LDPSW (pre-index) 2->3 (with chain penalty) 4.218474
LDPSW (pre-index) throughput 1.026233
LDPSW (signed offset) 1->3 (with chain penalty) 4.209227
LDPSW (signed offset) 2->3 (with chain penalty) 4.013489
LDPSW (signed offset) throughput 1.022170
LDR (32-bit) 1->2 (with chain penalty) 3.971448
LDR (32-bit) throughput 1.011684
LDR (64-bit) 1->2 (with chain penalty) 4.072593
LDR (64-bit) throughput 0.998471
LDR (post-index, 32-bit) 1->2 (with chain penalty) 4.351603
LDR (post-index, 32-bit) throughput 1.083341
LDR (post-index, 64-bit) 1->2 (with chain penalty) 4.229440
LDR (post-index, 64-bit) throughput 1.033304
LDR (pre-index, 32-bit) 1->2 (with chain penalty) 4.243514
LDR (pre-index, 32-bit) throughput 1.070808
LDR (pre-index, 64-bit) 1->2 (with chain penalty) 4.253933
LDR (pre-index, 64-bit) throughput 1.080068
LDR (unsigned offset, 32-bit) 1->2 (with chain penalty) 4.024049
LDR (unsigned offset, 32-bit) throughput 1.017205
LDR (unsigned offset, 64-bit) 1->2 (with chain penalty) 3.969632
LDR (unsigned offset, 64-bit) throughput 1.007177
LDR (literal, 32-bit) throughput 1.011905
LDR (literal, 64-bit) throughput 1.015191
LDR (register, 32-bit) 1->2 (with chain penalty) 4.092763
LDR (register, 32-bit) 1->3 (with chain penalty) 4.017211
LDR (register, 32-bit) throughput 0.992980
LDR (register, 64-bit) 1->2 (with chain penalty) 4.057472
LDR (register, 64-bit) 1->3 (with chain penalty) 4.090575
LDR (register, 64-bit) throughput 1.010506
LDR (register, uxtw, 32-bit) 1->2 (with chain penalty) 4.093121
LDR (register, uxtw, 32-bit) 1->3 (with chain penalty) 4.078751
LDR (register, uxtw, 32-bit) throughput 0.993836
LDR (register, uxtw, 64-bit) 1->2 (with chain penalty) 3.815059
LDR (register, uxtw, 64-bit) 1->3 (with chain penalty) 4.122432
LDR (register, uxtw, 64-bit) throughput 1.007306
LDR (register, sxtw, 32-bit) 1->2 (with chain penalty) 4.058102
LDR (register, sxtw, 32-bit) 1->3 (with chain penalty) 4.003536
LDR (register, sxtw, 32-bit) throughput 1.004945
LDR (register, sxtw, 64-bit) 1->2 (with chain penalty) 4.047149
LDR (register, sxtw, 64-bit) 1->3 (with chain penalty) 4.089495
LDR (register, sxtw, 64-bit) throughput 0.997869
LDR (register, lsl, 32-bit) 1->2 (with chain penalty) 4.106239
LDR (register, lsl, 32-bit) 1->3 (with chain penalty) 4.148664
LDR (register, lsl, 32-bit) throughput 1.006492
LDR (register, lsl, 64-bit) 1->2 (with chain penalty) 4.064046
LDR (register, lsl, 64-bit) 1->3 (with chain penalty) 4.006514
LDR (register, lsl, 64-bit) throughput 1.005486
LDRB 1->2 (with chain penalty) 3.938439
LDRB throughput 0.987648
LDRB (post-index) 1->2 (with chain penalty) 4.391652
LDRB (post-index) throughput 1.083751
LDRB (pre-index) 1->2 (with chain penalty) 4.209724
LDRB (pre-index) throughput 1.025403
LDRB (unsigned offset) 1->2 (with chain penalty) 4.067285
LDRB (unsigned offset) throughput 0.841919
LDRB (register) 1->2 (with chain penalty) 4.078150
LDRB (register) 1->3 (with chain penalty) 4.075915
LDRB (register) throughput 1.010469
LDRB (register, uxtw) 1->2 (with chain penalty) 4.121407
LDRB (register, uxtw) 1->3 (with chain penalty) 3.981325
LDRB (register, uxtw) throughput 1.003415
LDRB (register, sxtw) 1->2 (with chain penalty) 4.057791
LDRB (register, sxtw) 1->3 (with chain penalty) 4.060007
LDRB (register, sxtw) throughput 1.027547
LDRH 1->2 (with chain penalty) 4.032823
LDRH throughput 1.013230
LDRH (post-index) 1->2 (with chain penalty) 4.189693
LDRH (post-index) throughput 1.085782
LDRH (pre-index) 1->2 (with chain penalty) 4.286687
LDRH (pre-index) throughput 1.023389
LDRH (unsigned offset) 1->2 (with chain penalty) 4.001576
LDRH (unsigned offset) throughput 1.014868
LDRH (register) 1->2 (with chain penalty) 4.143153
LDRH (register) 1->3 (with chain penalty) 3.881328
LDRH (register) throughput 1.005329
LDRH (register, uxtw) 1->2 (with chain penalty) 4.039911
LDRH (register, uxtw) 1->3 (with chain penalty) 4.044468
LDRH (register, uxtw) throughput 1.005168
LDRH (register, sxtw) 1->2 (with chain penalty) 4.116180
LDRH (register, sxtw) 1->3 (with chain penalty) 4.128620
LDRH (register, sxtw) throughput 1.014622
LDRH (register, lsl) 1->2 (with chain penalty) 5.110571
LDRH (register, lsl) 1->3 (with chain penalty) 4.053479
LDRH (register, lsl) throughput 0.992300
LDRSB (32-bit) 1->2 (with chain penalty) 4.060524
LDRSB (32-bit) throughput 1.005136
LDRSB (64-bit) 1->2 (with chain penalty) 4.040987
LDRSB (64-bit) throughput 0.997351
LDRSB (post-index, 32-bit) 1->2 (with chain penalty) 4.304442
LDRSB (post-index, 32-bit) throughput 1.098438
LDRSB (post-index, 64-bit) 1->2 (with chain penalty) 4.300424
LDRSB (post-index, 64-bit) throughput 1.076438
LDRSB (pre-index, 32-bit) 1->2 (with chain penalty) 4.476465
LDRSB (pre-index, 32-bit) throughput 0.988187
LDRSB (pre-index, 64-bit) 1->2 (with chain penalty) 4.214619
LDRSB (pre-index, 64-bit) throughput 1.060600
LDRSB (unsigned offset, 32-bit) 1->2 (with chain penalty) 4.021815
LDRSB (unsigned offset, 32-bit) throughput 1.003236
LDRSB (unsigned offset, 64-bit) 1->2 (with chain penalty) 4.058586
LDRSB (unsigned offset, 64-bit) throughput 1.000011
LDRSB (register, 32-bit) 1->2 (with chain penalty) 4.059067
LDRSB (register, 32-bit) 1->3 (with chain penalty) 4.025416
LDRSB (register, 32-bit) throughput 0.971683
LDRSB (register, 64-bit) 1->2 (with chain penalty) 4.452249
LDRSB (register, 64-bit) 1->3 (with chain penalty) 3.696003
LDRSB (register, 64-bit) throughput 1.003044
LDRSB (register, uxtw, 32-bit) 1->2 (with chain penalty) 4.017409
LDRSB (register, uxtw, 32-bit) 1->3 (with chain penalty) 4.109331
LDRSB (register, uxtw, 32-bit) throughput 1.005799
LDRSB (register, uxtw, 64-bit) 1->2 (with chain penalty) 4.065447
LDRSB (register, uxtw, 64-bit) 1->3 (with chain penalty) 3.746108
LDRSB (register, uxtw, 64-bit) throughput 0.994660
LDRSB (register, sxtw, 32-bit) 1->2 (with chain penalty) 3.869439
LDRSB (register, sxtw, 32-bit) 1->3 (with chain penalty) 4.098361
LDRSB (register, sxtw, 32-bit) throughput 1.022140
LDRSB (register, sxtw, 64-bit) 1->2 (with chain penalty) 4.108026
LDRSB (register, sxtw, 64-bit) 1->3 (with chain penalty) 3.999254
LDRSB (register, sxtw, 64-bit) throughput 0.978809
LDRSH (32-bit) 1->2 (with chain penalty) 4.041448
LDRSH (32-bit) throughput 1.008837
LDRSH (64-bit) 1->2 (with chain penalty) 4.055635
LDRSH (64-bit) throughput 1.008577
LDRSH (post-index, 32-bit) 1->2 (with chain penalty) 4.134653
LDRSH (post-index, 32-bit) throughput 1.087029
LDRSH (post-index, 64-bit) 1->2 (with chain penalty) 4.317774
LDRSH (post-index, 64-bit) throughput 1.091839
LDRSH (pre-index, 32-bit) 1->2 (with chain penalty) 4.128648
LDRSH (pre-index, 32-bit) throughput 1.087311
LDRSH (pre-index, 64-bit) 1->2 (with chain penalty) 4.047829
LDRSH (pre-index, 64-bit) throughput 1.049689
LDRSH (unsigned offset, 32-bit) 1->2 (with chain penalty) 4.038525
LDRSH (unsigned offset, 32-bit) throughput 1.000895
LDRSH (unsigned offset, 64-bit) 1->2 (with chain penalty) 4.063464
LDRSH (unsigned offset, 64-bit) throughput 1.011666
LDRSH (register, 32-bit) 1->2 (with chain penalty) 4.034206
LDRSH (register, 32-bit) 1->3 (with chain penalty) 4.130790
LDRSH (register, 32-bit) throughput 0.991864
LDRSH (register, 64-bit) 1->2 (with chain penalty) 4.729945
LDRSH (register, 64-bit) 1->3 (with chain penalty) 4.023932
LDRSH (register, 64-bit) throughput 1.007128
LDRSH (register, uxtw, 32-bit) 1->2 (with chain penalty) 4.111685
LDRSH (register, uxtw, 32-bit) 1->3 (with chain penalty) 4.121450
LDRSH (register, uxtw, 32-bit) throughput 1.010207
LDRSH (register, uxtw, 64-bit) 1->2 (with chain penalty) 4.103386
LDRSH (register, uxtw, 64-bit) 1->3 (with chain penalty) 3.974509
LDRSH (register, uxtw, 64-bit) throughput 1.007329
LDRSH (register, sxtw, 32-bit) 1->2 (with chain penalty) 4.028790
LDRSH (register, sxtw, 32-bit) 1->3 (with chain penalty) 3.934001
LDRSH (register, sxtw, 32-bit) throughput 1.013163
LDRSH (register, sxtw, 64-bit) 1->2 (with chain penalty) 4.005696
LDRSH (register, sxtw, 64-bit) 1->3 (with chain penalty) 3.831607
LDRSH (register, sxtw, 64-bit) throughput 1.009420
LDRSH (register, lsl, 32-bit) 1->2 (with chain penalty) 3.819916
LDRSH (register, lsl, 32-bit) 1->3 (with chain penalty) 4.060493
LDRSH (register, lsl, 32-bit) throughput 0.995939
LDRSH (register, lsl, 64-bit) 1->2 (with chain penalty) 4.039633
LDRSH (register, lsl, 64-bit) 1->3 (with chain penalty) 4.054429
LDRSH (register, lsl, 64-bit) throughput 1.010285
LDRSW 1->2 (with chain penalty) 4.056849
LDRSW throughput 1.004890
LDRSW (post-index) 1->2 (with chain penalty) 4.221380
LDRSW (post-index) throughput 1.043827
LDRSW (pre-index) 1->2 (with chain penalty) 4.272394
LDRSW (pre-index) throughput 1.101287
LDRSW (unsigned offset) 1->2 (with chain penalty) 3.883019
LDRSW (unsigned offset) throughput 1.014983
LDRSW (literal) throughput 1.009622
LDRSW (register) 1->2 (with chain penalty) 3.981519
LDRSW (register) 1->3 (with chain penalty) 4.068463
LDRSW (register) throughput 0.965625
LDRSW (register, uxtw) 1->2 (with chain penalty) 4.106054
LDRSW (register, uxtw) 1->3 (with chain penalty) 3.942762
LDRSW (register, uxtw) throughput 1.000514
LDRSW (register, sxtw) 1->2 (with chain penalty) 3.810174
LDRSW (register, sxtw) 1->3 (with chain penalty) 4.046573
LDRSW (register, sxtw) throughput 1.005332
LDRSW (register, lsl) 1->2 (with chain penalty) 3.953119
LDRSW (register, lsl) 1->3 (with chain penalty) 4.076571
LDRSW (register, lsl) throughput 0.995241
LDUR (32-bit) 1->2 (with chain penalty) 4.104774
LDUR (32-bit) throughput 1.001598
LDUR (64-bit) 1->2 (with chain penalty) 4.080134
LDUR (64-bit) throughput 1.009448
LDURB 1->2 (with chain penalty) 4.139776
LDURB throughput 1.015552
LDURH 1->2 (with chain penalty) 4.025947
LDURH throughput 1.018075
LDURSB (32-bit) 1->2 (with chain penalty) 4.085610
LDURSB (32-bit) throughput 1.006289
LDURSB (64-bit) 1->2 (with chain penalty) 4.080201
LDURSB (64-bit) throughput 1.007633
LDURSH (32-bit) 1->2 (with chain penalty) 4.068606
LDURSH (32-bit) throughput 1.004755
LDURSH (64-bit) 1->2 (with chain penalty) 4.048513
LDURSH (64-bit) throughput 0.991765
LDURSW 1->2 (with chain penalty) 4.019505
LDURSW throughput 0.979031
LSL (immediate, 32-bit) 1->2 0.997302
LSL (immediate, 32-bit) throughput 0.503338
LSL (immediate, 64-bit) 1->2 1.015301
LSL (immediate, 64-bit) throughput 0.503262
LSL (register, 32-bit) 1->2 1.022436
LSL (register, 32-bit) 1->3 1.016407
LSL (register, 32-bit) throughput 0.499357
LSL (register, 64-bit) 1->2 0.989473
LSL (register, 64-bit) 1->3 1.030173
LSL (register, 64-bit) throughput 0.499789
LSR (immediate, 32-bit) 1->2 0.988585
LSR (immediate, 32-bit) throughput 0.501864
LSR (immediate, 64-bit) 1->2 1.002246
LSR (immediate, 64-bit) throughput 0.511032
LSR (register, 32-bit) 1->2 1.002903
LSR (register, 32-bit) 1->3 0.998532
LSR (register, 32-bit) throughput 0.501050
LSR (register, 64-bit) 1->2 1.020321
LSR (register, 64-bit) 1->3 0.998730
LSR (register, 64-bit) throughput 0.501147
MADD (32-bit) 1->2 3.067119
MADD (32-bit) 1->3 3.002686
MADD (32-bit) 1->4 1.000504
MADD (32-bit) throughput 1.007380
MADD (64-bit) 1->2 4.102906
MADD (64-bit) 1->3 4.026845
MADD (64-bit) 1->4 1.994087
MADD (64-bit) throughput 2.009217
MNEG (32-bit) 1->2 2.981975
MNEG (32-bit) 1->3 2.990850
MNEG (32-bit) throughput 1.009844
MNEG (64-bit) 1->2 4.170855
MNEG (64-bit) 1->3 3.963927
MNEG (64-bit) throughput 2.042000
MOV (bitmask immediate, 32-bit) throughput 0.503976
MOV (bitmask immediate, 64-bit) throughput 0.512462
MOV (from sp, 32-bit) throughput 0.518603
MOV (from sp, 64-bit) throughput 0.336146
MOVK (32-bit) 1->1 1.003659
MOVK (32-bit) throughput 0.511871
MOVK (64-bit) 1->1 0.996474
MOVK (64-bit) throughput 0.498991
MOVN (32-bit) throughput 0.504761
MOVN (64-bit) throughput 0.500471
MOVZ (32-bit) throughput 0.501836
MOVZ (64-bit) throughput 0.502475
MRS (CNTFRQ_EL0) throughput 15.925776
MRS (CNTPCT_EL0) throughput 15.943505
MRS (CNTVCT_EL0) throughput 16.165839
MRS (DCZID_EL0) throughput 1.269430
MRS (FPCR) throughput 1.257801
MRS (FPSR) throughput 7.062337
MRS (NZCV) throughput 0.507324
MRS (TPIDRRO_EL0) throughput 1.256763
MRS (TPIDR_EL0) throughput 1.287417
MSR (FPCR) throughput 8.550029
MSR (FPSR) throughput 8.252713
MSR (TPIDR_EL0) throughput 10.792015
MSR (NZCV) throughput 0.713828
MSUB (32-bit) 1->2 2.960646
MSUB (32-bit) 1->3 2.967032
MSUB (32-bit) 1->4 1.010344
MSUB (32-bit) throughput 1.009293
MSUB (64-bit) 1->2 4.047751
MSUB (64-bit) 1->3 4.025354
MSUB (64-bit) 1->4 1.992074
MSUB (64-bit) throughput 2.034131
MUL (32-bit) 1->2 2.976164
MUL (32-bit) 1->3 3.076357
MUL (32-bit) throughput 1.002403
MUL (64-bit) 1->2 4.051503
MUL (64-bit) 1->3 4.064561
MUL (64-bit) throughput 2.042069
MVN (register, 32-bit) 1->2 0.987009
MVN (register, 32-bit) throughput 0.508398
MVN (register, 64-bit) 1->2 1.001342
MVN (register, 64-bit) throughput 0.495858
MVN (register, lsl, 32-bit) 1->2 1.994074
MVN (register, lsl, 32-bit) throughput 1.009698
MVN (register, lsl, 64-bit) 1->2 2.016489
MVN (register, lsl, 64-bit) throughput 1.014539
MVN (register, lsr, 32-bit) 1->2 2.004031
MVN (register, lsr, 32-bit) throughput 0.882336
MVN (register, lsr, 64-bit) 1->2 1.988706
MVN (register, lsr, 64-bit) throughput 0.997821
MVN (register, asr, 32-bit) 1->2 1.931022
MVN (register, asr, 32-bit) throughput 1.010703
MVN (register, asr, 64-bit) 1->2 2.098831
MVN (register, asr, 64-bit) throughput 0.976565
MVN (register, ror, 32-bit) 1->2 2.024842
MVN (register, ror, 32-bit) throughput 0.972185
MVN (register, ror, 64-bit) 1->2 2.001163
MVN (register, ror, 64-bit) throughput 1.019052
NEG (register, 32-bit) 1->2 0.996874
NEG (register, 32-bit) throughput 0.504799
NEG (register, 64-bit) 1->2 1.009016
NEG (register, 64-bit) throughput 0.494766
NEG (register, lsl, 32-bit) 1->2 2.002033
NEG (register, lsl, 32-bit) throughput 1.010907
NEG (register, lsl, 64-bit) 1->2 2.006840
NEG (register, lsl, 64-bit) throughput 0.997460
NEG (register, lsr, 32-bit) 1->2 1.994174
NEG (register, lsr, 32-bit) throughput 0.998729
NEG (register, lsr, 64-bit) 1->2 1.972403
NEG (register, lsr, 64-bit) throughput 1.010143
NEG (register, asr, 32-bit) 1->2 2.019339
NEG (register, asr, 32-bit) throughput 1.011852
NEG (register, asr, 64-bit) 1->2 2.009973
NEG (register, asr, 64-bit) throughput 0.989676
NEGS (register, 32-bit) 1->2 1.022263
NEGS (register, 32-bit) 3->2 1.059463
NEGS (register, 32-bit) throughput 0.694140
NEGS (register, 64-bit) 1->2 0.990512
NEGS (register, 64-bit) 3->2 1.015714
NEGS (register, 64-bit) throughput 0.704142
NEGS (register, lsl, 32-bit) 1->2 1.999871
NEGS (register, lsl, 32-bit) 3->2 1.928226
NEGS (register, lsl, 32-bit) throughput 0.996164
NEGS (register, lsl, 64-bit) 1->2 1.993877
NEGS (register, lsl, 64-bit) 3->2 2.017083
NEGS (register, lsl, 64-bit) throughput 1.010918
NEGS (register, lsr, 32-bit) 1->2 2.038549
NEGS (register, lsr, 32-bit) 3->2 2.018510
NEGS (register, lsr, 32-bit) throughput 1.013227
NEGS (register, lsr, 64-bit) 1->2 2.029009
NEGS (register, lsr, 64-bit) 3->2 1.980736
NEGS (register, lsr, 64-bit) throughput 1.010742
NEGS (register, asr, 32-bit) 1->2 2.007103
NEGS (register, asr, 32-bit) 3->2 1.998093
NEGS (register, asr, 32-bit) throughput 1.005952
NEGS (register, asr, 64-bit) 1->2 2.059078
NEGS (register, asr, 64-bit) 3->2 2.073004
NEGS (register, asr, 64-bit) throughput 0.978139
NGC (register, 32-bit) 1->2 0.997031
NGC (register, 32-bit) 1->3 1.034986
NGC (register, 32-bit) throughput 0.503635
NGC (register, 64-bit) 1->2 0.984671
NGC (register, 64-bit) 1->3 1.037006
NGC (register, 64-bit) throughput 0.468107
NGCS (register, 32-bit) 1->2 1.101673
NGCS (register, 32-bit) 1->3 1.047002
NGCS (register, 32-bit) 3->2 0.962422
NGCS (register, 32-bit) 3->3 1.019873
NGCS (register, 32-bit) throughput 1.324228
NGCS (register, 32-bit) throughput 1.215830
NGCS (register, 64-bit) 1->2 0.976148
NGCS (register, 64-bit) 1->3 1.023024
NGCS (register, 64-bit) 3->2 1.019287
NGCS (register, 64-bit) 3->3 0.994054
NGCS (register, 64-bit) throughput 1.400749
NGCS (register, 64-bit) throughput 1.206793
NOP throughput 0.344755
ORN (register, 32-bit) 1->2 0.988865
ORN (register, 32-bit) 1->3 0.996737
ORN (register, 32-bit) throughput 0.502343
ORN (register, 64-bit) 1->2 1.004090
ORN (register, 64-bit) 1->3 1.002838
ORN (register, 64-bit) throughput 0.507296
ORN (register, lsl, 32-bit) 1->2 2.002500
ORN (register, lsl, 32-bit) 1->3 2.031274
ORN (register, lsl, 32-bit) throughput 1.003409
ORN (register, lsl, 64-bit) 1->2 2.000262
ORN (register, lsl, 64-bit) 1->3 2.058051
ORN (register, lsl, 64-bit) throughput 1.006899
ORN (register, lsr, 32-bit) 1->2 2.031945
ORN (register, lsr, 32-bit) 1->3 2.011339
ORN (register, lsr, 32-bit) throughput 1.009045
ORN (register, lsr, 64-bit) 1->2 1.969180
ORN (register, lsr, 64-bit) 1->3 2.058440
ORN (register, lsr, 64-bit) throughput 1.012134
ORN (register, asr, 32-bit) 1->2 2.044316
ORN (register, asr, 32-bit) 1->3 2.058993
ORN (register, asr, 32-bit) throughput 0.999094
ORN (register, asr, 64-bit) 1->2 2.049511
ORN (register, asr, 64-bit) 1->3 2.000569
ORN (register, asr, 64-bit) throughput 1.003979
ORN (register, ror, 32-bit) 1->2 2.028455
ORN (register, ror, 32-bit) 1->3 1.970518
ORN (register, ror, 32-bit) throughput 0.997469
ORN (register, ror, 64-bit) 1->2 1.998223
ORN (register, ror, 64-bit) 1->3 1.994437
ORN (register, ror, 64-bit) throughput 1.008887
ORR (immediate, 32-bit) 1->2 0.997324
ORR (immediate, 32-bit) throughput 0.506800
ORR (immediate, 64-bit) 1->2 1.006224
ORR (immediate, 64-bit) throughput 0.501054
ORR (register, 32-bit) 1->2 0.997120
ORR (register, 32-bit) 1->3 1.010859
ORR (register, 32-bit) throughput 0.502856
ORR (register, 64-bit) 1->2 0.998905
ORR (register, 64-bit) 1->3 1.007217
ORR (register, 64-bit) throughput 0.495476
ORR (register, lsl, 32-bit) 1->2 2.110541
ORR (register, lsl, 32-bit) 1->3 2.009559
ORR (register, lsl, 32-bit) throughput 0.961392
ORR (register, lsl, 64-bit) 1->2 1.998378
ORR (register, lsl, 64-bit) 1->3 2.036976
ORR (register, lsl, 64-bit) throughput 1.005420
ORR (register, lsr, 32-bit) 1->2 2.000473
ORR (register, lsr, 32-bit) 1->3 2.005124
ORR (register, lsr, 32-bit) throughput 0.988392
ORR (register, lsr, 64-bit) 1->2 2.018910
ORR (register, lsr, 64-bit) 1->3 2.030744
ORR (register, lsr, 64-bit) throughput 1.004932
ORR (register, asr, 32-bit) 1->2 2.013788
ORR (register, asr, 32-bit) 1->3 2.001206
ORR (register, asr, 32-bit) throughput 1.014340
ORR (register, asr, 64-bit) 1->2 2.032318
ORR (register, asr, 64-bit) 1->3 2.000304
ORR (register, asr, 64-bit) throughput 1.013947
ORR (register, ror, 32-bit) 1->2 2.002750
ORR (register, ror, 32-bit) 1->3 2.008731
ORR (register, ror, 32-bit) throughput 1.007724
ORR (register, ror, 64-bit) 1->2 2.033155
ORR (register, ror, 64-bit) 1->3 1.998070
ORR (register, ror, 64-bit) throughput 1.007996
PRFM (register, PLDL1KEEP) throughput 3.986847
PRFM (register, PLDL1KEEP) throughput 1.007763
PRFM (register, PLDL1STRM) throughput 4.597682
PRFM (register, PLDL1STRM) throughput 1.002881
PRFM (register, PLDL2KEEP) throughput 3.515471
PRFM (register, PLDL2KEEP) throughput 2.174587
PRFM (register, PLDL2STRM) throughput 3.485936
PRFM (register, PLDL2STRM) throughput 2.146856
PRFM (register, PLDL3KEEP) throughput 3.335165
PRFM (register, PLDL3KEEP) throughput 2.023664
PRFM (register, PLDL3STRM) throughput 3.290304
PRFM (register, PLDL3STRM) throughput 2.009286
PRFM (register, PLIL1KEEP) throughput 3.515846
PRFM (register, PLIL1KEEP) throughput 2.220636
PRFM (register, PLIL1STRM) throughput 3.974945
PRFM (register, PLIL1STRM) throughput 2.164438
PRFM (register, PLIL2KEEP) throughput 3.601024
PRFM (register, PLIL2KEEP) throughput 2.111450
PRFM (register, PLIL2STRM) throughput 3.479894
PRFM (register, PLIL2STRM) throughput 2.165973
PRFM (register, PLIL3KEEP) throughput 3.336266
PRFM (register, PLIL3KEEP) throughput 2.149180
PRFM (register, PLIL3STRM) throughput 3.368316
PRFM (register, PLIL3STRM) throughput 2.156700
PRFM (register, PSTL1KEEP) throughput 3.943530
PRFM (register, PSTL1KEEP) throughput 1.004560
PRFM (register, PSTL1STRM) throughput 4.335877
PRFM (register, PSTL1STRM) throughput 1.025412
PRFM (register, PSTL2KEEP) throughput 3.498117
PRFM (register, PSTL2KEEP) throughput 2.236191
PRFM (register, PSTL2STRM) throughput 3.493368
PRFM (register, PSTL2STRM) throughput 2.175155
PRFM (register, PSTL3KEEP) throughput 3.400666
PRFM (register, PSTL3KEEP) throughput 2.021109
PRFM (register, PSTL3STRM) throughput 3.370850
PRFM (register, PSTL3STRM) throughput 2.044863
PSSBB throughput 17.742926
RBIT (32-bit) 1->2 1.001773
RBIT (32-bit) throughput 0.491451
RBIT (64-bit) 1->2 0.995512
RBIT (64-bit) throughput 0.499946
REV (32-bit) 1->2 1.007455
REV (32-bit) throughput 0.500317
REV (64-bit) 1->2 0.998839
REV (64-bit) throughput 0.504681
REV16 (32-bit) 1->2 1.012279
REV16 (32-bit) throughput 0.506150
REV16 (64-bit) 1->2 0.998773
REV16 (64-bit) throughput 0.494626
REV32 1->2 1.004292
REV32 throughput 0.504542
ROR (immediate, 32-bit) 1->2 1.001425
ROR (immediate, 32-bit) throughput 0.505124
ROR (immediate, 64-bit) 1->2 0.998751
ROR (immediate, 64-bit) throughput 0.511169
ROR (register, 32-bit) 1->2 1.002706
ROR (register, 32-bit) 1->3 1.000129
ROR (register, 32-bit) throughput 0.538705
ROR (register, 64-bit) 1->2 1.000066
ROR (register, 64-bit) 1->3 1.000416
ROR (register, 64-bit) throughput 0.504569
SBC (32-bit) 1->2 1.006299
SBC (32-bit) 1->3 1.018122
SBC (32-bit) 1->4 1.050022
SBC (32-bit) throughput 0.505852
SBC (64-bit) 1->2 0.995534
SBC (64-bit) 1->3 1.041184
SBC (64-bit) 1->4 0.999745
SBC (64-bit) throughput 0.503993
SBCS (32-bit) 1->2 0.994929
SBCS (32-bit) 1->3 1.030072
SBCS (32-bit) 1->4 0.999718
SBCS (32-bit) 4->2 0.998038
SBCS (32-bit) 4->3 1.002144
SBCS (32-bit) 4->4 1.003968
SBCS (32-bit) throughput 1.393591
SBCS (32-bit) throughput 1.221082
SBCS (64-bit) 1->2 0.996983
SBCS (64-bit) 1->3 0.998774
SBCS (64-bit) 1->4 1.018107
SBCS (64-bit) 4->2 1.004970
SBCS (64-bit) 4->3 1.027721
SBCS (64-bit) 4->4 1.043388
SBCS (64-bit) throughput 1.394769
SBCS (64-bit) throughput 1.214808
SBFIZ (32-bit) 1->2 0.997150
SBFIZ (32-bit) throughput 0.501254
SBFIZ (64-bit) 1->2 0.990069
SBFIZ (64-bit) throughput 0.501864
SBFX (32-bit) 1->2 0.991299
SBFX (32-bit) throughput 0.500232
SBFX (64-bit) 1->2 0.977471
SBFX (64-bit) throughput 0.504415
SDIV (fast, 32-bit) 1->2 7.091602
SDIV (fast, 32-bit) 1->3 6.889388
SDIV (fast, 32-bit) throughput 6.959454
SDIV (slow, 32-bit) 1->2 20.155832
SDIV (slow, 32-bit) 1->3 20.143558
SDIV (slow, 32-bit) throughput 20.043998
SDIV (slow, 32-bit) 1->2 20.093463
SDIV (slow, 32-bit) 1->3 20.131564
SDIV (slow, 32-bit) throughput 20.181549
SDIV (fast, 64-bit) 1->2 7.041108
SDIV (fast, 64-bit) 1->3 7.023381
SDIV (fast, 64-bit) throughput 6.987216
SDIV (medium, 64-bit) 1->2 19.394689
SDIV (medium, 64-bit) 1->3 20.205211
SDIV (medium, 64-bit) throughput 20.179404
SDIV (medium, 64-bit) 1->2 20.178163
SDIV (medium, 64-bit) 1->3 19.559730
SDIV (medium, 64-bit) throughput 20.251444
SDIV (slow, 64-bit) 1->2 36.004227
SDIV (slow, 64-bit) 1->3 36.812225
SDIV (slow, 64-bit) throughput 35.062594
SDIV (slow, 64-bit) 1->2 36.267698
SDIV (slow, 64-bit) 1->3 35.982284
SDIV (slow, 64-bit) throughput 35.348887
SMADDL 1->2 3.012322
SMADDL 1->3 2.984304
SMADDL 1->4 0.992845
SMADDL throughput 1.011557
SMNEGL 1->2 3.051580
SMNEGL 1->3 3.008761
SMNEGL throughput 1.002101
SMSUBL 1->2 3.052942
SMSUBL 1->3 3.095829
SMSUBL 1->4 0.997583
SMSUBL throughput 0.988684
SMULH 1->2 3.981830
SMULH 1->3 4.058964
SMULH throughput 1.989850
SMULL 1->2 3.031260
SMULL 1->3 2.987869
SMULL throughput 1.000167
SSBB throughput 17.954255
STLRB throughput 4.993618
STLRH throughput 5.018081
STLXP (32-bit) throughput 3.265198
STLXP (32-bit) throughput 2.983491
STLXP (64-bit) throughput 3.756547
STLXP (64-bit) throughput 3.002096
STLXR (32-bit) throughput 3.357687
STLXR (32-bit) throughput 3.032819
STLXR (64-bit) throughput 3.543574
STLXR (64-bit) throughput 2.996474
STLXRB throughput 3.030090
STLXRB throughput 2.969336
STLXRH throughput 3.067380
STLXRH throughput 3.017659
STNP (32-bit) throughput 1.623523
STNP (32-bit) throughput 1.629658
STNP (64-bit) throughput 1.654872
STNP (64-bit) throughput 1.605381
STP (32-bit) throughput 1.301634
STP (64-bit) throughput 1.310045
STP (post-index, 32-bit) 3->3 1.325378
STP (post-index, 32-bit) throughput 1.312650
STP (post-index, 64-bit) 3->3 1.341117
STP (post-index, 64-bit) throughput 1.131411
STP (pre-index, 32-bit) 3->3 1.307511
STP (pre-index, 32-bit) throughput 1.343719
STP (pre-index, 64-bit) 3->3 1.351292
STP (pre-index, 64-bit) throughput 1.360616
STP (signed offset, 32-bit) throughput 1.303443
STP (signed offset, 64-bit) throughput 1.309789
STR (32-bit) throughput 1.317332
STR (64-bit) throughput 1.304332
STR (post-index, 32-bit) 2->2 1.317997
STR (post-index, 32-bit) throughput 1.389737
STR (post-index, 64-bit) 2->2 1.296906
STR (post-index, 64-bit) throughput 1.313127
STR (pre-index, 32-bit) 2->2 1.326055
STR (pre-index, 32-bit) throughput 1.329539
STR (pre-index, 64-bit) 2->2 1.316946
STR (pre-index, 64-bit) throughput 1.325778
STR (unsigned offset, 32-bit) throughput 1.299059
STR (unsigned offset, 64-bit) throughput 1.302797
STR (register, 32-bit) throughput 1.309067
STR (register, 64-bit) throughput 1.299706
STR (register, uxtw, 32-bit) throughput 1.297272
STR (register, uxtw, 64-bit) throughput 1.307501
STR (register, sxtw, 32-bit) throughput 1.315486
STR (register, sxtw, 64-bit) throughput 1.314574
STR (register, lsl, 32-bit) throughput 1.309826
STR (register, lsl, 64-bit) throughput 1.247463
STRB throughput 1.305440
STRB (post-index) 2->2 1.317801
STRB (post-index) throughput 1.313126
STRB (pre-index) 2->2 1.308737
STRB (pre-index) throughput 1.332777
STRB (unsigned offset) throughput 1.318997
STRB (register) throughput 1.296288
STRB (register, uxtw) throughput 1.303047
STRB (register, sxtw) throughput 1.293752
STRH throughput 1.311604
STRH (post-index) 2->2 1.322000
STRH (post-index) throughput 1.334765
STRH (pre-index) 2->2 1.301857
STRH (pre-index) throughput 1.335888
STRH (unsigned offset) throughput 1.301120
STRH (register) throughput 1.308421
STRH (register, uxtw) throughput 1.297228
STRH (register, sxtw) throughput 1.316197
STUR (32-bit) throughput 1.311619
STUR (64-bit) throughput 1.315147
STURB throughput 1.299996
STURH throughput 1.295586
STXP (64-bit) throughput 2.968561
STXP (64-bit) throughput 2.988119
STXR (32-bit) throughput 2.316949
STXR (32-bit) throughput 3.051856
STXR (64-bit) throughput 2.523231
STXR (64-bit) throughput 2.958454
STXRB throughput 2.154190
STXRB throughput 3.026010
STXRH throughput 2.061933
STXRH throughput 3.037008
SUB (sxtb, 32-bit) 1->2 2.028968
SUB (sxtb, 32-bit) 1->3 2.022052
SUB (sxtb, 32-bit) throughput 1.003985
SUB (sxtb, 64-bit) 1->2 2.062456
SUB (sxtb, 64-bit) 1->3 2.007082
SUB (sxtb, 64-bit) throughput 1.008219
SUB (uxtb, 32-bit) 1->2 2.013572
SUB (uxtb, 32-bit) 1->3 1.998071
SUB (uxtb, 32-bit) throughput 1.011570
SUB (uxtb, 64-bit) 1->2 2.020628
SUB (uxtb, 64-bit) 1->3 2.002365
SUB (uxtb, 64-bit) throughput 1.010586
SUB (sxth, 32-bit) 1->2 1.908815
SUB (sxth, 32-bit) 1->3 2.068727
SUB (sxth, 32-bit) throughput 1.008354
SUB (sxth, 64-bit) 1->2 2.015370
SUB (sxth, 64-bit) 1->3 2.016440
SUB (sxth, 64-bit) throughput 1.012354
SUB (uxth, 32-bit) 1->2 2.011405
SUB (uxth, 32-bit) 1->3 2.000417
SUB (uxth, 32-bit) throughput 1.014511
SUB (uxth, 64-bit) 1->2 1.998246
SUB (uxth, 64-bit) 1->3 2.018493
SUB (uxth, 64-bit) throughput 1.004645
SUB (sxtw, 32-bit) 1->2 0.995900
SUB (sxtw, 32-bit) 1->3 0.985586
SUB (sxtw, 32-bit) throughput 0.507496
SUB (sxtw, 64-bit) 1->2 1.997109
SUB (sxtw, 64-bit) 1->3 2.082160
SUB (sxtw, 64-bit) throughput 0.995020
SUB (uxtw, 32-bit) 1->2 1.225960
SUB (uxtw, 32-bit) 1->3 1.003079
SUB (uxtw, 32-bit) throughput 0.500851
SUB (uxtw, 64-bit) 1->2 2.028259
SUB (uxtw, 64-bit) 1->3 2.016029
SUB (uxtw, 64-bit) throughput 1.016495
SUB (uxtx, 64-bit) 1->2 0.998970
SUB (uxtx, 64-bit) 1->3 0.994604
SUB (uxtx, 64-bit) throughput 0.504904
SUB (sxtx, 64-bit) 1->2 1.003541
SUB (sxtx, 64-bit) 1->3 1.050796
SUB (sxtx, 64-bit) throughput 0.509152
SUB (immediate, 32-bit) 1->2 0.984442
SUB (immediate, 32-bit) throughput 0.500609
SUB (immediate, 64-bit) 1->2 0.986519
SUB (immediate, 64-bit) throughput 0.513428
SUB (shifted immediate, 32-bit) 1->2 1.021567
SUB (shifted immediate, 32-bit) throughput 0.505276
SUB (shifted immediate, 64-bit) 1->2 1.008930
SUB (shifted immediate, 64-bit) throughput 0.505390
SUB (register, 32-bit) 1->2 0.986456
SUB (register, 32-bit) 1->3 1.000613
SUB (register, 32-bit) throughput 0.499673
SUB (register, 64-bit) 1->2 0.994728
SUB (register, 64-bit) 1->3 0.995186
SUB (register, 64-bit) throughput 0.549387
SUB (register, lsl, 32-bit) 1->2 2.007060
SUB (register, lsl, 32-bit) 1->3 2.007201
SUB (register, lsl, 32-bit) throughput 1.011864
SUB (register, lsl, 64-bit) 1->2 2.004065
SUB (register, lsl, 64-bit) 1->3 2.015465
SUB (register, lsl, 64-bit) throughput 0.993700
SUB (register, lsr, 32-bit) 1->2 2.018297
SUB (register, lsr, 32-bit) 1->3 2.014869
SUB (register, lsr, 32-bit) throughput 1.015836
SUB (register, lsr, 64-bit) 1->2 2.032766
SUB (register, lsr, 64-bit) 1->3 1.995118
SUB (register, lsr, 64-bit) throughput 1.006461
SUB (register, asr, 32-bit) 1->2 2.045744
SUB (register, asr, 32-bit) 1->3 1.994463
SUB (register, asr, 32-bit) throughput 1.006441
SUB (register, asr, 64-bit) 1->2 2.004473
SUB (register, asr, 64-bit) 1->3 2.006773
SUB (register, asr, 64-bit) throughput 1.023835
SUBS (sxtb, 32-bit) 1->2 1.994482
SUBS (sxtb, 32-bit) 1->3 1.941896
SUBS (sxtb, 32-bit) 4->2 2.011105
SUBS (sxtb, 32-bit) 4->3 2.020295
SUBS (sxtb, 32-bit) throughput 1.011824
SUBS (sxtb, 64-bit) 1->2 2.011528
SUBS (sxtb, 64-bit) 1->3 2.015547
SUBS (sxtb, 64-bit) 4->2 2.015461
SUBS (sxtb, 64-bit) 4->3 2.025203
SUBS (sxtb, 64-bit) throughput 1.018512
SUBS (uxtb, 32-bit) 1->2 1.957742
SUBS (uxtb, 32-bit) 1->3 2.007600
SUBS (uxtb, 32-bit) 4->2 2.009017
SUBS (uxtb, 32-bit) 4->3 2.029060
SUBS (uxtb, 32-bit) throughput 0.993527
SUBS (uxtb, 64-bit) 1->2 2.028840
SUBS (uxtb, 64-bit) 1->3 2.001294
SUBS (uxtb, 64-bit) 4->2 2.022637
SUBS (uxtb, 64-bit) 4->3 2.181614
SUBS (uxtb, 64-bit) throughput 1.011328
SUBS (sxth, 32-bit) 1->2 2.001489
SUBS (sxth, 32-bit) 1->3 2.029052
SUBS (sxth, 32-bit) 4->2 2.022827
SUBS (sxth, 32-bit) 4->3 2.132333
SUBS (sxth, 32-bit) throughput 0.997502
SUBS (sxth, 64-bit) 1->2 1.940230
SUBS (sxth, 64-bit) 1->3 2.006950
SUBS (sxth, 64-bit) 4->2 2.026225
SUBS (sxth, 64-bit) 4->3 2.032140
SUBS (sxth, 64-bit) throughput 1.006579
SUBS (uxth, 32-bit) 1->2 2.024235
SUBS (uxth, 32-bit) 1->3 1.983973
SUBS (uxth, 32-bit) 4->2 2.039366
SUBS (uxth, 32-bit) 4->3 2.031664
SUBS (uxth, 32-bit) throughput 1.007411
SUBS (uxth, 64-bit) 1->2 1.990545
SUBS (uxth, 64-bit) 1->3 2.009302
SUBS (uxth, 64-bit) 4->2 1.998488
SUBS (uxth, 64-bit) 4->3 2.006288
SUBS (uxth, 64-bit) throughput 1.011150
SUBS (sxtw, 32-bit) 1->2 0.993160
SUBS (sxtw, 32-bit) 1->3 0.987266
SUBS (sxtw, 32-bit) 4->2 1.027440
SUBS (sxtw, 32-bit) 4->3 1.010220
SUBS (sxtw, 32-bit) throughput 0.692947
SUBS (sxtw, 64-bit) 1->2 2.028740
SUBS (sxtw, 64-bit) 1->3 2.016122
SUBS (sxtw, 64-bit) 4->2 2.019401
SUBS (sxtw, 64-bit) 4->3 1.971867
SUBS (sxtw, 64-bit) throughput 1.014582
SUBS (uxtw, 32-bit) 1->2 0.999246
SUBS (uxtw, 32-bit) 1->3 0.991171
SUBS (uxtw, 32-bit) 4->2 1.011213
SUBS (uxtw, 32-bit) 4->3 0.998113
SUBS (uxtw, 32-bit) throughput 0.695417
SUBS (uxtw, 64-bit) 1->2 1.989032
SUBS (uxtw, 64-bit) 1->3 1.995315
SUBS (uxtw, 64-bit) 4->2 2.026973
SUBS (uxtw, 64-bit) 4->3 2.033164
SUBS (uxtw, 64-bit) throughput 1.000916
SUBS (uxtx, 64-bit) 1->2 1.005220
SUBS (uxtx, 64-bit) 1->3 1.003167
SUBS (uxtx, 64-bit) 4->2 1.040271
SUBS (uxtx, 64-bit) 4->3 1.063937
SUBS (uxtx, 64-bit) throughput 0.748907
SUBS (sxtx, 64-bit) 1->2 1.162885
SUBS (sxtx, 64-bit) 1->3 0.932625
SUBS (sxtx, 64-bit) 4->2 0.966853
SUBS (sxtx, 64-bit) 4->3 1.027689
SUBS (sxtx, 64-bit) throughput 0.715815
SUBS (immediate, 32-bit) 1->2 0.990309
SUBS (immediate, 32-bit) 3->2 1.017803
SUBS (immediate, 32-bit) throughput 0.693814
SUBS (immediate, 64-bit) 1->2 0.996059
SUBS (immediate, 64-bit) 3->2 1.040072
SUBS (immediate, 64-bit) throughput 0.697542
SUBS (shifted immediate, 32-bit) 1->2 1.004537
SUBS (shifted immediate, 32-bit) 3->2 1.029406
SUBS (shifted immediate, 32-bit) throughput 0.692940
SUBS (shifted immediate, 64-bit) 1->2 0.998861
SUBS (shifted immediate, 64-bit) 3->2 0.997523
SUBS (shifted immediate, 64-bit) throughput 0.699309
SUBS (register, 32-bit) 1->2 0.993789
SUBS (register, 32-bit) 1->3 1.006864
SUBS (register, 32-bit) 4->2 1.368141
SUBS (register, 32-bit) 4->3 1.002423
SUBS (register, 32-bit) throughput 0.688424
SUBS (register, 64-bit) 1->2 0.997193
SUBS (register, 64-bit) 1->3 1.001820
SUBS (register, 64-bit) 4->2 0.972127
SUBS (register, 64-bit) 4->3 1.024015
SUBS (register, 64-bit) throughput 0.689549
SUBS (register, lsl, 32-bit) 1->2 1.969119
SUBS (register, lsl, 32-bit) 1->3 2.036333
SUBS (register, lsl, 32-bit) 4->2 2.027163
SUBS (register, lsl, 32-bit) 4->3 1.904781
SUBS (register, lsl, 32-bit) throughput 1.013826
SUBS (register, lsl, 64-bit) 1->2 2.007545
SUBS (register, lsl, 64-bit) 1->3 2.006664
SUBS (register, lsl, 64-bit) 4->2 2.012141
SUBS (register, lsl, 64-bit) 4->3 1.947231
SUBS (register, lsl, 64-bit) throughput 1.006104
SUBS (register, lsr, 32-bit) 1->2 2.001251
SUBS (register, lsr, 32-bit) 1->3 1.998289
SUBS (register, lsr, 32-bit) 4->2 1.995401
SUBS (register, lsr, 32-bit) 4->3 2.087861
SUBS (register, lsr, 32-bit) throughput 0.983165
SUBS (register, lsr, 64-bit) 1->2 1.986526
SUBS (register, lsr, 64-bit) 1->3 2.003114
SUBS (register, lsr, 64-bit) 4->2 2.064560
SUBS (register, lsr, 64-bit) 4->3 2.056404
SUBS (register, lsr, 64-bit) throughput 1.009377
SUBS (register, asr, 32-bit) 1->2 2.055502
SUBS (register, asr, 32-bit) 1->3 2.000811
SUBS (register, asr, 32-bit) 4->2 2.051997
SUBS (register, asr, 32-bit) 4->3 2.041100
SUBS (register, asr, 32-bit) throughput 1.001872
SUBS (register, asr, 64-bit) 1->2 1.946877
SUBS (register, asr, 64-bit) 1->3 2.016297
SUBS (register, asr, 64-bit) 4->2 2.002303
SUBS (register, asr, 64-bit) 4->3 2.001711
SUBS (register, asr, 64-bit) throughput 0.998324
SXTB (32-bit) 1->2 0.962689
SXTB (32-bit) throughput 0.510085
SXTB (64-bit) 1->2 1.004468
SXTB (64-bit) throughput 0.505368
SXTH (32-bit) 1->2 1.004553
SXTH (32-bit) throughput 0.506993
SXTH (64-bit) 1->2 1.000132
SXTH (64-bit) throughput 0.501385
SXTW 1->2 1.003048
SXTW throughput 0.491165
TBNZ (not taken) throughput 0.718616
TBNZ (taken) throughput 3.941054
TBZ (not taken) throughput 0.715419
TBZ (taken) throughput 3.975367
TST (immediate, 32-bit) 2->1 1.032814
TST (immediate, 32-bit) throughput 0.667282
TST (immediate, 64-bit) 2->1 0.977192
TST (immediate, 64-bit) throughput 0.669894
TST (register, 32-bit) 3->1 0.998292
TST (register, 32-bit) 3->2 0.998267
TST (register, 32-bit) throughput 0.678333
TST (register, 64-bit) 3->1 1.022280
TST (register, 64-bit) 3->2 0.990647
TST (register, 64-bit) throughput 0.670197
TST (register, lsl, 32-bit) 3->1 2.057407
TST (register, lsl, 32-bit) 3->2 1.997828
TST (register, lsl, 32-bit) throughput 1.000786
TST (register, lsl, 64-bit) 3->1 2.031343
TST (register, lsl, 64-bit) 3->2 2.008092
TST (register, lsl, 64-bit) throughput 1.007465
TST (register, lsr, 32-bit) 3->1 2.027355
TST (register, lsr, 32-bit) 3->2 1.990792
TST (register, lsr, 32-bit) throughput 0.998200
TST (register, lsr, 64-bit) 3->1 2.022151
TST (register, lsr, 64-bit) 3->2 1.970962
TST (register, lsr, 64-bit) throughput 1.011328
TST (register, asr, 32-bit) 3->1 1.993933
TST (register, asr, 32-bit) 3->2 2.040365
TST (register, asr, 32-bit) throughput 1.010991
TST (register, asr, 64-bit) 3->1 2.069563
TST (register, asr, 64-bit) 3->2 2.014430
TST (register, asr, 64-bit) throughput 1.056777
UBFIZ (32-bit) 1->2 1.006937
UBFIZ (32-bit) throughput 0.501135
UBFIZ (64-bit) 1->2 1.003400
UBFIZ (64-bit) throughput 0.508060
UBFX (32-bit) 1->2 1.030094
UBFX (32-bit) throughput 0.503354
UBFX (64-bit) 1->2 1.000439
UBFX (64-bit) throughput 0.509494
UDIV (fast, 32-bit) 1->2 6.987534
UDIV (fast, 32-bit) 1->3 7.080969
UDIV (fast, 32-bit) throughput 6.906448
UDIV (slow, 32-bit) 1->2 20.033958
UDIV (slow, 32-bit) 1->3 20.136273
UDIV (slow, 32-bit) throughput 20.152933
UDIV (fast, 64-bit) 1->2 7.084540
UDIV (fast, 64-bit) 1->3 6.904647
UDIV (fast, 64-bit) throughput 7.069000
UDIV (medium, 64-bit) 1->2 20.153040
UDIV (medium, 64-bit) 1->3 20.176154
UDIV (medium, 64-bit) throughput 19.176891
UDIV (slow, 64-bit) 1->2 36.615219
UDIV (slow, 64-bit) 1->3 35.385052
UDIV (slow, 64-bit) throughput 36.007720
UMADDL 1->2 3.007907
UMADDL 1->3 3.046113
UMADDL 1->4 0.998301
UMADDL throughput 0.991808
UMNEGL 1->2 3.073124
UMNEGL 1->3 3.027567
UMNEGL throughput 1.006605
UMSUBL 1->2 3.004029
UMSUBL 1->3 3.007454
UMSUBL 1->4 1.012594
UMSUBL throughput 1.006393
UMULH 1->2 3.533482
UMULH 1->3 4.018509
UMULH throughput 2.017083
UMULL 1->2 3.042159
UMULL 1->3 3.013704
UMULL throughput 1.002097
UXTB 1->2 1.008607
UXTB throughput 0.524309
UXTH 1->2 1.001381
UXTH throughput 0.498791
YIELD throughput 0.338875
ABS (vector, 8B) 1->2 3.007250
ABS (vector, 8B) throughput 1.015256
ABS (vector, 16B) 1->2 2.997544
ABS (vector, 16B) throughput 1.005269
ABS (vector, 4H) 1->2 2.939025
ABS (vector, 4H) throughput 1.002923
ABS (vector, 8H) 1->2 3.025956
ABS (vector, 8H) throughput 1.020965
ABS (vector, 2S) 1->2 2.999697
ABS (vector, 2S) throughput 1.022875
ABS (vector, 4S) 1->2 2.996147
ABS (vector, 4S) throughput 1.068286
ABS (vector, 2D) 1->2 2.991116
ABS (vector, 2D) throughput 1.007554
ABS (vector, D) 1->2 3.007523
ABS (vector, D) throughput 1.013336
ADD (vector, 8B) 1->2 2.033059
ADD (vector, 8B) 1->3 2.028099
ADD (vector, 8B) throughput 1.007672
ADD (vector, 16B) 1->2 2.036088
ADD (vector, 16B) 1->3 1.998401
ADD (vector, 16B) throughput 1.011445
ADD (vector, 4H) 1->2 1.996914
ADD (vector, 4H) 1->3 2.018701
ADD (vector, 4H) throughput 1.013600
ADD (vector, 8H) 1->2 2.020899
ADD (vector, 8H) 1->3 2.024322
ADD (vector, 8H) throughput 1.010555
ADD (vector, 2S) 1->2 1.999805
ADD (vector, 2S) 1->3 2.015027
ADD (vector, 2S) throughput 1.008079
ADD (vector, 4S) 1->2 2.001659
ADD (vector, 4S) 1->3 1.989961
ADD (vector, 4S) throughput 1.018138
ADD (vector, 2D) 1->2 2.032458
ADD (vector, 2D) 1->3 1.994197
ADD (vector, 2D) throughput 1.002314
ADD (vector, D) 1->2 2.159004
ADD (vector, D) 1->3 1.995849
ADD (vector, D) throughput 1.012542
ADDHN (vector, 8H) 1->2 3.983935
ADDHN (vector, 8H) 1->3 4.059027
ADDHN (vector, 8H) throughput 1.013663
ADDHN (vector, 4S) 1->2 4.004232
ADDHN (vector, 4S) 1->3 4.038881
ADDHN (vector, 4S) throughput 1.003135
ADDHN (vector, 2D) 1->2 4.015611
ADDHN (vector, 2D) 1->3 4.106365
ADDHN (vector, 2D) throughput 1.009300
ADDHN2 (vector, 8H) 1->1 4.003108
ADDHN2 (vector, 8H) 1->2 3.985854
ADDHN2 (vector, 8H) 1->3 4.035528
ADDHN2 (vector, 8H) throughput 1.002281
ADDHN2 (vector, 8H) throughput 1.006784
ADDHN2 (vector, 4S) 1->1 3.937920
ADDHN2 (vector, 4S) 1->2 4.019440
ADDHN2 (vector, 4S) 1->3 4.030841
ADDHN2 (vector, 4S) throughput 1.012815
ADDHN2 (vector, 4S) throughput 0.924417
ADDHN2 (vector, 2D) 1->1 3.999506
ADDHN2 (vector, 2D) 1->2 4.010053
ADDHN2 (vector, 2D) 1->3 4.030814
ADDHN2 (vector, 2D) throughput 0.991461
ADDHN2 (vector, 2D) throughput 1.010206
ADDP (scalar) 1->2 2.003254
ADDP (scalar) throughput 1.006389
ADDP (vector, 8B) 1->2 1.979064
ADDP (vector, 8B) 1->3 2.053154
ADDP (vector, 8B) throughput 1.005871
ADDP (vector, 16B) 1->2 2.037051
ADDP (vector, 16B) 1->3 2.017942
ADDP (vector, 16B) throughput 1.006277
ADDP (vector, 4H) 1->2 1.997258
ADDP (vector, 4H) 1->3 1.971871
ADDP (vector, 4H) throughput 0.984254
ADDP (vector, 8H) 1->2 2.005846
ADDP (vector, 8H) 1->3 2.010812
ADDP (vector, 8H) throughput 0.993541
ADDP (vector, 2S) 1->2 2.007545
ADDP (vector, 2S) 1->3 2.053019
ADDP (vector, 2S) throughput 0.985483
ADDP (vector, 4S) 1->2 2.034625
ADDP (vector, 4S) 1->3 2.053833
ADDP (vector, 4S) throughput 0.983663
ADDP (vector, 2D) 1->2 1.998465
ADDP (vector, 2D) 1->3 2.020057
ADDP (vector, 2D) throughput 1.007501
ADDV (8B) 1->2 2.971574
ADDV (8B) throughput 1.007575
ADDV (16B) 1->2 2.979880
ADDV (16B) throughput 1.007326
ADDV (4H) 1->2 2.952905
ADDV (4H) throughput 1.015763
ADDV (8H) 1->2 3.019495
ADDV (8H) throughput 0.994723
ADDV (4S) 1->2 2.998971
ADDV (4S) throughput 1.010202
AESD 1->1 5.039451
AESD 1->2 5.004992
AESD throughput 2.022245
AESD throughput 2.003695
AESD + AESIMC 1->1 5.034084
AESD + AESIMC 1->2 5.022808
AESD + AESIMC throughput 2.018294
AESD + AESIMC throughput 2.018709
AESE 1->1 4.996964
AESE 1->2 5.038355
AESE throughput 2.031805
AESE throughput 2.012394
AESE + AESMC 1->1 5.032591
AESE + AESMC 1->2 4.920683
AESE + AESMC throughput 1.995480
AESE + AESMC throughput 1.979431
AESIMC 1->2 2.014255
AESIMC throughput 0.987556
AESMC 1->2 2.006111
AESMC throughput 1.004387
AND (vector, 8B) 1->2 1.957855
AND (vector, 8B) 1->3 2.032406
AND (vector, 8B) throughput 1.008990
AND (vector, 16B) 1->2 2.018490
AND (vector, 16B) 1->3 2.062440
AND (vector, 16B) throughput 1.242809
BIC (vector, immediate, 4H) 1->1 2.025167
BIC (vector, immediate, 4H) throughput 1.182451
BIC (vector, immediate, 4H) throughput 0.945917
BIC (vector, immediate, 8H) 1->1 2.614613
BIC (vector, immediate, 8H) throughput 0.877937
BIC (vector, immediate, 8H) throughput 1.046293
BIC (vector, immediate, 2S) 1->1 2.014907
BIC (vector, immediate, 2S) throughput 1.021426
BIC (vector, immediate, 2S) throughput 1.002870
BIC (vector, immediate, 4S) 1->1 2.029656
BIC (vector, immediate, 4S) throughput 0.998721
BIC (vector, immediate, 4S) throughput 1.011562
BIC (vector, register, 8B) 1->2 2.018957
BIC (vector, register, 8B) 1->3 1.919775
BIC (vector, register, 8B) throughput 0.996863
BIC (vector, register, 16B) 1->2 1.985062
BIC (vector, register, 16B) 1->3 2.006509
BIC (vector, register, 16B) throughput 1.001146
BIF (vector, 8B) 1->1 2.005091
BIF (vector, 8B) 1->2 2.006887
BIF (vector, 8B) 1->3 2.013004
BIF (vector, 8B) throughput 0.987802
BIF (vector, 8B) throughput 1.010952
BIF (vector, 16B) 1->1 2.037882
BIF (vector, 16B) 1->2 1.998377
BIF (vector, 16B) 1->3 1.993803
BIF (vector, 16B) throughput 1.011696
BIF (vector, 16B) throughput 0.988295
BIT (vector, 8B) 1->1 2.010047
BIT (vector, 8B) 1->2 2.010549
BIT (vector, 8B) 1->3 2.120874
BIT (vector, 8B) throughput 1.004719
BIT (vector, 8B) throughput 0.937619
BIT (vector, 16B) 1->1 1.998202
BIT (vector, 16B) 1->2 2.029733
BIT (vector, 16B) 1->3 2.005593
BIT (vector, 16B) throughput 1.007276
BIT (vector, 16B) throughput 1.005503
BSL (vector, 8B) 1->1 1.957963
BSL (vector, 8B) 1->2 1.987802
BSL (vector, 8B) 1->3 2.098683
BSL (vector, 8B) throughput 1.002056
BSL (vector, 8B) throughput 1.023296
BSL (vector, 16B) 1->1 2.042226
BSL (vector, 16B) 1->2 2.008775
BSL (vector, 16B) 1->3 2.055397
BSL (vector, 16B) throughput 0.835919
BSL (vector, 16B) throughput 1.005752
CLS (vector, 8B) 1->2 2.002580
CLS (vector, 8B) throughput 1.009771
CLS (vector, 16B) 1->2 2.022808
CLS (vector, 16B) throughput 1.003902
CLS (vector, 4H) 1->2 1.979671
CLS (vector, 4H) throughput 0.988978
CLS (vector, 8H) 1->2 2.045748
CLS (vector, 8H) throughput 0.993252
CLS (vector, 2S) 1->2 1.993378
CLS (vector, 2S) throughput 1.008890
CLS (vector, 4S) 1->2 1.973815
CLS (vector, 4S) throughput 0.998547
CLZ (vector, 8B) 1->2 2.026835
CLZ (vector, 8B) throughput 0.994124
CLZ (vector, 16B) 1->2 2.039297
CLZ (vector, 16B) throughput 1.006855
CLZ (vector, 4H) 1->2 1.884179
CLZ (vector, 4H) throughput 1.008123
CLZ (vector, 8H) 1->2 2.070992
CLZ (vector, 8H) throughput 1.013810
CLZ (vector, 2S) 1->2 2.011339
CLZ (vector, 2S) throughput 1.011075
CLZ (vector, 4S) 1->2 2.020322
CLZ (vector, 4S) throughput 0.989674
CMEQ (register, 8B) 1->2 3.017637
CMEQ (register, 8B) 1->3 3.001862
CMEQ (register, 8B) throughput 1.001866
CMEQ (register, 16B) 1->2 3.503537
CMEQ (register, 16B) 1->3 3.003091
CMEQ (register, 16B) throughput 0.980771
CMEQ (register, 4H) 1->2 3.051864
CMEQ (register, 4H) 1->3 3.051241
CMEQ (register, 4H) throughput 1.000321
CMEQ (register, 8H) 1->2 3.066953
CMEQ (register, 8H) 1->3 2.998652
CMEQ (register, 8H) throughput 1.007762
CMEQ (register, 2S) 1->2 2.948785
CMEQ (register, 2S) 1->3 2.937869
CMEQ (register, 2S) throughput 1.041115
CMEQ (register, 4S) 1->2 2.966070
CMEQ (register, 4S) 1->3 2.995876
CMEQ (register, 4S) throughput 1.004902
CMEQ (register, 2D) 1->2 2.991247
CMEQ (register, 2D) 1->3 3.026451
CMEQ (register, 2D) throughput 1.013013
CMEQ (register, D) 1->2 3.014670
CMEQ (register, D) 1->3 3.029199
CMEQ (register, D) throughput 1.010273
CMEQ (zero, 8B) 1->2 3.044741
CMEQ (zero, 8B) throughput 0.988561
CMEQ (zero, 16B) 1->2 3.015411
CMEQ (zero, 16B) throughput 1.010857
CMEQ (zero, 4H) 1->2 3.023272
CMEQ (zero, 4H) throughput 1.003591
CMEQ (zero, 8H) 1->2 2.998620
CMEQ (zero, 8H) throughput 1.012617
CMEQ (zero, 2S) 1->2 3.048844
CMEQ (zero, 2S) throughput 1.002140
CMEQ (zero, 4S) 1->2 3.010727
CMEQ (zero, 4S) throughput 1.007749
CMEQ (zero, 2D) 1->2 3.021259
CMEQ (zero, 2D) throughput 1.006377
CMEQ (zero, D) 1->2 3.015079
CMEQ (zero, D) throughput 0.993009
CMGE (register, 8B) 1->2 3.003197
CMGE (register, 8B) 1->3 2.994217
CMGE (register, 8B) throughput 1.003785
CMGE (register, 16B) 1->2 3.021376
CMGE (register, 16B) 1->3 3.125078
CMGE (register, 16B) throughput 0.996632
CMGE (register, 4H) 1->2 2.963498
CMGE (register, 4H) 1->3 3.048888
CMGE (register, 4H) throughput 1.023741
CMGE (register, 8H) 1->2 3.024323
CMGE (register, 8H) 1->3 3.006578
CMGE (register, 8H) throughput 0.998227
CMGE (register, 2S) 1->2 2.958471
CMGE (register, 2S) 1->3 3.057998
CMGE (register, 2S) throughput 1.017653
CMGE (register, 4S) 1->2 3.004656
CMGE (register, 4S) 1->3 3.019552
CMGE (register, 4S) throughput 0.996919
CMGE (register, 2D) 1->2 3.074881
CMGE (register, 2D) 1->3 3.014300
CMGE (register, 2D) throughput 1.006788
CMGE (register, D) 1->2 3.014212
CMGE (register, D) 1->3 3.083429
CMGE (register, D) throughput 1.170342
CMGE (zero, 8B) 1->2 2.716496
CMGE (zero, 8B) throughput 1.002402
CMGE (zero, 16B) 1->2 3.001949
CMGE (zero, 16B) throughput 1.009566
CMGE (zero, 4H) 1->2 3.021817
CMGE (zero, 4H) throughput 1.008171
CMGE (zero, 8H) 1->2 3.029009
CMGE (zero, 8H) throughput 1.019968
CMGE (zero, 2S) 1->2 2.947408
CMGE (zero, 2S) throughput 1.050242
CMGE (zero, 4S) 1->2 2.298464
CMGE (zero, 4S) throughput 1.058524
CMGE (zero, 2D) 1->2 3.025326
CMGE (zero, 2D) throughput 1.013574
CMGE (zero, D) 1->2 3.045478
CMGE (zero, D) throughput 1.039020
CMGT (register, 8B) 1->2 3.121036
CMGT (register, 8B) 1->3 3.013067
CMGT (register, 8B) throughput 1.010115
CMGT (register, 16B) 1->2 3.037771
CMGT (register, 16B) 1->3 3.023097
CMGT (register, 16B) throughput 1.012334
CMGT (register, 4H) 1->2 3.016516
CMGT (register, 4H) 1->3 3.025002
CMGT (register, 4H) throughput 0.997583
CMGT (register, 8H) 1->2 3.009520
CMGT (register, 8H) 1->3 2.998078
CMGT (register, 8H) throughput 1.004880
CMGT (register, 2S) 1->2 3.017148
CMGT (register, 2S) 1->3 3.016181
CMGT (register, 2S) throughput 1.027698
CMGT (register, 4S) 1->2 3.006284
CMGT (register, 4S) 1->3 3.098666
CMGT (register, 4S) throughput 1.004115
CMGT (register, 2D) 1->2 3.012786
CMGT (register, 2D) 1->3 3.024081
CMGT (register, 2D) throughput 1.007721
CMGT (register, D) 1->2 3.004547
CMGT (register, D) 1->3 2.994373
CMGT (register, D) throughput 1.000727
CMGT (zero, 8B) 1->2 3.090299
CMGT (zero, 8B) throughput 1.008163
CMGT (zero, 16B) 1->2 3.004462
CMGT (zero, 16B) throughput 1.010336
CMGT (zero, 4H) 1->2 2.971203
CMGT (zero, 4H) throughput 1.014112
CMGT (zero, 8H) 1->2 2.991186
CMGT (zero, 8H) throughput 0.980895
CMGT (zero, 2S) 1->2 2.924302
CMGT (zero, 2S) throughput 1.018588
CMGT (zero, 4S) 1->2 3.041080
CMGT (zero, 4S) throughput 1.017192
CMGT (zero, 2D) 1->2 2.932733
CMGT (zero, 2D) throughput 1.007295
CMGT (zero, D) 1->2 3.032086
CMGT (zero, D) throughput 1.020047
CMHI (register, 8B) 1->2 3.034346
CMHI (register, 8B) 1->3 2.997679
CMHI (register, 8B) throughput 1.012243
CMHI (register, 16B) 1->2 3.012712
CMHI (register, 16B) 1->3 3.675759
CMHI (register, 16B) throughput 1.117940
CMHI (register, 4H) 1->2 3.069672
CMHI (register, 4H) 1->3 3.136666
CMHI (register, 4H) throughput 1.120472
CMHI (register, 8H) 1->2 3.050667
CMHI (register, 8H) 1->3 3.011052
CMHI (register, 8H) throughput 1.135903
CMHI (register, 2S) 1->2 2.977206
CMHI (register, 2S) 1->3 2.635471
CMHI (register, 2S) throughput 1.020468
CMHI (register, 4S) 1->2 3.065508
CMHI (register, 4S) 1->3 2.993282
CMHI (register, 4S) throughput 1.128183
CMHI (register, 2D) 1->2 3.103759
CMHI (register, 2D) 1->3 3.002096
CMHI (register, 2D) throughput 1.127916
CMHI (register, D) 1->2 3.030767
CMHI (register, D) 1->3 2.642310
CMHI (register, D) throughput 1.046791
CMHS (register, 8B) 1->2 2.996946
CMHS (register, 8B) 1->3 3.005851
CMHS (register, 8B) throughput 1.156565
CMHS (register, 16B) 1->2 3.011669
CMHS (register, 16B) 1->3 3.019267
CMHS (register, 16B) throughput 1.092231
CMHS (register, 4H) 1->2 2.987132
CMHS (register, 4H) 1->3 3.022592
CMHS (register, 4H) throughput 1.277792
CMHS (register, 8H) 1->2 3.140781
CMHS (register, 8H) 1->3 3.057998
CMHS (register, 8H) throughput 1.067974
CMHS (register, 2S) 1->2 3.095741
CMHS (register, 2S) 1->3 3.188748
CMHS (register, 2S) throughput 1.085919
CMHS (register, 4S) 1->2 2.902468
CMHS (register, 4S) 1->3 3.045540
CMHS (register, 4S) throughput 1.115180
CMHS (register, 2D) 1->2 3.977539
CMHS (register, 2D) 1->3 4.083088
CMHS (register, 2D) throughput 1.118848
CMHS (register, D) 1->2 2.942961
CMHS (register, D) 1->3 2.919819
CMHS (register, D) throughput 0.928489
CMLE (zero, 8B) 1->2 3.020901
CMLE (zero, 8B) throughput 1.145843
CMLE (zero, 16B) 1->2 3.032013
CMLE (zero, 16B) throughput 1.113597
CMLE (zero, 4H) 1->2 2.840190
CMLE (zero, 4H) throughput 1.155493
CMLE (zero, 8H) 1->2 3.024597
CMLE (zero, 8H) throughput 1.138283
CMLE (zero, 2S) 1->2 3.094546
CMLE (zero, 2S) throughput 1.026355
CMLE (zero, 4S) 1->2 3.059740
CMLE (zero, 4S) throughput 1.030018
CMLE (zero, 2D) 1->2 3.002029
CMLE (zero, 2D) throughput 1.019380
CMLE (zero, D) 1->2 3.095277
CMLE (zero, D) throughput 1.046601
CMLT (zero, 8B) 1->2 3.019816
CMLT (zero, 8B) throughput 1.002855
CMLT (zero, 16B) 1->2 2.998929
CMLT (zero, 16B) throughput 1.037633
CMLT (zero, 4H) 1->2 3.101886
CMLT (zero, 4H) throughput 1.030073
CMLT (zero, 8H) 1->2 3.032422
CMLT (zero, 8H) throughput 0.886951
CMLT (zero, 2S) 1->2 3.000154
CMLT (zero, 2S) throughput 0.985943
CMLT (zero, 4S) 1->2 2.938466
CMLT (zero, 4S) throughput 0.997628
CMLT (zero, 2D) 1->2 3.048376
CMLT (zero, 2D) throughput 1.018682
CMLT (zero, D) 1->2 3.001443
CMLT (zero, D) throughput 1.012767
CMTST (register, 8B) 1->2 2.038135
CMTST (register, 8B) 1->3 2.043341
CMTST (register, 8B) throughput 1.015485
CMTST (register, 16B) 1->2 1.996475
CMTST (register, 16B) 1->3 1.977750
CMTST (register, 16B) throughput 1.003309
CMTST (register, 4H) 1->2 2.097947
CMTST (register, 4H) 1->3 2.014694
CMTST (register, 4H) throughput 1.007663
CMTST (register, 8H) 1->2 2.229339
CMTST (register, 8H) 1->3 2.002552
CMTST (register, 8H) throughput 1.033817
CMTST (register, 2S) 1->2 1.992121
CMTST (register, 2S) 1->3 2.023400
CMTST (register, 2S) throughput 1.006595
CMTST (register, 4S) 1->2 2.014266
CMTST (register, 4S) 1->3 1.966606
CMTST (register, 4S) throughput 1.020447
CMTST (register, 2D) 1->2 2.034957
CMTST (register, 2D) 1->3 2.011556
CMTST (register, 2D) throughput 1.009277
CMTST (register, D) 1->2 2.026841
CMTST (register, D) 1->3 1.973072
CMTST (register, D) throughput 1.021966
CNT (8B) 1->2 2.016255
CNT (8B) throughput 1.007778
CNT (16B) 1->2 2.067843
CNT (16B) throughput 0.991455
DUP (element, vector, 8B) 1->2 2.012469
DUP (element, vector, 8B) throughput 1.006932
DUP (element, vector, 16B) 1->2 2.027843
DUP (element, vector, 16B) throughput 1.013051
DUP (element, vector, 4H) 1->2 1.984385
DUP (element, vector, 4H) throughput 0.993924
DUP (element, vector, 8H) 1->2 2.089872
DUP (element, vector, 8H) throughput 1.005545
DUP (element, vector, 2S) 1->2 2.030527
DUP (element, vector, 2S) throughput 1.013533
DUP (element, vector, 4S) 1->2 2.008498
DUP (element, vector, 4S) throughput 1.005530
DUP (element, vector, 2D) 1->2 1.994050
DUP (element, vector, 2D) throughput 1.009440
DUP (element, scalar, B) 1->2 2.013076
DUP (element, scalar, B) throughput 1.009509
DUP (element, scalar, H) 1->2 1.991251
DUP (element, scalar, H) throughput 1.005150
DUP (element, scalar, S) 1->2 2.011195
DUP (element, scalar, S) throughput 0.994445
DUP (element, scalar, D) 1->2 2.005768
DUP (element, scalar, D) throughput 1.010762
DUP (general, 8B) 1->2 roundtrip 9.077797
DUP (general, 8B) throughput 1.011090
DUP (general, 16B) 1->2 roundtrip 9.070450
DUP (general, 16B) throughput 1.007342
DUP (general, 4H) 1->2 roundtrip 9.107848
DUP (general, 4H) throughput 1.018217
DUP (general, 8H) 1->2 roundtrip 8.951617
DUP (general, 8H) throughput 1.007653
DUP (general, 2S) 1->2 roundtrip 8.921037
DUP (general, 2S) throughput 1.009119
DUP (general, 4S) 1->2 roundtrip 9.172240
DUP (general, 4S) throughput 1.012397
DUP (general, 2D) 1->2 roundtrip 9.059852
DUP (general, 2D) throughput 1.008670
EOR (vector, 8B) 1->2 1.998796
EOR (vector, 8B) 1->3 2.023973
EOR (vector, 8B) throughput 1.002293
EOR (vector, 16B) 1->2 1.932684
EOR (vector, 16B) 1->3 2.011900
EOR (vector, 16B) throughput 1.045204
EXT (vector, 8B) 1->2 2.009566
EXT (vector, 8B) 1->3 2.018625
EXT (vector, 8B) throughput 1.004671
EXT (vector, 16B) 1->2 1.974049
EXT (vector, 16B) 1->3 2.017323
EXT (vector, 16B) throughput 1.003258
FABD (vector, 2S) 1->2 4.020232
FABD (vector, 2S) 1->3 4.062504
FABD (vector, 2S) throughput 0.992576
FABD (vector, 4S) 1->2 4.058655
FABD (vector, 4S) 1->3 3.973624
FABD (vector, 4S) throughput 1.010846
FABD (vector, 2D) 1->2 4.013813
FABD (vector, 2D) 1->3 4.087835
FABD (vector, 2D) throughput 1.013977
FABD (scalar, S) 1->2 4.011514
FABD (scalar, S) 1->3 4.064278
FABD (scalar, S) throughput 1.000141
FABD (scalar, D) 1->2 4.014078
FABD (scalar, D) 1->3 4.007827
FABD (scalar, D) throughput 0.952025
FABS (scalar, S) 1->2 2.079151
FABS (scalar, S) throughput 1.045693
FABS (scalar, D) 1->2 2.051216
FABS (scalar, D) throughput 1.002666
FABS (vector, 2S) 1->2 1.991951
FABS (vector, 2S) throughput 1.025060
FABS (vector, 4S) 1->2 1.999739
FABS (vector, 4S) throughput 0.998324
FABS (vector, 2D) 1->2 1.998181
FABS (vector, 2D) throughput 0.984424
FACGE (vector, 2S) 1->2 2.019408
FACGE (vector, 2S) 1->3 2.004124
FACGE (vector, 2S) throughput 1.006681
FACGE (vector, 4S) 1->2 2.050559
FACGE (vector, 4S) 1->3 1.997645
FACGE (vector, 4S) throughput 1.004183
FACGE (vector, 2D) 1->2 2.006489
FACGE (vector, 2D) 1->3 2.005144
FACGE (vector, 2D) throughput 1.019028
FACGE (scalar, S) 1->2 1.993269
FACGE (scalar, S) 1->3 1.982889
FACGE (scalar, S) throughput 1.006143
FACGE (scalar, D) 1->2 2.030526
FACGE (scalar, D) 1->3 2.028616
FACGE (scalar, D) throughput 0.973117
FACGT (vector, 2S) 1->2 2.018682
FACGT (vector, 2S) 1->3 2.000251
FACGT (vector, 2S) throughput 1.009477
FACGT (vector, 4S) 1->2 1.962496
FACGT (vector, 4S) 1->3 2.029663
FACGT (vector, 4S) throughput 0.993334
FACGT (vector, 2D) 1->2 1.985457
FACGT (vector, 2D) 1->3 2.018949
FACGT (vector, 2D) throughput 1.007604
FACGT (scalar, S) 1->2 2.006470
FACGT (scalar, S) 1->3 2.008751
FACGT (scalar, S) throughput 1.014913
FACGT (scalar, D) 1->2 2.005913
FACGT (scalar, D) 1->3 2.015529
FACGT (scalar, D) throughput 0.999891
FADD (vector, 2S) 1->2 4.186355
FADD (vector, 2S) 1->3 4.004781
FADD (vector, 2S) throughput 0.991373
FADD (vector, 4S) 1->2 4.032909
FADD (vector, 4S) 1->3 4.028926
FADD (vector, 4S) throughput 1.013261
FADD (vector, 2D) 1->2 4.015857
FADD (vector, 2D) 1->3 3.977230
FADD (vector, 2D) throughput 1.020233
FADD (scalar, S) 1->2 4.023823
FADD (scalar, S) 1->3 4.030685
FADD (scalar, S) throughput 1.003422
FADD (scalar, D) 1->2 4.018040
FADD (scalar, D) 1->3 3.997376
FADD (scalar, D) throughput 1.017668
FADDP (scalar, S) 1->2 4.078243
FADDP (scalar, S) throughput 1.009822
FADDP (scalar, D) 1->2 4.000109
FADDP (scalar, D) throughput 1.009813
FADDP (vector, 2S) 1->2 4.021927
FADDP (vector, 2S) 1->3 3.977572
FADDP (vector, 2S) throughput 1.000250
FADDP (vector, 4S) 1->2 4.047988
FADDP (vector, 4S) 1->3 4.005085
FADDP (vector, 4S) throughput 1.007634
FADDP (vector, 2D) 1->2 4.019850
FADDP (vector, 2D) 1->3 4.064151
FADDP (vector, 2D) throughput 1.007084
FCCMP (scalar, S) 3->1 2.026000
FCCMP (scalar, S) 3->2 1.990341
FCCMP (scalar, S) 3->3 2.007662
FCCMP (scalar, S) throughput 1.961695
FCCMP (scalar, D) 3->1 2.041952
FCCMP (scalar, D) 3->2 2.024981
FCCMP (scalar, D) 3->3 2.037518
FCCMP (scalar, D) throughput 1.962348
FCCMPE (scalar, S) 3->1 2.010596
FCCMPE (scalar, S) 3->2 2.035419
FCCMPE (scalar, S) 3->3 1.893853
FCCMPE (scalar, S) throughput 2.011223
FCCMPE (scalar, D) 3->1 2.221593
FCCMPE (scalar, D) 3->2 2.031535
FCCMPE (scalar, D) 3->3 1.979717
FCCMPE (scalar, D) throughput 1.962478
FCMEQ (vector, 2S) 1->2 1.994281
FCMEQ (vector, 2S) 1->3 2.019096
FCMEQ (vector, 2S) throughput 0.985184
FCMEQ (vector, 4S) 1->2 2.006378
FCMEQ (vector, 4S) 1->3 2.002407
FCMEQ (vector, 4S) throughput 1.023163
FCMEQ (vector, 2D) 1->2 1.998442
FCMEQ (vector, 2D) 1->3 2.021900
FCMEQ (vector, 2D) throughput 1.008448
FCMEQ (scalar, S) 1->2 2.044424
FCMEQ (scalar, S) 1->3 1.941633
FCMEQ (scalar, S) throughput 0.993240
FCMEQ (scalar, D) 1->2 2.034857
FCMEQ (scalar, D) 1->3 1.993322
FCMEQ (scalar, D) throughput 1.010470
FCMEQ (vector, zero, 2S) 1->2 1.999609
FCMEQ (vector, zero, 2S) throughput 0.958830
FCMEQ (vector, zero, 4S) 1->2 2.022439
FCMEQ (vector, zero, 4S) throughput 1.005521
FCMEQ (vector, zero, 2D) 1->2 2.000299
FCMEQ (vector, zero, 2D) throughput 1.004661
FCMEQ (scalar, zero, S) 1->2 1.997701
FCMEQ (scalar, zero, S) throughput 1.014353
FCMEQ (scalar, zero, D) 1->2 2.016910
FCMEQ (scalar, zero, D) throughput 1.010781
FCMGE (vector, 2S) 1->2 2.052891
FCMGE (vector, 2S) 1->3 2.017369
FCMGE (vector, 2S) throughput 1.009269
FCMGE (vector, 4S) 1->2 2.042709
FCMGE (vector, 4S) 1->3 2.051338
FCMGE (vector, 4S) throughput 1.007357
FCMGE (vector, 2D) 1->2 1.999868
FCMGE (vector, 2D) 1->3 2.003926
FCMGE (vector, 2D) throughput 1.001823
FCMGE (scalar, S) 1->2 2.013379
FCMGE (scalar, S) 1->3 2.074096
FCMGE (scalar, S) throughput 1.010846
FCMGE (scalar, D) 1->2 2.024731
FCMGE (scalar, D) 1->3 1.998662
FCMGE (scalar, D) throughput 1.003274
FCMGE (vector, zero, 2S) 1->2 2.019012
FCMGE (vector, zero, 2S) throughput 0.998160
FCMGE (vector, zero, 4S) 1->2 2.088223
FCMGE (vector, zero, 4S) throughput 1.003577
FCMGE (vector, zero, 2D) 1->2 1.981894
FCMGE (vector, zero, 2D) throughput 1.005990
FCMGE (scalar, zero, S) 1->2 1.998820
FCMGE (scalar, zero, S) throughput 0.997829
FCMGE (scalar, zero, D) 1->2 1.955271
FCMGE (scalar, zero, D) throughput 1.007795
FCMGT (vector, 2S) 1->2 2.033346
FCMGT (vector, 2S) 1->3 2.060248
FCMGT (vector, 2S) throughput 1.005667
FCMGT (vector, 4S) 1->2 2.063154
FCMGT (vector, 4S) 1->3 2.018164
FCMGT (vector, 4S) throughput 0.999238
FCMGT (vector, 2D) 1->2 2.000109
FCMGT (vector, 2D) 1->3 1.996513
FCMGT (vector, 2D) throughput 1.006019
FCMGT (scalar, S) 1->2 1.998377
FCMGT (scalar, S) 1->3 1.718885
FCMGT (scalar, S) throughput 1.019332
FCMGT (scalar, D) 1->2 1.974903
FCMGT (scalar, D) 1->3 2.010380
FCMGT (scalar, D) throughput 0.969292
FCMGT (vector, zero, 2S) 1->2 2.002879
FCMGT (vector, zero, 2S) throughput 0.997094
FCMGT (vector, zero, 4S) 1->2 2.020541
FCMGT (vector, zero, 4S) throughput 0.988926
FCMGT (vector, zero, 2D) 1->2 2.006422
FCMGT (vector, zero, 2D) throughput 1.007257
FCMGT (scalar, zero, S) 1->2 2.023738
FCMGT (scalar, zero, S) throughput 0.990396
FCMGT (scalar, zero, D) 1->2 2.038861
FCMGT (scalar, zero, D) throughput 1.020140
FCMLE (vector, zero, 2S) 1->2 2.002696
FCMLE (vector, zero, 2S) throughput 1.013642
FCMLE (vector, zero, 4S) 1->2 2.000965
FCMLE (vector, zero, 4S) throughput 1.008865
FCMLE (vector, zero, 2D) 1->2 1.994207
FCMLE (vector, zero, 2D) throughput 1.015452
FCMLE (scalar, zero, S) 1->2 1.950953
FCMLE (scalar, zero, S) throughput 0.988528
FCMLE (scalar, zero, D) 1->2 2.060134
FCMLE (scalar, zero, D) throughput 1.019344
FCMLT (vector, zero, 2S) 1->2 1.953004
FCMLT (vector, zero, 2S) throughput 1.005601
FCMLT (vector, zero, 4S) 1->2 2.004144
FCMLT (vector, zero, 4S) throughput 1.011701
FCMLT (vector, zero, 2D) 1->2 2.042188
FCMLT (vector, zero, 2D) throughput 0.997285
FCMLT (scalar, zero, S) 1->2 2.003444
FCMLT (scalar, zero, S) throughput 0.994145
FCMLT (scalar, zero, D) 1->2 2.001075
FCMLT (scalar, zero, D) throughput 1.013383
FCMP (scalar, S) 3->1 2.259199
FCMP (scalar, S) 3->2 1.994852
FCMP (scalar, S) throughput 1.053492
FCMP (scalar, D) 3->1 2.031906
FCMP (scalar, D) 3->2 2.031709
FCMP (scalar, D) throughput 1.051698
FCMPE (scalar, S) 3->1 2.000606
FCMPE (scalar, S) 3->2 2.026060
FCMPE (scalar, S) throughput 1.054464
FCMPE (scalar, D) 3->1 1.978239
FCMPE (scalar, D) 3->2 2.015925
FCMPE (scalar, D) throughput 1.057230
FCSEL (scalar, S) 1->2 2.006553
FCSEL (scalar, S) 1->3 1.994042
FCSEL (scalar, S) 1->4 2.128829
FCSEL (scalar, S) throughput 1.119522
FCSEL (scalar, D) 1->2 2.008249
FCSEL (scalar, D) 1->3 1.998399
FCSEL (scalar, D) 1->4 2.030249
FCSEL (scalar, D) throughput 0.999905
FCVT (S to D) 1->2 2.006682
FCVT (S to D) throughput 1.006180
FCVT (D to S) 1->2 3.039506
FCVT (D to S) throughput 0.988332
FCVTAS (scalar, S to 32-bit GPR) 1->2 roundtrip 10.061319
FCVTAS (scalar, S to 32-bit GPR) throughput 2.138660
FCVTAS (scalar, S to 64-bit GPR) 1->2 roundtrip 10.071778
FCVTAS (scalar, S to 64-bit GPR) throughput 2.343706
FCVTAS (scalar, D to 32-bit GPR) 1->2 roundtrip 9.630054
FCVTAS (scalar, D to 32-bit GPR) throughput 2.138890
FCVTAS (scalar, D to 64-bit GPR) 1->2 roundtrip 9.997730
FCVTAS (scalar, D to 64-bit GPR) throughput 2.159988
FCVTAS (scalar, S to S) 1->2 3.011380
FCVTAS (scalar, S to S) throughput 1.007939
FCVTAS (scalar, S to S) 1->2 3.035954
FCVTAS (scalar, S to S) throughput 1.008043
FCVTAS (scalar, D to D) 1->2 3.543495
FCVTAS (scalar, D to D) throughput 0.966808
FCVTAS (scalar, D to D) 1->2 2.994436
FCVTAS (scalar, D to D) throughput 1.008486
FCVTAS (vector, 2S) 1->2 3.006437
FCVTAS (vector, 2S) throughput 0.990713
FCVTAS (vector, 4S) 1->2 2.995379
FCVTAS (vector, 4S) throughput 1.008809
FCVTAS (vector, 2D) 1->2 3.043062
FCVTAS (vector, 2D) throughput 1.021970
FCVTAU (scalar, S to 32-bit GPR) 1->2 roundtrip 10.031291
FCVTAU (scalar, S to 32-bit GPR) throughput 2.134542
FCVTAU (scalar, S to 64-bit GPR) 1->2 roundtrip 10.005041
FCVTAU (scalar, S to 64-bit GPR) throughput 2.138267
FCVTAU (scalar, D to 32-bit GPR) 1->2 roundtrip 10.077352
FCVTAU (scalar, D to 32-bit GPR) throughput 2.140064
FCVTAU (scalar, D to 64-bit GPR) 1->2 roundtrip 10.268318
FCVTAU (scalar, D to 64-bit GPR) throughput 2.116247
FCVTAU (scalar, S to S) 1->2 2.962183
FCVTAU (scalar, S to S) throughput 1.016910
FCVTAU (scalar, S to S) 1->2 3.002125
FCVTAU (scalar, S to S) throughput 1.011476
FCVTAU (scalar, D to D) 1->2 3.016996
FCVTAU (scalar, D to D) throughput 1.010180
FCVTAU (scalar, D to D) 1->2 3.010248
FCVTAU (scalar, D to D) throughput 0.997116
FCVTAU (vector, 2S) 1->2 3.005802
FCVTAU (vector, 2S) throughput 1.009040
FCVTAU (vector, 4S) 1->2 2.934530
FCVTAU (vector, 4S) throughput 1.003303
FCVTAU (vector, 2D) 1->2 3.018729
FCVTAU (vector, 2D) throughput 1.006175
FCVTL (vector, 4S) 1->2 2.000548
FCVTL (vector, 4S) throughput 1.012766
FCVTL (vector, 2D) 1->2 2.018332
FCVTL (vector, 2D) throughput 0.957642
FCVTL2 (vector, 4S) 1->2 2.020370
FCVTL2 (vector, 4S) throughput 0.991783
FCVTL2 (vector, 2D) 1->2 1.896799
FCVTL2 (vector, 2D) throughput 1.005209
FCVTMS (scalar, S to 32-bit GPR) 1->2 roundtrip 10.092557
FCVTMS (scalar, S to 32-bit GPR) throughput 2.136479
FCVTMS (scalar, S to 64-bit GPR) 1->2 roundtrip 10.068027
FCVTMS (scalar, S to 64-bit GPR) throughput 2.144211
FCVTMS (scalar, D to 32-bit GPR) 1->2 roundtrip 9.968189
FCVTMS (scalar, D to 32-bit GPR) throughput 2.078078
FCVTMS (scalar, D to 64-bit GPR) 1->2 roundtrip 10.049909
FCVTMS (scalar, D to 64-bit GPR) throughput 2.131093
FCVTMS (scalar, S to S) 1->2 2.995375
FCVTMS (scalar, S to S) throughput 1.034098
FCVTMS (scalar, S to S) 1->2 2.983652
FCVTMS (scalar, S to S) throughput 1.005147
FCVTMS (scalar, D to D) 1->2 3.014731
FCVTMS (scalar, D to D) throughput 0.980925
FCVTMS (scalar, D to D) 1->2 3.048954
FCVTMS (scalar, D to D) throughput 1.007998
FCVTMS (vector, 2S) 1->2 2.996018
FCVTMS (vector, 2S) throughput 1.001127
FCVTMS (vector, 4S) 1->2 3.035517
FCVTMS (vector, 4S) throughput 1.012515
FCVTMS (vector, 2D) 1->2 3.044227
FCVTMS (vector, 2D) throughput 1.010762
FCVTMU (scalar, S to 32-bit GPR) 1->2 roundtrip 9.657614
FCVTMU (scalar, S to 32-bit GPR) throughput 2.137284
FCVTMU (scalar, S to 64-bit GPR) 1->2 roundtrip 10.035573
FCVTMU (scalar, S to 64-bit GPR) throughput 3.986068
FCVTMU (scalar, D to 32-bit GPR) 1->2 roundtrip 9.969303
FCVTMU (scalar, D to 32-bit GPR) throughput 2.156926
FCVTMU (scalar, D to 64-bit GPR) 1->2 roundtrip 10.176627
FCVTMU (scalar, D to 64-bit GPR) throughput 2.120289
FCVTMU (scalar, S to S) 1->2 3.154726
FCVTMU (scalar, S to S) throughput 0.990315
FCVTMU (scalar, S to S) 1->2 2.850040
FCVTMU (scalar, S to S) throughput 1.004179
FCVTMU (scalar, D to D) 1->2 3.011799
FCVTMU (scalar, D to D) throughput 1.023759
FCVTMU (scalar, D to D) 1->2 3.019573
FCVTMU (scalar, D to D) throughput 0.996169
FCVTMU (vector, 2S) 1->2 2.965877
FCVTMU (vector, 2S) throughput 1.005303
FCVTMU (vector, 4S) 1->2 3.021353
FCVTMU (vector, 4S) throughput 1.003691
FCVTMU (vector, 2D) 1->2 3.008732
FCVTMU (vector, 2D) throughput 1.006584
FCVTN (vector, 4S) 1->2 3.008211
FCVTN (vector, 4S) throughput 0.967936
FCVTN (vector, 2D) 1->2 3.035143
FCVTN (vector, 2D) throughput 1.011916
FCVTN2 (vector, 4S) 1->1 3.034493
FCVTN2 (vector, 4S) 1->2 3.066599
FCVTN2 (vector, 4S) throughput 0.997495
FCVTN2 (vector, 4S) throughput 0.990466
FCVTN2 (vector, 2D) 1->1 2.930046
FCVTN2 (vector, 2D) 1->2 3.027742
FCVTN2 (vector, 2D) throughput 1.016622
FCVTN2 (vector, 2D) throughput 1.018647
FCVTNS (scalar, S to 32-bit GPR) 1->2 roundtrip 10.106616
FCVTNS (scalar, S to 32-bit GPR) throughput 2.137110
FCVTNS (scalar, S to 64-bit GPR) 1->2 roundtrip 10.169629
FCVTNS (scalar, S to 64-bit GPR) throughput 2.142879
FCVTNS (scalar, D to 32-bit GPR) 1->2 roundtrip 9.910934
FCVTNS (scalar, D to 32-bit GPR) throughput 2.128367
FCVTNS (scalar, D to 64-bit GPR) 1->2 roundtrip 9.935200
FCVTNS (scalar, D to 64-bit GPR) throughput 2.110051
FCVTNS (scalar, S to S) 1->2 3.085578
FCVTNS (scalar, S to S) throughput 1.004500
FCVTNS (scalar, S to S) 1->2 3.021324
FCVTNS (scalar, S to S) throughput 1.010441
FCVTNS (scalar, D to D) 1->2 2.918875
FCVTNS (scalar, D to D) throughput 1.009264
FCVTNS (scalar, D to D) 1->2 3.057155
FCVTNS (scalar, D to D) throughput 1.004902
FCVTNS (vector, 2S) 1->2 3.055068
FCVTNS (vector, 2S) throughput 1.006224
FCVTNS (vector, 4S) 1->2 3.005351
FCVTNS (vector, 4S) throughput 1.005869
FCVTNS (vector, 2D) 1->2 2.998025
FCVTNS (vector, 2D) throughput 1.008698
FCVTNU (scalar, S to 32-bit GPR) 1->2 roundtrip 10.023124
FCVTNU (scalar, S to 32-bit GPR) throughput 2.168133
FCVTNU (scalar, S to 64-bit GPR) 1->2 roundtrip 10.064284
FCVTNU (scalar, S to 64-bit GPR) throughput 2.153950
FCVTNU (scalar, D to 32-bit GPR) 1->2 roundtrip 10.088122
FCVTNU (scalar, D to 32-bit GPR) throughput 2.140951
FCVTNU (scalar, D to 64-bit GPR) 1->2 roundtrip 10.031406
FCVTNU (scalar, D to 64-bit GPR) throughput 2.155204
FCVTNU (scalar, S to S) 1->2 3.002529
FCVTNU (scalar, S to S) throughput 0.985668
FCVTNU (scalar, S to S) 1->2 3.015989
FCVTNU (scalar, S to S) throughput 1.007821
FCVTNU (scalar, D to D) 1->2 2.956014
FCVTNU (scalar, D to D) throughput 1.012010
FCVTNU (scalar, D to D) 1->2 3.004490
FCVTNU (scalar, D to D) throughput 1.007665
FCVTNU (vector, 2S) 1->2 3.009386
FCVTNU (vector, 2S) throughput 1.009267
FCVTNU (vector, 4S) 1->2 3.013269
FCVTNU (vector, 4S) throughput 1.005183
FCVTNU (vector, 2D) 1->2 2.994656
FCVTNU (vector, 2D) throughput 1.011827
FCVTPS (scalar, S to 32-bit GPR) 1->2 roundtrip 10.115852
FCVTPS (scalar, S to 32-bit GPR) throughput 2.119694
FCVTPS (scalar, S to 64-bit GPR) 1->2 roundtrip 10.060751
FCVTPS (scalar, S to 64-bit GPR) throughput 2.139101
FCVTPS (scalar, D to 32-bit GPR) 1->2 roundtrip 10.129125
FCVTPS (scalar, D to 32-bit GPR) throughput 2.135400
FCVTPS (scalar, D to 64-bit GPR) 1->2 roundtrip 9.901799
FCVTPS (scalar, D to 64-bit GPR) throughput 2.144636
FCVTPS (scalar, S to S) 1->2 3.054144
FCVTPS (scalar, S to S) throughput 1.016800
FCVTPS (scalar, S to S) 1->2 2.970039
FCVTPS (scalar, S to S) throughput 1.015167
FCVTPS (scalar, D to D) 1->2 2.992329
FCVTPS (scalar, D to D) throughput 1.007597
FCVTPS (scalar, D to D) 1->2 3.011568
FCVTPS (scalar, D to D) throughput 0.993552
FCVTPS (vector, 2S) 1->2 2.996585
FCVTPS (vector, 2S) throughput 1.027355
FCVTPS (vector, 4S) 1->2 2.876506
FCVTPS (vector, 4S) throughput 1.009946
FCVTPS (vector, 2D) 1->2 3.018495
FCVTPS (vector, 2D) throughput 1.004521
FCVTPU (scalar, S to 32-bit GPR) 1->2 roundtrip 10.102810
FCVTPU (scalar, S to 32-bit GPR) throughput 2.139521
FCVTPU (scalar, S to 64-bit GPR) 1->2 roundtrip 10.057561
FCVTPU (scalar, S to 64-bit GPR) throughput 2.137785
FCVTPU (scalar, D to 32-bit GPR) 1->2 roundtrip 10.068706
FCVTPU (scalar, D to 32-bit GPR) throughput 2.118206
FCVTPU (scalar, D to 64-bit GPR) 1->2 roundtrip 10.078257
FCVTPU (scalar, D to 64-bit GPR) throughput 2.136209
FCVTPU (scalar, S to S) 1->2 3.019607
FCVTPU (scalar, S to S) throughput 0.995515
FCVTPU (scalar, S to S) 1->2 3.023934
FCVTPU (scalar, S to S) throughput 1.005547
FCVTPU (scalar, D to D) 1->2 2.996166
FCVTPU (scalar, D to D) throughput 1.012636
FCVTPU (scalar, D to D) 1->2 3.068843
FCVTPU (scalar, D to D) throughput 1.025063
FCVTPU (vector, 2S) 1->2 2.998772
FCVTPU (vector, 2S) throughput 1.007055
FCVTPU (vector, 4S) 1->2 3.064272
FCVTPU (vector, 4S) throughput 1.007256
FCVTPU (vector, 2D) 1->2 2.999999
FCVTPU (vector, 2D) throughput 1.000789
FCVTXN (scalar, D) 1->2 3.036707
FCVTXN (scalar, D) throughput 1.010853
FCVTXN (vector, 2D) 1->2 3.044755
FCVTXN (vector, 2D) throughput 1.005045
FCVTXN2 (vector, 2D) 1->1 3.021704
FCVTXN2 (vector, 2D) 1->2 3.028494
FCVTXN2 (vector, 2D) throughput 0.996499
FCVTXN2 (vector, 2D) throughput 0.978953
FCVTZS (scalar, integer, S to 32-bit GPR) 1->2 roundtrip 9.957923
FCVTZS (scalar, integer, S to 32-bit GPR) throughput 2.117176
FCVTZS (scalar, integer, S to 64-bit GPR) 1->2 roundtrip 10.053207
FCVTZS (scalar, integer, S to 64-bit GPR) throughput 2.145150
FCVTZS (scalar, integer, D to 32-bit GPR) 1->2 roundtrip 9.974343
FCVTZS (scalar, integer, D to 32-bit GPR) throughput 2.155256
FCVTZS (scalar, integer, D to 64-bit GPR) 1->2 roundtrip 10.107432
FCVTZS (scalar, integer, D to 64-bit GPR) throughput 2.169202
FCVTZS (scalar, integer, S to S) 1->2 3.020957
FCVTZS (scalar, integer, S to S) throughput 1.002141
FCVTZS (scalar, integer, S to S) 1->2 3.049770
FCVTZS (scalar, integer, S to S) throughput 1.007672
FCVTZS (scalar, integer, D to D) 1->2 3.025703
FCVTZS (scalar, integer, D to D) throughput 0.991487
FCVTZS (scalar, integer, D to D) 1->2 3.029313
FCVTZS (scalar, integer, D to D) throughput 1.011726
FCVTZS (vector, integer, 2S) 1->2 3.037974
FCVTZS (vector, integer, 2S) throughput 1.002672
FCVTZS (vector, integer, 4S) 1->2 3.008892
FCVTZS (vector, integer, 4S) throughput 1.010439
FCVTZS (vector, integer, 2D) 1->2 2.977815
FCVTZS (vector, integer, 2D) throughput 1.005455
FCVTZS (scalar, fixed-point, S to 32-bit GPR) 1->2 roundtrip 10.117074
FCVTZS (scalar, fixed-point, S to 32-bit GPR) throughput 2.115324
FCVTZS (scalar, fixed-point, S to 64-bit GPR) 1->2 roundtrip 10.090627
FCVTZS (scalar, fixed-point, S to 64-bit GPR) throughput 2.128979
FCVTZS (scalar, fixed-point, D to 32-bit GPR) 1->2 roundtrip 9.970467
FCVTZS (scalar, fixed-point, D to 32-bit GPR) throughput 2.139603
FCVTZS (scalar, fixed-point, D to 64-bit GPR) 1->2 roundtrip 10.108589
FCVTZS (scalar, fixed-point, D to 64-bit GPR) throughput 2.136684
FCVTZS (scalar, fixed-point, S to S) 1->2 2.967562
FCVTZS (scalar, fixed-point, S to S) throughput 0.991106
FCVTZS (scalar, fixed-point, S to S) 1->2 3.006859
FCVTZS (scalar, fixed-point, S to S) throughput 1.010855
FCVTZS (scalar, fixed-point, D to D) 1->2 3.008519
FCVTZS (scalar, fixed-point, D to D) throughput 1.004345
FCVTZS (scalar, fixed-point, D to D) 1->2 3.025204
FCVTZS (scalar, fixed-point, D to D) throughput 1.011902
FCVTZS (vector, fixed-point, 2S) 1->2 3.010901
FCVTZS (vector, fixed-point, 2S) throughput 1.002275
FCVTZS (vector, fixed-point, 4S) 1->2 3.027401
FCVTZS (vector, fixed-point, 4S) throughput 1.021032
FCVTZS (vector, fixed-point, 2D) 1->2 3.021803
FCVTZS (vector, fixed-point, 2D) throughput 1.013874
FCVTZU (scalar, integer, S to 32-bit GPR) 1->2 roundtrip 10.127905
FCVTZU (scalar, integer, S to 32-bit GPR) throughput 2.144489
FCVTZU (scalar, integer, S to 64-bit GPR) 1->2 roundtrip 9.924366
FCVTZU (scalar, integer, S to 64-bit GPR) throughput 2.145475
FCVTZU (scalar, integer, D to 32-bit GPR) 1->2 roundtrip 10.004019
FCVTZU (scalar, integer, D to 32-bit GPR) throughput 2.144170
FCVTZU (scalar, integer, D to 64-bit GPR) 1->2 roundtrip 10.009939
FCVTZU (scalar, integer, D to 64-bit GPR) throughput 2.137347
FCVTZU (scalar, integer, S to S) 1->2 2.996139
FCVTZU (scalar, integer, S to S) throughput 0.989946
FCVTZU (scalar, integer, S to S) 1->2 3.028813
FCVTZU (scalar, integer, S to S) throughput 1.006595
FCVTZU (scalar, integer, D to D) 1->2 2.997894
FCVTZU (scalar, integer, D to D) throughput 1.008997
FCVTZU (scalar, integer, D to D) 1->2 3.009142
FCVTZU (scalar, integer, D to D) throughput 1.016126
FCVTZU (vector, integer, 2S) 1->2 3.001444
FCVTZU (vector, integer, 2S) throughput 1.027382
FCVTZU (vector, integer, 4S) 1->2 3.062963
FCVTZU (vector, integer, 4S) throughput 0.987467
FCVTZU (vector, integer, 2D) 1->2 3.003308
FCVTZU (vector, integer, 2D) throughput 1.010305
FCVTZU (scalar, fixed-point, S to 32-bit GPR) 1->2 roundtrip 10.035348
FCVTZU (scalar, fixed-point, S to 32-bit GPR) throughput 2.148109
FCVTZU (scalar, fixed-point, S to 64-bit GPR) 1->2 roundtrip 10.132975
FCVTZU (scalar, fixed-point, S to 64-bit GPR) throughput 2.143680
FCVTZU (scalar, fixed-point, D to 32-bit GPR) 1->2 roundtrip 8.016477
FCVTZU (scalar, fixed-point, D to 32-bit GPR) throughput 2.093145
FCVTZU (scalar, fixed-point, D to 64-bit GPR) 1->2 roundtrip 9.948674
FCVTZU (scalar, fixed-point, D to 64-bit GPR) throughput 2.133497
FCVTZU (scalar, fixed-point, S to S) 1->2 2.975704
FCVTZU (scalar, fixed-point, S to S) throughput 1.010735
FCVTZU (scalar, fixed-point, S to S) 1->2 3.024078
FCVTZU (scalar, fixed-point, S to S) throughput 1.004389
FCVTZU (scalar, fixed-point, D to D) 1->2 3.055684
FCVTZU (scalar, fixed-point, D to D) throughput 1.007572
FCVTZU (scalar, fixed-point, D to D) 1->2 2.957352
FCVTZU (scalar, fixed-point, D to D) throughput 1.000410
FCVTZU (vector, fixed-point, 2S) 1->2 3.025273
FCVTZU (vector, fixed-point, 2S) throughput 1.008710
FCVTZU (vector, fixed-point, 4S) 1->2 3.004430
FCVTZU (vector, fixed-point, 4S) throughput 0.896252
FCVTZU (vector, fixed-point, 2D) 1->2 3.008015
FCVTZU (vector, fixed-point, 2D) throughput 1.008929
FDIV (vector, 2S) 1->2 11.883940
FDIV (vector, 2S) 1->3 12.123440
FDIV (vector, 2S) throughput 11.168964
FDIV (vector, 4S) 1->2 23.266495
FDIV (vector, 4S) 1->3 23.078169
FDIV (vector, 4S) throughput 21.705846
FDIV (vector, 2D) 1->2 37.311787
FDIV (vector, 2D) 1->3 36.626648
FDIV (vector, 2D) throughput 33.593491
FDIV (scalar, S) 1->2 11.873748
FDIV (scalar, S) 1->3 11.978797
FDIV (scalar, S) throughput 10.935595
FDIV (scalar, D) 1->2 19.266442
FDIV (scalar, D) 1->3 18.849536
FDIV (scalar, D) throughput 18.058017
FMADD (scalar, S) 1->2 4.082473
FMADD (scalar, S) 1->3 4.741659
FMADD (scalar, S) 1->4 4.070124
FMADD (scalar, S) throughput 1.003896
FMADD (scalar, D) 1->2 4.034459
FMADD (scalar, D) 1->3 4.000263
FMADD (scalar, D) 1->4 3.996803
FMADD (scalar, D) throughput 1.007117
FMAX (vector, 2S) 1->2 2.058751
FMAX (vector, 2S) 1->3 1.994328
FMAX (vector, 2S) throughput 0.997165
FMAX (vector, 4S) 1->2 2.017198
FMAX (vector, 4S) 1->3 1.968115
FMAX (vector, 4S) throughput 0.864791
FMAX (vector, 2D) 1->2 2.043467
FMAX (vector, 2D) 1->3 1.952274
FMAX (vector, 2D) throughput 1.009664
FMAX (scalar, S) 1->2 1.994350
FMAX (scalar, S) 1->3 2.029510
FMAX (scalar, S) throughput 1.088810
FMAX (scalar, D) 1->2 1.998907
FMAX (scalar, D) 1->3 2.024255
FMAX (scalar, D) throughput 0.993129
FMAXNM (vector, 2S) 1->2 1.998202
FMAXNM (vector, 2S) 1->3 2.006422
FMAXNM (vector, 2S) throughput 1.011905
FMAXNM (vector, 4S) 1->2 2.006369
FMAXNM (vector, 4S) 1->3 1.996996
FMAXNM (vector, 4S) throughput 1.007701
FMAXNM (vector, 2D) 1->2 1.993662
FMAXNM (vector, 2D) 1->3 2.020417
FMAXNM (vector, 2D) throughput 0.983012
FMAXNM (scalar, S) 1->2 2.031086
FMAXNM (scalar, S) 1->3 2.010665
FMAXNM (scalar, S) throughput 1.008022
FMAXNM (scalar, D) 1->2 2.018790
FMAXNM (scalar, D) 1->3 2.007214
FMAXNM (scalar, D) throughput 1.011180
FMAXNMP (scalar) 1->2 2.039425
FMAXNMP (scalar) throughput 0.989602
FMAXNMP (vector, 2S) 1->2 2.001535
FMAXNMP (vector, 2S) 1->3 1.994370
FMAXNMP (vector, 2S) throughput 1.003765
FMAXNMP (vector, 4S) 1->2 1.950738
FMAXNMP (vector, 4S) 1->3 1.999036
FMAXNMP (vector, 4S) throughput 1.002633
FMAXNMP (vector, 2D) 1->2 2.013135
FMAXNMP (vector, 2D) 1->3 2.027319
FMAXNMP (vector, 2D) throughput 1.001035
FMAXNMV (4S) 1->2 3.055475
FMAXNMV (4S) throughput 0.992706
FMAXP (scalar) 1->2 2.024463
FMAXP (scalar) throughput 1.009954
FMAXP (vector, 2S) 1->2 1.999738
FMAXP (vector, 2S) 1->3 1.994195
FMAXP (vector, 2S) throughput 0.985423
FMAXP (vector, 4S) 1->2 2.015513
FMAXP (vector, 4S) 1->3 1.968863
FMAXP (vector, 4S) throughput 1.006171
FMAXP (vector, 2D) 1->2 1.991650
FMAXP (vector, 2D) 1->3 2.023224
FMAXP (vector, 2D) throughput 1.010471
FMAXV (4S) 1->2 3.008885
FMAXV (4S) throughput 1.007646
FMIN (vector, 2S) 1->2 1.999013
FMIN (vector, 2S) 1->3 2.005168
FMIN (vector, 2S) throughput 1.019480
FMIN (vector, 4S) 1->2 2.001579
FMIN (vector, 4S) 1->3 1.993973
FMIN (vector, 4S) throughput 1.014117
FMIN (vector, 2D) 1->2 2.008661
FMIN (vector, 2D) 1->3 1.992335
FMIN (vector, 2D) throughput 1.007207
FMIN (scalar, S) 1->2 1.954927
FMIN (scalar, S) 1->3 2.038642
FMIN (scalar, S) throughput 1.007022
FMIN (scalar, D) 1->2 2.000483
FMIN (scalar, D) 1->3 2.022813
FMIN (scalar, D) throughput 0.996324
FMINNM (vector, 2S) 1->2 2.008665
FMINNM (vector, 2S) 1->3 1.935620
FMINNM (vector, 2S) throughput 1.012933
FMINNM (vector, 4S) 1->2 1.998422
FMINNM (vector, 4S) 1->3 2.035447
FMINNM (vector, 4S) throughput 1.008743
FMINNM (vector, 2D) 1->2 2.024284
FMINNM (vector, 2D) 1->3 1.973579
FMINNM (vector, 2D) throughput 1.029452
FMINNM (scalar, S) 1->2 1.982217
FMINNM (scalar, S) 1->3 2.010575
FMINNM (scalar, S) throughput 0.992056
FMINNM (scalar, D) 1->2 2.036164
FMINNM (scalar, D) 1->3 1.994406
FMINNM (scalar, D) throughput 1.008352
FMINNMP (scalar) 1->2 1.999873
FMINNMP (scalar) throughput 1.004880
FMINNMP (vector, 2S) 1->2 2.017897
FMINNMP (vector, 2S) 1->3 2.016706
FMINNMP (vector, 2S) throughput 1.009574
FMINNMP (vector, 4S) 1->2 2.013970
FMINNMP (vector, 4S) 1->3 1.971694
FMINNMP (vector, 4S) throughput 1.016926
FMINNMP (vector, 2D) 1->2 2.016093
FMINNMP (vector, 2D) 1->3 2.020953
FMINNMP (vector, 2D) throughput 0.988013
FMINNMV (4S) 1->2 3.373177
FMINNMV (4S) throughput 0.920609
FMINP (scalar) 1->2 1.999483
FMINP (scalar) throughput 1.014325
FMINP (vector, 2S) 1->2 2.035823
FMINP (vector, 2S) 1->3 2.014370
FMINP (vector, 2S) throughput 1.007597
FMINP (vector, 4S) 1->2 2.007389
FMINP (vector, 4S) 1->3 2.014145
FMINP (vector, 4S) throughput 1.028193
FMINP (vector, 2D) 1->2 2.000921
FMINP (vector, 2D) 1->3 2.005968
FMINP (vector, 2D) throughput 1.005843
FMINV (4S) 1->2 3.029391
FMINV (4S) throughput 1.010122
FMLA (by element, S) 1->1 4.006175
FMLA (by element, S) 1->2 4.984882
FMLA (by element, S) 1->3 3.991444
FMLA (by element, S) throughput 1.006591
FMLA (by element, S) throughput 1.010021
FMLA (by element, D) 1->1 3.975259
FMLA (by element, D) 1->2 3.940885
FMLA (by element, D) 1->3 4.038907
FMLA (by element, D) throughput 1.008316
FMLA (by element, D) throughput 0.991591
FMLA (by element, 2S) 1->1 4.037765
FMLA (by element, 2S) 1->2 3.995187
FMLA (by element, 2S) 1->3 3.998575
FMLA (by element, 2S) throughput 1.012940
FMLA (by element, 2S) throughput 0.970028
FMLA (by element, 4S) 1->1 3.940953
FMLA (by element, 4S) 1->2 4.026599
FMLA (by element, 4S) 1->3 4.010439
FMLA (by element, 4S) throughput 1.008649
FMLA (by element, 4S) throughput 1.004096
FMLA (by element, 2D) 1->1 4.037325
FMLA (by element, 2D) 1->2 3.922274
FMLA (by element, 2D) 1->3 4.004814
FMLA (by element, 2D) throughput 1.007994
FMLA (by element, 2D) throughput 1.004828
FMLA (vector, 2S) 1->1 3.828071
FMLA (vector, 2S) 1->2 3.854801
FMLA (vector, 2S) 1->3 4.049547
FMLA (vector, 2S) throughput 1.014533
FMLA (vector, 2S) throughput 1.012293
FMLA (vector, 4S) 1->1 4.011280
FMLA (vector, 4S) 1->2 4.148114
FMLA (vector, 4S) 1->3 4.065493
FMLA (vector, 4S) throughput 0.985554
FMLA (vector, 4S) throughput 0.991961
FMLA (vector, 2D) 1->1 4.034901
FMLA (vector, 2D) 1->2 4.042159
FMLA (vector, 2D) 1->3 3.985739
FMLA (vector, 2D) throughput 1.051972
FMLA (vector, 2D) throughput 0.997011
FMLS (by element, S) 1->1 3.989459
FMLS (by element, S) 1->2 4.018233
FMLS (by element, S) 1->3 3.999159
FMLS (by element, S) throughput 1.007718
FMLS (by element, S) throughput 1.012744
FMLS (by element, D) 1->1 4.013681
FMLS (by element, D) 1->2 3.992698
FMLS (by element, D) 1->3 3.994065
FMLS (by element, D) throughput 1.003612
FMLS (by element, D) throughput 0.995715
FMLS (by element, 2S) 1->1 4.312570
FMLS (by element, 2S) 1->2 3.991247
FMLS (by element, 2S) 1->3 3.963227
FMLS (by element, 2S) throughput 1.009872
FMLS (by element, 2S) throughput 1.005650
FMLS (by element, 4S) 1->1 4.026918
FMLS (by element, 4S) 1->2 3.963492
FMLS (by element, 4S) 1->3 4.041039
FMLS (by element, 4S) throughput 1.001368
FMLS (by element, 4S) throughput 0.997427
FMLS (by element, 2D) 1->1 3.992732
FMLS (by element, 2D) 1->2 4.022931
FMLS (by element, 2D) 1->3 4.010768
FMLS (by element, 2D) throughput 1.015805
FMLS (by element, 2D) throughput 1.009735
FMLS (vector, 2S) 1->1 4.037870
FMLS (vector, 2S) 1->2 4.060799
FMLS (vector, 2S) 1->3 4.012879
FMLS (vector, 2S) throughput 1.008993
FMLS (vector, 2S) throughput 1.012918
FMLS (vector, 4S) 1->1 4.016206
FMLS (vector, 4S) 1->2 4.003382
FMLS (vector, 4S) 1->3 4.106987
FMLS (vector, 4S) throughput 1.018002
FMLS (vector, 4S) throughput 1.012029
FMLS (vector, 2D) 1->1 4.001770
FMLS (vector, 2D) 1->2 4.009655
FMLS (vector, 2D) 1->3 4.003818
FMLS (vector, 2D) throughput 1.009772
FMLS (vector, 2D) throughput 1.003348
FMOV (S to 32-bit GPR) 1->2 roundtrip 7.030600
FMOV (S to 32-bit GPR) throughput 1.006643
FMOV (D to 64-bit GPR) 1->2 roundtrip 7.084774
FMOV (D to 64-bit GPR) throughput 0.997339
FMOV (D[1] to 64-bit GPR) 1->2 roundtrip 7.048773
FMOV (D[1] to 64-bit GPR) throughput 1.010402
FMOV (S from 32-bit GPR) 1->2 roundtrip 7.396586
FMOV (S from 32-bit GPR) throughput 1.001215
FMOV (D from 64-bit GPR) 1->2 roundtrip 6.888423
FMOV (D from 64-bit GPR) throughput 1.012986
FMOV (D[1] from 64-bit GPR) 1->2 roundtrip 9.024353
FMOV (D[1] from 64-bit GPR) throughput 1.014697
FMSUB (scalar, S) 1->2 4.082884
FMSUB (scalar, S) 1->3 3.978226
FMSUB (scalar, S) 1->4 3.994261
FMSUB (scalar, S) throughput 1.006920
FMSUB (scalar, D) 1->2 4.038553
FMSUB (scalar, D) 1->3 4.129726
FMSUB (scalar, D) 1->4 4.038282
FMSUB (scalar, D) throughput 1.038617
FMUL (by element, 2S) 1->2 4.018129
FMUL (by element, 2S) 1->3 4.048136
FMUL (by element, 2S) throughput 1.009439
FMUL (by element, 4S) 1->2 4.031579
FMUL (by element, 4S) 1->3 4.048261
FMUL (by element, 4S) throughput 1.007131
FMUL (by element, 2D) 1->2 4.027147
FMUL (by element, 2D) 1->3 4.026787
FMUL (by element, 2D) throughput 1.004884
FMUL (vector, 2S) 1->2 4.064166
FMUL (vector, 2S) 1->3 3.959252
FMUL (vector, 2S) throughput 1.009615
FMUL (vector, 4S) 1->2 3.982433
FMUL (vector, 4S) 1->3 4.002582
FMUL (vector, 4S) throughput 1.085657
FMUL (vector, 2D) 1->2 3.999198
FMUL (vector, 2D) 1->3 3.865610
FMUL (vector, 2D) throughput 1.019729
FMUL (scalar, S) 1->2 4.055596
FMUL (scalar, S) 1->3 4.038593
FMUL (scalar, S) throughput 1.009986
FMUL (scalar, D) 1->2 3.956153
FMUL (scalar, D) 1->3 4.018195
FMUL (scalar, D) throughput 1.007575
FMULX (vector, 2S) 1->2 3.988963
FMULX (vector, 2S) 1->3 4.043888
FMULX (vector, 2S) throughput 0.999541
FMULX (vector, 4S) 1->2 3.991698
FMULX (vector, 4S) 1->3 4.046189
FMULX (vector, 4S) throughput 0.994239
FMULX (vector, 2D) 1->2 4.040918
FMULX (vector, 2D) 1->3 3.994809
FMULX (vector, 2D) throughput 1.019513
FMULX (scalar, S) 1->2 3.989386
FMULX (scalar, S) 1->3 4.005870
FMULX (scalar, S) throughput 1.015183
FMULX (scalar, D) 1->2 3.999210
FMULX (scalar, D) 1->3 4.042303
FMULX (scalar, D) throughput 1.006547
FMULX (by element, 2S) 1->2 4.011035
FMULX (by element, 2S) 1->3 4.024376
FMULX (by element, 2S) throughput 1.011603
FMULX (by element, 4S) 1->2 4.006432
FMULX (by element, 4S) 1->3 4.246269
FMULX (by element, 4S) throughput 0.997508
FMULX (by element, 2D) 1->2 4.080346
FMULX (by element, 2D) 1->3 4.030396
FMULX (by element, 2D) throughput 1.000696
FNEG (scalar, S) 1->2 2.006492
FNEG (scalar, S) throughput 1.004427
FNEG (scalar, D) 1->2 2.023274
FNEG (scalar, D) throughput 1.010066
FNEG (vector, 2S) 1->2 2.004948
FNEG (vector, 2S) throughput 0.994966
FNEG (vector, 4S) 1->2 2.007939
FNEG (vector, 4S) throughput 0.998960
FNEG (vector, 2D) 1->2 2.003487
FNEG (vector, 2D) throughput 0.984485
FNMADD (scalar, S) 1->2 4.233963
FNMADD (scalar, S) 1->3 3.927177
FNMADD (scalar, S) 1->4 4.043547
FNMADD (scalar, S) throughput 0.992799
FNMADD (scalar, D) 1->2 4.027616
FNMADD (scalar, D) 1->3 4.014648
FNMADD (scalar, D) 1->4 4.055288
FNMADD (scalar, D) throughput 1.014207
FNMSUB (scalar, S) 1->2 3.433737
FNMSUB (scalar, S) 1->3 4.060290
FNMSUB (scalar, S) 1->4 3.996146
FNMSUB (scalar, S) throughput 1.009507
FNMSUB (scalar, D) 1->2 4.016078
FNMSUB (scalar, D) 1->3 4.114792
FNMSUB (scalar, D) 1->4 3.925991
FNMSUB (scalar, D) throughput 1.034054
FNMUL (scalar, S) 1->2 4.166939
FNMUL (scalar, S) 1->3 4.342061
FNMUL (scalar, S) throughput 0.996482
FNMUL (scalar, D) 1->2 4.013942
FNMUL (scalar, D) 1->3 4.003179
FNMUL (scalar, D) throughput 0.998093
FRECPE (scalar, S) 1->2 3.997818
FRECPE (scalar, S) throughput 1.009859
FRECPE (scalar, D) 1->2 3.950194
FRECPE (scalar, D) throughput 1.011271
FRECPE (vector, 2S) 1->2 4.041366
FRECPE (vector, 2S) throughput 0.988755
FRECPE (vector, 4S) 1->2 3.990729
FRECPE (vector, 4S) throughput 1.035258
FRECPE (vector, 2D) 1->2 3.997526
FRECPE (vector, 2D) throughput 1.008873
FRECPS (vector, 2S) 1->2 4.037676
FRECPS (vector, 2S) 1->3 4.011194
FRECPS (vector, 2S) throughput 1.002013
FRECPS (vector, 4S) 1->2 3.998033
FRECPS (vector, 4S) 1->3 3.998490
FRECPS (vector, 4S) throughput 1.000093
FRECPS (vector, 2D) 1->2 4.068930
FRECPS (vector, 2D) 1->3 4.032468
FRECPS (vector, 2D) throughput 1.004159
FRECPS (scalar, S) 1->2 4.061499
FRECPS (scalar, S) 1->3 4.015724
FRECPS (scalar, S) throughput 1.014283
FRECPS (scalar, D) 1->2 4.014294
FRECPS (scalar, D) 1->3 4.060255
FRECPS (scalar, D) throughput 1.006518
FRECPX (scalar, S) 1->2 4.011629
FRECPX (scalar, S) throughput 1.006801
FRECPX (scalar, D) 1->2 4.016711
FRECPX (scalar, D) throughput 1.005877
FRINTA (scalar, S) 1->2 2.992317
FRINTA (scalar, S) throughput 1.003425
FRINTA (scalar, D) 1->2 3.019803
FRINTA (scalar, D) throughput 1.016749
FRINTA (vector, 2S) 1->2 3.009917
FRINTA (vector, 2S) throughput 1.002605
FRINTA (vector, 4S) 1->2 3.009200
FRINTA (vector, 4S) throughput 1.014306
FRINTA (vector, 2D) 1->2 2.971849
FRINTA (vector, 2D) throughput 1.011259
FRINTI (scalar, S) 1->2 3.016120
FRINTI (scalar, S) throughput 0.993472
FRINTI (scalar, D) 1->2 2.988140
FRINTI (scalar, D) throughput 0.996758
FRINTI (vector, 2S) 1->2 3.032104
FRINTI (vector, 2S) throughput 0.997301
FRINTI (vector, 4S) 1->2 2.984252
FRINTI (vector, 4S) throughput 1.010048
FRINTI (vector, 2D) 1->2 3.022348
FRINTI (vector, 2D) throughput 1.004951
FRINTM (scalar, S) 1->2 2.961350
FRINTM (scalar, S) throughput 1.014867
FRINTM (scalar, D) 1->2 2.986126
FRINTM (scalar, D) throughput 1.002337
FRINTM (vector, 2S) 1->2 3.030927
FRINTM (vector, 2S) throughput 1.014066
FRINTM (vector, 4S) 1->2 3.001897
FRINTM (vector, 4S) throughput 0.989276
FRINTM (vector, 2D) 1->2 3.034166
FRINTM (vector, 2D) throughput 1.012167
FRINTN (scalar, S) 1->2 2.994305
FRINTN (scalar, S) throughput 1.009897
FRINTN (scalar, D) 1->2 2.991695
FRINTN (scalar, D) throughput 1.118566
FRINTN (vector, 2S) 1->2 3.027700
FRINTN (vector, 2S) throughput 1.073678
FRINTN (vector, 4S) 1->2 3.076270
FRINTN (vector, 4S) throughput 1.079775
FRINTN (vector, 2D) 1->2 2.250889
FRINTN (vector, 2D) throughput 1.020851
FRINTP (scalar, S) 1->2 3.095631
FRINTP (scalar, S) throughput 1.034992
FRINTP (scalar, D) 1->2 3.042085
FRINTP (scalar, D) throughput 1.011978
FRINTP (vector, 2S) 1->2 2.998376
FRINTP (vector, 2S) throughput 0.999987
FRINTP (vector, 4S) 1->2 2.939942
FRINTP (vector, 4S) throughput 1.006159
FRINTP (vector, 2D) 1->2 3.027959
FRINTP (vector, 2D) throughput 1.014666
FRINTX (scalar, S) 1->2 2.971286
FRINTX (scalar, S) throughput 1.017913
FRINTX (scalar, D) 1->2 3.008794
FRINTX (scalar, D) throughput 1.009592
FRINTX (vector, 2S) 1->2 3.048071
FRINTX (vector, 2S) throughput 1.003441
FRINTX (vector, 4S) 1->2 2.990690
FRINTX (vector, 4S) throughput 0.985103
FRINTX (vector, 2D) 1->2 3.028110
FRINTX (vector, 2D) throughput 1.052700
FRINTZ (scalar, S) 1->2 3.018930
FRINTZ (scalar, S) throughput 0.995798
FRINTZ (scalar, D) 1->2 3.027587
FRINTZ (scalar, D) throughput 1.007381
FRINTZ (vector, 2S) 1->2 2.994327
FRINTZ (vector, 2S) throughput 1.030017
FRINTZ (vector, 4S) 1->2 2.994568
FRINTZ (vector, 4S) throughput 1.031143
FRINTZ (vector, 2D) 1->2 3.036996
FRINTZ (vector, 2D) throughput 1.013634
FRSQRTE (scalar, S) 1->2 3.995433
FRSQRTE (scalar, S) throughput 0.989990
FRSQRTE (scalar, D) 1->2 4.024469
FRSQRTE (scalar, D) throughput 1.003917
FRSQRTE (vector, 2S) 1->2 4.039535
FRSQRTE (vector, 2S) throughput 1.017098
FRSQRTE (vector, 4S) 1->2 3.996559
FRSQRTE (vector, 4S) throughput 0.996325
FRSQRTE (vector, 2D) 1->2 4.023388
FRSQRTE (vector, 2D) throughput 1.010604
FRSQRTS (vector, 2S) 1->2 4.063842
FRSQRTS (vector, 2S) 1->3 4.025462
FRSQRTS (vector, 2S) throughput 1.008591
FRSQRTS (vector, 4S) 1->2 3.823533
FRSQRTS (vector, 4S) 1->3 4.019797
FRSQRTS (vector, 4S) throughput 1.013359
FRSQRTS (vector, 2D) 1->2 4.074096
FRSQRTS (vector, 2D) 1->3 3.998948
FRSQRTS (vector, 2D) throughput 1.002740
FRSQRTS (scalar, S) 1->2 3.998574
FRSQRTS (scalar, S) 1->3 4.052372
FRSQRTS (scalar, S) throughput 1.017829
FRSQRTS (scalar, D) 1->2 4.011187
FRSQRTS (scalar, D) 1->3 4.056374
FRSQRTS (scalar, D) throughput 1.009142
FSQRT (scalar, S) 1->2 17.139374
FSQRT (scalar, S) throughput 16.229360
FSQRT (scalar, D) 1->2 31.141787
FSQRT (scalar, D) throughput 28.601205
FSQRT (vector, 2S) 1->2 16.866213
FSQRT (vector, 2S) throughput 16.106713
FSQRT (vector, 4S) 1->2 32.078250
FSQRT (vector, 4S) throughput 30.735117
FSQRT (vector, 2D) 1->2 60.562890
FSQRT (vector, 2D) throughput 59.485221
FSUB (vector, 2S) 1->2 4.011366
FSUB (vector, 2S) 1->3 4.024436
FSUB (vector, 2S) throughput 1.009158
FSUB (vector, 4S) 1->2 4.056793
FSUB (vector, 4S) 1->3 4.031407
FSUB (vector, 4S) throughput 1.007814
FSUB (vector, 2D) 1->2 4.015526
FSUB (vector, 2D) 1->3 4.043335
FSUB (vector, 2D) throughput 1.003129
FSUB (scalar, S) 1->2 4.013624
FSUB (scalar, S) 1->3 3.991329
FSUB (scalar, S) throughput 0.992512
FSUB (scalar, D) 1->2 4.046151
FSUB (scalar, D) 1->3 3.991030
FSUB (scalar, D) throughput 1.004966
INS (element, B) 1->1 2.006290
INS (element, B) 1->2 1.994043
INS (element, B) throughput 1.010848
INS (element, B) throughput 0.997193
INS (element, H) 1->1 1.999562
INS (element, H) 1->2 2.027488
INS (element, H) throughput 1.009209
INS (element, H) throughput 1.006922
INS (element, S) 1->1 1.999934
INS (element, S) 1->2 1.995473
INS (element, S) throughput 1.007201
INS (element, S) throughput 1.001724
INS (element, D) 1->1 2.018831
INS (element, D) 1->2 2.022761
INS (element, D) throughput 1.008592
INS (element, D) throughput 0.992961
INS (general, B) 1->1 1.967488
INS (general, B) 1->2 roundtrip 9.053829
INS (general, B) throughput 0.996189
INS (general, H) 1->1 1.995948
INS (general, H) 1->2 roundtrip 9.839381
INS (general, H) throughput 1.009186
INS (general, S) 1->1 1.998664
INS (general, S) 1->2 roundtrip 9.044913
INS (general, S) throughput 0.992849
INS (general, D) 1->1 1.999210
INS (general, D) 1->2 roundtrip 9.086861
INS (general, D) throughput 1.010689
LD1 (multiple, one register, 8B) 1->2 roundtrip 7.094598
LD1 (multiple, one register, 8B) throughput 1.007628
LD1 (multiple, one register, 16B) 1->2 roundtrip 7.106324
LD1 (multiple, one register, 16B) throughput 1.018383
LD1 (multiple, one register, 4H) 1->2 roundtrip 7.088227
LD1 (multiple, one register, 4H) throughput 1.011977
LD1 (multiple, one register, 8H) 1->2 roundtrip 7.069806
LD1 (multiple, one register, 8H) throughput 1.003262
LD1 (multiple, one register, 2S) 1->2 roundtrip 7.114671
LD1 (multiple, one register, 2S) throughput 1.018454
LD1 (multiple, one register, 4S) 1->2 roundtrip 7.097158
LD1 (multiple, one register, 4S) throughput 1.001019
LD1 (multiple, one register, 1D) 1->2 roundtrip 7.076132
LD1 (multiple, one register, 1D) throughput 0.979773
LD1 (multiple, one register, 2D) 1->2 roundtrip 7.150492
LD1 (multiple, one register, 2D) throughput 0.986771
LD1 (multiple, two registers, 8B) 1->3 roundtrip 6.996186
LD1 (multiple, two registers, 8B) 2->3 roundtrip 7.978079
LD1 (multiple, two registers, 8B) throughput 2.017906
LD1 (multiple, two registers, 16B) 1->3 roundtrip 6.975963
LD1 (multiple, two registers, 16B) 2->3 roundtrip 8.001197
LD1 (multiple, two registers, 16B) throughput 1.991158
LD1 (multiple, two registers, 4H) 1->3 roundtrip 7.082772
LD1 (multiple, two registers, 4H) 2->3 roundtrip 7.865790
LD1 (multiple, two registers, 4H) throughput 2.016546
LD1 (multiple, two registers, 8H) 1->3 roundtrip 7.054840
LD1 (multiple, two registers, 8H) 2->3 roundtrip 8.092806
LD1 (multiple, two registers, 8H) throughput 2.031060
LD1 (multiple, two registers, 2S) 1->3 roundtrip 7.190780
LD1 (multiple, two registers, 2S) 2->3 roundtrip 8.197140
LD1 (multiple, two registers, 2S) throughput 2.014596
LD1 (multiple, two registers, 4S) 1->3 roundtrip 7.078812
LD1 (multiple, two registers, 4S) 2->3 roundtrip 8.041236
LD1 (multiple, two registers, 4S) throughput 2.025199
LD1 (multiple, two registers, 1D) 1->3 roundtrip 7.034100
LD1 (multiple, two registers, 1D) 2->3 roundtrip 8.088614
LD1 (multiple, two registers, 1D) throughput 2.070803
LD1 (multiple, two registers, 2D) 1->3 roundtrip 7.150959
LD1 (multiple, two registers, 2D) 2->3 roundtrip 8.071093
LD1 (multiple, two registers, 2D) throughput 2.023146
LD1 (multiple, three registers, 8B) throughput 3.026971
LD1 (multiple, three registers, 16B) throughput 2.999183
LD1 (multiple, three registers, 4H) throughput 2.960124
LD1 (multiple, three registers, 8H) throughput 2.955356
LD1 (multiple, three registers, 2S) throughput 2.993971
LD1 (multiple, three registers, 4S) throughput 3.017486
LD1 (multiple, three registers, 1D) throughput 3.033198
LD1 (multiple, three registers, 2D) throughput 2.945340
LD1 (multiple, four registers, 8B) throughput 4.015912
LD1 (multiple, four registers, 16B) throughput 3.987346
LD1 (multiple, four registers, 4H) throughput 4.033705
LD1 (multiple, four registers, 8H) throughput 4.038455
LD1 (multiple, four registers, 2S) throughput 3.949162
LD1 (multiple, four registers, 4S) throughput 3.914522
LD1 (multiple, four registers, 1D) throughput 4.017619
LD1 (multiple, four registers, 2D) throughput 3.971328
LD1 (single structure, B) 1->2 roundtrip 9.071130
LD1 (single structure, B) throughput 2.009144
LD1 (single structure, H) 1->2 roundtrip 9.009524
LD1 (single structure, H) throughput 2.021958
LD1 (single structure, S) 1->2 roundtrip 9.099139
LD1 (single structure, S) throughput 1.999768
LD1 (single structure, D) 1->2 roundtrip 9.411967
LD1 (single structure, D) throughput 1.982592
LD1R (8B) 1->2 roundtrip 8.523228
LD1R (8B) throughput 1.006268
LD1R (16B) 1->2 roundtrip 8.948784
LD1R (16B) throughput 1.000139
LD1R (4H) 1->2 roundtrip 9.131817
LD1R (4H) throughput 1.008283
LD1R (8H) 1->2 roundtrip 9.104459
LD1R (8H) throughput 1.004553
LD1R (2S) 1->2 roundtrip 9.039798
LD1R (2S) throughput 1.166126
LD1R (4S) 1->2 roundtrip 10.799477
LD1R (4S) throughput 0.973315
LD1R (1D) 1->2 roundtrip 8.873077
LD1R (1D) throughput 0.997192
LD1R (2D) 1->2 roundtrip 9.148959
LD1R (2D) throughput 0.998921
LD2 (multiple structures, 8B) 1->3 roundtrip 9.121930
LD2 (multiple structures, 8B) 2->3 roundtrip 8.994152
LD2 (multiple structures, 8B) throughput 2.012176
LD2 (multiple structures, 16B) 1->3 roundtrip 10.076148
LD2 (multiple structures, 16B) 2->3 roundtrip 11.123065
LD2 (multiple structures, 16B) throughput 1.962271
LD2 (multiple structures, 4H) 1->3 roundtrip 9.090301
LD2 (multiple structures, 4H) 2->3 roundtrip 9.875551
LD2 (multiple structures, 4H) throughput 1.991854
LD2 (multiple structures, 8H) 1->3 roundtrip 10.048960
LD2 (multiple structures, 8H) 2->3 roundtrip 11.112125
LD2 (multiple structures, 8H) throughput 1.891502
LD2 (multiple structures, 2S) 1->3 roundtrip 9.100471
LD2 (multiple structures, 2S) 2->3 roundtrip 9.901684
LD2 (multiple structures, 2S) throughput 2.033195
LD2 (multiple structures, 4S) 1->3 roundtrip 10.164552
LD2 (multiple structures, 4S) 2->3 roundtrip 11.193533
LD2 (multiple structures, 4S) throughput 2.013163
LD2 (multiple structures, 2D) 1->3 roundtrip 10.082453
LD2 (multiple structures, 2D) 2->3 roundtrip 11.165924
LD2 (multiple structures, 2D) throughput 2.014126
LD2 (single structure, B) 1->3 roundtrip 9.013802
LD2 (single structure, B) 2->3 roundtrip 10.203858
LD2 (single structure, B) throughput 1.996736
LD2 (single structure, H) 1->3 roundtrip 9.172477
LD2 (single structure, H) 2->3 roundtrip 10.246846
LD2 (single structure, H) throughput 2.036272
LD2 (single structure, S) 1->3 roundtrip 9.340898
LD2 (single structure, S) 2->3 roundtrip 9.672291
LD2 (single structure, S) throughput 2.011843
LD2 (single structure, D) 1->3 roundtrip 9.107647
LD2 (single structure, D) 2->3 roundtrip 10.040064
LD2 (single structure, D) throughput 2.014421
LD2R (8B) 1->3 roundtrip 8.864812
LD2R (8B) 2->3 roundtrip 10.217025
LD2R (8B) throughput 1.949422
LD2R (16B) 1->3 roundtrip 9.100755
LD2R (16B) 2->3 roundtrip 10.039998
LD2R (16B) throughput 2.026419
LD2R (4H) 1->3 roundtrip 9.057277
LD2R (4H) 2->3 roundtrip 10.046369
LD2R (4H) throughput 2.028164
LD2R (8H) 1->3 roundtrip 9.120667
LD2R (8H) 2->3 roundtrip 10.151903
LD2R (8H) throughput 2.023252
LD2R (2S) 1->3 roundtrip 9.155496
LD2R (2S) 2->3 roundtrip 10.078405
LD2R (2S) throughput 2.008488
LD2R (4S) 1->3 roundtrip 8.331569
LD2R (4S) 2->3 roundtrip 10.163671
LD2R (4S) throughput 2.009089
LD2R (1D) 1->3 roundtrip 7.358907
LD2R (1D) 2->3 roundtrip 10.789044
LD2R (1D) throughput 2.029207
LD2R (2D) 1->3 roundtrip 9.012012
LD2R (2D) 2->3 roundtrip 10.171791
LD2R (2D) throughput 2.030970
LD3 (multiple structures, 8B) throughput 2.993885
LD3 (multiple structures, 16B) throughput 3.132193
LD3 (multiple structures, 4H) throughput 3.029481
LD3 (multiple structures, 8H) throughput 3.184917
LD3 (multiple structures, 2S) throughput 3.025650
LD3 (multiple structures, 4S) throughput 3.169154
LD3 (multiple structures, 2D) throughput 3.157428
LD3 (single structure, B) throughput 3.040378
LD3 (single structure, H) throughput 3.042053
LD3 (single structure, S) throughput 3.020677
LD3 (single structure, D) throughput 3.026556
LD3R (8B) throughput 3.048117
LD3R (16B) throughput 3.003918
LD3R (4H) throughput 2.973801
LD3R (8H) throughput 3.008481
LD3R (2S) throughput 3.174389
LD3R (4S) throughput 3.057643
LD3R (1D) throughput 3.035130
LD3R (2D) throughput 3.026668
LD4 (multiple structures, 8B) throughput 3.956560
LD4 (multiple structures, 16B) throughput 8.062444
LD4 (multiple structures, 4H) throughput 4.069654
LD4 (multiple structures, 8H) throughput 8.066979
LD4 (multiple structures, 2S) throughput 3.999731
LD4 (multiple structures, 4S) throughput 7.948301
LD4 (multiple structures, 2D) throughput 8.060163
LD4 (multiple, 8B, post-index) throughput 4.010692
LD4 (multiple, 16B, post-index) throughput 7.981206
LD4 (multiple, 4H, post-index) throughput 3.962781
LD4 (multiple, 8H, post-index) throughput 8.093958
LD4 (multiple, 2S, post-index) throughput 3.957269
LD4 (multiple, 4S, post-index) throughput 8.088370
LD4 (multiple, 2D, post-index) throughput 8.020004
LD4 (single structure, B) throughput 3.825810
LD4 (single structure, H) throughput 4.008274
LD4 (single structure, S) throughput 4.000988
LD4 (single structure, D) throughput 4.027281
LD4 (single, B, post-index) throughput 3.956026
LD4 (single, H, post-index) throughput 4.041723
LD4 (single, S, post-index) throughput 3.989958
LD4 (single, D, post-index) throughput 4.122687
LD4R (8B) throughput 4.053561
LD4R (16B) throughput 4.037118
LD4R (4H) throughput 4.039982
LD4R (8H) throughput 4.041912
LD4R (2S) throughput 4.023787
LD4R (4S) throughput 4.034545
LD4R (1D) throughput 3.986303
LD4R (2D) throughput 3.977720
LDNP (S) 1->3 roundtrip 7.000762
LDNP (S) 2->3 roundtrip 8.119773
LDNP (S) throughput 1.997603
LDNP (D) 1->3 roundtrip 6.992467
LDNP (D) 2->3 roundtrip 7.865003
LDNP (D) throughput 2.009390
LDNP (Q) 1->3 roundtrip 7.140608
LDNP (Q) 2->3 roundtrip 8.078744
LDNP (Q) throughput 1.970361
LDNP (signed offset, S) 1->3 roundtrip 6.980402
LDNP (signed offset, S) 2->3 roundtrip 8.000654
LDNP (signed offset, S) throughput 2.000679
LDNP (signed offset, D) 1->3 roundtrip 6.897616
LDNP (signed offset, D) 2->3 roundtrip 7.811365
LDNP (signed offset, D) throughput 1.997081
LDNP (signed offset, Q) 1->3 roundtrip 7.085803
LDNP (signed offset, Q) 2->3 roundtrip 11.023365
LDNP (signed offset, Q) throughput 1.947348
LDP (S) 1->3 roundtrip 7.003742
LDP (S) 2->3 roundtrip 8.083244
LDP (S) throughput 2.022976
LDP (D) 1->3 roundtrip 7.130645
LDP (D) 2->3 roundtrip 8.163688
LDP (D) throughput 2.043748
LDP (Q) 1->3 roundtrip 7.064457
LDP (Q) 2->3 roundtrip 8.172987
LDP (Q) throughput 2.018059
LDP (post-index, S) 1->3 roundtrip 7.297353
LDP (post-index, S) 2->3 roundtrip 7.994153
LDP (post-index, S) throughput 2.015806
LDP (post-index, D) 1->3 roundtrip 7.202317
LDP (post-index, D) 2->3 roundtrip 8.219693
LDP (post-index, D) throughput 2.104758
LDP (post-index, Q) 1->3 roundtrip 7.153011
LDP (post-index, Q) 2->3 roundtrip 8.082561
LDP (post-index, Q) throughput 2.079079
LDP (pre-index, S) 1->3 roundtrip 7.059594
LDP (pre-index, S) 2->3 roundtrip 8.324455
LDP (pre-index, S) throughput 2.101429
LDP (pre-index, D) 1->3 roundtrip 7.325863
LDP (pre-index, D) 2->3 roundtrip 8.219350
LDP (pre-index, D) throughput 2.070732
LDP (pre-index, Q) 1->3 roundtrip 7.081176
LDP (pre-index, Q) 2->3 roundtrip 7.872452
LDP (pre-index, Q) throughput 2.113459
LDP (signed offset, S) 1->3 roundtrip 8.130788
LDP (signed offset, S) 2->3 roundtrip 7.815204
LDP (signed offset, S) throughput 2.021555
LDP (signed offset, D) 1->3 roundtrip 7.051319
LDP (signed offset, D) 2->3 roundtrip 8.174056
LDP (signed offset, D) throughput 1.791227
LDP (signed offset, Q) 1->3 roundtrip 6.905685
LDP (signed offset, Q) 2->3 roundtrip 8.148309
LDP (signed offset, Q) throughput 1.998897
LDR (S) 1->2 roundtrip 7.149626
LDR (S) throughput 1.014067
LDR (D) 1->2 roundtrip 7.037952
LDR (D) throughput 1.003665
LDR (Q) 1->2 roundtrip 6.968110
LDR (Q) throughput 1.022553
LDR (post-index, S) 1->2 roundtrip 7.103144
LDR (post-index, S) throughput 1.070907
LDR (post-index, D) 1->2 roundtrip 6.889222
LDR (post-index, D) throughput 1.057968
LDR (post-index, Q) 1->2 roundtrip 14.647482
LDR (post-index, Q) throughput 1.096726
LDR (pre-index, S) 1->2 roundtrip 7.332851
LDR (pre-index, S) throughput 1.098441
LDR (pre-index, D) 1->2 roundtrip 7.304321
LDR (pre-index, D) throughput 1.066280
LDR (pre-index, Q) 1->2 roundtrip 15.798588
LDR (pre-index, Q) throughput 1.091894
LDR (unsigned offset, S) 1->2 roundtrip 7.195555
LDR (unsigned offset, S) throughput 1.010922
LDR (unsigned offset, D) 1->2 roundtrip 7.020637
LDR (unsigned offset, D) throughput 1.006117
LDR (unsigned offset, Q) 1->2 roundtrip 7.126580
LDR (unsigned offset, Q) throughput 1.017005
LDR (literal, S) throughput 1.012070
LDR (literal, D) throughput 1.013506
LDR (literal, Q) throughput 0.977463
LDR (register, S) 1->2 roundtrip 6.999976
LDR (register, S) 1->3 roundtrip 7.081642
LDR (register, S) throughput 1.000024
LDR (register, D) 1->2 roundtrip 7.153651
LDR (register, D) 1->3 roundtrip 7.025644
LDR (register, D) throughput 1.008530
LDR (register, Q) 1->2 roundtrip 7.096594
LDR (register, Q) 1->3 roundtrip 7.137113
LDR (register, Q) throughput 1.007492
LDR (register, uxtw, S) 1->2 roundtrip 7.040530
LDR (register, uxtw, S) 1->3 roundtrip 7.149083
LDR (register, uxtw, S) throughput 1.005010
LDR (register, uxtw, D) 1->2 roundtrip 7.069544
LDR (register, uxtw, D) 1->3 roundtrip 6.927661
LDR (register, uxtw, D) throughput 1.010564
LDR (register, uxtw, Q) 1->2 roundtrip 7.034146
LDR (register, uxtw, Q) 1->3 roundtrip 7.044710
LDR (register, uxtw, Q) throughput 1.015642
LDR (register, sxtw, S) 1->2 roundtrip 7.204588
LDR (register, sxtw, S) 1->3 roundtrip 7.086510
LDR (register, sxtw, S) throughput 1.020610
LDR (register, sxtw, D) 1->2 roundtrip 6.894803
LDR (register, sxtw, D) 1->3 roundtrip 7.061228
LDR (register, sxtw, D) throughput 1.011801
LDR (register, sxtw, Q) 1->2 roundtrip 7.050658
LDR (register, sxtw, Q) 1->3 roundtrip 7.432550
LDR (register, sxtw, Q) throughput 1.017093
LDR (register, lsl, S) 1->2 roundtrip 7.061313
LDR (register, lsl, S) 1->3 roundtrip 6.983034
LDR (register, lsl, S) throughput 1.006468
LDR (register, lsl, D) 1->2 roundtrip 7.291773
LDR (register, lsl, D) 1->3 roundtrip 6.832418
LDR (register, lsl, D) throughput 1.006824
LDR (register, lsl, Q) 1->2 roundtrip 7.240369
LDR (register, lsl, Q) 1->3 roundtrip 8.062310
LDR (register, lsl, Q) throughput 1.031528
LDUR (S) 1->2 roundtrip 7.010993
LDUR (S) throughput 1.004433
LDUR (D) 1->2 roundtrip 7.080629
LDUR (D) throughput 1.002511
LDUR (Q) 1->2 roundtrip 7.027892
LDUR (Q) throughput 1.013174
MLA (by element, 4H) 1->1 1.994123
MLA (by element, 4H) 1->2 3.040918
MLA (by element, 4H) 1->3 3.002875
MLA (by element, 4H) throughput 1.008161
MLA (by element, 4H) throughput 1.001151
MLA (by element, 8H) 1->1 2.060267
MLA (by element, 8H) 1->2 3.038995
MLA (by element, 8H) 1->3 3.024616
MLA (by element, 8H) throughput 1.001055
MLA (by element, 8H) throughput 1.001348
MLA (by element, 2S) 1->1 1.969948
MLA (by element, 2S) 1->2 3.008906
MLA (by element, 2S) 1->3 3.014796
MLA (by element, 2S) throughput 1.029021
MLA (by element, 2S) throughput 1.024570
MLA (by element, 4S) 1->1 2.066809
MLA (by element, 4S) 1->2 3.008056
MLA (by element, 4S) 1->3 3.010495
MLA (by element, 4S) throughput 1.012663
MLA (by element, 4S) throughput 1.021976
MLA (vector, 8B) 1->1 2.010361
MLA (vector, 8B) 1->2 3.007129
MLA (vector, 8B) 1->3 3.015417
MLA (vector, 8B) throughput 0.993066
MLA (vector, 8B) throughput 1.009545
MLA (vector, 16B) 1->1 1.998514
MLA (vector, 16B) 1->2 3.043332
MLA (vector, 16B) 1->3 3.005717
MLA (vector, 16B) throughput 1.009101
MLA (vector, 16B) throughput 0.994515
MLA (vector, 4H) 1->1 2.017646
MLA (vector, 4H) 1->2 3.022631
MLA (vector, 4H) 1->3 3.052392
MLA (vector, 4H) throughput 1.010587
MLA (vector, 4H) throughput 1.008197
MLA (vector, 8H) 1->1 2.007711
MLA (vector, 8H) 1->2 3.063713
MLA (vector, 8H) 1->3 3.020140
MLA (vector, 8H) throughput 1.006693
MLA (vector, 8H) throughput 1.016858
MLA (vector, 2S) 1->1 1.954744
MLA (vector, 2S) 1->2 3.049163
MLA (vector, 2S) 1->3 3.059576
MLA (vector, 2S) throughput 1.012389
MLA (vector, 2S) throughput 1.007875
MLA (vector, 4S) 1->1 1.978865
MLA (vector, 4S) 1->2 3.017502
MLA (vector, 4S) 1->3 3.009832
MLA (vector, 4S) throughput 1.001708
MLA (vector, 4S) throughput 1.007259
MLS (by element, 4H) 1->1 2.030876
MLS (by element, 4H) 1->2 2.997982
MLS (by element, 4H) 1->3 3.020230
MLS (by element, 4H) throughput 1.005687
MLS (by element, 4H) throughput 1.107040
MLS (by element, 8H) 1->1 1.959786
MLS (by element, 8H) 1->2 3.025610
MLS (by element, 8H) 1->3 3.132079
MLS (by element, 8H) throughput 0.844740
MLS (by element, 8H) throughput 1.013551
MLS (by element, 2S) 1->1 2.034451
MLS (by element, 2S) 1->2 3.018182
MLS (by element, 2S) 1->3 3.019767
MLS (by element, 2S) throughput 1.007092
MLS (by element, 2S) throughput 1.004681
MLS (by element, 4S) 1->1 2.027798
MLS (by element, 4S) 1->2 3.014254
MLS (by element, 4S) 1->3 2.994567
MLS (by element, 4S) throughput 1.007058
MLS (by element, 4S) throughput 1.013739
MLS (vector, 8B) 1->1 1.997500
MLS (vector, 8B) 1->2 3.077964
MLS (vector, 8B) 1->3 2.974763
MLS (vector, 8B) throughput 1.006896
MLS (vector, 8B) throughput 0.998205
MLS (vector, 16B) 1->1 2.014013
MLS (vector, 16B) 1->2 2.978360
MLS (vector, 16B) 1->3 3.011215
MLS (vector, 16B) throughput 1.015056
MLS (vector, 16B) throughput 1.008860
MLS (vector, 4H) 1->1 2.007282
MLS (vector, 4H) 1->2 3.056088
MLS (vector, 4H) 1->3 3.171090
MLS (vector, 4H) throughput 1.037902
MLS (vector, 4H) throughput 0.968117
MLS (vector, 8H) 1->1 2.017309
MLS (vector, 8H) 1->2 3.031487
MLS (vector, 8H) 1->3 2.998027
MLS (vector, 8H) throughput 1.008959
MLS (vector, 8H) throughput 0.999855
MLS (vector, 2S) 1->1 2.034893
MLS (vector, 2S) 1->2 3.044404
MLS (vector, 2S) 1->3 3.028880
MLS (vector, 2S) throughput 1.014938
MLS (vector, 2S) throughput 1.013332
MLS (vector, 4S) 1->1 2.024152
MLS (vector, 4S) 1->2 3.048691
MLS (vector, 4S) 1->3 3.021361
MLS (vector, 4S) throughput 0.985544
MLS (vector, 4S) throughput 1.004311
MOV (vector, 8B) 1->2 2.001052
MOV (vector, 8B) throughput 1.008864
MOV (vector, 16B) 1->2 1.515897
MOV (vector, 16B) throughput 0.674997
MOVI (vector, 4H, zero) throughput 0.335973
MOVI (vector, 8H, zero) throughput 0.334946
MOVI (vector, 2S, zero) throughput 0.336656
MOVI (vector, 4S, zero) throughput 0.334022
MOVI (vector, 4H) throughput 1.006349
MOVI (vector, 8H) throughput 1.132170
MOVI (vector, 2S) throughput 1.028326
MOVI (vector, 4S) throughput 0.994822
MOVI (vector, 4H, lsl) throughput 1.006999
MOVI (vector, 8H, lsl) throughput 1.013197
MOVI (vector, 2S, lsl) throughput 1.010297
MOVI (vector, 4S, lsl) throughput 1.003825
MOVI (vector, 2S, msl) throughput 1.001414
MOVI (vector, 4S, msl) throughput 0.990220
MOVI (vector, 64-bit, 2D) throughput 1.005672
MOVI (vector, 64-bit, D) throughput 1.015231
MUL (by element, 4H) 1->2 3.399468
MUL (by element, 4H) 1->3 3.001490
MUL (by element, 4H) throughput 0.998619
MUL (by element, 8H) 1->2 3.033811
MUL (by element, 8H) 1->3 2.968722
MUL (by element, 8H) throughput 1.012007
MUL (by element, 2S) 1->2 3.091594
MUL (by element, 2S) 1->3 3.000657
MUL (by element, 2S) throughput 1.005459
MUL (by element, 4S) 1->2 2.987555
MUL (by element, 4S) 1->3 3.001425
MUL (by element, 4S) throughput 1.007751
MUL (vector, 8B) 1->2 3.021184
MUL (vector, 8B) 1->3 3.040622
MUL (vector, 8B) throughput 1.002308
MUL (vector, 16B) 1->2 3.000043
MUL (vector, 16B) 1->3 3.030257
MUL (vector, 16B) throughput 1.014655
MUL (vector, 4H) 1->2 3.015437
MUL (vector, 4H) 1->3 3.082729
MUL (vector, 4H) throughput 1.015050
MUL (vector, 8H) 1->2 3.030969
MUL (vector, 8H) 1->3 3.011160
MUL (vector, 8H) throughput 0.996488
MUL (vector, 2S) 1->2 2.992596
MUL (vector, 2S) 1->3 3.008077
MUL (vector, 2S) throughput 1.007121
MUL (vector, 4S) 1->2 3.048376
MUL (vector, 4S) 1->3 3.013681
MUL (vector, 4S) throughput 1.013426
MVNI (vector, 4H) throughput 1.004394
MVNI (vector, 8H) throughput 1.022595
MVNI (vector, 2S) throughput 1.007236
MVNI (vector, 4S) throughput 1.009667
MVNI (vector, 4H, lsl) throughput 1.010625
MVNI (vector, 8H, lsl) throughput 0.975818
MVNI (vector, 2S, lsl) throughput 1.008869
MVNI (vector, 4S, lsl) throughput 1.001930
MVNI (vector, 2S, msl) throughput 0.995008
MVNI (vector, 4S, msl) throughput 1.008937
NEG (vector, 8B) 1->2 2.062360
NEG (vector, 8B) throughput 1.008300
NEG (vector, 16B) 1->2 2.006906
NEG (vector, 16B) throughput 1.029551
NEG (vector, 4H) 1->2 2.013239
NEG (vector, 4H) throughput 1.008943
NEG (vector, 8H) 1->2 2.005176
NEG (vector, 8H) throughput 0.933436
NEG (vector, 2S) 1->2 2.035211
NEG (vector, 2S) throughput 1.007089
NEG (vector, 4S) 1->2 2.001447
NEG (vector, 4S) throughput 1.012813
NEG (vector, 2D) 1->2 2.006949
NEG (vector, 2D) throughput 1.004561
NOT (vector, 8B) 1->2 2.013655
NOT (vector, 8B) throughput 0.996764
NOT (vector, 16B) 1->2 1.946171
NOT (vector, 16B) throughput 1.000896
ORN (vector, 8B) 1->2 2.051137
ORN (vector, 8B) 1->3 2.043809
ORN (vector, 8B) throughput 1.005183
ORN (vector, 16B) 1->2 2.009745
ORN (vector, 16B) 1->3 2.011862
ORN (vector, 16B) throughput 1.009445
ORR (vector, immediate, 4H) 1->1 2.014141
ORR (vector, immediate, 4H) throughput 1.001582
ORR (vector, immediate, 4H) throughput 1.004559
ORR (vector, immediate, 8H) 1->1 2.005124
ORR (vector, immediate, 8H) throughput 1.012243
ORR (vector, immediate, 8H) throughput 1.000840
ORR (vector, immediate, 2S) 1->1 2.023244
ORR (vector, immediate, 2S) throughput 1.007483
ORR (vector, immediate, 2S) throughput 1.056901
ORR (vector, immediate, 4S) 1->1 1.928399
ORR (vector, immediate, 4S) throughput 1.039218
ORR (vector, immediate, 4S) throughput 1.035488
ORR (vector, 8B) 1->2 2.040337
ORR (vector, 8B) 1->3 2.112669
ORR (vector, 8B) throughput 1.039529
ORR (vector, 16B) 1->2 2.014870
ORR (vector, 16B) 1->3 2.105945
ORR (vector, 16B) throughput 1.008895
PMUL (8B) 1->2 3.017152
PMUL (8B) 1->3 3.006557
PMUL (8B) throughput 1.023880
PMUL (16B) 1->2 3.001964
PMUL (16B) 1->3 3.020032
PMUL (16B) throughput 1.006121
PMULL (1Q) 1->2 3.009735
PMULL (1Q) 1->3 3.014877
PMULL (1Q) throughput 1.036657
PMULL (8H) 1->2 3.004653
PMULL (8H) 1->3 3.028874
PMULL (8H) throughput 1.007143
PMULL2 (1Q) 1->2 3.012720
PMULL2 (1Q) 1->3 3.023533
PMULL2 (1Q) throughput 0.998412
PMULL2 (8H) 1->2 3.008883
PMULL2 (8H) 1->3 3.003038
PMULL2 (8H) throughput 1.003411
PMULL + EOR (1Q) 1->2 4.560702
PMULL + EOR (1Q) 1->3 4.429133
PMULL + EOR (1Q) 1->4 4.519806
PMULL + EOR (1Q) throughput 1.007774
PMULL + EOR (8H) 1->2 4.462245
PMULL + EOR (8H) 1->3 4.395711
PMULL + EOR (8H) 1->4 4.494503
PMULL + EOR (8H) throughput 1.013351
RADDHN (8H) 1->2 4.052868
RADDHN (8H) 1->3 4.036968
RADDHN (8H) throughput 1.007992
RADDHN (4S) 1->2 3.954327
RADDHN (4S) 1->3 4.012892
RADDHN (4S) throughput 1.008545
RADDHN (2D) 1->2 4.050395
RADDHN (2D) 1->3 5.055486
RADDHN (2D) throughput 1.004871
RADDHN2 (8H) 1->1 4.013042
RADDHN2 (8H) 1->2 4.013946
RADDHN2 (8H) 1->3 4.018853
RADDHN2 (8H) throughput 1.010915
RADDHN2 (8H) throughput 1.004368
RADDHN2 (4S) 1->1 3.979254
RADDHN2 (4S) 1->2 4.066697
RADDHN2 (4S) 1->3 4.011824
RADDHN2 (4S) throughput 0.981022
RADDHN2 (4S) throughput 1.004288
RADDHN2 (2D) 1->1 4.170248
RADDHN2 (2D) 1->2 3.992816
RADDHN2 (2D) 1->3 3.987506
RADDHN2 (2D) throughput 0.997334
RADDHN2 (2D) throughput 1.006608
RBIT (vector, 8B) 1->2 2.010620
RBIT (vector, 8B) throughput 1.005727
RBIT (vector, 16B) 1->2 1.937261
RBIT (vector, 16B) throughput 0.999526
REV16 (vector, 8B) 1->2 2.015333
REV16 (vector, 8B) throughput 1.016656
REV16 (vector, 16B) 1->2 2.040223
REV16 (vector, 16B) throughput 0.987701
REV32 (vector, 8B) 1->2 2.010075
REV32 (vector, 8B) throughput 1.025881
REV32 (vector, 16B) 1->2 2.015111
REV32 (vector, 16B) throughput 0.988869
REV32 (vector, 4H) 1->2 1.891187
REV32 (vector, 4H) throughput 1.011735
REV32 (vector, 8H) 1->2 1.989317
REV32 (vector, 8H) throughput 0.994255
REV64 (vector, 8B) 1->2 1.980373
REV64 (vector, 8B) throughput 1.013282
REV64 (vector, 16B) 1->2 2.020988
REV64 (vector, 16B) throughput 1.005975
REV64 (vector, 4H) 1->2 2.026497
REV64 (vector, 4H) throughput 1.012111
REV64 (vector, 8H) 1->2 2.009837
REV64 (vector, 8H) throughput 1.006337
REV64 (vector, 2S) 1->2 2.008773
REV64 (vector, 2S) throughput 0.991170
REV64 (vector, 4S) 1->2 2.017196
REV64 (vector, 4S) throughput 1.010071
RSHRN (8H) 1->2 4.021408
RSHRN (8H) throughput 0.984314
RSHRN (4S) 1->2 4.059482
RSHRN (4S) throughput 1.011370
RSHRN (2D) 1->2 3.988616
RSHRN (2D) throughput 1.003271
RSHRN2 (8H) 1->1 4.035963
RSHRN2 (8H) 1->2 4.016628
RSHRN2 (8H) throughput 0.999328
RSHRN2 (8H) throughput 0.997781
RSHRN2 (4S) 1->1 4.082512
RSHRN2 (4S) 1->2 3.924463
RSHRN2 (4S) throughput 1.014018
RSHRN2 (4S) throughput 1.001584
RSHRN2 (2D) 1->1 4.005322
RSHRN2 (2D) 1->2 4.014959
RSHRN2 (2D) throughput 1.004425
RSHRN2 (2D) throughput 1.004984
RSUBHN (8H) 1->2 4.006977
RSUBHN (8H) 1->3 4.043029
RSUBHN (8H) throughput 1.012152
RSUBHN (4S) 1->2 4.001765
RSUBHN (4S) 1->3 4.151385
RSUBHN (4S) throughput 0.996860
RSUBHN (2D) 1->2 4.042812
RSUBHN (2D) 1->3 4.009043
RSUBHN (2D) throughput 1.009465
RSUBHN2 (8H) 1->1 4.022807
RSUBHN2 (8H) 1->2 4.035021
RSUBHN2 (8H) 1->3 4.020748
RSUBHN2 (8H) throughput 0.997086
RSUBHN2 (8H) throughput 1.010594
RSUBHN2 (4S) 1->1 4.034768
RSUBHN2 (4S) 1->2 4.004540
RSUBHN2 (4S) 1->3 4.237777
RSUBHN2 (4S) throughput 1.010794
RSUBHN2 (4S) throughput 1.001294
RSUBHN2 (2D) 1->1 3.984003
RSUBHN2 (2D) 1->2 4.048976
RSUBHN2 (2D) 1->3 4.031761
RSUBHN2 (2D) throughput 1.008312
RSUBHN2 (2D) throughput 1.002700
SABA (vector, 8B) 1->1 1.994392
SABA (vector, 8B) 1->2 3.047644
SABA (vector, 8B) 1->3 3.028680
SABA (vector, 8B) throughput 1.005149
SABA (vector, 8B) throughput 0.862255
SABA (vector, 16B) 1->1 1.998728
SABA (vector, 16B) 1->2 3.054772
SABA (vector, 16B) 1->3 3.201108
SABA (vector, 16B) throughput 1.022395
SABA (vector, 16B) throughput 1.002724
SABA (vector, 4H) 1->1 1.955581
SABA (vector, 4H) 1->2 3.097438
SABA (vector, 4H) 1->3 3.143701
SABA (vector, 4H) throughput 1.016649
SABA (vector, 4H) throughput 1.014533
SABA (vector, 8H) 1->1 2.001360
SABA (vector, 8H) 1->2 3.004486
SABA (vector, 8H) 1->3 2.864721
SABA (vector, 8H) throughput 1.003880
SABA (vector, 8H) throughput 1.008136
SABA (vector, 2S) 1->1 2.043830
SABA (vector, 2S) 1->2 3.014002
SABA (vector, 2S) 1->3 3.067800
SABA (vector, 2S) throughput 1.004516
SABA (vector, 2S) throughput 1.011657
SABA (vector, 4S) 1->1 1.994240
SABA (vector, 4S) 1->2 3.034458
SABA (vector, 4S) 1->3 3.040355
SABA (vector, 4S) throughput 1.012051
SABA (vector, 4S) throughput 1.023626
SABAL (vector, 8H) 1->1 2.000457
SABAL (vector, 8H) 1->2 2.997173
SABAL (vector, 8H) 1->3 3.046583
SABAL (vector, 8H) throughput 1.012400
SABAL (vector, 8H) throughput 1.014495
SABAL (vector, 4S) 1->1 2.000153
SABAL (vector, 4S) 1->2 3.023993
SABAL (vector, 4S) 1->3 2.975585
SABAL (vector, 4S) throughput 1.021654
SABAL (vector, 4S) throughput 1.000362
SABAL (vector, 2D) 1->1 2.001028
SABAL (vector, 2D) 1->2 3.027742
SABAL (vector, 2D) 1->3 2.990141
SABAL (vector, 2D) throughput 0.987691
SABAL (vector, 2D) throughput 0.986973
SABAL2 (vector, 8H) 1->1 2.027271
SABAL2 (vector, 8H) 1->2 3.021909
SABAL2 (vector, 8H) 1->3 3.000709
SABAL2 (vector, 8H) throughput 1.002900
SABAL2 (vector, 8H) throughput 1.003088
SABAL2 (vector, 4S) 1->1 1.997083
SABAL2 (vector, 4S) 1->2 3.004890
SABAL2 (vector, 4S) 1->3 2.996424
SABAL2 (vector, 4S) throughput 1.016295
SABAL2 (vector, 4S) throughput 1.008308
SABAL2 (vector, 2D) 1->1 1.996212
SABAL2 (vector, 2D) 1->2 3.008902
SABAL2 (vector, 2D) 1->3 3.046473
SABAL2 (vector, 2D) throughput 1.007528
SABAL2 (vector, 2D) throughput 1.010111
SABD (vector, 8B) 1->2 3.028707
SABD (vector, 8B) 1->3 3.024048
SABD (vector, 8B) throughput 1.005396
SABD (vector, 16B) 1->2 2.959471
SABD (vector, 16B) 1->3 2.967510
SABD (vector, 16B) throughput 0.990578
SABD (vector, 4H) 1->2 3.009212
SABD (vector, 4H) 1->3 2.967480
SABD (vector, 4H) throughput 1.009673
SABD (vector, 8H) 1->2 3.047693
SABD (vector, 8H) 1->3 3.002095
SABD (vector, 8H) throughput 0.970576
SABD (vector, 2S) 1->2 3.011112
SABD (vector, 2S) 1->3 2.995547
SABD (vector, 2S) throughput 1.132258
SABD (vector, 4S) 1->2 2.974151
SABD (vector, 4S) 1->3 3.009981
SABD (vector, 4S) throughput 1.048315
SABDL (vector, 8H) 1->2 3.118561
SABDL (vector, 8H) 1->3 3.007303
SABDL (vector, 8H) throughput 0.984402
SABDL (vector, 4S) 1->2 3.022789
SABDL (vector, 4S) 1->3 2.962503
SABDL (vector, 4S) throughput 1.005068
SABDL (vector, 2D) 1->2 3.000389
SABDL (vector, 2D) 1->3 3.021050
SABDL (vector, 2D) throughput 1.019036
SABDL2 (vector, 8H) 1->2 2.972069
SABDL2 (vector, 8H) 1->3 3.016052
SABDL2 (vector, 8H) throughput 1.004494
SABDL2 (vector, 4S) 1->2 3.022530
SABDL2 (vector, 4S) 1->3 3.028053
SABDL2 (vector, 4S) throughput 1.015183
SABDL2 (vector, 2D) 1->2 3.004781
SABDL2 (vector, 2D) 1->3 3.006666
SABDL2 (vector, 2D) throughput 1.007565
SADALP (vector, 4H) 1->1 2.008577
SADALP (vector, 4H) 1->2 2.952439
SADALP (vector, 4H) throughput 0.989935
SADALP (vector, 4H) throughput 1.015091
SADALP (vector, 8H) 1->1 2.004141
SADALP (vector, 8H) 1->2 3.846702
SADALP (vector, 8H) throughput 1.012482
SADALP (vector, 8H) throughput 0.995170
SADALP (vector, 2S) 1->1 1.985839
SADALP (vector, 2S) 1->2 3.027417
SADALP (vector, 2S) throughput 1.006622
SADALP (vector, 2S) throughput 1.006187
SADALP (vector, 4S) 1->1 1.999058
SADALP (vector, 4S) 1->2 2.977011
SADALP (vector, 4S) throughput 1.009454
SADALP (vector, 4S) throughput 1.044498
SADALP (vector, 1D) 1->1 1.993797
SADALP (vector, 1D) 1->2 3.034943
SADALP (vector, 1D) throughput 1.008253
SADALP (vector, 1D) throughput 1.006941
SADALP (vector, 2D) 1->1 2.046205
SADALP (vector, 2D) 1->2 2.999386
SADALP (vector, 2D) throughput 1.010839
SADALP (vector, 2D) throughput 0.989324
SADDL (vector, 8H) 1->2 2.005225
SADDL (vector, 8H) 1->3 2.012942
SADDL (vector, 8H) throughput 0.987288
SADDL (vector, 4S) 1->2 2.003268
SADDL (vector, 4S) 1->3 2.005789
SADDL (vector, 4S) throughput 1.002487
SADDL (vector, 2D) 1->2 1.965871
SADDL (vector, 2D) 1->3 1.974976
SADDL (vector, 2D) throughput 1.008132
SADDL2 (vector, 8H) 1->2 2.030874
SADDL2 (vector, 8H) 1->3 2.007753
SADDL2 (vector, 8H) throughput 1.001471
SADDL2 (vector, 4S) 1->2 2.004473
SADDL2 (vector, 4S) 1->3 1.964516
SADDL2 (vector, 4S) throughput 1.000653
SADDL2 (vector, 2D) 1->2 2.057561
SADDL2 (vector, 2D) 1->3 1.997192
SADDL2 (vector, 2D) throughput 1.005264
SADDLP (vector, 4H) 1->2 2.033428
SADDLP (vector, 4H) throughput 0.975096
SADDLP (vector, 8H) 1->2 2.025358
SADDLP (vector, 8H) throughput 1.017959
SADDLP (vector, 2S) 1->2 1.994348
SADDLP (vector, 2S) throughput 0.994529
SADDLP (vector, 4S) 1->2 2.003772
SADDLP (vector, 4S) throughput 1.000782
SADDLP (vector, 1D) 1->2 2.001098
SADDLP (vector, 1D) throughput 0.991205
SADDLP (vector, 2D) 1->2 1.987527
SADDLP (vector, 2D) throughput 0.982868
SADDLV (vector, 8B) 1->2 2.876083
SADDLV (vector, 8B) throughput 1.027311
SADDLV (vector, 16B) 1->2 3.026380
SADDLV (vector, 16B) throughput 1.036184
SADDLV (vector, 4H) 1->2 3.029262
SADDLV (vector, 4H) throughput 1.019246
SADDLV (vector, 8H) 1->2 3.037059
SADDLV (vector, 8H) throughput 1.004186
SADDLV (vector, 4S) 1->2 2.936999
SADDLV (vector, 4S) throughput 1.015364
SADDW (vector, 8H) 1->2 2.022099
SADDW (vector, 8H) 1->3 2.028720
SADDW (vector, 8H) throughput 1.007651
SADDW (vector, 4S) 1->2 2.062081
SADDW (vector, 4S) 1->3 2.003254
SADDW (vector, 4S) throughput 1.011508
SADDW (vector, 2D) 1->2 2.046820
SADDW (vector, 2D) 1->3 1.989716
SADDW (vector, 2D) throughput 1.005253
SADDW2 (vector, 8H) 1->2 2.016670
SADDW2 (vector, 8H) 1->3 2.046380
SADDW2 (vector, 8H) throughput 0.986631
SADDW2 (vector, 4S) 1->2 2.006650
SADDW2 (vector, 4S) 1->3 2.024316
SADDW2 (vector, 4S) throughput 1.022784
SADDW2 (vector, 2D) 1->2 2.010169
SADDW2 (vector, 2D) 1->3 1.994109
SADDW2 (vector, 2D) throughput 1.017668
SCVTF (scalar, fixed-point, S from 32-bit GPR) 1->2 roundtrip 9.918009
SCVTF (scalar, fixed-point, S from 32-bit GPR) throughput 1.009686
SCVTF (scalar, fixed-point, D from 32-bit GPR) 1->2 roundtrip 10.074537
SCVTF (scalar, fixed-point, D from 32-bit GPR) throughput 1.007089
SCVTF (scalar, fixed-point, S from 64-bit GPR) 1->2 roundtrip 10.100717
SCVTF (scalar, fixed-point, S from 64-bit GPR) throughput 1.007194
SCVTF (scalar, fixed-point, D from 64-bit GPR) 1->2 roundtrip 10.044557
SCVTF (scalar, fixed-point, D from 64-bit GPR) throughput 1.012941
SCVTF (scalar, integer, S from 32-bit GPR) 1->2 roundtrip 9.935330
SCVTF (scalar, integer, S from 32-bit GPR) throughput 1.006438
SCVTF (scalar, integer, D from 32-bit GPR) 1->2 roundtrip 10.027651
SCVTF (scalar, integer, D from 32-bit GPR) throughput 1.015946
SCVTF (scalar, integer, S from 64-bit GPR) 1->2 roundtrip 10.006854
SCVTF (scalar, integer, S from 64-bit GPR) throughput 1.006996
SCVTF (scalar, integer, D from 64-bit GPR) 1->2 roundtrip 10.039849
SCVTF (scalar, integer, D from 64-bit GPR) throughput 1.004608
SCVTF (scalar, fixed-point, S from S) 1->2 2.999579
SCVTF (scalar, fixed-point, S from S) throughput 1.012036
SCVTF (scalar, fixed-point, D from D) 1->2 3.011327
SCVTF (scalar, fixed-point, D from D) throughput 0.999926
SCVTF (vector, fixed-point, 2S from 2S) 1->2 3.024322
SCVTF (vector, fixed-point, 2S from 2S) throughput 1.012638
SCVTF (vector, fixed-point, 4S from 4S) 1->2 2.986756
SCVTF (vector, fixed-point, 4S from 4S) throughput 1.005748
SCVTF (vector, fixed-point, 2D from 2D) 1->2 3.023377
SCVTF (vector, fixed-point, 2D from 2D) throughput 0.974618
SCVTF (scalar, integer, S from S) 1->2 3.066739
SCVTF (scalar, integer, S from S) throughput 1.009499
SCVTF (scalar, integer, D from D) 1->2 3.029985
SCVTF (scalar, integer, D from D) throughput 1.001533
SCVTF (vector, integer, 2S from 2S) 1->2 3.036120
SCVTF (vector, integer, 2S from 2S) throughput 1.008807
SCVTF (vector, integer, 4S from 4S) 1->2 2.884876
SCVTF (vector, integer, 4S from 4S) throughput 0.971467
SCVTF (vector, integer, 2D from 2D) 1->2 3.038833
SCVTF (vector, integer, 2D from 2D) throughput 1.014511
SHA1C 1->1 5.024894
SHA1C 1->2 5.060340
SHA1C 1->3 5.028331
SHA1C throughput 4.036938
SHA1C throughput 3.908395
SHA1H 1->2 1.955366
SHA1H throughput 1.005860
SHA1M 1->1 5.073646
SHA1M 1->2 4.985275
SHA1M 1->3 5.038137
SHA1M throughput 4.028662
SHA1M throughput 4.035788
SHA1P 1->1 5.012074
SHA1P 1->2 4.998680
SHA1P 1->3 5.025577
SHA1P throughput 3.996057
SHA1P throughput 4.022321
SHA1SU0 1->1 2.005135
SHA1SU0 1->2 1.999934
SHA1SU0 1->3 2.047283
SHA1SU0 throughput 0.994366
SHA1SU0 throughput 1.015655
SHA1SU1 1->1 1.994633
SHA1SU1 1->2 1.998092
SHA1SU1 throughput 1.003034
SHA1SU1 throughput 1.007335
SHA256H 1->1 5.081018
SHA256H 1->2 5.041674
SHA256H 1->3 4.990472
SHA256H throughput 4.060842
SHA256H throughput 4.023415
SHA256H2 1->1 5.027326
SHA256H2 1->2 5.018499
SHA256H2 1->3 5.026376
SHA256H2 throughput 3.985535
SHA256H2 throughput 3.992722
SHA256SU0 1->1 1.994962
SHA256SU0 1->2 2.011590
SHA256SU0 throughput 1.016746
SHA256SU0 throughput 1.020090
SHA256SU1 1->1 3.037013
SHA256SU1 1->2 2.952537
SHA256SU1 1->3 3.006110
SHA256SU1 throughput 1.004076
SHA256SU1 throughput 1.000225
SHADD (vector, 8B) 1->2 2.014433
SHADD (vector, 8B) 1->3 2.000811
SHADD (vector, 8B) throughput 1.005955
SHADD (vector, 16B) 1->2 1.975699
SHADD (vector, 16B) 1->3 2.009890
SHADD (vector, 16B) throughput 1.014230
SHADD (vector, 4H) 1->2 1.981427
SHADD (vector, 4H) 1->3 2.018562
SHADD (vector, 4H) throughput 0.988409
SHADD (vector, 8H) 1->2 2.015649
SHADD (vector, 8H) 1->3 1.996123
SHADD (vector, 8H) throughput 1.008086
SHADD (vector, 2S) 1->2 2.007543
SHADD (vector, 2S) 1->3 2.037379
SHADD (vector, 2S) throughput 1.010055
SHADD (vector, 4S) 1->2 2.018575
SHADD (vector, 4S) 1->3 1.983412
SHADD (vector, 4S) throughput 1.009104
SHL (vector, 8B) 1->2 2.007887
SHL (vector, 8B) throughput 1.005609
SHL (vector, 16B) 1->2 1.994632
SHL (vector, 16B) throughput 1.013253
SHL (vector, 4H) 1->2 2.019109
SHL (vector, 4H) throughput 1.009391
SHL (vector, 8H) 1->2 1.998490
SHL (vector, 8H) throughput 1.024142
SHL (vector, 2S) 1->2 2.012197
SHL (vector, 2S) throughput 0.998608
SHL (vector, 4S) 1->2 2.025752
SHL (vector, 4S) throughput 1.004070
SHL (vector, 2D) 1->2 1.999145
SHL (vector, 2D) throughput 1.004371
SHL (scalar, D) 1->2 2.008872
SHL (scalar, D) throughput 0.992533
SHLL (vector, 8H) 1->2 2.048405
SHLL (vector, 8H) throughput 1.021637
SHLL (vector, 4S) 1->2 2.024587
SHLL (vector, 4S) throughput 1.006487
SHLL (vector, 2D) 1->2 2.018170
SHLL (vector, 2D) throughput 1.006623
SHLL2 (vector, 8H) 1->2 1.958272
SHLL2 (vector, 8H) throughput 0.997561
SHLL2 (vector, 4S) 1->2 2.022873
SHLL2 (vector, 4S) throughput 1.007682
SHLL2 (vector, 2D) 1->2 2.011115
SHLL2 (vector, 2D) throughput 0.996469
SHRN (8H) 1->2 4.362977
SHRN (8H) throughput 0.964550
SHRN (4S) 1->2 3.081054
SHRN (4S) throughput 1.130533
SHRN (2D) 1->2 3.165798
SHRN (2D) throughput 0.864060
SHRN2 (8H) 1->1 4.047584
SHRN2 (8H) 1->2 3.972984
SHRN2 (8H) throughput 1.000829
SHRN2 (8H) throughput 1.012462
SHRN2 (4S) 1->1 3.856936
SHRN2 (4S) 1->2 4.028558
SHRN2 (4S) throughput 1.008792
SHRN2 (4S) throughput 0.999261
SHRN2 (2D) 1->1 4.066367
SHRN2 (2D) 1->2 4.048712
SHRN2 (2D) throughput 1.004621
SHRN2 (2D) throughput 1.010732
SHSUB (vector, 8B) 1->2 1.996891
SHSUB (vector, 8B) 1->3 2.005031
SHSUB (vector, 8B) throughput 1.000687
SHSUB (vector, 16B) 1->2 2.037747
SHSUB (vector, 16B) 1->3 2.268977
SHSUB (vector, 16B) throughput 1.013036
SHSUB (vector, 4H) 1->2 2.005428
SHSUB (vector, 4H) 1->3 2.028858
SHSUB (vector, 4H) throughput 1.007361
SHSUB (vector, 8H) 1->2 1.933147
SHSUB (vector, 8H) 1->3 2.011587
SHSUB (vector, 8H) throughput 1.006291
SHSUB (vector, 2S) 1->2 2.009723
SHSUB (vector, 2S) 1->3 2.003049
SHSUB (vector, 2S) throughput 1.007588
SHSUB (vector, 4S) 1->2 2.000811
SHSUB (vector, 4S) 1->3 2.006818
SHSUB (vector, 4S) throughput 1.013420
SLI (vector, 8B) 1->1 1.994415
SLI (vector, 8B) 1->2 2.018824
SLI (vector, 8B) throughput 0.995906
SLI (vector, 8B) throughput 1.008599
SLI (vector, 16B) 1->1 1.999433
SLI (vector, 16B) 1->2 2.002454
SLI (vector, 16B) throughput 0.999471
SLI (vector, 16B) throughput 1.017845
SLI (vector, 4H) 1->1 2.017881
SLI (vector, 4H) 1->2 2.013723
SLI (vector, 4H) throughput 0.993576
SLI (vector, 4H) throughput 1.011954
SLI (vector, 8H) 1->1 2.009149
SLI (vector, 8H) 1->2 2.012060
SLI (vector, 8H) throughput 0.978534
SLI (vector, 8H) throughput 0.945842
SLI (vector, 2S) 1->1 1.996716
SLI (vector, 2S) 1->2 2.001229
SLI (vector, 2S) throughput 1.005853
SLI (vector, 2S) throughput 0.987195
SLI (vector, 4S) 1->1 2.052204
SLI (vector, 4S) 1->2 2.004190
SLI (vector, 4S) throughput 1.000006
SLI (vector, 4S) throughput 1.006689
SLI (vector, 2D) 1->1 1.998370
SLI (vector, 2D) 1->2 2.031189
SLI (vector, 2D) throughput 1.004385
SLI (vector, 2D) throughput 1.008674
SLI (scalar, D) 1->1 2.005833
SLI (scalar, D) 1->2 1.990370
SLI (scalar, D) throughput 1.005812
SLI (scalar, D) throughput 1.015759
SMAX (vector, 8B) 1->2 3.014671
SMAX (vector, 8B) 1->3 3.019427
SMAX (vector, 8B) throughput 1.012925
SMAX (vector, 16B) 1->2 2.996147
SMAX (vector, 16B) 1->3 3.001992
SMAX (vector, 16B) throughput 1.061324
SMAX (vector, 4H) 1->2 3.009434
SMAX (vector, 4H) 1->3 2.922535
SMAX (vector, 4H) throughput 1.034461
SMAX (vector, 8H) 1->2 3.000109
SMAX (vector, 8H) 1->3 3.031359
SMAX (vector, 8H) throughput 1.010554
SMAX (vector, 2S) 1->2 3.026101
SMAX (vector, 2S) 1->3 3.007740
SMAX (vector, 2S) throughput 0.998060
SMAX (vector, 4S) 1->2 2.999604
SMAX (vector, 4S) 1->3 2.993899
SMAX (vector, 4S) throughput 1.009445
SMAXP (vector, 8B) 1->2 3.048076
SMAXP (vector, 8B) 1->3 2.996582
SMAXP (vector, 8B) throughput 1.009835
SMAXP (vector, 16B) 1->2 2.998057
SMAXP (vector, 16B) 1->3 3.039413
SMAXP (vector, 16B) throughput 0.997744
SMAXP (vector, 4H) 1->2 2.996512
SMAXP (vector, 4H) 1->3 3.102324
SMAXP (vector, 4H) throughput 1.431876
SMAXP (vector, 8H) 1->2 3.079655
SMAXP (vector, 8H) 1->3 3.011058
SMAXP (vector, 8H) throughput 1.007411
SMAXP (vector, 2S) 1->2 3.027074
SMAXP (vector, 2S) 1->3 3.013224
SMAXP (vector, 2S) throughput 1.011341
SMAXP (vector, 4S) 1->2 3.011237
SMAXP (vector, 4S) 1->3 3.043309
SMAXP (vector, 4S) throughput 0.992858
SMAXV (vector, 8B) 1->2 3.165228
SMAXV (vector, 8B) throughput 0.985150
SMAXV (vector, 16B) 1->2 3.038462
SMAXV (vector, 16B) throughput 1.007722
SMAXV (vector, 4H) 1->2 3.003219
SMAXV (vector, 4H) throughput 1.081817
SMAXV (vector, 8H) 1->2 3.002671
SMAXV (vector, 8H) throughput 1.005036
SMAXV (vector, 4S) 1->2 3.010799
SMAXV (vector, 4S) throughput 1.009986
SMIN (vector, 8B) 1->2 3.016535
SMIN (vector, 8B) 1->3 3.014031
SMIN (vector, 8B) throughput 1.016578
SMIN (vector, 16B) 1->2 3.005854
SMIN (vector, 16B) 1->3 3.020804
SMIN (vector, 16B) throughput 1.007599
SMIN (vector, 4H) 1->2 3.025290
SMIN (vector, 4H) 1->3 3.019607
SMIN (vector, 4H) throughput 1.012207
SMIN (vector, 8H) 1->2 2.996149
SMIN (vector, 8H) 1->3 3.296800
SMIN (vector, 8H) throughput 0.989174
SMIN (vector, 2S) 1->2 3.023693
SMIN (vector, 2S) 1->3 3.039746
SMIN (vector, 2S) throughput 1.008986
SMIN (vector, 4S) 1->2 2.991694
SMIN (vector, 4S) 1->3 3.018861
SMIN (vector, 4S) throughput 1.003750
SMINP (vector, 8B) 1->2 3.030001
SMINP (vector, 8B) 1->3 3.028025
SMINP (vector, 8B) throughput 1.008092
SMINP (vector, 16B) 1->2 3.018684
SMINP (vector, 16B) 1->3 3.003794
SMINP (vector, 16B) throughput 1.008781
SMINP (vector, 4H) 1->2 3.045878
SMINP (vector, 4H) 1->3 3.049751
SMINP (vector, 4H) throughput 1.000417
SMINP (vector, 8H) 1->2 2.949443
SMINP (vector, 8H) 1->3 3.033535
SMINP (vector, 8H) throughput 0.965571
SMINP (vector, 2S) 1->2 3.013553
SMINP (vector, 2S) 1->3 3.515670
SMINP (vector, 2S) throughput 1.007884
SMINP (vector, 4S) 1->2 3.090913
SMINP (vector, 4S) 1->3 3.040656
SMINP (vector, 4S) throughput 1.023803
SMINV (vector, 8B) 1->2 2.997982
SMINV (vector, 8B) throughput 1.008893
SMINV (vector, 16B) 1->2 2.997939
SMINV (vector, 16B) throughput 1.015140
SMINV (vector, 4H) 1->2 3.008753
SMINV (vector, 4H) throughput 1.000623
SMINV (vector, 8H) 1->2 3.011327
SMINV (vector, 8H) throughput 1.009928
SMINV (vector, 4S) 1->2 3.001358
SMINV (vector, 4S) throughput 1.011418
SMLAL (by element, 4S) 1->1 2.018334
SMLAL (by element, 4S) 1->2 3.018965
SMLAL (by element, 4S) 1->3 3.043051
SMLAL (by element, 4S) throughput 1.007378
SMLAL (by element, 4S) throughput 1.010860
SMLAL (by element, 2D) 1->1 2.036607
SMLAL (by element, 2D) 1->2 2.998465
SMLAL (by element, 2D) 1->3 2.998466
SMLAL (by element, 2D) throughput 1.006621
SMLAL (by element, 2D) throughput 1.016241
SMLAL2 (by element, 4S) 1->1 1.982368
SMLAL2 (by element, 4S) 1->2 3.009029
SMLAL2 (by element, 4S) 1->3 3.013799
SMLAL2 (by element, 4S) throughput 1.001305
SMLAL2 (by element, 4S) throughput 0.987019
SMLAL2 (by element, 2D) 1->1 2.036859
SMLAL2 (by element, 2D) 1->2 3.003571
SMLAL2 (by element, 2D) 1->3 3.023052
SMLAL2 (by element, 2D) throughput 1.007320
SMLAL2 (by element, 2D) throughput 1.023619
SMLAL (vector, 8H) 1->1 1.991925
SMLAL (vector, 8H) 1->2 2.965061
SMLAL (vector, 8H) 1->3 3.031171
SMLAL (vector, 8H) throughput 1.006054
SMLAL (vector, 8H) throughput 1.006391
SMLAL (vector, 4S) 1->1 2.022681
SMLAL (vector, 4S) 1->2 2.968041
SMLAL (vector, 4S) 1->3 2.982587
SMLAL (vector, 4S) throughput 1.008911
SMLAL (vector, 4S) throughput 0.988664
SMLAL (vector, 2D) 1->1 2.016841
SMLAL (vector, 2D) 1->2 3.021547
SMLAL (vector, 2D) 1->3 3.013640
SMLAL (vector, 2D) throughput 1.008892
SMLAL (vector, 2D) throughput 1.007548
SMLAL2 (vector, 8H) 1->1 2.020276
SMLAL2 (vector, 8H) 1->2 2.999020
SMLAL2 (vector, 8H) 1->3 3.048935
SMLAL2 (vector, 8H) throughput 1.006715
SMLAL2 (vector, 8H) throughput 1.000328
SMLAL2 (vector, 4S) 1->1 1.964508
SMLAL2 (vector, 4S) 1->2 3.030549
SMLAL2 (vector, 4S) 1->3 3.024920
SMLAL2 (vector, 4S) throughput 0.991948
SMLAL2 (vector, 4S) throughput 0.995149
SMLAL2 (vector, 2D) 1->1 1.998314
SMLAL2 (vector, 2D) 1->2 2.899380
SMLAL2 (vector, 2D) 1->3 3.026814
SMLAL2 (vector, 2D) throughput 1.007750
SMLAL2 (vector, 2D) throughput 1.039592
SMLSL (by element, 4S) 1->1 2.009166
SMLSL (by element, 4S) 1->2 3.137904
SMLSL (by element, 4S) 1->3 2.987486
SMLSL (by element, 4S) throughput 1.015109
SMLSL (by element, 4S) throughput 1.012478
SMLSL (by element, 2D) 1->1 2.005500
SMLSL (by element, 2D) 1->2 3.011338
SMLSL (by element, 2D) 1->3 2.970615
SMLSL (by element, 2D) throughput 1.011992
SMLSL (by element, 2D) throughput 1.004947
SMLSL2 (by element, 4S) 1->1 1.989763
SMLSL2 (by element, 4S) 1->2 3.047848
SMLSL2 (by element, 4S) 1->3 2.951297
SMLSL2 (by element, 4S) throughput 1.001514
SMLSL2 (by element, 4S) throughput 1.026799
SMLSL2 (by element, 2D) 1->1 2.032012
SMLSL2 (by element, 2D) 1->2 2.992554
SMLSL2 (by element, 2D) 1->3 3.045947
SMLSL2 (by element, 2D) throughput 1.008740
SMLSL2 (by element, 2D) throughput 1.007644
SMLSL (vector, 8H) 1->1 1.996996
SMLSL (vector, 8H) 1->2 3.065685
SMLSL (vector, 8H) 1->3 3.135816
SMLSL (vector, 8H) throughput 1.006559
SMLSL (vector, 8H) throughput 1.007429
SMLSL (vector, 4S) 1->1 1.994349
SMLSL (vector, 4S) 1->2 2.819862
SMLSL (vector, 4S) 1->3 3.024500
SMLSL (vector, 4S) throughput 1.010604
SMLSL (vector, 4S) throughput 0.978929
SMLSL (vector, 2D) 1->1 1.932752
SMLSL (vector, 2D) 1->2 3.046364
SMLSL (vector, 2D) 1->3 3.002125
SMLSL (vector, 2D) throughput 1.010234
SMLSL (vector, 2D) throughput 1.014273
SMLSL2 (vector, 8H) 1->1 2.002562
SMLSL2 (vector, 8H) 1->2 2.738021
SMLSL2 (vector, 8H) 1->3 3.145145
SMLSL2 (vector, 8H) throughput 1.167923
SMLSL2 (vector, 8H) throughput 1.011521
SMLSL2 (vector, 4S) 1->1 1.998554
SMLSL2 (vector, 4S) 1->2 3.027955
SMLSL2 (vector, 4S) 1->3 3.034694
SMLSL2 (vector, 4S) throughput 1.035149
SMLSL2 (vector, 4S) throughput 1.005616
SMLSL2 (vector, 2D) 1->1 1.971424
SMLSL2 (vector, 2D) 1->2 2.998181
SMLSL2 (vector, 2D) 1->3 3.005257
SMLSL2 (vector, 2D) throughput 0.993260
SMLSL2 (vector, 2D) throughput 1.013377
SMULL (by element, 4S) 1->2 3.028554
SMULL (by element, 4S) 1->3 3.002851
SMULL (by element, 4S) throughput 1.017259
SMULL (by element, 2D) 1->2 3.028333
SMULL (by element, 2D) 1->3 2.994239
SMULL (by element, 2D) throughput 1.007325
SMULL2 (by element, 4S) 1->2 2.888566
SMULL2 (by element, 4S) 1->3 2.939609
SMULL2 (by element, 4S) throughput 1.017117
SMULL2 (by element, 2D) 1->2 3.025217
SMULL2 (by element, 2D) 1->3 3.050674
SMULL2 (by element, 2D) throughput 1.004841
SMULL (vector, 8H) 1->2 2.997894
SMULL (vector, 8H) 1->3 2.962427
SMULL (vector, 8H) throughput 1.012578
SMULL (vector, 4S) 1->2 3.021884
SMULL (vector, 4S) 1->3 3.038248
SMULL (vector, 4S) throughput 1.011746
SMULL (vector, 2D) 1->2 2.996088
SMULL (vector, 2D) 1->3 2.996722
SMULL (vector, 2D) throughput 1.010501
SMULL2 (vector, 8H) 1->2 2.997676
SMULL2 (vector, 8H) 1->3 3.038463
SMULL2 (vector, 8H) throughput 0.973446
SMULL2 (vector, 4S) 1->2 3.026666
SMULL2 (vector, 4S) 1->3 3.076801
SMULL2 (vector, 4S) throughput 0.997702
SMULL2 (vector, 2D) 1->2 2.994197
SMULL2 (vector, 2D) 1->3 3.029254
SMULL2 (vector, 2D) throughput 0.987966
SQABS (scalar, B) 1->2 3.030896
SQABS (scalar, B) throughput 1.008203
SQABS (scalar, H) 1->2 2.981377
SQABS (scalar, H) throughput 1.009071
SQABS (scalar, S) 1->2 3.028390
SQABS (scalar, S) throughput 1.009327
SQABS (scalar, D) 1->2 3.052132
SQABS (scalar, D) throughput 0.972796
SQABS (vector, 8B) 1->2 3.040826
SQABS (vector, 8B) throughput 1.006626
SQABS (vector, 16B) 1->2 2.997793
SQABS (vector, 16B) throughput 0.988591
SQABS (vector, 4H) 1->2 2.998026
SQABS (vector, 4H) throughput 1.003934
SQABS (vector, 8H) 1->2 2.997916
SQABS (vector, 8H) throughput 1.004963
SQABS (vector, 2S) 1->2 3.021002
SQABS (vector, 2S) throughput 1.007884
SQABS (vector, 4S) 1->2 3.005438
SQABS (vector, 4S) throughput 1.005262
SQABS (vector, 2D) 1->2 3.037397
SQABS (vector, 2D) throughput 1.026496
SQADD (scalar, B) 1->2 3.059032
SQADD (scalar, B) 1->3 3.012455
SQADD (scalar, B) throughput 0.988183
SQADD (scalar, H) 1->2 3.036362
SQADD (scalar, H) 1->3 3.054491
SQADD (scalar, H) throughput 1.012725
SQADD (scalar, S) 1->2 2.940832
SQADD (scalar, S) 1->3 3.030529
SQADD (scalar, S) throughput 1.011217
SQADD (scalar, D) 1->2 3.048315
SQADD (scalar, D) 1->3 2.972213
SQADD (scalar, D) throughput 0.999130
SQADD (vector, 8B) 1->2 3.011260
SQADD (vector, 8B) 1->3 3.037475
SQADD (vector, 8B) throughput 1.015563
SQADD (vector, 16B) 1->2 3.029073
SQADD (vector, 16B) 1->3 3.022583
SQADD (vector, 16B) throughput 1.007246
SQADD (vector, 4H) 1->2 3.036450
SQADD (vector, 4H) 1->3 3.007324
SQADD (vector, 4H) throughput 1.013047
SQADD (vector, 8H) 1->2 3.024412
SQADD (vector, 8H) 1->3 3.021251
SQADD (vector, 8H) throughput 1.009738
SQADD (vector, 2S) 1->2 2.995616
SQADD (vector, 2S) 1->3 3.040906
SQADD (vector, 2S) throughput 1.005874
SQADD (vector, 4S) 1->2 3.022564
SQADD (vector, 4S) 1->3 3.025945
SQADD (vector, 4S) throughput 1.000129
SQADD (vector, 2D) 1->2 3.001302
SQADD (vector, 2D) 1->3 3.000317
SQADD (vector, 2D) throughput 1.005721
SQDMLAL (by element, S) 1->1 2.063067
SQDMLAL (by element, S) 1->2 3.031570
SQDMLAL (by element, S) 1->3 3.020869
SQDMLAL (by element, S) throughput 1.011596
SQDMLAL (by element, S) throughput 1.009479
SQDMLAL (by element, D) 1->1 2.001521
SQDMLAL (by element, D) 1->2 3.070309
SQDMLAL (by element, D) 1->3 2.965584
SQDMLAL (by element, D) throughput 1.006835
SQDMLAL (by element, D) throughput 1.000458
SQDMLAL (by element, 4S) 1->1 1.996842
SQDMLAL (by element, 4S) 1->2 2.995337
SQDMLAL (by element, 4S) 1->3 2.997131
SQDMLAL (by element, 4S) throughput 1.063145
SQDMLAL (by element, 4S) throughput 1.008735
SQDMLAL (by element, 2D) 1->1 2.001141
SQDMLAL (by element, 2D) 1->2 3.010072
SQDMLAL (by element, 2D) 1->3 2.984179
SQDMLAL (by element, 2D) throughput 1.013829
SQDMLAL (by element, 2D) throughput 1.003164
SQDMLAL2 (by element, 4S) 1->1 1.992423
SQDMLAL2 (by element, 4S) 1->2 2.978450
SQDMLAL2 (by element, 4S) 1->3 2.994436
SQDMLAL2 (by element, 4S) throughput 1.007557
SQDMLAL2 (by element, 4S) throughput 1.007814
SQDMLAL2 (by element, 2D) 1->1 2.058875
SQDMLAL2 (by element, 2D) 1->2 2.996821
SQDMLAL2 (by element, 2D) 1->3 3.011843
SQDMLAL2 (by element, 2D) throughput 1.010135
SQDMLAL2 (by element, 2D) throughput 1.008760
SQDMLAL (scalar, S) 1->1 2.020211
SQDMLAL (scalar, S) 1->2 2.950131
SQDMLAL (scalar, S) 1->3 3.000282
SQDMLAL (scalar, S) throughput 1.008622
SQDMLAL (scalar, S) throughput 1.010580
SQDMLAL (scalar, D) 1->1 2.035017
SQDMLAL (scalar, D) 1->2 3.015484
SQDMLAL (scalar, D) 1->3 3.082880
SQDMLAL (scalar, D) throughput 1.011757
SQDMLAL (scalar, D) throughput 1.019818
SQDMLAL (vector, 4S) 1->1 2.050013
SQDMLAL (vector, 4S) 1->2 3.016384
SQDMLAL (vector, 4S) 1->3 3.035496
SQDMLAL (vector, 4S) throughput 1.010305
SQDMLAL (vector, 4S) throughput 0.991992
SQDMLAL (vector, 2D) 1->1 1.988543
SQDMLAL (vector, 2D) 1->2 3.002507
SQDMLAL (vector, 2D) 1->3 2.994396
SQDMLAL (vector, 2D) throughput 1.007203
SQDMLAL (vector, 2D) throughput 1.000294
SQDMLAL2 (vector, 4S) 1->1 2.016075
SQDMLAL2 (vector, 4S) 1->2 2.960048
SQDMLAL2 (vector, 4S) 1->3 2.990349
SQDMLAL2 (vector, 4S) throughput 1.012169
SQDMLAL2 (vector, 4S) throughput 0.978944
SQDMLAL2 (vector, 2D) 1->1 2.008731
SQDMLAL2 (vector, 2D) 1->2 2.946622
SQDMLAL2 (vector, 2D) 1->3 2.956194
SQDMLAL2 (vector, 2D) throughput 1.005384
SQDMLAL2 (vector, 2D) throughput 0.999726
SQDMLSL (by element, S) 1->1 2.008769
SQDMLSL (by element, S) 1->2 2.997732
SQDMLSL (by element, S) 1->3 3.029536
SQDMLSL (by element, S) throughput 1.000715
SQDMLSL (by element, S) throughput 1.033963
SQDMLSL (by element, D) 1->1 3.077649
SQDMLSL (by element, D) 1->2 3.019847
SQDMLSL (by element, D) 1->3 3.006729
SQDMLSL (by element, D) throughput 1.003004
SQDMLSL (by element, D) throughput 1.006773
SQDMLSL (by element, 4S) 1->1 2.006883
SQDMLSL (by element, 4S) 1->2 2.988174
SQDMLSL (by element, 4S) 1->3 3.011317
SQDMLSL (by element, 4S) throughput 1.014966
SQDMLSL (by element, 4S) throughput 1.005343
SQDMLSL (by element, 2D) 1->1 1.980101
SQDMLSL (by element, 2D) 1->2 2.963060
SQDMLSL (by element, 2D) 1->3 3.024235
SQDMLSL (by element, 2D) throughput 1.004592
SQDMLSL (by element, 2D) throughput 1.003652
SQDMLSL2 (by element, 4S) 1->1 2.009494
SQDMLSL2 (by element, 4S) 1->2 3.008662
SQDMLSL2 (by element, 4S) 1->3 3.017479
SQDMLSL2 (by element, 4S) throughput 1.012346
SQDMLSL2 (by element, 4S) throughput 1.010358
SQDMLSL2 (by element, 2D) 1->1 2.047914
SQDMLSL2 (by element, 2D) 1->2 3.003910
SQDMLSL2 (by element, 2D) 1->3 3.008754
SQDMLSL2 (by element, 2D) throughput 0.999300
SQDMLSL2 (by element, 2D) throughput 1.010722
SQDMLSL (scalar, S) 1->1 2.012161
SQDMLSL (scalar, S) 1->2 3.006063
SQDMLSL (scalar, S) 1->3 3.041839
SQDMLSL (scalar, S) throughput 1.003093
SQDMLSL (scalar, S) throughput 0.944744
SQDMLSL (scalar, D) 1->1 1.952814
SQDMLSL (scalar, D) 1->2 3.001665
SQDMLSL (scalar, D) 1->3 2.981992
SQDMLSL (scalar, D) throughput 1.005814
SQDMLSL (scalar, D) throughput 1.008966
SQDMLSL (vector, 4S) 1->1 2.033534
SQDMLSL (vector, 4S) 1->2 3.017303
SQDMLSL (vector, 4S) 1->3 3.036988
SQDMLSL (vector, 4S) throughput 1.011478
SQDMLSL (vector, 4S) throughput 0.990710
SQDMLSL (vector, 2D) 1->1 1.976681
SQDMLSL (vector, 2D) 1->2 3.097330
SQDMLSL (vector, 2D) 1->3 3.050067
SQDMLSL (vector, 2D) throughput 0.978688
SQDMLSL (vector, 2D) throughput 1.002714
SQDMLSL2 (vector, 4S) 1->1 2.046299
SQDMLSL2 (vector, 4S) 1->2 3.007451
SQDMLSL2 (vector, 4S) 1->3 3.046911
SQDMLSL2 (vector, 4S) throughput 1.012941
SQDMLSL2 (vector, 4S) throughput 1.005545
SQDMLSL2 (vector, 2D) 1->1 2.001096
SQDMLSL2 (vector, 2D) 1->2 3.030086
SQDMLSL2 (vector, 2D) 1->3 2.998171
SQDMLSL2 (vector, 2D) throughput 1.006908
SQDMLSL2 (vector, 2D) throughput 1.023380
SQDMULH (by element, H) 1->2 3.018221
SQDMULH (by element, H) 1->3 2.994414
SQDMULH (by element, H) throughput 1.010454
SQDMULH (by element, S) 1->2 3.017036
SQDMULH (by element, S) 1->3 3.003224
SQDMULH (by element, S) throughput 0.994935
SQDMULH (by element, 4H) 1->2 2.999781
SQDMULH (by element, 4H) 1->3 3.008819
SQDMULH (by element, 4H) throughput 1.008895
SQDMULH (by element, 8H) 1->2 2.997152
SQDMULH (by element, 8H) 1->3 3.051179
SQDMULH (by element, 8H) throughput 1.010914
SQDMULH (by element, 2S) 1->2 3.001971
SQDMULH (by element, 2S) 1->3 3.011320
SQDMULH (by element, 2S) throughput 1.026674
SQDMULH (by element, 4S) 1->2 2.994590
SQDMULH (by element, 4S) 1->3 3.073794
SQDMULH (by element, 4S) throughput 1.010619
SQDMULH (scalar, H) 1->2 3.003027
SQDMULH (scalar, H) 1->3 3.027681
SQDMULH (scalar, H) throughput 1.008266
SQDMULH (scalar, S) 1->2 2.957018
SQDMULH (scalar, S) 1->3 3.019689
SQDMULH (scalar, S) throughput 1.012640
SQDMULH (vector, 4H) 1->2 3.029588
SQDMULH (vector, 4H) 1->3 3.052536
SQDMULH (vector, 4H) throughput 0.995427
SQDMULH (vector, 8H) 1->2 3.007565
SQDMULH (vector, 8H) 1->3 3.043273
SQDMULH (vector, 8H) throughput 1.015234
SQDMULH (vector, 2S) 1->2 3.009883
SQDMULH (vector, 2S) 1->3 2.991053
SQDMULH (vector, 2S) throughput 1.010461
SQDMULH (vector, 4S) 1->2 3.016951
SQDMULH (vector, 4S) 1->3 3.027263
SQDMULH (vector, 4S) throughput 1.006139
SQDMULL (by element, S) 1->2 3.021293
SQDMULL (by element, S) 1->3 3.023933
SQDMULL (by element, S) throughput 1.009658
SQDMULL (by element, D) 1->2 2.996364
SQDMULL (by element, D) 1->3 3.091142
SQDMULL (by element, D) throughput 0.985792
SQDMULL (by element, 4S) 1->2 3.005023
SQDMULL (by element, 4S) 1->3 3.032151
SQDMULL (by element, 4S) throughput 1.009910
SQDMULL (by element, 2D) 1->2 3.010271
SQDMULL (by element, 2D) 1->3 3.027407
SQDMULL (by element, 2D) throughput 1.156127
SQDMULL2 (by element, 4S) 1->2 2.939879
SQDMULL2 (by element, 4S) 1->3 3.010096
SQDMULL2 (by element, 4S) throughput 1.012005
SQDMULL2 (by element, 2D) 1->2 3.041270
SQDMULL2 (by element, 2D) 1->3 2.994283
SQDMULL2 (by element, 2D) throughput 1.010870
SQDMULL (scalar, S) 1->2 3.024021
SQDMULL (scalar, S) 1->3 2.999913
SQDMULL (scalar, S) throughput 0.990838
SQDMULL (scalar, D) 1->2 3.043764
SQDMULL (scalar, D) 1->3 3.005856
SQDMULL (scalar, D) throughput 1.006343
SQDMULL (vector, 4S) 1->2 2.998201
SQDMULL (vector, 4S) 1->3 3.006724
SQDMULL (vector, 4S) throughput 1.020151
SQDMULL (vector, 2D) 1->2 2.930049
SQDMULL (vector, 2D) 1->3 3.048686
SQDMULL (vector, 2D) throughput 1.006360
SQDMULL2 (vector, 4S) 1->2 3.013088
SQDMULL2 (vector, 4S) 1->3 3.026165
SQDMULL2 (vector, 4S) throughput 1.008745
SQDMULL2 (vector, 2D) 1->2 2.960073
SQDMULL2 (vector, 2D) 1->3 3.006176
SQDMULL2 (vector, 2D) throughput 1.022609
SQNEG (scalar, B) 1->2 3.022970
SQNEG (scalar, B) throughput 0.991153
SQNEG (scalar, H) 1->2 2.994067
SQNEG (scalar, H) throughput 0.999937
SQNEG (scalar, S) 1->2 3.007601
SQNEG (scalar, S) throughput 1.014851
SQNEG (scalar, D) 1->2 3.008448
SQNEG (scalar, D) throughput 1.013411
SQNEG (vector, 8B) 1->2 3.015064
SQNEG (vector, 8B) throughput 0.992666
SQNEG (vector, 16B) 1->2 3.023612
SQNEG (vector, 16B) throughput 0.992200
SQNEG (vector, 4H) 1->2 3.458001
SQNEG (vector, 4H) throughput 0.982479
SQNEG (vector, 8H) 1->2 3.012727
SQNEG (vector, 8H) throughput 1.012153
SQNEG (vector, 2S) 1->2 3.000043
SQNEG (vector, 2S) throughput 1.010406
SQNEG (vector, 4S) 1->2 3.024931
SQNEG (vector, 4S) throughput 0.989045
SQNEG (vector, 2D) 1->2 3.080291
SQNEG (vector, 2D) throughput 0.997592
SQRDMULH (by element, H) 1->2 2.988678
SQRDMULH (by element, H) 1->3 3.018002
SQRDMULH (by element, H) throughput 0.997913
SQRDMULH (by element, S) 1->2 3.062211
SQRDMULH (by element, S) 1->3 3.045826
SQRDMULH (by element, S) throughput 1.004968
SQRDMULH (by element, 4H) 1->2 2.992526
SQRDMULH (by element, 4H) 1->3 3.094218
SQRDMULH (by element, 4H) throughput 1.035839
SQRDMULH (by element, 8H) 1->2 3.012952
SQRDMULH (by element, 8H) 1->3 2.996358
SQRDMULH (by element, 8H) throughput 1.005388
SQRDMULH (by element, 2S) 1->2 3.033996
SQRDMULH (by element, 2S) 1->3 2.980525
SQRDMULH (by element, 2S) throughput 1.010288
SQRDMULH (by element, 4S) 1->2 3.018518
SQRDMULH (by element, 4S) 1->3 2.964894
SQRDMULH (by element, 4S) throughput 1.004546
SQRDMULH (scalar, H) 1->2 3.033961
SQRDMULH (scalar, H) 1->3 3.028367
SQRDMULH (scalar, H) throughput 0.994179
SQRDMULH (scalar, S) 1->2 3.027590
SQRDMULH (scalar, S) 1->3 2.991406
SQRDMULH (scalar, S) throughput 1.006067
SQRDMULH (vector, 4H) 1->2 3.032329
SQRDMULH (vector, 4H) 1->3 3.035980
SQRDMULH (vector, 4H) throughput 1.001773
SQRDMULH (vector, 8H) 1->2 3.024292
SQRDMULH (vector, 8H) 1->3 3.047216
SQRDMULH (vector, 8H) throughput 1.008829
SQRDMULH (vector, 2S) 1->2 3.111107
SQRDMULH (vector, 2S) 1->3 3.002881
SQRDMULH (vector, 2S) throughput 0.991666
SQRDMULH (vector, 4S) 1->2 3.022658
SQRDMULH (vector, 4S) 1->3 3.034676
SQRDMULH (vector, 4S) throughput 1.009273
SQRSHL (scalar, B) 1->2 2.941774
SQRSHL (scalar, B) 1->3 2.961367
SQRSHL (scalar, B) throughput 1.015579
SQRSHL (scalar, H) 1->2 3.023225
SQRSHL (scalar, H) 1->3 3.061363
SQRSHL (scalar, H) throughput 1.003523
SQRSHL (scalar, S) 1->2 3.024466
SQRSHL (scalar, S) 1->3 3.011196
SQRSHL (scalar, S) throughput 1.006383
SQRSHL (scalar, D) 1->2 3.029044
SQRSHL (scalar, D) 1->3 2.995321
SQRSHL (scalar, D) throughput 0.969640
SQRSHL (vector, 8B) 1->2 3.021308
SQRSHL (vector, 8B) 1->3 3.028128
SQRSHL (vector, 8B) throughput 0.993309
SQRSHL (vector, 16B) 1->2 2.947511
SQRSHL (vector, 16B) 1->3 3.027136
SQRSHL (vector, 16B) throughput 1.006448
SQRSHL (vector, 4H) 1->2 3.013844
SQRSHL (vector, 4H) 1->3 3.009637
SQRSHL (vector, 4H) throughput 0.997915
SQRSHL (vector, 8H) 1->2 3.033295
SQRSHL (vector, 8H) 1->3 3.017171
SQRSHL (vector, 8H) throughput 1.005028
SQRSHL (vector, 2S) 1->2 2.986493
SQRSHL (vector, 2S) 1->3 3.028522
SQRSHL (vector, 2S) throughput 1.041037
SQRSHL (vector, 4S) 1->2 2.999002
SQRSHL (vector, 4S) 1->3 3.012777
SQRSHL (vector, 4S) throughput 1.020321
SQRSHL (vector, 2D) 1->2 3.022022
SQRSHL (vector, 2D) 1->3 2.992322
SQRSHL (vector, 2D) throughput 1.011341
SQRSHRN (H) 1->2 4.044909
SQRSHRN (H) throughput 1.008622
SQRSHRN (S) 1->2 3.990626
SQRSHRN (S) throughput 1.007941
SQRSHRN (D) 1->2 4.016668
SQRSHRN (D) throughput 1.007334
SQRSHRN (8H) 1->2 4.001891
SQRSHRN (8H) throughput 0.997130
SQRSHRN (4S) 1->2 4.075342
SQRSHRN (4S) throughput 1.005955
SQRSHRN (2D) 1->2 4.060986
SQRSHRN (2D) throughput 1.012811
SQRSHRN2 (8H) 1->1 3.945436
SQRSHRN2 (8H) 1->2 4.043918
SQRSHRN2 (8H) throughput 1.007572
SQRSHRN2 (8H) throughput 1.012020
SQRSHRN2 (4S) 1->1 4.010794
SQRSHRN2 (4S) 1->2 4.062548
SQRSHRN2 (4S) throughput 0.993664
SQRSHRN2 (4S) throughput 1.023827
SQRSHRN2 (2D) 1->1 4.041180
SQRSHRN2 (2D) 1->2 4.047548
SQRSHRN2 (2D) throughput 1.012155
SQRSHRN2 (2D) throughput 1.005845
SQRSHRUN (H) 1->2 4.046964
SQRSHRUN (H) throughput 0.993446
SQRSHRUN (S) 1->2 4.020155
SQRSHRUN (S) throughput 1.015091
SQRSHRUN (D) 1->2 3.996869
SQRSHRUN (D) throughput 1.007691
SQRSHRUN (8H) 1->2 4.020304
SQRSHRUN (8H) throughput 1.002897
SQRSHRUN (4S) 1->2 4.050080
SQRSHRUN (4S) throughput 1.005837
SQRSHRUN (2D) 1->2 3.995072
SQRSHRUN (2D) throughput 0.975688
SQRSHRUN2 (8H) 1->1 3.992269
SQRSHRUN2 (8H) 1->2 4.036976
SQRSHRUN2 (8H) throughput 1.011830
SQRSHRUN2 (8H) throughput 1.019330
SQRSHRUN2 (4S) 1->1 4.040445
SQRSHRUN2 (4S) 1->2 4.026185
SQRSHRUN2 (4S) throughput 1.003689
SQRSHRUN2 (4S) throughput 1.024804
SQRSHRUN2 (2D) 1->1 4.008378
SQRSHRUN2 (2D) 1->2 3.992595
SQRSHRUN2 (2D) throughput 1.006839
SQRSHRUN2 (2D) throughput 1.005814
SQSHL (immediate, scalar, B) 1->2 2.002807
SQSHL (immediate, scalar, B) throughput 1.004761
SQSHL (immediate, scalar, H) 1->2 2.001141
SQSHL (immediate, scalar, H) throughput 1.022186
SQSHL (immediate, scalar, S) 1->2 1.998204
SQSHL (immediate, scalar, S) throughput 1.022394
SQSHL (immediate, scalar, D) 1->2 2.093115
SQSHL (immediate, scalar, D) throughput 0.992772
SQSHL (immediate, vector, 8B) 1->2 1.990732
SQSHL (immediate, vector, 8B) throughput 0.997495
SQSHL (immediate, vector, 16B) 1->2 1.998796
SQSHL (immediate, vector, 16B) throughput 1.011386
SQSHL (immediate, vector, 4H) 1->2 2.043820
SQSHL (immediate, vector, 4H) throughput 1.016537
SQSHL (immediate, vector, 8H) 1->2 2.042974
SQSHL (immediate, vector, 8H) throughput 0.985141
SQSHL (immediate, vector, 2S) 1->2 2.004539
SQSHL (immediate, vector, 2S) throughput 1.012469
SQSHL (immediate, vector, 4S) 1->2 2.019520
SQSHL (immediate, vector, 4S) throughput 1.006394
SQSHL (immediate, vector, 2D) 1->2 2.054789
SQSHL (immediate, vector, 2D) throughput 1.007273
SQSHL (scalar, B) 1->2 2.029979
SQSHL (scalar, B) 1->3 2.005345
SQSHL (scalar, B) throughput 0.976494
SQSHL (scalar, H) 1->2 2.008876
SQSHL (scalar, H) 1->3 2.024345
SQSHL (scalar, H) throughput 1.006258
SQSHL (scalar, S) 1->2 1.984060
SQSHL (scalar, S) 1->3 2.024626
SQSHL (scalar, S) throughput 0.990092
SQSHL (scalar, D) 1->2 2.008170
SQSHL (scalar, D) 1->3 2.045979
SQSHL (scalar, D) throughput 1.005439
SQSHL (vector, 8B) 1->2 1.957525
SQSHL (vector, 8B) 1->3 2.007276
SQSHL (vector, 8B) throughput 1.000290
SQSHL (vector, 16B) 1->2 1.998577
SQSHL (vector, 16B) 1->3 2.021055
SQSHL (vector, 16B) throughput 1.011017
SQSHL (vector, 4H) 1->2 1.996001
SQSHL (vector, 4H) 1->3 2.031004
SQSHL (vector, 4H) throughput 1.026959
SQSHL (vector, 8H) 1->2 2.022018
SQSHL (vector, 8H) 1->3 1.997215
SQSHL (vector, 8H) throughput 1.013342
SQSHL (vector, 2S) 1->2 1.980555
SQSHL (vector, 2S) 1->3 1.998180
SQSHL (vector, 2S) throughput 1.010001
SQSHL (vector, 4S) 1->2 2.014343
SQSHL (vector, 4S) 1->3 2.042580
SQSHL (vector, 4S) throughput 0.991113
SQSHL (vector, 2D) 1->2 2.050667
SQSHL (vector, 2D) 1->3 1.992473
SQSHL (vector, 2D) throughput 0.991783
SQSHLU (scalar, B) 1->2 1.997987
SQSHLU (scalar, B) throughput 1.010819
SQSHLU (scalar, H) 1->2 2.015922
SQSHLU (scalar, H) throughput 1.009796
SQSHLU (scalar, S) 1->2 2.040741
SQSHLU (scalar, S) throughput 1.009644
SQSHLU (scalar, D) 1->2 2.028307
SQSHLU (scalar, D) throughput 0.998967
SQSHLU (vector, 8B) 1->2 2.024871
SQSHLU (vector, 8B) throughput 1.011381
SQSHLU (vector, 16B) 1->2 2.006400
SQSHLU (vector, 16B) throughput 1.010938
SQSHLU (vector, 4H) 1->2 2.005169
SQSHLU (vector, 4H) throughput 1.007738
SQSHLU (vector, 8H) 1->2 1.956433
SQSHLU (vector, 8H) throughput 1.007000
SQSHLU (vector, 2S) 1->2 2.014839
SQSHLU (vector, 2S) throughput 0.995066
SQSHLU (vector, 4S) 1->2 2.053489
SQSHLU (vector, 4S) throughput 1.016482
SQSHLU (vector, 2D) 1->2 1.975397
SQSHLU (vector, 2D) throughput 0.997049
SQSHRN (H) 1->2 4.074271
SQSHRN (H) throughput 1.004941
SQSHRN (S) 1->2 4.024059
SQSHRN (S) throughput 1.001946
SQSHRN (D) 1->2 4.011118
SQSHRN (D) throughput 1.005894
SQSHRN (8H) 1->2 4.061210
SQSHRN (8H) throughput 1.011185
SQSHRN (4S) 1->2 4.048537
SQSHRN (4S) throughput 1.006635
SQSHRN (2D) 1->2 4.050717
SQSHRN (2D) throughput 1.009690
SQSHRN2 (8H) 1->1 4.048465
SQSHRN2 (8H) 1->2 4.020062
SQSHRN2 (8H) throughput 1.015373
SQSHRN2 (8H) throughput 0.999781
SQSHRN2 (4S) 1->1 4.022341
SQSHRN2 (4S) 1->2 4.019364
SQSHRN2 (4S) throughput 1.015449
SQSHRN2 (4S) throughput 1.010501
SQSHRN2 (2D) 1->1 4.070353
SQSHRN2 (2D) 1->2 3.913278
SQSHRN2 (2D) throughput 1.015200
SQSHRN2 (2D) throughput 0.994711
SQSHRUN (H) 1->2 4.019676
SQSHRUN (H) throughput 1.007204
SQSHRUN (S) 1->2 4.055879
SQSHRUN (S) throughput 1.004375
SQSHRUN (D) 1->2 4.016405
SQSHRUN (D) throughput 1.006677
SQSHRUN (8H) 1->2 4.067435
SQSHRUN (8H) throughput 1.005798
SQSHRUN (4S) 1->2 4.014255
SQSHRUN (4S) throughput 1.007143
SQSHRUN (2D) 1->2 4.004206
SQSHRUN (2D) throughput 1.070318
SQSHRUN2 (8H) 1->1 4.017183
SQSHRUN2 (8H) 1->2 3.989229
SQSHRUN2 (8H) throughput 1.001304
SQSHRUN2 (8H) throughput 1.004631
SQSHRUN2 (4S) 1->1 4.037875
SQSHRUN2 (4S) 1->2 4.033689
SQSHRUN2 (4S) throughput 1.006090
SQSHRUN2 (4S) throughput 0.994905
SQSHRUN2 (2D) 1->1 3.917931
SQSHRUN2 (2D) 1->2 4.054396
SQSHRUN2 (2D) throughput 0.998261
SQSHRUN2 (2D) throughput 1.007131
SQSUB (scalar, B) 1->2 2.981465
SQSUB (scalar, B) 1->3 3.008732
SQSUB (scalar, B) throughput 0.985692
SQSUB (scalar, H) 1->2 3.005388
SQSUB (scalar, H) 1->3 2.999611
SQSUB (scalar, H) throughput 0.993227
SQSUB (scalar, S) 1->2 3.050701
SQSUB (scalar, S) 1->3 2.942637
SQSUB (scalar, S) throughput 1.007241
SQSUB (scalar, D) 1->2 3.017470
SQSUB (scalar, D) 1->3 3.031626
SQSUB (scalar, D) throughput 1.005404
SQSUB (vector, 8B) 1->2 3.535745
SQSUB (vector, 8B) 1->3 3.002093
SQSUB (vector, 8B) throughput 1.005792
SQSUB (vector, 16B) 1->2 2.992246
SQSUB (vector, 16B) 1->3 3.019290
SQSUB (vector, 16B) throughput 1.004578
SQSUB (vector, 4H) 1->2 2.992596
SQSUB (vector, 4H) 1->3 3.011118
SQSUB (vector, 4H) throughput 0.996988
SQSUB (vector, 8H) 1->2 3.030460
SQSUB (vector, 8H) 1->3 3.004093
SQSUB (vector, 8H) throughput 1.008856
SQSUB (vector, 2S) 1->2 3.056952
SQSUB (vector, 2S) 1->3 3.002063
SQSUB (vector, 2S) throughput 1.009047
SQSUB (vector, 4S) 1->2 3.010814
SQSUB (vector, 4S) 1->3 2.894054
SQSUB (vector, 4S) throughput 1.039455
SQSUB (vector, 2D) 1->2 3.002015
SQSUB (vector, 2D) 1->3 3.007554
SQSUB (vector, 2D) throughput 0.987977
SQXTN (H) 1->2 3.999101
SQXTN (H) throughput 1.009455
SQXTN (S) 1->2 4.027434
SQXTN (S) throughput 1.006020
SQXTN (D) 1->2 4.020745
SQXTN (D) throughput 0.992342
SQXTN (8H) 1->2 3.944556
SQXTN (8H) throughput 1.004991
SQXTN (4S) 1->2 4.061731
SQXTN (4S) throughput 1.110145
SQXTN (2D) 1->2 4.017352
SQXTN (2D) throughput 0.989306
SQXTN2 (8H) 1->1 4.120973
SQXTN2 (8H) 1->2 4.002409
SQXTN2 (8H) throughput 1.006755
SQXTN2 (8H) throughput 0.997769
SQXTN2 (4S) 1->1 4.001798
SQXTN2 (4S) 1->2 3.996399
SQXTN2 (4S) throughput 1.007424
SQXTN2 (4S) throughput 1.006744
SQXTN2 (2D) 1->1 4.025506
SQXTN2 (2D) 1->2 4.001699
SQXTN2 (2D) throughput 1.003951
SQXTN2 (2D) throughput 1.008574
SQXTUN (H) 1->2 3.995158
SQXTUN (H) throughput 0.956994
SQXTUN (S) 1->2 4.501508
SQXTUN (S) throughput 1.305454
SQXTUN (D) 1->2 4.002290
SQXTUN (D) throughput 1.065169
SQXTUN (8H) 1->2 4.145597
SQXTUN (8H) throughput 0.857118
SQXTUN (4S) 1->2 5.614371
SQXTUN (4S) throughput 0.992712
SQXTUN (2D) 1->2 4.043530
SQXTUN (2D) throughput 0.999016
SQXTUN2 (8H) 1->1 4.068083
SQXTUN2 (8H) 1->2 4.009159
SQXTUN2 (8H) throughput 1.001524
SQXTUN2 (8H) throughput 1.000270
SQXTUN2 (4S) 1->1 4.042815
SQXTUN2 (4S) 1->2 4.025439
SQXTUN2 (4S) throughput 1.015190
SQXTUN2 (4S) throughput 1.014095
SQXTUN2 (2D) 1->1 3.964880
SQXTUN2 (2D) 1->2 4.016418
SQXTUN2 (2D) throughput 0.866530
SQXTUN2 (2D) throughput 1.003073
SRHADD (vector, 8B) 1->2 1.971217
SRHADD (vector, 8B) 1->3 1.994021
SRHADD (vector, 8B) throughput 1.007886
SRHADD (vector, 16B) 1->2 1.986813
SRHADD (vector, 16B) 1->3 2.014738
SRHADD (vector, 16B) throughput 1.003170
SRHADD (vector, 4H) 1->2 2.008650
SRHADD (vector, 4H) 1->3 2.016537
SRHADD (vector, 4H) throughput 0.979590
SRHADD (vector, 8H) 1->2 2.015288
SRHADD (vector, 8H) 1->3 2.006422
SRHADD (vector, 8H) throughput 1.006593
SRHADD (vector, 2S) 1->2 2.035875
SRHADD (vector, 2S) 1->3 1.990089
SRHADD (vector, 2S) throughput 0.979099
SRHADD (vector, 4S) 1->2 2.006707
SRHADD (vector, 4S) 1->3 1.986371
SRHADD (vector, 4S) throughput 1.013202
SRI (vector, 8B) 1->1 1.981956
SRI (vector, 8B) 1->2 2.002940
SRI (vector, 8B) throughput 0.999081
SRI (vector, 8B) throughput 0.996341
SRI (vector, 16B) 1->1 1.998596
SRI (vector, 16B) 1->2 2.015045
SRI (vector, 16B) throughput 1.014571
SRI (vector, 16B) throughput 0.985127
SRI (vector, 4H) 1->1 2.053954
SRI (vector, 4H) 1->2 1.999979
SRI (vector, 4H) throughput 1.041140
SRI (vector, 4H) throughput 0.949864
SRI (vector, 8H) 1->1 2.038325
SRI (vector, 8H) 1->2 2.032219
SRI (vector, 8H) throughput 1.028275
SRI (vector, 8H) throughput 0.977902
SRI (vector, 2S) 1->1 1.957389
SRI (vector, 2S) 1->2 2.016164
SRI (vector, 2S) throughput 1.003177
SRI (vector, 2S) throughput 1.010862
SRI (vector, 4S) 1->1 1.945017
SRI (vector, 4S) 1->2 2.010639
SRI (vector, 4S) throughput 1.036512
SRI (vector, 4S) throughput 1.005793
SRI (vector, 2D) 1->1 2.012017
SRI (vector, 2D) 1->2 1.986241
SRI (vector, 2D) throughput 1.012600
SRI (vector, 2D) throughput 0.976597
SRI (vector, D) 1->1 2.015145
SRI (vector, D) 1->2 2.032360
SRI (vector, D) throughput 1.005016
SRI (vector, D) throughput 0.753173
SRSHL (vector, 8B) 1->2 2.783539
SRSHL (vector, 8B) 1->3 3.007711
SRSHL (vector, 8B) throughput 1.027569
SRSHL (vector, 16B) 1->2 3.071892
SRSHL (vector, 16B) 1->3 3.010542
SRSHL (vector, 16B) throughput 0.994178
SRSHL (vector, 4H) 1->2 3.011301
SRSHL (vector, 4H) 1->3 3.065755
SRSHL (vector, 4H) throughput 1.010668
SRSHL (vector, 8H) 1->2 3.013410
SRSHL (vector, 8H) 1->3 2.944867
SRSHL (vector, 8H) throughput 1.007963
SRSHL (vector, 2S) 1->2 3.053339
SRSHL (vector, 2S) 1->3 3.005630
SRSHL (vector, 2S) throughput 0.987873
SRSHL (vector, 4S) 1->2 3.026103
SRSHL (vector, 4S) 1->3 3.027968
SRSHL (vector, 4S) throughput 0.978106
SRSHL (vector, 2D) 1->2 2.994525
SRSHL (vector, 2D) 1->3 3.064982
SRSHL (vector, 2D) throughput 0.980113
SRSHL (vector, D) 1->2 3.009105
SRSHL (vector, D) 1->3 3.025024
SRSHL (vector, D) throughput 1.017968
SRSHR (vector, 8B) 1->2 2.950108
SRSHR (vector, 8B) throughput 1.002073
SRSHR (vector, 16B) 1->2 3.068673
SRSHR (vector, 16B) throughput 1.005291
SRSHR (vector, 4H) 1->2 2.998245
SRSHR (vector, 4H) throughput 1.006767
SRSHR (vector, 8H) 1->2 2.998387
SRSHR (vector, 8H) throughput 1.004037
SRSHR (vector, 2S) 1->2 2.967227
SRSHR (vector, 2S) throughput 0.998034
SRSHR (vector, 4S) 1->2 3.248131
SRSHR (vector, 4S) throughput 0.990468
SRSHR (vector, 2D) 1->2 3.080760
SRSHR (vector, 2D) throughput 1.007021
SRSHR (vector, D) 1->2 3.048300
SRSHR (vector, D) throughput 0.994890
SRSRA (vector, 8B) 1->1 2.035251
SRSRA (vector, 8B) 1->2 3.027963
SRSRA (vector, 8B) throughput 0.978757
SRSRA (vector, 8B) throughput 1.010549
SRSRA (vector, 16B) 1->1 2.023224
SRSRA (vector, 16B) 1->2 3.022920
SRSRA (vector, 16B) throughput 1.020323
SRSRA (vector, 16B) throughput 1.008015
SRSRA (vector, 4H) 1->1 1.974387
SRSRA (vector, 4H) 1->2 3.023729
SRSRA (vector, 4H) throughput 1.011265
SRSRA (vector, 4H) throughput 0.958811
SRSRA (vector, 8H) 1->1 2.011432
SRSRA (vector, 8H) 1->2 3.003690
SRSRA (vector, 8H) throughput 1.004595
SRSRA (vector, 8H) throughput 0.991288
SRSRA (vector, 2S) 1->1 2.036990
SRSRA (vector, 2S) 1->2 2.993069
SRSRA (vector, 2S) throughput 0.998871
SRSRA (vector, 2S) throughput 0.924663
SRSRA (vector, 4S) 1->1 1.994502
SRSRA (vector, 4S) 1->2 2.994567
SRSRA (vector, 4S) throughput 1.009606
SRSRA (vector, 4S) throughput 1.030082
SRSRA (vector, 2D) 1->1 2.009475
SRSRA (vector, 2D) 1->2 3.008994
SRSRA (vector, 2D) throughput 0.995329
SRSRA (vector, 2D) throughput 1.002546
SRSRA (vector, D) 1->1 2.006443
SRSRA (vector, D) 1->2 3.016337
SRSRA (vector, D) throughput 1.008227
SRSRA (vector, D) throughput 1.008709
SSHL (vector, 8B) 1->2 2.014519
SSHL (vector, 8B) 1->3 1.968412
SSHL (vector, 8B) throughput 1.009319
SSHL (vector, 16B) 1->2 1.997039
SSHL (vector, 16B) 1->3 2.012074
SSHL (vector, 16B) throughput 1.007759
SSHL (vector, 4H) 1->2 1.973273
SSHL (vector, 4H) 1->3 2.006620
SSHL (vector, 4H) throughput 0.992478
SSHL (vector, 8H) 1->2 1.994042
SSHL (vector, 8H) 1->3 1.997039
SSHL (vector, 8H) throughput 1.006698
SSHL (vector, 2S) 1->2 2.000042
SSHL (vector, 2S) 1->3 2.016488
SSHL (vector, 2S) throughput 1.009561
SSHL (vector, 4S) 1->2 2.056980
SSHL (vector, 4S) 1->3 1.994125
SSHL (vector, 4S) throughput 0.980274
SSHL (vector, 2D) 1->2 2.033418
SSHL (vector, 2D) 1->3 2.006444
SSHL (vector, 2D) throughput 1.015772
SSHL (vector, D) 1->2 2.006799
SSHL (vector, D) 1->3 2.023207
SSHL (vector, D) throughput 1.012886
SSHLL (8H) 1->2 1.998752
SSHLL (8H) throughput 1.004640
SSHLL (4S) 1->2 2.006532
SSHLL (4S) throughput 1.004575
SSHLL (2D) 1->2 1.994462
SSHLL (2D) throughput 0.997030
SSHLL2 (8H) 1->2 2.006465
SSHLL2 (8H) throughput 1.013222
SSHLL2 (4S) 1->2 2.023415
SSHLL2 (4S) throughput 1.002942
SSHLL2 (2D) 1->2 1.977051
SSHLL2 (2D) throughput 1.009587
SSHR (vector, 8B) 1->2 2.006707
SSHR (vector, 8B) throughput 1.001741
SSHR (vector, 16B) 1->2 2.006642
SSHR (vector, 16B) throughput 1.000596
SSHR (vector, 4H) 1->2 2.025920
SSHR (vector, 4H) throughput 1.010956
SSHR (vector, 8H) 1->2 2.028874
SSHR (vector, 8H) throughput 1.017312
SSHR (vector, 2S) 1->2 1.992076
SSHR (vector, 2S) throughput 0.981985
SSHR (vector, 4S) 1->2 2.023256
SSHR (vector, 4S) throughput 1.023756
SSHR (vector, 2D) 1->2 1.985794
SSHR (vector, 2D) throughput 0.993969
SSHR (vector, D) 1->2 1.998558
SSHR (vector, D) throughput 1.013726
SSRA (vector, 8B) 1->1 2.027862
SSRA (vector, 8B) 1->2 3.019492
SSRA (vector, 8B) throughput 1.007081
SSRA (vector, 8B) throughput 1.007624
SSRA (vector, 16B) 1->1 2.008972
SSRA (vector, 16B) 1->2 3.027808
SSRA (vector, 16B) throughput 1.017795
SSRA (vector, 16B) throughput 1.019948
SSRA (vector, 4H) 1->1 2.007990
SSRA (vector, 4H) 1->2 3.036741
SSRA (vector, 4H) throughput 0.992146
SSRA (vector, 4H) throughput 1.010888
SSRA (vector, 8H) 1->1 2.028587
SSRA (vector, 8H) 1->2 3.056328
SSRA (vector, 8H) throughput 0.997658
SSRA (vector, 8H) throughput 1.021438
SSRA (vector, 2S) 1->1 1.767569
SSRA (vector, 2S) 1->2 3.395435
SSRA (vector, 2S) throughput 1.025766
SSRA (vector, 2S) throughput 1.026062
SSRA (vector, 4S) 1->1 1.969858
SSRA (vector, 4S) 1->2 3.022897
SSRA (vector, 4S) throughput 1.011108
SSRA (vector, 4S) throughput 0.993433
SSRA (vector, 2D) 1->1 2.041933
SSRA (vector, 2D) 1->2 2.994046
SSRA (vector, 2D) throughput 0.999225
SSRA (vector, 2D) throughput 1.007019
SSRA (vector, D) 1->1 2.040981
SSRA (vector, D) 1->2 3.056104
SSRA (vector, D) throughput 1.025781
SSRA (vector, D) throughput 1.001879
SSUBL (vector, 8H) 1->2 1.992582
SSUBL (vector, 8H) 1->3 1.992373
SSUBL (vector, 8H) throughput 0.996047
SSUBL (vector, 4S) 1->2 1.993531
SSUBL (vector, 4S) 1->3 2.006908
SSUBL (vector, 4S) throughput 0.999679
SSUBL (vector, 2D) 1->2 1.998838
SSUBL (vector, 2D) 1->3 1.993977
SSUBL (vector, 2D) throughput 1.019006
SSUBL2 (vector, 8H) 1->2 1.993868
SSUBL2 (vector, 8H) 1->3 2.021144
SSUBL2 (vector, 8H) throughput 1.001925
SSUBL2 (vector, 4S) 1->2 2.047370
SSUBL2 (vector, 4S) 1->3 2.010686
SSUBL2 (vector, 4S) throughput 1.011077
SSUBL2 (vector, 2D) 1->2 2.035030
SSUBL2 (vector, 2D) 1->3 2.126175
SSUBL2 (vector, 2D) throughput 0.819675
SSUBW (vector, 8H) 1->2 2.022038
SSUBW (vector, 8H) 1->3 2.000394
SSUBW (vector, 8H) throughput 1.005196
SSUBW (vector, 4S) 1->2 2.014389
SSUBW (vector, 4S) 1->3 2.009567
SSUBW (vector, 4S) throughput 1.015396
SSUBW (vector, 2D) 1->2 2.123727
SSUBW (vector, 2D) 1->3 2.125596
SSUBW (vector, 2D) throughput 1.017504
SSUBW2 (vector, 8H) 1->2 2.011207
SSUBW2 (vector, 8H) 1->3 2.018348
SSUBW2 (vector, 8H) throughput 1.003725
SSUBW2 (vector, 4S) 1->2 2.001141
SSUBW2 (vector, 4S) 1->3 2.036051
SSUBW2 (vector, 4S) throughput 1.007235
SSUBW2 (vector, 2D) 1->2 2.011189
SSUBW2 (vector, 2D) 1->3 2.057219
SSUBW2 (vector, 2D) throughput 1.008526
STNP (S) throughput 1.605318
STNP (D) throughput 1.584840
STNP (Q) throughput 3.220081
STNP (signed offset, S) throughput 1.608980
STNP (signed offset, D) throughput 1.607139
STNP (signed offset, Q) throughput 3.179019
STP (S) throughput 1.305135
STP (D) throughput 1.301307
STP (Q) throughput 2.617626
STP (post-index, S) 3->3 1.856977
STP (post-index, S) throughput 1.386138
STP (post-index, D) 3->3 2.271303
STP (post-index, D) throughput 1.288280
STP (post-index, Q) 3->3 2.557226
STP (post-index, Q) throughput 2.665786
STP (pre-index, S) 3->3 2.032385
STP (pre-index, S) throughput 1.319163
STP (pre-index, D) 3->3 2.160882
STP (pre-index, D) throughput 1.278568
STP (pre-index, Q) 3->3 2.581071
STP (pre-index, Q) throughput 2.516391
STP (signed offset, S) throughput 1.319398
STP (signed offset, D) throughput 1.307083
STP (signed offset, Q) throughput 2.514538
STR (S) throughput 1.308694
STR (D) throughput 1.309687
STR (Q) throughput 1.308230
STR (post-index, S) 3->3 1.967066
STR (post-index, S) throughput 1.312436
STR (post-index, D) 3->3 2.135685
STR (post-index, D) throughput 1.292746
STR (post-index, Q) 3->3 2.170121
STR (post-index, Q) throughput 1.288302
STR (pre-index, S) 3->3 3.461508
STR (pre-index, S) throughput 1.326403
STR (pre-index, D) 3->3 2.078380
STR (pre-index, D) throughput 1.310177
STR (pre-index, Q) 3->3 2.197522
STR (pre-index, Q) throughput 1.310133
STR (signed offset, S) throughput 1.290264
STR (signed offset, D) throughput 1.306027
STR (signed offset, Q) throughput 1.315530
STR (register, S) throughput 1.309290
STR (register, D) throughput 1.316808
STR (register, Q) throughput 1.315633
STR (register, uxtw, S) throughput 1.315043
STR (register, uxtw, D) throughput 1.322165
STR (register, uxtw, Q) throughput 1.279477
STR (register, sxtw, S) throughput 1.309526
STR (register, sxtw, D) throughput 1.335142
STR (register, sxtw, Q) throughput 1.389897
STR (register, lsl, S) throughput 1.286922
STR (register, lsl, D) throughput 1.123577
STR (register, lsl, Q) throughput 1.295475
STUR (S) throughput 1.341832
STUR (D) throughput 1.324120
STUR (Q) throughput 1.287557
SUB (vector, 8B) 1->2 3.787274
SUB (vector, 8B) 1->3 2.112790
SUB (vector, 8B) throughput 0.936980
SUB (vector, 16B) 1->2 1.997610
SUB (vector, 16B) 1->3 2.183081
SUB (vector, 16B) throughput 1.058447
SUB (vector, 4H) 1->2 2.177791
SUB (vector, 4H) 1->3 1.793947
SUB (vector, 4H) throughput 1.010547
SUB (vector, 8H) 1->2 2.042766
SUB (vector, 8H) 1->3 1.984387
SUB (vector, 8H) throughput 1.011733
SUB (vector, 2S) 1->2 2.002896
SUB (vector, 2S) 1->3 2.030517
SUB (vector, 2S) throughput 1.007648
SUB (vector, 4S) 1->2 2.030794
SUB (vector, 4S) 1->3 2.004010
SUB (vector, 4S) throughput 1.010720
SUB (vector, 2D) 1->2 2.009429
SUB (vector, 2D) 1->3 2.005740
SUB (vector, 2D) throughput 1.012890
SUB (vector, D) 1->2 2.031210
SUB (vector, D) 1->3 2.032180
SUB (vector, D) throughput 1.007506
SUBHN (vector, 8H) 1->2 4.096382
SUBHN (vector, 8H) 1->3 3.986949
SUBHN (vector, 8H) throughput 1.009415
SUBHN (vector, 4S) 1->2 4.005146
SUBHN (vector, 4S) 1->3 4.035747
SUBHN (vector, 4S) throughput 0.986602
SUBHN (vector, 2D) 1->2 4.031215
SUBHN (vector, 2D) 1->3 4.025931
SUBHN (vector, 2D) throughput 1.010565
SUBHN2 (vector, 8H) 1->1 4.038957
SUBHN2 (vector, 8H) 1->2 3.971453
SUBHN2 (vector, 8H) 1->3 4.015193
SUBHN2 (vector, 8H) throughput 1.002936
SUBHN2 (vector, 8H) throughput 1.177842
SUBHN2 (vector, 4S) 1->1 3.979886
SUBHN2 (vector, 4S) 1->2 4.004277
SUBHN2 (vector, 4S) 1->3 4.031952
SUBHN2 (vector, 4S) throughput 1.017204
SUBHN2 (vector, 4S) throughput 1.004404
SUBHN2 (vector, 2D) 1->1 4.040523
SUBHN2 (vector, 2D) 1->2 4.035214
SUBHN2 (vector, 2D) 1->3 4.198219
SUBHN2 (vector, 2D) throughput 1.007856
SUBHN2 (vector, 2D) throughput 1.016118
SUQADD (scalar, B) 1->1 3.021792
SUQADD (scalar, B) 1->2 3.676615
SUQADD (scalar, B) throughput 1.004278
SUQADD (scalar, B) throughput 1.009586
SUQADD (scalar, H) 1->1 3.003772
SUQADD (scalar, H) 1->2 3.010316
SUQADD (scalar, H) throughput 0.990544
SUQADD (scalar, H) throughput 1.011364
SUQADD (scalar, S) 1->1 3.005067
SUQADD (scalar, S) 1->2 3.048904
SUQADD (scalar, S) throughput 0.976357
SUQADD (scalar, S) throughput 1.008679
SUQADD (scalar, D) 1->1 3.111536
SUQADD (scalar, D) 1->2 3.056644
SUQADD (scalar, D) throughput 1.002018
SUQADD (scalar, D) throughput 0.997049
SUQADD (vector, 8B) 1->1 3.003197
SUQADD (vector, 8B) 1->2 2.955460
SUQADD (vector, 8B) throughput 1.006053
SUQADD (vector, 8B) throughput 0.992366
SUQADD (vector, 16B) 1->1 2.986354
SUQADD (vector, 16B) 1->2 3.001346
SUQADD (vector, 16B) throughput 1.003372
SUQADD (vector, 16B) throughput 1.022567
SUQADD (vector, 4H) 1->1 3.027812
SUQADD (vector, 4H) 1->2 2.980867
SUQADD (vector, 4H) throughput 1.012929
SUQADD (vector, 4H) throughput 1.004507
SUQADD (vector, 8H) 1->1 2.921773
SUQADD (vector, 8H) 1->2 3.042960
SUQADD (vector, 8H) throughput 1.006304
SUQADD (vector, 8H) throughput 1.009438
SUQADD (vector, 2S) 1->1 3.025636
SUQADD (vector, 2S) 1->2 3.020711
SUQADD (vector, 2S) throughput 0.998366
SUQADD (vector, 2S) throughput 1.009839
SUQADD (vector, 4S) 1->1 2.997149
SUQADD (vector, 4S) 1->2 3.013688
SUQADD (vector, 4S) throughput 1.009714
SUQADD (vector, 4S) throughput 1.010211
SUQADD (vector, 2D) 1->1 3.021993
SUQADD (vector, 2D) 1->2 3.014762
SUQADD (vector, 2D) throughput 0.996278
SUQADD (vector, 2D) throughput 1.009553
SXTL (8H) 1->2 2.036239
SXTL (8H) throughput 1.012692
SXTL (4S) 1->2 2.005374
SXTL (4S) throughput 1.028773
SXTL (2D) 1->2 1.958011
SXTL (2D) throughput 0.973716
SXTL2 (8H) 1->2 1.993839
SXTL2 (8H) throughput 1.005548
SXTL2 (4S) 1->2 2.008258
SXTL2 (4S) throughput 0.997502
SXTL2 (2D) 1->2 2.018356
SXTL2 (2D) throughput 1.009324
TBL (single register table, 8B) 1->2 2.014520
TBL (single register table, 8B) 1->3 2.007906
TBL (single register table, 8B) throughput 0.999135
TBL (single register table, 16B) 1->2 2.030085
TBL (single register table, 16B) 1->3 2.001338
TBL (single register table, 16B) throughput 1.008466
TBL (two register table, 8B) 1->2 2.087254
TBL (two register table, 8B) 1->3 1.980670
TBL (two register table, 8B) 1->4 2.019703
TBL (two register table, 8B) throughput 0.956559
TBL (two register table, 16B) 1->2 2.006641
TBL (two register table, 16B) 1->3 1.975868
TBL (two register table, 16B) 1->4 2.035648
TBL (two register table, 16B) throughput 1.005680
TBL (three register table, 8B) 1->2 3.973592
TBL (three register table, 8B) 1->3 3.988427
TBL (three register table, 8B) 1->4 2.121615
TBL (three register table, 8B) 1->5 4.006453
TBL (three register table, 8B) throughput 1.986089
TBL (three register table, 16B) 1->2 4.057108
TBL (three register table, 16B) 1->3 4.019171
TBL (three register table, 16B) 1->4 2.015637
TBL (three register table, 16B) 1->5 4.004942
TBL (three register table, 16B) throughput 2.037553
TBL (four register table, 8B) 1->2 4.045370
TBL (four register table, 8B) 1->3 5.233667
TBL (four register table, 8B) 1->4 3.505042
TBL (four register table, 8B) 1->5 4.105238
TBL (four register table, 8B) 1->6 4.829315
TBL (four register table, 8B) throughput 3.029285
TBL (four register table, 16B) 1->2 4.064715
TBL (four register table, 16B) 1->3 4.091787
TBL (four register table, 16B) 1->4 4.076072
TBL (four register table, 16B) 1->5 4.128892
TBL (four register table, 16B) 1->6 5.045918
TBL (four register table, 16B) throughput 3.038301
TBX (single register table, 8B) 1->1 1.988499
TBX (single register table, 8B) 1->2 2.031637
TBX (single register table, 8B) 1->3 1.984515
TBX (single register table, 8B) throughput 1.009464
TBX (single register table, 8B) throughput 0.983699
TBX (single register table, 16B) 1->1 1.998443
TBX (single register table, 16B) 1->2 2.084051
TBX (single register table, 16B) 1->3 2.004628
TBX (single register table, 16B) throughput 1.003466
TBX (single register table, 16B) throughput 1.004976
TBX (two register table, 8B) 1->1 3.962639
TBX (two register table, 8B) 1->2 4.003470
TBX (two register table, 8B) 1->3 2.023147
TBX (two register table, 8B) 1->4 4.047642
TBX (two register table, 8B) throughput 2.024486
TBX (two register table, 16B) 1->1 4.012691
TBX (two register table, 16B) 1->2 4.010335
TBX (two register table, 16B) 1->3 2.037535
TBX (two register table, 16B) 1->4 3.746288
TBX (two register table, 16B) throughput 1.963504
TBX (three register table, 8B) 1->1 6.013150
TBX (three register table, 8B) 1->2 6.090373
TBX (three register table, 8B) 1->3 4.041663
TBX (three register table, 8B) 1->4 2.022061
TBX (three register table, 8B) 1->5 6.082229
TBX (three register table, 8B) throughput 3.007549
TBX (three register table, 16B) 1->1 6.055121
TBX (three register table, 16B) 1->2 6.042619
TBX (three register table, 16B) 1->3 4.046945
TBX (three register table, 16B) 1->4 2.071180
TBX (three register table, 16B) 1->5 6.148657
TBX (three register table, 16B) throughput 2.402872
TBX (four register table, 8B) 1->1 8.209048
TBX (four register table, 8B) 1->2 8.033494
TBX (four register table, 8B) 1->3 6.774442
TBX (four register table, 8B) 1->4 4.078132
TBX (four register table, 8B) 1->5 3.038819
TBX (four register table, 8B) 1->6 8.188945
TBX (four register table, 8B) throughput 3.997747
TBX (four register table, 16B) 1->1 8.103817
TBX (four register table, 16B) 1->2 8.247256
TBX (four register table, 16B) 1->3 6.067567
TBX (four register table, 16B) 1->4 3.993432
TBX (four register table, 16B) 1->5 3.019235
TBX (four register table, 16B) 1->6 8.079493
TBX (four register table, 16B) throughput 4.040168
TRN1 (vector, 8B) 1->2 2.007544
TRN1 (vector, 8B) 1->3 1.930856
TRN1 (vector, 8B) throughput 1.043681
TRN1 (vector, 16B) 1->2 2.053645
TRN1 (vector, 16B) 1->3 2.002106
TRN1 (vector, 16B) throughput 1.011553
TRN1 (vector, 4H) 1->2 1.924824
TRN1 (vector, 4H) 1->3 1.998574
TRN1 (vector, 4H) throughput 1.011740
TRN1 (vector, 8H) 1->2 2.006664
TRN1 (vector, 8H) 1->3 2.012696
TRN1 (vector, 8H) throughput 1.011721
TRN1 (vector, 2S) 1->2 2.063406
TRN1 (vector, 2S) 1->3 1.994130
TRN1 (vector, 2S) throughput 1.007461
TRN1 (vector, 4S) 1->2 2.016518
TRN1 (vector, 4S) 1->3 1.986830
TRN1 (vector, 4S) throughput 1.001483
TRN1 (vector, 2D) 1->2 2.028750
TRN1 (vector, 2D) 1->3 1.990383
TRN1 (vector, 2D) throughput 1.008522
TRN2 (vector, 8B) 1->2 1.996841
TRN2 (vector, 8B) 1->3 2.007698
TRN2 (vector, 8B) throughput 1.000368
TRN2 (vector, 16B) 1->2 2.487689
TRN2 (vector, 16B) 1->3 1.994349
TRN2 (vector, 16B) throughput 1.011857
TRN2 (vector, 4H) 1->2 2.020175
TRN2 (vector, 4H) 1->3 2.012585
TRN2 (vector, 4H) throughput 1.015541
TRN2 (vector, 8H) 1->2 2.025428
TRN2 (vector, 8H) 1->3 2.009167
TRN2 (vector, 8H) throughput 0.991419
TRN2 (vector, 2S) 1->2 2.049184
TRN2 (vector, 2S) 1->3 1.992182
TRN2 (vector, 2S) throughput 1.011293
TRN2 (vector, 4S) 1->2 2.015944
TRN2 (vector, 4S) 1->3 1.992119
TRN2 (vector, 4S) throughput 1.010913
TRN2 (vector, 2D) 1->2 2.006443
TRN2 (vector, 2D) 1->3 2.003992
TRN2 (vector, 2D) throughput 1.011887
UABA (vector, 8B) 1->1 2.025334
UABA (vector, 8B) 1->2 3.028766
UABA (vector, 8B) 1->3 3.000193
UABA (vector, 8B) throughput 1.013294
UABA (vector, 8B) throughput 0.996226
UABA (vector, 16B) 1->1 2.077354
UABA (vector, 16B) 1->2 3.008709
UABA (vector, 16B) 1->3 2.955273
UABA (vector, 16B) throughput 1.005379
UABA (vector, 16B) throughput 1.009153
UABA (vector, 4H) 1->1 1.849448
UABA (vector, 4H) 1->2 3.031629
UABA (vector, 4H) 1->3 3.028632
UABA (vector, 4H) throughput 1.004494
UABA (vector, 4H) throughput 0.969859
UABA (vector, 8H) 1->1 1.999311
UABA (vector, 8H) 1->2 2.994042
UABA (vector, 8H) 1->3 3.039247
UABA (vector, 8H) throughput 1.004616
UABA (vector, 8H) throughput 1.007720
UABA (vector, 2S) 1->1 2.029559
UABA (vector, 2S) 1->2 3.000395
UABA (vector, 2S) 1->3 3.034126
UABA (vector, 2S) throughput 0.995009
UABA (vector, 2S) throughput 1.017373
UABA (vector, 4S) 1->1 2.025555
UABA (vector, 4S) 1->2 3.031482
UABA (vector, 4S) 1->3 3.030172
UABA (vector, 4S) throughput 1.010970
UABA (vector, 4S) throughput 1.000708
UABAL (vector, 8H) 1->1 2.001469
UABAL (vector, 8H) 1->2 2.990790
UABAL (vector, 8H) 1->3 3.023720
UABAL (vector, 8H) throughput 0.998667
UABAL (vector, 8H) throughput 1.257772
UABAL (vector, 4S) 1->1 1.978897
UABAL (vector, 4S) 1->2 3.118654
UABAL (vector, 4S) 1->3 3.038871
UABAL (vector, 4S) throughput 1.015919
UABAL (vector, 4S) throughput 0.994891
UABAL (vector, 2D) 1->1 1.998245
UABAL (vector, 2D) 1->2 3.034318
UABAL (vector, 2D) 1->3 3.050697
UABAL (vector, 2D) throughput 1.007248
UABAL (vector, 2D) throughput 0.993672
UABAL2 (vector, 8H) 1->1 1.980325
UABAL2 (vector, 8H) 1->2 2.982690
UABAL2 (vector, 8H) 1->3 3.040837
UABAL2 (vector, 8H) throughput 1.008014
UABAL2 (vector, 8H) throughput 0.997037
UABAL2 (vector, 4S) 1->1 2.102074
UABAL2 (vector, 4S) 1->2 3.008445
UABAL2 (vector, 4S) 1->3 3.002530
UABAL2 (vector, 4S) throughput 0.991396
UABAL2 (vector, 4S) throughput 1.016143
UABAL2 (vector, 2D) 1->1 1.988253
UABAL2 (vector, 2D) 1->2 2.991552
UABAL2 (vector, 2D) 1->3 2.993824
UABAL2 (vector, 2D) throughput 1.003021
UABAL2 (vector, 2D) throughput 1.012586
UABD (vector, 8B) 1->2 2.991651
UABD (vector, 8B) 1->3 3.030113
UABD (vector, 8B) throughput 1.009928
UABD (vector, 16B) 1->2 3.043518
UABD (vector, 16B) 1->3 3.504196
UABD (vector, 16B) throughput 0.985546
UABD (vector, 4H) 1->2 2.996623
UABD (vector, 4H) 1->3 3.005343
UABD (vector, 4H) throughput 1.008001
UABD (vector, 8H) 1->2 3.022037
UABD (vector, 8H) 1->3 2.978581
UABD (vector, 8H) throughput 1.007682
UABD (vector, 2S) 1->2 2.972213
UABD (vector, 2S) 1->3 3.015398
UABD (vector, 2S) throughput 1.007016
UABD (vector, 4S) 1->2 3.072031
UABD (vector, 4S) 1->3 3.033995
UABD (vector, 4S) throughput 1.011398
UABDL (vector, 8H) 1->2 3.000525
UABDL (vector, 8H) 1->3 2.989370
UABDL (vector, 8H) throughput 0.964277
UABDL (vector, 4S) 1->2 3.030226
UABDL (vector, 4S) 1->3 3.009964
UABDL (vector, 4S) throughput 1.006557
UABDL (vector, 2D) 1->2 3.026273
UABDL (vector, 2D) 1->3 2.976180
UABDL (vector, 2D) throughput 1.010979
UABDL2 (vector, 8H) 1->2 3.021883
UABDL2 (vector, 8H) 1->3 3.008730
UABDL2 (vector, 8H) throughput 1.005069
UABDL2 (vector, 4S) 1->2 3.023556
UABDL2 (vector, 4S) 1->3 2.995619
UABDL2 (vector, 4S) throughput 1.006814
UABDL2 (vector, 2D) 1->2 2.994151
UABDL2 (vector, 2D) 1->3 3.079710
UABDL2 (vector, 2D) throughput 1.009071
UADALP (vector, 4H) 1->1 1.994255
UADALP (vector, 4H) 1->2 2.992615
UADALP (vector, 4H) throughput 0.992759
UADALP (vector, 4H) throughput 1.009698
UADALP (vector, 8H) 1->1 2.021764
UADALP (vector, 8H) 1->2 3.030940
UADALP (vector, 8H) throughput 1.003931
UADALP (vector, 8H) throughput 1.006304
UADALP (vector, 2S) 1->1 2.002831
UADALP (vector, 2S) 1->2 3.130221
UADALP (vector, 2S) throughput 1.006881
UADALP (vector, 2S) throughput 1.031648
UADALP (vector, 4S) 1->1 1.963902
UADALP (vector, 4S) 1->2 3.071909
UADALP (vector, 4S) throughput 0.982427
UADALP (vector, 4S) throughput 0.994983
UADALP (vector, 1D) 1->1 1.998574
UADALP (vector, 1D) 1->2 3.006449
UADALP (vector, 1D) throughput 1.002862
UADALP (vector, 1D) throughput 1.004361
UADALP (vector, 2D) 1->1 2.005539
UADALP (vector, 2D) 1->2 3.008710
UADALP (vector, 2D) throughput 1.031721
UADALP (vector, 2D) throughput 1.000400
UADDL (vector, 8H) 1->2 1.994502
UADDL (vector, 8H) 1->3 2.001535
UADDL (vector, 8H) throughput 1.011830
UADDL (vector, 4S) 1->2 2.013393
UADDL (vector, 4S) 1->3 1.988295
UADDL (vector, 4S) throughput 1.003737
UADDL (vector, 2D) 1->2 1.993236
UADDL (vector, 2D) 1->3 2.001229
UADDL (vector, 2D) throughput 1.011995
UADDL2 (vector, 8H) 1->2 1.936694
UADDL2 (vector, 8H) 1->3 2.006554
UADDL2 (vector, 8H) throughput 1.010324
UADDL2 (vector, 4S) 1->2 1.987523
UADDL2 (vector, 4S) 1->3 2.016340
UADDL2 (vector, 4S) throughput 0.997261
UADDL2 (vector, 2D) 1->2 2.009808
UADDL2 (vector, 2D) 1->3 1.819076
UADDL2 (vector, 2D) throughput 1.010621
UADDLP (vector, 4H) 1->2 1.998728
UADDLP (vector, 4H) throughput 0.984378
UADDLP (vector, 8H) 1->2 2.017955
UADDLP (vector, 8H) throughput 1.010852
UADDLP (vector, 2S) 1->2 2.053370
UADDLP (vector, 2S) throughput 1.006622
UADDLP (vector, 4S) 1->2 2.022849
UADDLP (vector, 4S) throughput 1.006401
UADDLP (vector, 1D) 1->2 2.000695
UADDLP (vector, 1D) throughput 0.996908
UADDLP (vector, 2D) 1->2 1.998723
UADDLP (vector, 2D) throughput 1.007794
UADDLV (vector, 8B) 1->2 2.991642
UADDLV (vector, 8B) throughput 1.011196
UADDLV (vector, 16B) 1->2 3.114601
UADDLV (vector, 16B) throughput 0.986153
UADDLV (vector, 4H) 1->2 2.961399
UADDLV (vector, 4H) throughput 1.022958
UADDLV (vector, 8H) 1->2 3.037784
UADDLV (vector, 8H) throughput 1.003171
UADDLV (vector, 4S) 1->2 2.994788
UADDLV (vector, 4S) throughput 1.012952
UADDW (vector, 8H) 1->2 2.000942
UADDW (vector, 8H) 1->3 2.033173
UADDW (vector, 8H) throughput 1.016678
UADDW (vector, 4S) 1->2 2.016507
UADDW (vector, 4S) 1->3 2.024363
UADDW (vector, 4S) throughput 1.000960
UADDW (vector, 2D) 1->2 2.024504
UADDW (vector, 2D) 1->3 2.083938
UADDW (vector, 2D) throughput 1.008246
UADDW2 (vector, 8H) 1->2 2.025464
UADDW2 (vector, 8H) 1->3 1.993823
UADDW2 (vector, 8H) throughput 1.006465
UADDW2 (vector, 4S) 1->2 2.012603
UADDW2 (vector, 4S) 1->3 2.019359
UADDW2 (vector, 4S) throughput 1.009338
UADDW2 (vector, 2D) 1->2 1.997301
UADDW2 (vector, 2D) 1->3 2.023038
UADDW2 (vector, 2D) throughput 1.012700
UCVTF (scalar, fixed-point, S from 32-bit GPR) 1->2 roundtrip 10.167072
UCVTF (scalar, fixed-point, S from 32-bit GPR) throughput 1.056138
UCVTF (scalar, fixed-point, D from 32-bit GPR) 1->2 roundtrip 10.126884
UCVTF (scalar, fixed-point, D from 32-bit GPR) throughput 1.016231
UCVTF (scalar, fixed-point, S from 64-bit GPR) 1->2 roundtrip 11.843969
UCVTF (scalar, fixed-point, S from 64-bit GPR) throughput 1.015234
UCVTF (scalar, fixed-point, D from 64-bit GPR) 1->2 roundtrip 10.087749
UCVTF (scalar, fixed-point, D from 64-bit GPR) throughput 1.008776
UCVTF (scalar, integer, S from 32-bit GPR) 1->2 roundtrip 10.122001
UCVTF (scalar, integer, S from 32-bit GPR) throughput 1.161307
UCVTF (scalar, integer, D from 32-bit GPR) 1->2 roundtrip 10.032361
UCVTF (scalar, integer, D from 32-bit GPR) throughput 1.001224
UCVTF (scalar, integer, S from 64-bit GPR) 1->2 roundtrip 10.067420
UCVTF (scalar, integer, S from 64-bit GPR) throughput 1.022808
UCVTF (scalar, integer, D from 64-bit GPR) 1->2 roundtrip 9.668760
UCVTF (scalar, integer, D from 64-bit GPR) throughput 1.012275
UCVTF (scalar, fixed-point, S from S) 1->2 3.014423
UCVTF (scalar, fixed-point, S from S) throughput 1.012312
UCVTF (scalar, fixed-point, D from D) 1->2 3.025641
UCVTF (scalar, fixed-point, D from D) throughput 0.988661
UCVTF (vector, fixed-point, 2S from 2S) 1->2 2.998245
UCVTF (vector, fixed-point, 2S from 2S) throughput 1.006363
UCVTF (vector, fixed-point, 4S from 4S) 1->2 3.080194
UCVTF (vector, fixed-point, 4S from 4S) throughput 1.018183
UCVTF (vector, fixed-point, 2D from 2D) 1->2 3.057292
UCVTF (vector, fixed-point, 2D from 2D) throughput 1.010477
UCVTF (scalar, integer, S from S) 1->2 3.006262
UCVTF (scalar, integer, S from S) throughput 1.032024
UCVTF (scalar, integer, D from D) 1->2 3.008126
UCVTF (scalar, integer, D from D) throughput 1.003946
UCVTF (vector, integer, 2S from 2S) 1->2 2.962326
UCVTF (vector, integer, 2S from 2S) throughput 1.003804
UCVTF (vector, integer, 4S from 4S) 1->2 3.054815
UCVTF (vector, integer, 4S from 4S) throughput 1.000136
UCVTF (vector, integer, 2D from 2D) 1->2 3.028542
UCVTF (vector, integer, 2D from 2D) throughput 1.016484
UHADD (vector, 8B) 1->2 2.009030
UHADD (vector, 8B) 1->3 2.033149
UHADD (vector, 8B) throughput 1.004222
UHADD (vector, 16B) 1->2 2.033261
UHADD (vector, 16B) 1->3 2.001116
UHADD (vector, 16B) throughput 0.999159
UHADD (vector, 4H) 1->2 2.068931
UHADD (vector, 4H) 1->3 2.025008
UHADD (vector, 4H) throughput 1.013447
UHADD (vector, 8H) 1->2 1.970305
UHADD (vector, 8H) 1->3 2.008202
UHADD (vector, 8H) throughput 1.013495
UHADD (vector, 2S) 1->2 1.974647
UHADD (vector, 2S) 1->3 2.009359
UHADD (vector, 2S) throughput 0.990893
UHADD (vector, 4S) 1->2 1.960995
UHADD (vector, 4S) 1->3 2.002873
UHADD (vector, 4S) throughput 0.988823
UHSUB (vector, 8B) 1->2 1.996995
UHSUB (vector, 8B) 1->3 1.922087
UHSUB (vector, 8B) throughput 1.017265
UHSUB (vector, 16B) 1->2 2.012182
UHSUB (vector, 16B) 1->3 1.932348
UHSUB (vector, 16B) throughput 1.002147
UHSUB (vector, 4H) 1->2 2.019808
UHSUB (vector, 4H) 1->3 2.016630
UHSUB (vector, 4H) throughput 0.983520
UHSUB (vector, 8H) 1->2 1.997193
UHSUB (vector, 8H) 1->3 2.016865
UHSUB (vector, 8H) throughput 0.995683
UHSUB (vector, 2S) 1->2 2.039859
UHSUB (vector, 2S) 1->3 1.974754
UHSUB (vector, 2S) throughput 1.015508
UHSUB (vector, 4S) 1->2 2.003210
UHSUB (vector, 4S) 1->3 2.002828
UHSUB (vector, 4S) throughput 0.996355
UMAX (vector, 8B) 1->2 3.031515
UMAX (vector, 8B) 1->3 3.032825
UMAX (vector, 8B) throughput 1.024084
UMAX (vector, 16B) 1->2 3.037696
UMAX (vector, 16B) 1->3 3.072905
UMAX (vector, 16B) throughput 1.043218
UMAX (vector, 4H) 1->2 3.012145
UMAX (vector, 4H) 1->3 3.025105
UMAX (vector, 4H) throughput 0.999206
UMAX (vector, 8H) 1->2 2.905015
UMAX (vector, 8H) 1->3 3.030832
UMAX (vector, 8H) throughput 1.005483
UMAX (vector, 2S) 1->2 3.003422
UMAX (vector, 2S) 1->3 3.039803
UMAX (vector, 2S) throughput 1.012589
UMAX (vector, 4S) 1->2 3.041508
UMAX (vector, 4S) 1->3 2.887797
UMAX (vector, 4S) throughput 1.002913
UMAXP (vector, 8B) 1->2 3.067574
UMAXP (vector, 8B) 1->3 3.008709
UMAXP (vector, 8B) throughput 1.010730
UMAXP (vector, 16B) 1->2 3.030111
UMAXP (vector, 16B) 1->3 3.336696
UMAXP (vector, 16B) throughput 1.000513
UMAXP (vector, 4H) 1->2 3.007833
UMAXP (vector, 4H) 1->3 3.027140
UMAXP (vector, 4H) throughput 1.020824
UMAXP (vector, 8H) 1->2 3.020410
UMAXP (vector, 8H) 1->3 3.011385
UMAXP (vector, 8H) throughput 1.012852
UMAXP (vector, 2S) 1->2 3.022850
UMAXP (vector, 2S) 1->3 2.963282
UMAXP (vector, 2S) throughput 1.004999
UMAXP (vector, 4S) 1->2 3.034874
UMAXP (vector, 4S) 1->3 3.023905
UMAXP (vector, 4S) throughput 0.995294
UMAXV (vector, 8B) 1->2 2.997126
UMAXV (vector, 8B) throughput 1.011659
UMAXV (vector, 16B) 1->2 3.091381
UMAXV (vector, 16B) throughput 1.019404
UMAXV (vector, 4H) 1->2 3.089994
UMAXV (vector, 4H) throughput 1.000755
UMAXV (vector, 8H) 1->2 2.998202
UMAXV (vector, 8H) throughput 0.994510
UMAXV (vector, 4S) 1->2 3.007655
UMAXV (vector, 4S) throughput 1.007664
UMIN (vector, 8B) 1->2 3.001753
UMIN (vector, 8B) 1->3 2.996605
UMIN (vector, 8B) throughput 0.991748
UMIN (vector, 16B) 1->2 2.998005
UMIN (vector, 16B) 1->3 3.055716
UMIN (vector, 16B) throughput 1.005181
UMIN (vector, 4H) 1->2 2.899232
UMIN (vector, 4H) 1->3 3.004140
UMIN (vector, 4H) throughput 1.006132
UMIN (vector, 8H) 1->2 3.026972
UMIN (vector, 8H) 1->3 2.998158
UMIN (vector, 8H) throughput 1.006189
UMIN (vector, 2S) 1->2 3.088714
UMIN (vector, 2S) 1->3 3.034049
UMIN (vector, 2S) throughput 1.010415
UMIN (vector, 4S) 1->2 3.085099
UMIN (vector, 4S) 1->3 2.993523
UMIN (vector, 4S) throughput 0.988339
UMINP (vector, 8B) 1->2 3.097436
UMINP (vector, 8B) 1->3 3.040114
UMINP (vector, 8B) throughput 1.003416
UMINP (vector, 16B) 1->2 3.019396
UMINP (vector, 16B) 1->3 3.013622
UMINP (vector, 16B) throughput 1.026317
UMINP (vector, 4H) 1->2 3.086303
UMINP (vector, 4H) 1->3 3.086346
UMINP (vector, 4H) throughput 1.014763
UMINP (vector, 8H) 1->2 3.014796
UMINP (vector, 8H) 1->3 2.997784
UMINP (vector, 8H) throughput 1.010398
UMINP (vector, 2S) 1->2 3.037185
UMINP (vector, 2S) 1->3 2.987227
UMINP (vector, 2S) throughput 1.006856
UMINP (vector, 4S) 1->2 3.046842
UMINP (vector, 4S) 1->3 3.048700
UMINP (vector, 4S) throughput 1.017885
UMINV (vector, 8B) 1->2 2.967196
UMINV (vector, 8B) throughput 1.050506
UMINV (vector, 16B) 1->2 3.008446
UMINV (vector, 16B) throughput 0.949252
UMINV (vector, 4H) 1->2 2.995852
UMINV (vector, 4H) throughput 0.983359
UMINV (vector, 8H) 1->2 3.039547
UMINV (vector, 8H) throughput 1.010157
UMINV (vector, 4S) 1->2 3.050322
UMINV (vector, 4S) throughput 1.007656
UMLAL (by element, 4S) 1->1 1.998467
UMLAL (by element, 4S) 1->2 3.003487
UMLAL (by element, 4S) 1->3 2.974830
UMLAL (by element, 4S) throughput 1.007085
UMLAL (by element, 4S) throughput 1.009129
UMLAL (by element, 2D) 1->1 1.928287
UMLAL (by element, 2D) 1->2 3.105260
UMLAL (by element, 2D) 1->3 3.007597
UMLAL (by element, 2D) throughput 1.010491
UMLAL (by element, 2D) throughput 1.008750
UMLAL2 (by element, 4S) 1->1 2.029532
UMLAL2 (by element, 4S) 1->2 3.024084
UMLAL2 (by element, 4S) 1->3 2.999294
UMLAL2 (by element, 4S) throughput 1.010988
UMLAL2 (by element, 4S) throughput 1.004247
UMLAL2 (by element, 2D) 1->1 2.038134
UMLAL2 (by element, 2D) 1->2 3.014036
UMLAL2 (by element, 2D) 1->3 3.012720
UMLAL2 (by element, 2D) throughput 1.004774
UMLAL2 (by element, 2D) throughput 1.005051
UMLAL (vector, 8H) 1->1 2.020636
UMLAL (vector, 8H) 1->2 3.017248
UMLAL (vector, 8H) 1->3 3.008361
UMLAL (vector, 8H) throughput 1.007186
UMLAL (vector, 8H) throughput 1.010548
UMLAL (vector, 4S) 1->1 1.988178
UMLAL (vector, 4S) 1->2 3.057062
UMLAL (vector, 4S) 1->3 3.010645
UMLAL (vector, 4S) throughput 1.003670
UMLAL (vector, 4S) throughput 1.006535
UMLAL (vector, 2D) 1->1 2.017056
UMLAL (vector, 2D) 1->2 3.014316
UMLAL (vector, 2D) 1->3 3.048868
UMLAL (vector, 2D) throughput 1.006027
UMLAL (vector, 2D) throughput 0.984009
UMLAL2 (vector, 8H) 1->1 2.045406
UMLAL2 (vector, 8H) 1->2 3.027867
UMLAL2 (vector, 8H) 1->3 3.076053
UMLAL2 (vector, 8H) throughput 0.926665
UMLAL2 (vector, 8H) throughput 1.009435
UMLAL2 (vector, 4S) 1->1 2.049499
UMLAL2 (vector, 4S) 1->2 3.059788
UMLAL2 (vector, 4S) 1->3 3.766172
UMLAL2 (vector, 4S) throughput 1.006591
UMLAL2 (vector, 4S) throughput 1.010946
UMLAL2 (vector, 2D) 1->1 1.940643
UMLAL2 (vector, 2D) 1->2 3.010820
UMLAL2 (vector, 2D) 1->3 3.010540
UMLAL2 (vector, 2D) throughput 1.002830
UMLAL2 (vector, 2D) throughput 0.996555
UMLSL (by element, 4S) 1->1 2.063145
UMLSL (by element, 4S) 1->2 3.039866
UMLSL (by element, 4S) 1->3 3.019272
UMLSL (by element, 4S) throughput 0.998752
UMLSL (by element, 4S) throughput 1.009123
UMLSL (by element, 2D) 1->1 2.017440
UMLSL (by element, 2D) 1->2 3.091270
UMLSL (by element, 2D) 1->3 3.015936
UMLSL (by element, 2D) throughput 1.007633
UMLSL (by element, 2D) throughput 1.000212
UMLSL2 (by element, 4S) 1->1 2.008010
UMLSL2 (by element, 4S) 1->2 2.996190
UMLSL2 (by element, 4S) 1->3 3.069508
UMLSL2 (by element, 4S) throughput 1.010648
UMLSL2 (by element, 4S) throughput 1.010992
UMLSL2 (by element, 2D) 1->1 2.052582
UMLSL2 (by element, 2D) 1->2 2.999543
UMLSL2 (by element, 2D) 1->3 3.031053
UMLSL2 (by element, 2D) throughput 0.972927
UMLSL2 (by element, 2D) throughput 1.008893
UMLSL (vector, 8H) 1->1 1.998399
UMLSL (vector, 8H) 1->2 2.967244
UMLSL (vector, 8H) 1->3 3.019974
UMLSL (vector, 8H) throughput 1.022772
UMLSL (vector, 8H) throughput 0.998327
UMLSL (vector, 4S) 1->1 2.010600
UMLSL (vector, 4S) 1->2 3.001144
UMLSL (vector, 4S) 1->3 3.028859
UMLSL (vector, 4S) throughput 1.008636
UMLSL (vector, 4S) throughput 1.005847
UMLSL (vector, 2D) 1->1 2.012623
UMLSL (vector, 2D) 1->2 3.033399
UMLSL (vector, 2D) 1->3 3.028052
UMLSL (vector, 2D) throughput 1.015179
UMLSL (vector, 2D) throughput 1.007337
UMLSL2 (vector, 8H) 1->1 2.008621
UMLSL2 (vector, 8H) 1->2 3.027545
UMLSL2 (vector, 8H) 1->3 3.075542
UMLSL2 (vector, 8H) throughput 0.985468
UMLSL2 (vector, 8H) throughput 1.012031
UMLSL2 (vector, 4S) 1->1 2.020032
UMLSL2 (vector, 4S) 1->2 2.967901
UMLSL2 (vector, 4S) 1->3 3.013099
UMLSL2 (vector, 4S) throughput 1.008094
UMLSL2 (vector, 4S) throughput 0.954831
UMLSL2 (vector, 2D) 1->1 1.994480
UMLSL2 (vector, 2D) 1->2 3.013142
UMLSL2 (vector, 2D) 1->3 2.985072
UMLSL2 (vector, 2D) throughput 1.009912
UMLSL2 (vector, 2D) throughput 1.022621
UMULL (by element, 4S) 1->2 3.027131
UMULL (by element, 4S) 1->3 2.998051
UMULL (by element, 4S) throughput 1.005675
UMULL (by element, 2D) 1->2 3.022762
UMULL (by element, 2D) 1->3 3.015626
UMULL (by element, 2D) throughput 1.005252
UMULL2 (by element, 4S) 1->2 3.009984
UMULL2 (by element, 4S) 1->3 3.001293
UMULL2 (by element, 4S) throughput 1.007478
UMULL2 (by element, 2D) 1->2 3.010865
UMULL2 (by element, 2D) 1->3 3.069726
UMULL2 (by element, 2D) throughput 1.013972
UMULL (vector, 8H) 1->2 3.074791
UMULL (vector, 8H) 1->3 3.008818
UMULL (vector, 8H) throughput 0.985439
UMULL (vector, 4S) 1->2 3.006264
UMULL (vector, 4S) 1->3 3.038752
UMULL (vector, 4S) throughput 0.991156
UMULL (vector, 2D) 1->2 3.028496
UMULL (vector, 2D) 1->3 3.032567
UMULL (vector, 2D) throughput 1.003612
UMULL2 (vector, 8H) 1->2 3.024235
UMULL2 (vector, 8H) 1->3 3.002347
UMULL2 (vector, 8H) throughput 1.004558
UMULL2 (vector, 4S) 1->2 3.040400
UMULL2 (vector, 4S) 1->3 2.965595
UMULL2 (vector, 4S) throughput 1.010069
UMULL2 (vector, 2D) 1->2 3.042244
UMULL2 (vector, 2D) 1->3 2.997939
UMULL2 (vector, 2D) throughput 1.003202
UQADD (scalar, B) 1->2 3.017723
UQADD (scalar, B) 1->3 3.510872
UQADD (scalar, B) throughput 0.988537
UQADD (scalar, H) 1->2 3.064452
UQADD (scalar, H) 1->3 3.032481
UQADD (scalar, H) throughput 1.000646
UQADD (scalar, S) 1->2 3.010050
UQADD (scalar, S) 1->3 3.021750
UQADD (scalar, S) throughput 1.012607
UQADD (scalar, D) 1->2 2.984389
UQADD (scalar, D) 1->3 2.992490
UQADD (scalar, D) throughput 1.011660
UQADD (vector, 8B) 1->2 3.019990
UQADD (vector, 8B) 1->3 3.007503
UQADD (vector, 8B) throughput 1.004246
UQADD (vector, 16B) 1->2 3.037617
UQADD (vector, 16B) 1->3 3.032984
UQADD (vector, 16B) throughput 1.000465
UQADD (vector, 4H) 1->2 3.070741
UQADD (vector, 4H) 1->3 3.060261
UQADD (vector, 4H) throughput 1.007271
UQADD (vector, 8H) 1->2 3.008059
UQADD (vector, 8H) 1->3 3.022693
UQADD (vector, 8H) throughput 1.009694
UQADD (vector, 2S) 1->2 2.988833
UQADD (vector, 2S) 1->3 3.004143
UQADD (vector, 2S) throughput 1.011703
UQADD (vector, 4S) 1->2 3.011646
UQADD (vector, 4S) 1->3 2.967322
UQADD (vector, 4S) throughput 1.011497
UQADD (vector, 2D) 1->2 3.018596
UQADD (vector, 2D) 1->3 3.040144
UQADD (vector, 2D) throughput 0.990765
UQRSHL (scalar, B) 1->2 3.010950
UQRSHL (scalar, B) 1->3 3.074329
UQRSHL (scalar, B) throughput 0.991977
UQRSHL (scalar, H) 1->2 3.018930
UQRSHL (scalar, H) 1->3 3.077714
UQRSHL (scalar, H) throughput 1.008894
UQRSHL (scalar, S) 1->2 2.983085
UQRSHL (scalar, S) 1->3 3.016295
UQRSHL (scalar, S) throughput 1.011751
UQRSHL (scalar, D) 1->2 3.030985
UQRSHL (scalar, D) 1->3 2.992553
UQRSHL (scalar, D) throughput 1.007919
UQRSHL (vector, 8B) 1->2 3.053187
UQRSHL (vector, 8B) 1->3 3.008179
UQRSHL (vector, 8B) throughput 1.014104
UQRSHL (vector, 16B) 1->2 3.068276
UQRSHL (vector, 16B) 1->3 3.072716
UQRSHL (vector, 16B) throughput 0.989206
UQRSHL (vector, 4H) 1->2 3.009084
UQRSHL (vector, 4H) 1->3 3.026347
UQRSHL (vector, 4H) throughput 1.012484
UQRSHL (vector, 8H) 1->2 3.004299
UQRSHL (vector, 8H) 1->3 2.997744
UQRSHL (vector, 8H) throughput 1.000080
UQRSHL (vector, 2S) 1->2 3.132499
UQRSHL (vector, 2S) 1->3 3.396190
UQRSHL (vector, 2S) throughput 1.031019
UQRSHL (vector, 4S) 1->2 3.009965
UQRSHL (vector, 4S) 1->3 3.020303
UQRSHL (vector, 4S) throughput 1.005361
UQRSHL (vector, 2D) 1->2 3.011075
UQRSHL (vector, 2D) 1->3 3.023453
UQRSHL (vector, 2D) throughput 1.000492
UQRSHRN (H) 1->2 3.983331
UQRSHRN (H) throughput 1.096999
UQRSHRN (S) 1->2 4.336671
UQRSHRN (S) throughput 1.009975
UQRSHRN (D) 1->2 4.033624
UQRSHRN (D) throughput 1.007755
UQRSHRN (8H) 1->2 4.078490
UQRSHRN (8H) throughput 1.016285
UQRSHRN (4S) 1->2 3.994370
UQRSHRN (4S) throughput 0.998247
UQRSHRN (2D) 1->2 4.055310
UQRSHRN (2D) throughput 1.051889
UQRSHRN2 (8H) 1->1 5.284969
UQRSHRN2 (8H) 1->2 3.946257
UQRSHRN2 (8H) throughput 0.775218
UQRSHRN2 (8H) throughput 1.037457
UQRSHRN2 (4S) 1->1 4.018465
UQRSHRN2 (4S) 1->2 4.001314
UQRSHRN2 (4S) throughput 1.008083
UQRSHRN2 (4S) throughput 0.987702
UQRSHRN2 (2D) 1->1 4.067674
UQRSHRN2 (2D) 1->2 4.034710
UQRSHRN2 (2D) throughput 1.006705
UQRSHRN2 (2D) throughput 1.011347
UQSHL (immediate, scalar, B) 1->2 1.973461
UQSHL (immediate, scalar, B) throughput 1.002886
UQSHL (immediate, scalar, H) 1->2 2.152295
UQSHL (immediate, scalar, H) throughput 1.017848
UQSHL (immediate, scalar, S) 1->2 2.060653
UQSHL (immediate, scalar, S) throughput 0.989302
UQSHL (immediate, scalar, D) 1->2 1.999935
UQSHL (immediate, scalar, D) throughput 1.002383
UQSHL (immediate, vector, 8B) 1->2 1.981970
UQSHL (immediate, vector, 8B) throughput 1.004521
UQSHL (immediate, vector, 16B) 1->2 2.008682
UQSHL (immediate, vector, 16B) throughput 0.994756
UQSHL (immediate, vector, 4H) 1->2 1.998398
UQSHL (immediate, vector, 4H) throughput 1.006707
UQSHL (immediate, vector, 8H) 1->2 2.012395
UQSHL (immediate, vector, 8H) throughput 1.008146
UQSHL (immediate, vector, 2S) 1->2 1.999430
UQSHL (immediate, vector, 2S) throughput 1.019712
UQSHL (immediate, vector, 4S) 1->2 1.966046
UQSHL (immediate, vector, 4S) throughput 1.005506
UQSHL (immediate, vector, 2D) 1->2 1.996747
UQSHL (immediate, vector, 2D) throughput 1.009917
UQSHL (scalar, B) 1->2 1.994056
UQSHL (scalar, B) 1->3 2.039280
UQSHL (scalar, B) throughput 0.994195
UQSHL (scalar, H) 1->2 2.037171
UQSHL (scalar, H) 1->3 2.006466
UQSHL (scalar, H) throughput 1.010734
UQSHL (scalar, S) 1->2 1.999614
UQSHL (scalar, S) 1->3 1.998333
UQSHL (scalar, S) throughput 1.016805
UQSHL (scalar, D) 1->2 2.044420
UQSHL (scalar, D) 1->3 1.992309
UQSHL (scalar, D) throughput 1.013328
UQSHL (vector, 8B) 1->2 1.993776
UQSHL (vector, 8B) 1->3 2.002620
UQSHL (vector, 8B) throughput 0.989713
UQSHL (vector, 16B) 1->2 2.004972
UQSHL (vector, 16B) 1->3 2.036153
UQSHL (vector, 16B) throughput 1.008922
UQSHL (vector, 4H) 1->2 2.026565
UQSHL (vector, 4H) 1->3 1.997033
UQSHL (vector, 4H) throughput 1.007798
UQSHL (vector, 8H) 1->2 2.017110
UQSHL (vector, 8H) 1->3 1.964820
UQSHL (vector, 8H) throughput 1.010416
UQSHL (vector, 2S) 1->2 2.065557
UQSHL (vector, 2S) 1->3 2.030695
UQSHL (vector, 2S) throughput 1.005824
UQSHL (vector, 4S) 1->2 2.048863
UQSHL (vector, 4S) 1->3 2.001206
UQSHL (vector, 4S) throughput 1.004400
UQSHL (vector, 2D) 1->2 2.031710
UQSHL (vector, 2D) 1->3 1.999736
UQSHL (vector, 2D) throughput 1.006574
UQSHRN (H) 1->2 4.006342
UQSHRN (H) throughput 1.009384
UQSHRN (S) 1->2 4.056936
UQSHRN (S) throughput 1.011226
UQSHRN (D) 1->2 4.040596
UQSHRN (D) throughput 0.991970
UQSHRN (8H) 1->2 4.055417
UQSHRN (8H) throughput 1.007980
UQSHRN (4S) 1->2 4.010813
UQSHRN (4S) throughput 1.013092
UQSHRN (2D) 1->2 4.110466
UQSHRN (2D) throughput 1.025580
UQSHRN2 (8H) 1->1 4.318087
UQSHRN2 (8H) 1->2 3.705922
UQSHRN2 (8H) throughput 1.011685
UQSHRN2 (8H) throughput 1.006353
UQSHRN2 (4S) 1->1 4.037083
UQSHRN2 (4S) 1->2 4.015565
UQSHRN2 (4S) throughput 0.988904
UQSHRN2 (4S) throughput 1.006362
UQSHRN2 (2D) 1->1 4.069686
UQSHRN2 (2D) 1->2 4.028105
UQSHRN2 (2D) throughput 1.008035
UQSHRN2 (2D) throughput 0.993281
UQSUB (scalar, B) 1->2 2.994239
UQSUB (scalar, B) 1->3 3.036126
UQSUB (scalar, B) throughput 0.994878
UQSUB (scalar, H) 1->2 3.001974
UQSUB (scalar, H) 1->3 3.055673
UQSUB (scalar, H) throughput 1.010550
UQSUB (scalar, S) 1->2 2.967294
UQSUB (scalar, S) 1->3 3.024720
UQSUB (scalar, S) throughput 0.993717
UQSUB (scalar, D) 1->2 3.018015
UQSUB (scalar, D) 1->3 3.010519
UQSUB (scalar, D) throughput 0.986013
UQSUB (vector, 8B) 1->2 3.033175
UQSUB (vector, 8B) 1->3 3.034664
UQSUB (vector, 8B) throughput 1.031823
UQSUB (vector, 16B) 1->2 2.742839
UQSUB (vector, 16B) 1->3 3.020584
UQSUB (vector, 16B) throughput 1.006164
UQSUB (vector, 4H) 1->2 3.038129
UQSUB (vector, 4H) 1->3 2.939930
UQSUB (vector, 4H) throughput 1.008952
UQSUB (vector, 8H) 1->2 3.032636
UQSUB (vector, 8H) 1->3 3.011427
UQSUB (vector, 8H) throughput 0.998015
UQSUB (vector, 2S) 1->2 3.014097
UQSUB (vector, 2S) 1->3 3.034707
UQSUB (vector, 2S) throughput 1.009144
UQSUB (vector, 4S) 1->2 3.000702
UQSUB (vector, 4S) 1->3 2.995871
UQSUB (vector, 4S) throughput 1.009187
UQSUB (vector, 2D) 1->2 2.979049
UQSUB (vector, 2D) 1->3 3.010512
UQSUB (vector, 2D) throughput 1.010357
UQXTN (H) 1->2 4.016167
UQXTN (H) throughput 1.015597
UQXTN (S) 1->2 4.068655
UQXTN (S) throughput 0.997725
UQXTN (D) 1->2 3.961083
UQXTN (D) throughput 1.018110
UQXTN (8H) 1->2 4.009096
UQXTN (8H) throughput 0.999611
UQXTN (4S) 1->2 4.026732
UQXTN (4S) throughput 1.008459
UQXTN (2D) 1->2 4.055073
UQXTN (2D) throughput 1.019639
UQXTN2 (8H) 1->1 3.978407
UQXTN2 (8H) 1->2 4.030115
UQXTN2 (8H) throughput 1.014675
UQXTN2 (8H) throughput 1.005044
UQXTN2 (4S) 1->1 4.020703
UQXTN2 (4S) 1->2 4.014321
UQXTN2 (4S) throughput 1.010416
UQXTN2 (4S) throughput 1.007642
UQXTN2 (2D) 1->1 4.074411
UQXTN2 (2D) 1->2 4.077278
UQXTN2 (2D) throughput 1.004530
UQXTN2 (2D) throughput 0.989893
URECPE (vector, 2S) 1->2 4.057679
URECPE (vector, 2S) throughput 1.006313
URECPE (vector, 4S) 1->2 4.021441
URECPE (vector, 4S) throughput 1.006110
URHADD (vector, 8B) 1->2 1.998509
URHADD (vector, 8B) 1->3 2.027202
URHADD (vector, 8B) throughput 1.006082
URHADD (vector, 16B) 1->2 2.001536
URHADD (vector, 16B) 1->3 2.017913
URHADD (vector, 16B) throughput 0.989461
URHADD (vector, 4H) 1->2 2.002751
URHADD (vector, 4H) 1->3 2.012138
URHADD (vector, 4H) throughput 1.002063
URHADD (vector, 8H) 1->2 2.000466
URHADD (vector, 8H) 1->3 2.013224
URHADD (vector, 8H) throughput 0.979901
URHADD (vector, 2S) 1->2 2.016209
URHADD (vector, 2S) 1->3 2.006204
URHADD (vector, 2S) throughput 1.007315
URHADD (vector, 4S) 1->2 2.013993
URHADD (vector, 4S) 1->3 1.977044
URHADD (vector, 4S) throughput 1.010831
URSHL (vector, 8B) 1->2 3.004358
URSHL (vector, 8B) 1->3 3.026894
URSHL (vector, 8B) throughput 1.008933
URSHL (vector, 16B) 1->2 2.999457
URSHL (vector, 16B) 1->3 2.991505
URSHL (vector, 16B) throughput 1.011705
URSHL (vector, 4H) 1->2 3.042107
URSHL (vector, 4H) 1->3 2.956567
URSHL (vector, 4H) throughput 1.015617
URSHL (vector, 8H) 1->2 2.972683
URSHL (vector, 8H) 1->3 3.054060
URSHL (vector, 8H) throughput 1.040882
URSHL (vector, 2S) 1->2 3.050813
URSHL (vector, 2S) 1->3 3.006094
URSHL (vector, 2S) throughput 1.011961
URSHL (vector, 4S) 1->2 2.999438
URSHL (vector, 4S) 1->3 3.010005
URSHL (vector, 4S) throughput 0.989823
URSHL (vector, 2D) 1->2 2.994172
URSHL (vector, 2D) 1->3 3.023405
URSHL (vector, 2D) throughput 1.014368
URSHL (vector, D) 1->2 3.082364
URSHL (vector, D) 1->3 3.020212
URSHL (vector, D) throughput 1.006835
URSHR (vector, 8B) 1->2 3.030027
URSHR (vector, 8B) throughput 0.901813
URSHR (vector, 16B) 1->2 3.005746
URSHR (vector, 16B) throughput 1.010552
URSHR (vector, 4H) 1->2 3.019908
URSHR (vector, 4H) throughput 1.009390
URSHR (vector, 8H) 1->2 2.993977
URSHR (vector, 8H) throughput 1.009997
URSHR (vector, 2S) 1->2 3.009017
URSHR (vector, 2S) throughput 1.011055
URSHR (vector, 4S) 1->2 3.002940
URSHR (vector, 4S) throughput 1.011984
URSHR (vector, 2D) 1->2 3.001294
URSHR (vector, 2D) throughput 0.995331
URSHR (vector, D) 1->2 3.001898
URSHR (vector, D) throughput 0.954483
URSQRTE (vector, 2S) 1->2 4.046286
URSQRTE (vector, 2S) throughput 1.012579
URSQRTE (vector, 4S) 1->2 4.036071
URSQRTE (vector, 4S) throughput 1.007617
URSRA (vector, 8B) 1->1 1.970056
URSRA (vector, 8B) 1->2 3.013414
URSRA (vector, 8B) throughput 1.015592
URSRA (vector, 8B) throughput 1.012971
URSRA (vector, 16B) 1->1 2.049146
URSRA (vector, 16B) 1->2 3.006316
URSRA (vector, 16B) throughput 0.989434
URSRA (vector, 16B) throughput 1.021480
URSRA (vector, 4H) 1->1 2.011957
URSRA (vector, 4H) 1->2 3.049123
URSRA (vector, 4H) throughput 1.003221
URSRA (vector, 4H) throughput 1.003317
URSRA (vector, 8H) 1->1 2.012375
URSRA (vector, 8H) 1->2 3.037451
URSRA (vector, 8H) throughput 1.003205
URSRA (vector, 8H) throughput 0.998360
URSRA (vector, 2S) 1->1 1.995227
URSRA (vector, 2S) 1->2 3.017663
URSRA (vector, 2S) throughput 0.998065
URSRA (vector, 2S) throughput 0.973124
URSRA (vector, 4S) 1->1 2.021207
URSRA (vector, 4S) 1->2 2.970649
URSRA (vector, 4S) throughput 1.002866
URSRA (vector, 4S) throughput 1.010577
URSRA (vector, 2D) 1->1 2.029386
URSRA (vector, 2D) 1->2 3.008742
URSRA (vector, 2D) throughput 1.009473
URSRA (vector, 2D) throughput 1.016716
URSRA (vector, D) 1->1 1.990011
URSRA (vector, D) 1->2 3.030444
URSRA (vector, D) throughput 1.018320
URSRA (vector, D) throughput 0.998967
USHL (vector, 8B) 1->2 1.982262
USHL (vector, 8B) 1->3 2.044721
USHL (vector, 8B) throughput 1.004979
USHL (vector, 16B) 1->2 1.994720
USHL (vector, 16B) 1->3 2.001096
USHL (vector, 16B) throughput 1.009338
USHL (vector, 4H) 1->2 2.020958
USHL (vector, 4H) 1->3 1.966442
USHL (vector, 4H) throughput 1.011940
USHL (vector, 8H) 1->2 2.006465
USHL (vector, 8H) 1->3 2.007446
USHL (vector, 8H) throughput 0.997243
USHL (vector, 2S) 1->2 1.971311
USHL (vector, 2S) 1->3 2.014619
USHL (vector, 2S) throughput 1.023315
USHL (vector, 4S) 1->2 2.007411
USHL (vector, 4S) 1->3 1.966042
USHL (vector, 4S) throughput 1.014678
USHL (vector, 2D) 1->2 2.000679
USHL (vector, 2D) 1->3 2.010819
USHL (vector, 2D) throughput 1.007399
USHL (vector, D) 1->2 2.007406
USHL (vector, D) 1->3 2.026892
USHL (vector, D) throughput 1.019861
USHLL (8H) 1->2 2.074269
USHLL (8H) throughput 1.006221
USHLL (4S) 1->2 1.964782
USHLL (4S) throughput 1.013665
USHLL (2D) 1->2 1.964603
USHLL (2D) throughput 1.003747
USHLL2 (8H) 1->2 2.059353
USHLL2 (8H) throughput 1.010832
USHLL2 (4S) 1->2 1.998271
USHLL2 (4S) throughput 1.005987
USHLL2 (2D) 1->2 2.000964
USHLL2 (2D) throughput 1.011923
USHR (vector, 8B) 1->2 2.049367
USHR (vector, 8B) throughput 0.997892
USHR (vector, 16B) 1->2 2.036010
USHR (vector, 16B) throughput 0.998202
USHR (vector, 4H) 1->2 2.005965
USHR (vector, 4H) throughput 1.009395
USHR (vector, 8H) 1->2 2.018181
USHR (vector, 8H) throughput 1.006311
USHR (vector, 2S) 1->2 2.125660
USHR (vector, 2S) throughput 1.032366
USHR (vector, 4S) 1->2 2.105794
USHR (vector, 4S) throughput 1.001931
USHR (vector, 2D) 1->2 2.037103
USHR (vector, 2D) throughput 1.010747
USHR (vector, D) 1->2 2.009817
USHR (vector, D) throughput 1.007375
USQADD (scalar, B) 1->1 3.070526
USQADD (scalar, B) 1->2 2.998311
USQADD (scalar, B) throughput 1.015586
USQADD (scalar, B) throughput 1.007416
USQADD (scalar, H) 1->1 3.024895
USQADD (scalar, H) 1->2 3.026139
USQADD (scalar, H) throughput 1.007012
USQADD (scalar, H) throughput 1.019739
USQADD (scalar, S) 1->1 2.898774
USQADD (scalar, S) 1->2 3.013050
USQADD (scalar, S) throughput 0.997145
USQADD (scalar, S) throughput 1.013852
USQADD (scalar, D) 1->1 3.014540
USQADD (scalar, D) 1->2 3.008864
USQADD (scalar, D) throughput 1.011575
USQADD (scalar, D) throughput 1.008720
USQADD (vector, 8B) 1->1 2.964971
USQADD (vector, 8B) 1->2 2.994524
USQADD (vector, 8B) throughput 1.013431
USQADD (vector, 8B) throughput 1.004135
USQADD (vector, 16B) 1->1 3.023662
USQADD (vector, 16B) 1->2 2.964636
USQADD (vector, 16B) throughput 0.990499
USQADD (vector, 16B) throughput 1.006895
USQADD (vector, 4H) 1->1 2.994415
USQADD (vector, 4H) 1->2 3.030149
USQADD (vector, 4H) throughput 1.005547
USQADD (vector, 4H) throughput 1.008261
USQADD (vector, 8H) 1->1 3.042432
USQADD (vector, 8H) 1->2 3.039167
USQADD (vector, 8H) throughput 0.996934
USQADD (vector, 8H) throughput 0.992876
USQADD (vector, 2S) 1->1 3.025645
USQADD (vector, 2S) 1->2 3.036349
USQADD (vector, 2S) throughput 1.012227
USQADD (vector, 2S) throughput 0.972064
USQADD (vector, 4S) 1->1 2.998899
USQADD (vector, 4S) 1->2 3.028727
USQADD (vector, 4S) throughput 1.006914
USQADD (vector, 4S) throughput 1.009266
USQADD (vector, 2D) 1->1 2.916537
USQADD (vector, 2D) 1->2 3.002162
USQADD (vector, 2D) throughput 0.992278
USQADD (vector, 2D) throughput 1.016504
USRA (vector, 8B) 1->1 2.008662
USRA (vector, 8B) 1->2 3.026691
USRA (vector, 8B) throughput 0.995876
USRA (vector, 8B) throughput 1.007489
USRA (vector, 16B) 1->1 1.998401
USRA (vector, 16B) 1->2 3.019364
USRA (vector, 16B) throughput 1.009325
USRA (vector, 16B) throughput 1.004239
USRA (vector, 4H) 1->1 2.009497
USRA (vector, 4H) 1->2 2.970091
USRA (vector, 4H) throughput 1.011410
USRA (vector, 4H) throughput 1.012972
USRA (vector, 8H) 1->1 2.010711
USRA (vector, 8H) 1->2 3.028507
USRA (vector, 8H) throughput 1.010049
USRA (vector, 8H) throughput 0.988095
USRA (vector, 2S) 1->1 1.968524
USRA (vector, 2S) 1->2 3.010405
USRA (vector, 2S) throughput 0.993129
USRA (vector, 2S) throughput 1.013219
USRA (vector, 4S) 1->1 2.043365
USRA (vector, 4S) 1->2 3.019625
USRA (vector, 4S) throughput 1.008218
USRA (vector, 4S) throughput 0.997444
USRA (vector, 2D) 1->1 2.036521
USRA (vector, 2D) 1->2 3.025509
USRA (vector, 2D) throughput 1.015594
USRA (vector, 2D) throughput 1.008616
USRA (vector, D) 1->1 1.997215
USRA (vector, D) 1->2 3.026202
USRA (vector, D) throughput 1.009821
USRA (vector, D) throughput 0.897408
USUBL (vector, 8H) 1->2 2.015614
USUBL (vector, 8H) 1->3 2.013863
USUBL (vector, 8H) throughput 0.991812
USUBL (vector, 4S) 1->2 1.997192
USUBL (vector, 4S) 1->3 2.023092
USUBL (vector, 4S) throughput 1.003124
USUBL (vector, 2D) 1->2 1.997841
USUBL (vector, 2D) 1->3 2.014361
USUBL (vector, 2D) throughput 1.010381
USUBL2 (vector, 8H) 1->2 2.021050
USUBL2 (vector, 8H) 1->3 2.006532
USUBL2 (vector, 8H) throughput 0.995777
USUBL2 (vector, 4S) 1->2 2.019819
USUBL2 (vector, 4S) 1->3 1.973316
USUBL2 (vector, 4S) throughput 1.011425
USUBL2 (vector, 2D) 1->2 2.004993
USUBL2 (vector, 2D) 1->3 1.993156
USUBL2 (vector, 2D) throughput 1.005047
USUBW (vector, 8H) 1->2 2.041016
USUBW (vector, 8H) 1->3 2.006377
USUBW (vector, 8H) throughput 1.014379
USUBW (vector, 4S) 1->2 2.006329
USUBW (vector, 4S) 1->3 2.006708
USUBW (vector, 4S) throughput 0.985698
USUBW (vector, 2D) 1->2 1.998554
USUBW (vector, 2D) 1->3 2.053920
USUBW (vector, 2D) throughput 1.014242
USUBW2 (vector, 8H) 1->2 2.000614
USUBW2 (vector, 8H) 1->3 2.007719
USUBW2 (vector, 8H) throughput 1.007974
USUBW2 (vector, 4S) 1->2 2.015023
USUBW2 (vector, 4S) 1->3 1.946863
USUBW2 (vector, 4S) throughput 0.987427
USUBW2 (vector, 2D) 1->2 2.046619
USUBW2 (vector, 2D) 1->3 2.434172
USUBW2 (vector, 2D) throughput 1.002191
UXTL (8H) 1->2 1.996426
UXTL (8H) throughput 1.022438
UXTL (4S) 1->2 2.027412
UXTL (4S) throughput 1.013041
UXTL (2D) 1->2 1.997022
UXTL (2D) throughput 0.993712
UXTL2 (8H) 1->2 1.983242
UXTL2 (8H) throughput 1.018955
UXTL2 (4S) 1->2 2.004753
UXTL2 (4S) throughput 0.995269
UXTL2 (2D) 1->2 2.022418
UXTL2 (2D) throughput 1.009490
UZP1 (vector, 8B) 1->2 2.037658
UZP1 (vector, 8B) 1->3 1.996821
UZP1 (vector, 8B) throughput 1.128574
UZP1 (vector, 16B) 1->2 2.028158
UZP1 (vector, 16B) 1->3 1.996201
UZP1 (vector, 16B) throughput 1.000340
UZP1 (vector, 4H) 1->2 2.006850
UZP1 (vector, 4H) 1->3 1.997983
UZP1 (vector, 4H) throughput 1.004215
UZP1 (vector, 8H) 1->2 1.998969
UZP1 (vector, 8H) 1->3 2.034084
UZP1 (vector, 8H) throughput 1.012748
UZP1 (vector, 2S) 1->2 2.021062
UZP1 (vector, 2S) 1->3 1.979898
UZP1 (vector, 2S) throughput 0.987572
UZP1 (vector, 4S) 1->2 1.998533
UZP1 (vector, 4S) 1->3 1.978211
UZP1 (vector, 4S) throughput 1.003713
UZP1 (vector, 2D) 1->2 2.009481
UZP1 (vector, 2D) 1->3 2.022678
UZP1 (vector, 2D) throughput 0.994420
UZP2 (vector, 8B) 1->2 2.006969
UZP2 (vector, 8B) 1->3 2.063862
UZP2 (vector, 8B) throughput 1.019601
UZP2 (vector, 16B) 1->2 2.003443
UZP2 (vector, 16B) 1->3 2.011363
UZP2 (vector, 16B) throughput 1.009820
UZP2 (vector, 4H) 1->2 2.005262
UZP2 (vector, 4H) 1->3 2.008599
UZP2 (vector, 4H) throughput 1.012967
UZP2 (vector, 8H) 1->2 1.993956
UZP2 (vector, 8H) 1->3 2.008664
UZP2 (vector, 8H) throughput 1.007638
UZP2 (vector, 2S) 1->2 1.967523
UZP2 (vector, 2S) 1->3 2.023072
UZP2 (vector, 2S) throughput 1.014149
UZP2 (vector, 4S) 1->2 2.021310
UZP2 (vector, 4S) 1->3 2.021648
UZP2 (vector, 4S) throughput 0.989008
UZP2 (vector, 2D) 1->2 2.046774
UZP2 (vector, 2D) 1->3 2.011616
UZP2 (vector, 2D) throughput 1.006382
XTN (8H) 1->2 2.045587
XTN (8H) throughput 1.012241
XTN (4S) 1->2 1.993432
XTN (4S) throughput 1.010385
XTN (2D) 1->2 1.997324
XTN (2D) throughput 0.993312
XTN2 (8H) 1->1 1.958505
XTN2 (8H) 1->2 1.998959
XTN2 (8H) throughput 1.012375
XTN2 (8H) throughput 1.002959
XTN2 (4S) 1->1 1.999672
XTN2 (4S) 1->2 2.011361
XTN2 (4S) throughput 0.993897
XTN2 (4S) throughput 1.001464
XTN2 (2D) 1->1 2.001335
XTN2 (2D) 1->2 1.969894
XTN2 (2D) throughput 0.995561
XTN2 (2D) throughput 1.014650
ZIP1 (vector, 8B) 1->2 1.981046
ZIP1 (vector, 8B) 1->3 2.033533
ZIP1 (vector, 8B) throughput 1.010053
ZIP1 (vector, 16B) 1->2 2.032022
ZIP1 (vector, 16B) 1->3 2.047964
ZIP1 (vector, 16B) throughput 1.018360
ZIP1 (vector, 4H) 1->2 1.991134
ZIP1 (vector, 4H) 1->3 2.040148
ZIP1 (vector, 4H) throughput 1.019063
ZIP1 (vector, 8H) 1->2 2.004231
ZIP1 (vector, 8H) 1->3 2.010182
ZIP1 (vector, 8H) throughput 0.994850
ZIP1 (vector, 2S) 1->2 2.059960
ZIP1 (vector, 2S) 1->3 2.010262
ZIP1 (vector, 2S) throughput 1.005501
ZIP1 (vector, 4S) 1->2 1.998114
ZIP1 (vector, 4S) 1->3 2.039771
ZIP1 (vector, 4S) throughput 1.001410
ZIP1 (vector, 2D) 1->2 1.994941
ZIP1 (vector, 2D) 1->3 2.046762
ZIP1 (vector, 2D) throughput 1.007975
ZIP2 (vector, 8B) 1->2 1.985348
ZIP2 (vector, 8B) 1->3 2.044116
ZIP2 (vector, 8B) throughput 1.021643
ZIP2 (vector, 16B) 1->2 1.999058
ZIP2 (vector, 16B) 1->3 1.943940
ZIP2 (vector, 16B) throughput 1.004225
ZIP2 (vector, 4H) 1->2 2.022657
ZIP2 (vector, 4H) 1->3 2.001246
ZIP2 (vector, 4H) throughput 1.007186
ZIP2 (vector, 8H) 1->2 2.017150
ZIP2 (vector, 8H) 1->3 2.000989
ZIP2 (vector, 8H) throughput 1.003736
ZIP2 (vector, 2S) 1->2 1.998356
ZIP2 (vector, 2S) 1->3 2.024065
ZIP2 (vector, 2S) throughput 1.004968
ZIP2 (vector, 4S) 1->2 2.020453
ZIP2 (vector, 4S) 1->3 2.018754
ZIP2 (vector, 4S) throughput 1.004913
ZIP2 (vector, 2D) 1->2 1.997973
ZIP2 (vector, 2D) 1->3 2.011876
ZIP2 (vector, 2D) throughput 1.034460
ADC (32-bit) 1->2 1.001903
ADC (32-bit) 1->3 1.001863
ADC (32-bit) 1->4 1.003920
ADC (32-bit) throughput 0.333057
ADC (64-bit) 1->2 1.002713
ADC (64-bit) 1->3 0.990760
ADC (64-bit) 1->4 0.993828
ADC (64-bit) throughput 0.354186
ADCS (32-bit) 1->2 1.088066
ADCS (32-bit) 1->3 1.039229
ADCS (32-bit) 1->4 1.016314
ADCS (32-bit) 4->2 0.996122
ADCS (32-bit) 4->3 1.001621
ADCS (32-bit) 4->4 1.003887
ADCS (32-bit) throughput 0.859593
ADCS (32-bit) throughput 0.623696
ADCS (64-bit) 1->2 0.997536
ADCS (64-bit) 1->3 1.006640
ADCS (64-bit) 1->4 1.017710
ADCS (64-bit) 4->2 1.007450
ADCS (64-bit) 4->3 1.038165
ADCS (64-bit) 4->4 1.005506
ADCS (64-bit) throughput 0.796344
ADCS (64-bit) throughput 0.620338
ADD (sxtb, 32-bit) 1->2 1.995841
ADD (sxtb, 32-bit) 1->3 1.991248
ADD (sxtb, 32-bit) throughput 0.667155
ADD (sxtb, 64-bit) 1->2 2.004213
ADD (sxtb, 64-bit) 1->3 2.005386
ADD (sxtb, 64-bit) throughput 0.668243
ADD (uxtb, 32-bit) 1->2 2.002470
ADD (uxtb, 32-bit) 1->3 1.999839
ADD (uxtb, 32-bit) throughput 0.692735
ADD (uxtb, 64-bit) 1->2 2.003438
ADD (uxtb, 64-bit) 1->3 1.993274
ADD (uxtb, 64-bit) throughput 0.672232
ADD (sxth, 32-bit) 1->2 1.999515
ADD (sxth, 32-bit) 1->3 2.822200
ADD (sxth, 32-bit) throughput 0.684144
ADD (sxth, 64-bit) 1->2 2.241766
ADD (sxth, 64-bit) 1->3 2.006549
ADD (sxth, 64-bit) throughput 0.629518
ADD (uxth, 32-bit) 1->2 1.992725
ADD (uxth, 32-bit) 1->3 2.000277
ADD (uxth, 32-bit) throughput 0.673479
ADD (uxth, 64-bit) 1->2 2.003079
ADD (uxth, 64-bit) 1->3 2.006722
ADD (uxth, 64-bit) throughput 0.750779
ADD (sxtw, 32-bit) 1->2 1.001660
ADD (sxtw, 32-bit) 1->3 1.002632
ADD (sxtw, 32-bit) throughput 0.333937
ADD (sxtw, 64-bit) 1->2 2.001741
ADD (sxtw, 64-bit) 1->3 2.001538
ADD (sxtw, 64-bit) throughput 0.666080
ADD (uxtw, 32-bit) 1->2 1.000258
ADD (uxtw, 32-bit) 1->3 1.004697
ADD (uxtw, 32-bit) throughput 0.334053
ADD (uxtw, 64-bit) 1->2 2.000728
ADD (uxtw, 64-bit) 1->3 1.978352
ADD (uxtw, 64-bit) throughput 0.668252
ADD (uxtx, 64-bit) 1->2 0.998141
ADD (uxtx, 64-bit) 1->3 1.001660
ADD (uxtx, 64-bit) throughput 0.333881
ADD (sxtx, 64-bit) 1->2 1.002510
ADD (sxtx, 64-bit) 1->3 1.003240
ADD (sxtx, 64-bit) throughput 0.334626
ADD (immediate, 32-bit) 1->2 1.000322
ADD (immediate, 32-bit) throughput 0.333512
ADD (immediate, 64-bit) 1->2 1.001538
ADD (immediate, 64-bit) throughput 0.357442
ADD (shifted immediate, 32-bit) 1->2 1.002101
ADD (shifted immediate, 32-bit) throughput 0.334842
ADD (shifted immediate, 64-bit) 1->2 0.999151
ADD (shifted immediate, 64-bit) throughput 0.332331
ADD (register, 32-bit) 1->2 0.999273
ADD (register, 32-bit) 1->3 0.997214
ADD (register, 32-bit) throughput 0.333728
ADD (register, 64-bit) 1->2 1.000566
ADD (register, 64-bit) 1->3 0.997938
ADD (register, 64-bit) throughput 0.334417
ADD (register, lsl, 32-bit) 1->2 2.024686
ADD (register, lsl, 32-bit) 1->3 2.000444
ADD (register, lsl, 32-bit) throughput 0.643954
ADD (register, lsl, 64-bit) 1->2 1.342847
ADD (register, lsl, 64-bit) 1->3 1.809081
ADD (register, lsl, 64-bit) throughput 0.804207
ADD (register, lsr, 32-bit) 1->2 2.000342
ADD (register, lsr, 32-bit) 1->3 1.999929
ADD (register, lsr, 32-bit) throughput 0.662794
ADD (register, lsr, 64-bit) 1->2 2.006398
ADD (register, lsr, 64-bit) 1->3 2.284870
ADD (register, lsr, 64-bit) throughput 0.666871
ADD (register, asr, 32-bit) 1->2 2.133751
ADD (register, asr, 32-bit) 1->3 2.001721
ADD (register, asr, 32-bit) throughput 0.668116
ADD (register, asr, 64-bit) 1->2 2.003726
ADD (register, asr, 64-bit) 1->3 1.999637
ADD (register, asr, 64-bit) throughput 0.694452
ADDS (sxtb, 32-bit) 1->2 2.031534
ADDS (sxtb, 32-bit) 1->3 1.984801
ADDS (sxtb, 32-bit) 4->2 2.024389
ADDS (sxtb, 32-bit) 4->3 1.997249
ADDS (sxtb, 32-bit) throughput 0.666822
ADDS (sxtb, 64-bit) 1->2 1.998144
ADDS (sxtb, 64-bit) 1->3 1.995963
ADDS (sxtb, 64-bit) 4->2 1.993183
ADDS (sxtb, 64-bit) 4->3 1.998624
ADDS (sxtb, 64-bit) throughput 0.676349
ADDS (uxtb, 32-bit) 1->2 2.001621
ADDS (uxtb, 32-bit) 1->3 2.004855
ADDS (uxtb, 32-bit) 4->2 2.004900
ADDS (uxtb, 32-bit) 4->3 2.008262
ADDS (uxtb, 32-bit) throughput 0.668861
ADDS (uxtb, 64-bit) 1->2 1.991500
ADDS (uxtb, 64-bit) 1->3 1.974317
ADDS (uxtb, 64-bit) 4->2 2.043171
ADDS (uxtb, 64-bit) 4->3 2.004292
ADDS (uxtb, 64-bit) throughput 0.668924
ADDS (sxth, 32-bit) 1->2 2.003840
ADDS (sxth, 32-bit) 1->3 2.002063
ADDS (sxth, 32-bit) 4->2 1.997857
ADDS (sxth, 32-bit) 4->3 1.995239
ADDS (sxth, 32-bit) throughput 0.666426
ADDS (sxth, 64-bit) 1->2 1.979793
ADDS (sxth, 64-bit) 1->3 1.999111
ADDS (sxth, 64-bit) 4->2 2.003928
ADDS (sxth, 64-bit) 4->3 1.991963
ADDS (sxth, 64-bit) throughput 0.702496
ADDS (uxth, 32-bit) 1->2 2.000687
ADDS (uxth, 32-bit) 1->3 1.970247
ADDS (uxth, 32-bit) 4->2 2.054057
ADDS (uxth, 32-bit) 4->3 2.003314
ADDS (uxth, 32-bit) throughput 0.711197
ADDS (uxth, 64-bit) 1->2 2.139254
ADDS (uxth, 64-bit) 1->3 1.933987
ADDS (uxth, 64-bit) 4->2 2.041828
ADDS (uxth, 64-bit) 4->3 2.148362
ADDS (uxth, 64-bit) throughput 0.700568
ADDS (sxtw, 32-bit) 1->2 1.002915
ADDS (sxtw, 32-bit) 1->3 1.001701
ADDS (sxtw, 32-bit) 4->2 1.002911
ADDS (sxtw, 32-bit) 4->3 0.990223
ADDS (sxtw, 32-bit) throughput 0.500420
ADDS (sxtw, 64-bit) 1->2 2.005507
ADDS (sxtw, 64-bit) 1->3 2.002511
ADDS (sxtw, 64-bit) 4->2 2.002626
ADDS (sxtw, 64-bit) 4->3 2.007413
ADDS (sxtw, 64-bit) throughput 0.674444
ADDS (uxtw, 32-bit) 1->2 0.999274
ADDS (uxtw, 32-bit) 1->3 1.001701
ADDS (uxtw, 32-bit) 4->2 1.004850
ADDS (uxtw, 32-bit) 4->3 1.019735
ADDS (uxtw, 32-bit) throughput 0.500784
ADDS (uxtw, 64-bit) 1->2 1.987062
ADDS (uxtw, 64-bit) 1->3 1.999313
ADDS (uxtw, 64-bit) 4->2 2.002387
ADDS (uxtw, 64-bit) 4->3 2.002871
ADDS (uxtw, 64-bit) throughput 0.670504
ADDS (uxtx, 64-bit) 1->2 0.999031
ADDS (uxtx, 64-bit) 1->3 1.001862
ADDS (uxtx, 64-bit) 4->2 1.035277
ADDS (uxtx, 64-bit) 4->3 1.007976
ADDS (uxtx, 64-bit) throughput 0.711329
ADDS (sxtx, 64-bit) 1->2 0.998111
ADDS (sxtx, 64-bit) 1->3 1.000688
ADDS (sxtx, 64-bit) 4->2 0.996325
ADDS (sxtx, 64-bit) 4->3 1.005022
ADDS (sxtx, 64-bit) throughput 0.500774
ADDS (immediate, 32-bit) 1->2 1.003280
ADDS (immediate, 32-bit) 3->2 0.999031
ADDS (immediate, 32-bit) throughput 0.502610
ADDS (immediate, 64-bit) 1->2 0.998180
ADDS (immediate, 64-bit) 3->2 1.004170
ADDS (immediate, 64-bit) throughput 0.500971
ADDS (shifted immediate, 32-bit) 1->2 1.072839
ADDS (shifted immediate, 32-bit) 3->2 1.011981
ADDS (shifted immediate, 32-bit) throughput 0.501484
ADDS (shifted immediate, 64-bit) 1->2 0.998383
ADDS (shifted immediate, 64-bit) 3->2 1.023486
ADDS (shifted immediate, 64-bit) throughput 0.500106
ADDS (register, 32-bit) 1->2 1.000000
ADDS (register, 32-bit) 1->3 1.025081
ADDS (register, 32-bit) 4->2 1.005262
ADDS (register, 32-bit) 4->3 1.003354
ADDS (register, 32-bit) throughput 0.500232
ADDS (register, 64-bit) 1->2 1.000887
ADDS (register, 64-bit) 1->3 0.772217
ADDS (register, 64-bit) 4->2 1.005539
ADDS (register, 64-bit) 4->3 0.749781
ADDS (register, 64-bit) throughput 1.016991
ADDS (register, lsl, 32-bit) 1->2 3.589544
ADDS (register, lsl, 32-bit) 1->3 1.153530
ADDS (register, lsl, 32-bit) 4->2 3.351728
ADDS (register, lsl, 32-bit) 4->3 2.030090
ADDS (register, lsl, 32-bit) throughput 0.667407
ADDS (register, lsl, 64-bit) 1->2 1.907195
ADDS (register, lsl, 64-bit) 1->3 2.008221
ADDS (register, lsl, 64-bit) 4->2 2.005304
ADDS (register, lsl, 64-bit) 4->3 2.000323
ADDS (register, lsl, 64-bit) throughput 0.672743
ADDS (register, lsr, 32-bit) 1->2 1.999514
ADDS (register, lsr, 32-bit) 1->3 2.007331
ADDS (register, lsr, 32-bit) 4->2 2.007451
ADDS (register, lsr, 32-bit) 4->3 2.000807
ADDS (register, lsr, 32-bit) throughput 0.668367
ADDS (register, lsr, 64-bit) 1->2 2.001454
ADDS (register, lsr, 64-bit) 1->3 1.999273
ADDS (register, lsr, 64-bit) 4->2 2.008180
ADDS (register, lsr, 64-bit) 4->3 2.292598
ADDS (register, lsr, 64-bit) throughput 0.671835
ADDS (register, asr, 32-bit) 1->2 2.001940
ADDS (register, asr, 32-bit) 1->3 2.002349
ADDS (register, asr, 32-bit) 4->2 2.003929
ADDS (register, asr, 32-bit) 4->3 2.003807
ADDS (register, asr, 32-bit) throughput 0.669334
ADDS (register, asr, 64-bit) 1->2 1.996407
ADDS (register, asr, 64-bit) 1->3 2.006953
ADDS (register, asr, 64-bit) 4->2 2.054001
ADDS (register, asr, 64-bit) 4->3 1.994355
ADDS (register, asr, 64-bit) throughput 0.672331
ADR throughput 0.261793
ADRP throughput 0.249558
AND (immediate, 32-bit) 1->2 1.002658
AND (immediate, 32-bit) throughput 0.335666
AND (immediate, 64-bit) 1->2 0.976435
AND (immediate, 64-bit) throughput 0.333986
AND (register, 32-bit) 1->2 1.001863
AND (register, 32-bit) 1->3 1.001660
AND (register, 32-bit) throughput 0.334299
AND (register, 64-bit) 1->2 1.000243
AND (register, 64-bit) 1->3 1.001782
AND (register, 64-bit) throughput 0.334081
AND (register, lsl, 32-bit) 1->2 2.003927
AND (register, lsl, 32-bit) 1->3 2.001699
AND (register, lsl, 32-bit) throughput 0.651652
AND (register, lsl, 64-bit) 1->2 1.998019
AND (register, lsl, 64-bit) 1->3 2.004050
AND (register, lsl, 64-bit) throughput 0.674301
AND (register, lsr, 32-bit) 1->2 2.002871
AND (register, lsr, 32-bit) 1->3 1.999071
AND (register, lsr, 32-bit) throughput 0.667530
AND (register, lsr, 64-bit) 1->2 2.006964
AND (register, lsr, 64-bit) 1->3 2.002389
AND (register, lsr, 64-bit) throughput 0.666111
AND (register, asr, 32-bit) 1->2 2.004258
AND (register, asr, 32-bit) 1->3 2.001666
AND (register, asr, 32-bit) throughput 0.666394
AND (register, asr, 64-bit) 1->2 2.003887
AND (register, asr, 64-bit) 1->3 2.063459
AND (register, asr, 64-bit) throughput 0.668464
AND (register, ror, 32-bit) 1->2 2.004980
AND (register, ror, 32-bit) 1->3 1.991184
AND (register, ror, 32-bit) throughput 0.667424
AND (register, ror, 64-bit) 1->2 2.002633
AND (register, ror, 64-bit) 1->3 1.996365
AND (register, ror, 64-bit) throughput 0.668071
ADDS (immediate, 32-bit) 1->2 0.998282
ADDS (immediate, 32-bit) 3->2 0.999636
ADDS (immediate, 32-bit) throughput 0.501762
ADDS (immediate, 64-bit) 1->2 1.002062
ADDS (immediate, 64-bit) 3->2 1.000566
ADDS (immediate, 64-bit) throughput 0.499869
ANDS (register, 32-bit) 1->2 1.332210
ANDS (register, 32-bit) 1->3 1.000171
ANDS (register, 32-bit) 4->2 0.969679
ANDS (register, 32-bit) 4->3 0.891175
ANDS (register, 32-bit) throughput 0.507547
ANDS (register, 64-bit) 1->2 1.056775
ANDS (register, 64-bit) 1->3 1.008949
ANDS (register, 64-bit) 4->2 1.001171
ANDS (register, 64-bit) 4->3 1.007882
ANDS (register, 64-bit) throughput 0.501807
ANDS (register, lsl, 32-bit) 1->2 2.004252
ANDS (register, lsl, 32-bit) 1->3 2.002389
ANDS (register, lsl, 32-bit) 4->2 2.002954
ANDS (register, lsl, 32-bit) 4->3 1.997171
ANDS (register, lsl, 32-bit) throughput 0.668558
ANDS (register, lsl, 64-bit) 1->2 2.003362
ANDS (register, lsl, 64-bit) 1->3 2.043286
ANDS (register, lsl, 64-bit) 4->2 2.002628
ANDS (register, lsl, 64-bit) 4->3 1.997778
ANDS (register, lsl, 64-bit) throughput 0.667960
ANDS (register, lsr, 32-bit) 1->2 2.002631
ANDS (register, lsr, 32-bit) 1->3 2.002343
ANDS (register, lsr, 32-bit) 4->2 2.008707
ANDS (register, lsr, 32-bit) 4->3 2.004291
ANDS (register, lsr, 32-bit) throughput 0.668623
ANDS (register, lsr, 64-bit) 1->2 1.998425
ANDS (register, lsr, 64-bit) 1->3 1.996082
ANDS (register, lsr, 64-bit) 4->2 1.983005
ANDS (register, lsr, 64-bit) 4->3 1.706663
ANDS (register, lsr, 64-bit) throughput 0.671978
ANDS (register, asr, 32-bit) 1->2 2.002512
ANDS (register, asr, 32-bit) 1->3 2.002469
ANDS (register, asr, 32-bit) 4->2 2.004566
ANDS (register, asr, 32-bit) 4->3 2.004656
ANDS (register, asr, 32-bit) throughput 0.667100
ANDS (register, asr, 64-bit) 1->2 2.001578
ANDS (register, asr, 64-bit) 1->3 2.002834
ANDS (register, asr, 64-bit) 4->2 2.114553
ANDS (register, asr, 64-bit) 4->3 2.011183
ANDS (register, asr, 64-bit) throughput 0.675008
ANDS (register, ror, 32-bit) 1->2 2.007975
ANDS (register, ror, 32-bit) 1->3 2.594237
ANDS (register, ror, 32-bit) 4->2 2.001386
ANDS (register, ror, 32-bit) 4->3 1.957118
ANDS (register, ror, 32-bit) throughput 0.670391
ANDS (register, ror, 64-bit) 1->2 2.003685
ANDS (register, ror, 64-bit) 1->3 2.003848
ANDS (register, ror, 64-bit) 4->2 2.000405
ANDS (register, ror, 64-bit) 4->3 2.004898
ANDS (register, ror, 64-bit) throughput 0.668192
ASR (immediate, 32-bit) 1->2 0.998910
ASR (immediate, 32-bit) throughput 0.333391
ASR (immediate, 64-bit) 1->2 1.002713
ASR (immediate, 64-bit) throughput 0.335044
ASR (register, 32-bit) 1->2 1.001701
ASR (register, 32-bit) 1->3 0.999071
ASR (register, 32-bit) throughput 0.334031
ASR (register, 64-bit) 1->2 0.998342
ASR (register, 64-bit) 1->3 1.001741
ASR (register, 64-bit) throughput 0.334268
B throughput 1.027178
B.cc (not taken) throughput 1.018246
B.cc (taken) throughput 0.614678
BFC (32-bit) 1->1 0.992769
BFC (32-bit) throughput 1.203510
BFC (64-bit) 1->1 0.997577
BFC (64-bit) throughput 1.004566
BFI (32-bit) 1->1 1.000162
BFI (32-bit) 1->2 0.997620
BFI (32-bit) throughput 1.019555
BFI (64-bit) 1->1 0.997415
BFI (64-bit) 1->2 0.840565
BFI (64-bit) throughput 1.001483
BFXIL (32-bit) 1->1 0.936552
BFXIL (32-bit) 1->2 1.415581
BFXIL (32-bit) throughput 1.050595
BFXIL (64-bit) 1->1 1.007046
BFXIL (64-bit) 1->2 1.004738
BFXIL (64-bit) throughput 1.010355
BIC (register, 32-bit) 1->2 1.001822
BIC (register, 32-bit) 1->3 0.998423
BIC (register, 32-bit) throughput 0.334734
BIC (register, 64-bit) 1->2 1.001819
BIC (register, 64-bit) 1->3 1.003927
BIC (register, 64-bit) throughput 0.333488
BIC (register, lsl, 32-bit) 1->2 2.002491
BIC (register, lsl, 32-bit) 1->3 2.006006
BIC (register, lsl, 32-bit) throughput 0.667773
BIC (register, lsl, 64-bit) 1->2 2.006173
BIC (register, lsl, 64-bit) 1->3 2.022625
BIC (register, lsl, 64-bit) throughput 0.670981
BIC (register, lsr, 32-bit) 1->2 2.004739
BIC (register, lsr, 32-bit) 1->3 2.003239
BIC (register, lsr, 32-bit) throughput 0.669672
BIC (register, lsr, 64-bit) 1->2 2.001580
BIC (register, lsr, 64-bit) 1->3 2.001564
BIC (register, lsr, 64-bit) throughput 0.677387
BIC (register, asr, 32-bit) 1->2 2.004899
BIC (register, asr, 32-bit) 1->3 2.000203
BIC (register, asr, 32-bit) throughput 0.668210
BIC (register, asr, 64-bit) 1->2 2.001658
BIC (register, asr, 64-bit) 1->3 2.088013
BIC (register, asr, 64-bit) throughput 0.665782
BIC (register, ror, 32-bit) 1->2 1.981316
BIC (register, ror, 32-bit) 1->3 2.001661
BIC (register, ror, 32-bit) throughput 0.669165
BIC (register, ror, 64-bit) 1->2 2.004050
BIC (register, ror, 64-bit) 1->3 2.001253
BIC (register, ror, 64-bit) throughput 0.688740
BICS (register, 32-bit) 1->2 1.046807
BICS (register, 32-bit) 1->3 1.001822
BICS (register, 32-bit) 4->2 0.996446
BICS (register, 32-bit) 4->3 1.040572
BICS (register, 32-bit) throughput 0.500860
BICS (register, 64-bit) 1->2 1.000162
BICS (register, 64-bit) 1->3 0.998383
BICS (register, 64-bit) 4->2 1.027822
BICS (register, 64-bit) 4->3 1.101789
BICS (register, 64-bit) throughput 0.502201
BICS (register, lsl, 32-bit) 1->2 2.006761
BICS (register, lsl, 32-bit) 1->3 1.968792
BICS (register, lsl, 32-bit) 4->2 1.947245
BICS (register, lsl, 32-bit) 4->3 2.003887
BICS (register, lsl, 32-bit) throughput 0.684048
BICS (register, lsl, 64-bit) 1->2 2.002551
BICS (register, lsl, 64-bit) 1->3 1.992461
BICS (register, lsl, 64-bit) 4->2 2.007167
BICS (register, lsl, 64-bit) 4->3 2.006277
BICS (register, lsl, 64-bit) throughput 0.668038
BICS (register, lsr, 32-bit) 1->2 2.003441
BICS (register, lsr, 32-bit) 1->3 2.006681
BICS (register, lsr, 32-bit) 4->2 2.003478
BICS (register, lsr, 32-bit) 4->3 2.004004
BICS (register, lsr, 32-bit) throughput 0.668066
BICS (register, lsr, 64-bit) 1->2 2.007289
BICS (register, lsr, 64-bit) 1->3 2.005225
BICS (register, lsr, 64-bit) 4->2 1.997013
BICS (register, lsr, 64-bit) 4->3 2.023081
BICS (register, lsr, 64-bit) throughput 0.667725
BICS (register, asr, 32-bit) 1->2 1.814106
BICS (register, asr, 32-bit) 1->3 2.005629
BICS (register, asr, 32-bit) 4->2 2.036902
BICS (register, asr, 32-bit) 4->3 2.058713
BICS (register, asr, 32-bit) throughput 0.668840
BICS (register, asr, 64-bit) 1->2 2.001538
BICS (register, asr, 64-bit) 1->3 2.002633
BICS (register, asr, 64-bit) 4->2 1.998181
BICS (register, asr, 64-bit) 4->3 2.009800
BICS (register, asr, 64-bit) throughput 0.667197
BICS (register, ror, 32-bit) 1->2 2.265385
BICS (register, ror, 32-bit) 1->3 2.559332
BICS (register, ror, 32-bit) 4->2 2.001334
BICS (register, ror, 32-bit) 4->3 2.016278
BICS (register, ror, 32-bit) throughput 0.666107
BICS (register, ror, 64-bit) 1->2 2.032618
BICS (register, ror, 64-bit) 1->3 2.427435
BICS (register, ror, 64-bit) 4->2 1.868111
BICS (register, ror, 64-bit) 4->3 2.006479
BICS (register, ror, 64-bit) throughput 0.668118
BL throughput 1.037941
CBNZ (not taken) throughput 0.590057
CBNZ (taken) throughput 1.047260
CBZ (not taken) throughput 0.584017
CBZ (taken) throughput 1.275696
CCMN (immediate, 32-bit) 2->1 1.013080
CCMN (immediate, 32-bit) 2->2 0.998221
CCMN (immediate, 32-bit) throughput 0.785009
CCMN (immediate, 32-bit) throughput 0.597447
CCMN (immediate, 64-bit) 2->1 1.002628
CCMN (immediate, 64-bit) 2->2 0.998383
CCMN (immediate, 64-bit) throughput 0.787317
CCMN (immediate, 64-bit) throughput 0.599692
CCMN (register, 32-bit) 3->1 1.003482
CCMN (register, 32-bit) 3->2 1.001821
CCMN (register, 32-bit) 3->3 1.001620
CCMN (register, 32-bit) throughput 0.956969
CCMN (register, 32-bit) throughput 0.617040
CCMN (register, 64-bit) 3->1 1.072848
CCMN (register, 64-bit) 3->2 0.981262
CCMN (register, 64-bit) 3->3 1.000000
CCMN (register, 64-bit) throughput 0.721782
CCMN (register, 64-bit) throughput 0.597850
CCMP (immediate, 32-bit) 2->1 0.991903
CCMP (immediate, 32-bit) 2->2 1.183874
CCMP (immediate, 32-bit) throughput 0.791766
CCMP (immediate, 32-bit) throughput 0.599388
CCMP (immediate, 64-bit) 2->1 1.002789
CCMP (immediate, 64-bit) 2->2 0.998343
CCMP (immediate, 64-bit) throughput 0.854149
CCMP (immediate, 64-bit) throughput 0.557530
CCMP (register, 32-bit) 3->1 0.948494
CCMP (register, 32-bit) 3->2 1.033404
CCMP (register, 32-bit) 3->3 0.995646
CCMP (register, 32-bit) throughput 0.786003
CCMP (register, 32-bit) throughput 0.598332
CCMP (register, 64-bit) 3->1 1.003801
CCMP (register, 64-bit) 3->2 1.002715
CCMP (register, 64-bit) 3->3 1.002186
CCMP (register, 64-bit) throughput 0.785822
CCMP (register, 64-bit) throughput 0.599190
CINC (32-bit) 1->2 1.001620
CINC (32-bit) 1->3 0.999919
CINC (32-bit) throughput 0.334717
CINC (64-bit) 1->2 0.998628
CINC (64-bit) 1->3 1.005061
CINC (64-bit) throughput 0.334305
CINV (32-bit) 1->2 1.002430
CINV (32-bit) 1->3 1.002708
CINV (32-bit) throughput 0.333917
CINV (64-bit) 1->2 1.001618
CINV (64-bit) 1->3 0.999757
CINV (64-bit) throughput 0.334271
CLREX throughput 5.042392
CLREX throughput 5.040897
CLS (32-bit) 1->2 0.998384
CLS (32-bit) throughput 0.334799
CLS (64-bit) 1->2 1.052502
CLS (64-bit) throughput 0.369045
CLZ (32-bit) 1->2 0.997415
CLZ (32-bit) throughput 0.333446
CLZ (64-bit) 1->2 0.999072
CLZ (64-bit) throughput 0.334386
CMN (sxtb, 32-bit) 3->1 2.001008
CMN (sxtb, 32-bit) 3->2 2.066975
CMN (sxtb, 32-bit) throughput 0.695930
CMN (sxtb, 64-bit) 3->1 2.007411
CMN (sxtb, 64-bit) 3->2 1.896898
CMN (sxtb, 64-bit) throughput 0.673156
CMN (uxtb, 32-bit) 3->1 1.998749
CMN (uxtb, 32-bit) 3->2 2.073315
CMN (uxtb, 32-bit) throughput 0.696378
CMN (uxtb, 64-bit) 3->1 2.011985
CMN (uxtb, 64-bit) 3->2 2.007530
CMN (uxtb, 64-bit) throughput 0.668089
CMN (sxth, 32-bit) 3->1 1.998423
CMN (sxth, 32-bit) 3->2 2.000687
CMN (sxth, 32-bit) throughput 0.730747
CMN (sxth, 64-bit) 3->1 2.002425
CMN (sxth, 64-bit) 3->2 2.025975
CMN (sxth, 64-bit) throughput 0.671030
CMN (uxth, 32-bit) 3->1 1.997376
CMN (uxth, 32-bit) 3->2 2.003928
CMN (uxth, 32-bit) throughput 0.674851
CMN (uxth, 64-bit) 3->1 1.998869
CMN (uxth, 64-bit) 3->2 2.005540
CMN (uxth, 64-bit) throughput 0.671287
CMN (sxtw, 32-bit) 3->1 1.002589
CMN (sxtw, 32-bit) 3->2 1.006277
CMN (sxtw, 32-bit) throughput 0.362176
CMN (sxtw, 64-bit) 3->1 2.003727
CMN (sxtw, 64-bit) 3->2 2.009111
CMN (sxtw, 64-bit) throughput 0.667497
CMN (uxtw, 32-bit) 3->1 0.998385
CMN (uxtw, 32-bit) 3->2 1.005143
CMN (uxtw, 32-bit) throughput 0.362898
CMN (uxtw, 64-bit) 3->1 2.004010
CMN (uxtw, 64-bit) 3->2 2.010285
CMN (uxtw, 64-bit) throughput 0.720575
CMN (uxtx, 64-bit) 3->1 1.020367
CMN (uxtx, 64-bit) 3->2 1.226029
CMN (uxtx, 64-bit) throughput 0.371948
CMN (sxtx, 64-bit) 3->1 0.998667
CMN (sxtx, 64-bit) 3->2 0.994804
CMN (sxtx, 64-bit) throughput 0.362751
CMN (immediate, 32-bit) 2->1 1.078844
CMN (immediate, 32-bit) throughput 0.368025
CMN (immediate, 64-bit) 2->1 1.005965
CMN (immediate, 64-bit) throughput 0.363733
CMN (shifted immediate, 32-bit) 2->1 0.957338
CMN (shifted immediate, 32-bit) throughput 0.362752
CMN (shifted immediate, 64-bit) 2->1 0.996808
CMN (shifted immediate, 64-bit) throughput 0.362332
CMN (register, 32-bit) 3->1 0.996688
CMN (register, 32-bit) 3->2 1.002593
CMN (register, 32-bit) throughput 0.364062
CMN (register, 64-bit) 3->1 1.003563
CMN (register, 64-bit) 3->2 1.009620
CMN (register, 64-bit) throughput 0.362005
CMN (register, lsl, 32-bit) 3->1 2.004697
CMN (register, lsl, 32-bit) 3->2 2.004861
CMN (register, lsl, 32-bit) throughput 0.667960
CMN (register, lsl, 64-bit) 3->1 2.000848
CMN (register, lsl, 64-bit) 3->2 2.004277
CMN (register, lsl, 64-bit) throughput 0.668306
CMN (register, lsr, 32-bit) 3->1 2.005134
CMN (register, lsr, 32-bit) 3->2 2.001980
CMN (register, lsr, 32-bit) throughput 0.666564
CMN (register, lsr, 64-bit) 3->1 1.995033
CMN (register, lsr, 64-bit) 3->2 2.004931
CMN (register, lsr, 64-bit) throughput 0.663833
CMN (register, asr, 32-bit) 3->1 2.002425
CMN (register, asr, 32-bit) 3->2 1.996159
CMN (register, asr, 32-bit) throughput 0.678795
CMN (register, asr, 64-bit) 3->1 2.014695
CMN (register, asr, 64-bit) 3->2 1.932653
CMN (register, asr, 64-bit) throughput 0.725067
CMP (sxtb, 32-bit) 3->1 1.782066
CMP (sxtb, 32-bit) 3->2 1.934255
CMP (sxtb, 32-bit) throughput 0.677646
CMP (sxtb, 64-bit) 3->1 1.997577
CMP (sxtb, 64-bit) 3->2 2.004009
CMP (sxtb, 64-bit) throughput 0.667853
CMP (uxtb, 32-bit) 3->1 2.129626
CMP (uxtb, 32-bit) 3->2 1.804826
CMP (uxtb, 32-bit) throughput 0.669218
CMP (uxtb, 64-bit) 3->1 2.003684
CMP (uxtb, 64-bit) 3->2 2.041748
CMP (uxtb, 64-bit) throughput 0.667110
CMP (sxth, 32-bit) 3->1 2.017980
CMP (sxth, 32-bit) 3->2 1.997291
CMP (sxth, 32-bit) throughput 0.669837
CMP (sxth, 64-bit) 3->1 2.002998
CMP (sxth, 64-bit) 3->2 1.993540
CMP (sxth, 64-bit) throughput 0.668360
CMP (uxth, 32-bit) 3->1 1.999111
CMP (uxth, 32-bit) 3->2 2.004253
CMP (uxth, 32-bit) throughput 0.666936
CMP (uxth, 64-bit) 3->1 2.006924
CMP (uxth, 64-bit) 3->2 2.009259
CMP (uxth, 64-bit) throughput 0.667296
CMP (sxtw, 32-bit) 3->1 1.001539
CMP (sxtw, 32-bit) 3->2 0.998302
CMP (sxtw, 32-bit) throughput 0.363393
CMP (sxtw, 64-bit) 3->1 2.019436
CMP (sxtw, 64-bit) 3->2 2.272226
CMP (sxtw, 64-bit) throughput 0.667147
CMP (uxtw, 32-bit) 3->1 1.213414
CMP (uxtw, 32-bit) 3->2 1.007384
CMP (uxtw, 32-bit) throughput 0.363343
CMP (uxtw, 64-bit) 3->1 2.000568
CMP (uxtw, 64-bit) 3->2 1.996850
CMP (uxtw, 64-bit) throughput 0.694523
CMP (uxtx, 64-bit) 3->1 1.092283
CMP (uxtx, 64-bit) 3->2 0.997899
CMP (uxtx, 64-bit) throughput 0.363333
CMP (sxtx, 64-bit) 3->1 1.002710
CMP (sxtx, 64-bit) 3->2 1.002991
CMP (sxtx, 64-bit) throughput 0.366922
CMP (immediate, 32-bit) 2->1 1.002389
CMP (immediate, 32-bit) throughput 0.330037
CMP (immediate, 64-bit) 2->1 1.006937
CMP (immediate, 64-bit) throughput 0.381874
CMP (shifted immediate, 32-bit) 2->1 1.034162
CMP (shifted immediate, 32-bit) throughput 0.363702
CMP (shifted immediate, 64-bit) 2->1 1.003887
CMP (shifted immediate, 64-bit) throughput 0.366069
CMP (register, 32-bit) 3->1 0.994874
CMP (register, 32-bit) 3->2 1.001418
CMP (register, 32-bit) throughput 0.368467
CMP (register, 64-bit) 3->1 1.005102
CMP (register, 64-bit) 3->2 1.023646
CMP (register, 64-bit) throughput 0.363029
CMP (register, lsl, 32-bit) 3->1 2.003926
CMP (register, lsl, 32-bit) 3->2 1.997943
CMP (register, lsl, 32-bit) throughput 0.674608
CMP (register, lsl, 64-bit) 3->1 2.007573
CMP (register, lsl, 64-bit) 3->2 2.000806
CMP (register, lsl, 64-bit) throughput 0.667873
CMP (register, lsr, 32-bit) 3->1 2.003768
CMP (register, lsr, 32-bit) 3->2 1.993827
CMP (register, lsr, 32-bit) throughput 0.668364
CMP (register, lsr, 64-bit) 3->1 2.003199
CMP (register, lsr, 64-bit) 3->2 2.007922
CMP (register, lsr, 64-bit) throughput 0.667000
CMP (register, asr, 32-bit) 3->1 2.004089
CMP (register, asr, 32-bit) 3->2 2.000363
CMP (register, asr, 32-bit) throughput 0.667257
CMP (register, asr, 64-bit) 3->1 2.000605
CMP (register, asr, 64-bit) 3->2 2.007289
CMP (register, asr, 64-bit) throughput 0.687485
CNEG (32-bit) 1->2 0.998181
CNEG (32-bit) 1->3 1.001661
CNEG (32-bit) throughput 0.335098
CNEG (64-bit) 1->2 1.002592
CNEG (64-bit) 1->3 1.004374
CNEG (64-bit) throughput 0.334435
CSDB throughput 0.254710
CSEL (32-bit) 1->2 0.998504
CSEL (32-bit) 1->3 0.978022
CSEL (32-bit) 1->4 1.005963
CSEL (32-bit) throughput 0.337953
CSEL (64-bit) 1->2 0.998707
CSEL (64-bit) 1->3 1.008017
CSEL (64-bit) 1->4 1.001661
CSEL (64-bit) throughput 0.334278
CSET (32-bit) 1->2 1.005427
CSET (32-bit) throughput 0.334071
CSET (64-bit) 1->2 1.002951
CSET (64-bit) throughput 0.333640
CSETM (32-bit) 1->2 0.999030
CSETM (32-bit) throughput 0.334416
CSETM (64-bit) 1->2 1.002835
CSETM (64-bit) throughput 0.333673
CSINC (32-bit) 1->2 1.004292
CSINC (32-bit) 1->3 1.002390
CSINC (32-bit) 1->4 0.998019
CSINC (32-bit) throughput 0.334250
CSINC (64-bit) 1->2 1.003239
CSINC (64-bit) 1->3 0.998262
CSINC (64-bit) 1->4 1.003279
CSINC (64-bit) throughput 0.334273
CSINV (32-bit) 1->2 0.996369
CSINV (32-bit) 1->3 1.000040
CSINV (32-bit) 1->4 1.002430
CSINV (32-bit) throughput 0.334417
CSINV (64-bit) 1->2 1.001781
CSINV (64-bit) 1->3 1.001660
CSINV (64-bit) 1->4 1.003720
CSINV (64-bit) throughput 0.334479
CSNEG (32-bit) 1->2 0.998302
CSNEG (32-bit) 1->3 1.042200
CSNEG (32-bit) 1->4 0.998787
CSNEG (32-bit) throughput 0.333498
CSNEG (64-bit) 1->2 0.941456
CSNEG (64-bit) 1->3 1.001862
CSNEG (64-bit) 1->4 0.999636
CSNEG (64-bit) throughput 0.333817
DMB (SY) throughput 4.006277
DMB (ST) throughput 3.994141
DMB (LD) throughput 3.738653
DMB (ISH) throughput 4.100051
DMB (ISHST) throughput 4.022104
DMB (ISHLD) throughput 3.998259
DMB (NSH) throughput 3.995758
DMB (NSHST) throughput 4.476901
DMB (NSHLD) throughput 3.992562
DMB (OSH) throughput 4.060819
DMB (OSHST) throughput 4.039963
DMB (OSHLD) throughput 4.036362
DSB (SY) throughput 16.687661
DSB (ST) throughput 16.149862
DSB (LD) throughput 16.039126
DSB (ISH) throughput 16.123987
DSB (ISHST) throughput 16.055434
DSB (ISHLD) throughput 16.166586
DSB (NSH) throughput 16.124712
DSB (NSHST) throughput 16.026861
DSB (NSHLD) throughput 16.112095
DSB (OSH) throughput 16.772379
DSB (OSHST) throughput 16.135574
DSB (OSHLD) throughput 16.123735
EON (register, 32-bit) 1->2 1.001741
EON (register, 32-bit) 1->3 1.000444
EON (register, 32-bit) throughput 0.334200
EON (register, 64-bit) 1->2 0.998343
EON (register, 64-bit) 1->3 1.001579
EON (register, 64-bit) throughput 0.334223
EON (register, lsl, 32-bit) 1->2 2.001617
EON (register, lsl, 32-bit) 1->3 2.003604
EON (register, lsl, 32-bit) throughput 0.670625
EON (register, lsl, 64-bit) 1->2 1.999152
EON (register, lsl, 64-bit) 1->3 1.973580
EON (register, lsl, 64-bit) throughput 0.668622
EON (register, lsr, 32-bit) 1->2 2.018586
EON (register, lsr, 32-bit) 1->3 2.002430
EON (register, lsr, 32-bit) throughput 0.668155
EON (register, lsr, 64-bit) 1->2 1.817771
EON (register, lsr, 64-bit) 1->3 2.005466
EON (register, lsr, 64-bit) throughput 0.704457
EON (register, asr, 32-bit) 1->2 1.996405
EON (register, asr, 32-bit) 1->3 2.002948
EON (register, asr, 32-bit) throughput 0.667892
EON (register, asr, 64-bit) 1->2 2.005872
EON (register, asr, 64-bit) 1->3 2.007882
EON (register, asr, 64-bit) throughput 0.668209
EON (register, ror, 32-bit) 1->2 2.003401
EON (register, ror, 32-bit) 1->3 2.003760
EON (register, ror, 32-bit) throughput 0.667914
EON (register, ror, 64-bit) 1->2 2.729193
EON (register, ror, 64-bit) 1->3 2.024914
EON (register, ror, 64-bit) throughput 0.673839
EOR (immediate, 32-bit) 1->2 1.002672
EOR (immediate, 32-bit) throughput 0.333061
EOR (immediate, 64-bit) 1->2 0.999233
EOR (immediate, 64-bit) throughput 0.336303
EOR (register, 32-bit) 1->2 0.998065
EOR (register, 32-bit) 1->3 1.001620
EOR (register, 32-bit) throughput 0.334175
EOR (register, 64-bit) 1->2 1.001701
EOR (register, 64-bit) 1->3 1.003604
EOR (register, 64-bit) throughput 0.332755
EOR (register, lsl, 32-bit) 1->2 1.720575
EOR (register, lsl, 32-bit) 1->3 2.000945
EOR (register, lsl, 32-bit) throughput 0.668329
EOR (register, lsl, 64-bit) 1->2 2.018316
EOR (register, lsl, 64-bit) 1->3 2.004492
EOR (register, lsl, 64-bit) throughput 0.671717
EOR (register, lsr, 32-bit) 1->2 2.001658
EOR (register, lsr, 32-bit) 1->3 2.006235
EOR (register, lsr, 32-bit) throughput 0.668018
EOR (register, lsr, 64-bit) 1->2 2.004738
EOR (register, lsr, 64-bit) 1->3 2.001376
EOR (register, lsr, 64-bit) throughput 0.667064
EOR (register, asr, 32-bit) 1->2 1.998950
EOR (register, asr, 32-bit) 1->3 1.999193
EOR (register, asr, 32-bit) throughput 0.668187
EOR (register, asr, 64-bit) 1->2 1.998463
EOR (register, asr, 64-bit) 1->3 1.998800
EOR (register, asr, 64-bit) throughput 0.666610
EOR (register, ror, 32-bit) 1->2 2.009069
EOR (register, ror, 32-bit) 1->3 1.998788
EOR (register, ror, 32-bit) throughput 0.668621
EOR (register, ror, 64-bit) 1->2 1.993436
EOR (register, ror, 64-bit) 1->3 2.002350
EOR (register, ror, 64-bit) throughput 0.859259
EXTR (register, 32-bit) 1->2 1.004043
EXTR (register, 32-bit) 1->3 2.004244
EXTR (register, 32-bit) throughput 1.007592
EXTR (register, 64-bit) 1->2 0.998423
EXTR (register, 64-bit) 1->3 2.002425
EXTR (register, 64-bit) throughput 1.073627
ISB (SY) throughput 26.152276
LDNP (32-bit) 1->3 (with chain penalty) 4.011216
LDNP (32-bit) 2->3 (with chain penalty) 4.067778
LDNP (32-bit) throughput 0.500773
LDNP (64-bit) 1->3 (with chain penalty) 4.013928
LDNP (64-bit) 2->3 (with chain penalty) 4.029279
LDNP (64-bit) throughput 0.501316
LDP (32-bit) 1->3 (with chain penalty) 3.962296
LDP (32-bit) 2->3 (with chain penalty) 4.993629
LDP (32-bit) throughput 0.422988
LDP (64-bit) 1->3 (with chain penalty) 4.818747
LDP (64-bit) 2->3 (with chain penalty) 5.571466
LDP (64-bit) throughput 0.606076
LDP (post-index, 32-bit) 1->3 (with chain penalty) 4.834397
LDP (post-index, 32-bit) 2->3 (with chain penalty) 4.158612
LDP (post-index, 32-bit) throughput 0.833657
LDP (post-index, 64-bit) 1->3 (with chain penalty) 27.169619
LDP (post-index, 64-bit) 2->3 (with chain penalty) 20.001005
LDP (post-index, 64-bit) throughput 0.854760
LDP (pre-index, 32-bit) 1->3 (with chain penalty) 7.875741
LDP (pre-index, 32-bit) 2->3 (with chain penalty) 5.425403
LDP (pre-index, 32-bit) throughput 0.801130
LDP (pre-index, 64-bit) 1->3 (with chain penalty) 29.096643
LDP (pre-index, 64-bit) 2->3 (with chain penalty) 20.744503
LDP (pre-index, 64-bit) throughput 0.829911
LDP (signed offset, 32-bit) 1->3 (with chain penalty) 4.479390
LDP (signed offset, 32-bit) 2->3 (with chain penalty) 4.421192
LDP (signed offset, 32-bit) throughput 0.553061
LDP (signed offset, 64-bit) 1->3 (with chain penalty) 4.445606
LDP (signed offset, 64-bit) 2->3 (with chain penalty) 4.520398
LDP (signed offset, 64-bit) throughput 0.614190
LDPSW (post-index) 1->3 (with chain penalty) 4.531448
LDPSW (post-index) 2->3 (with chain penalty) 2.807169
LDPSW (post-index) throughput 0.886689
LDPSW (pre-index) 1->3 (with chain penalty) 1.880322
LDPSW (pre-index) 2->3 (with chain penalty) 4.373434
LDPSW (pre-index) throughput 0.829710
LDPSW (signed offset) 1->3 (with chain penalty) 4.470898
LDPSW (signed offset) 2->3 (with chain penalty) 3.879052
LDPSW (signed offset) throughput 0.503425
LDR (32-bit) 1->2 (with chain penalty) 3.971645
LDR (32-bit) throughput 0.512350
LDR (64-bit) 1->2 (with chain penalty) 7.773384
LDR (64-bit) throughput 0.485995
LDR (post-index, 32-bit) 1->2 (with chain penalty) 4.752876
LDR (post-index, 32-bit) throughput 0.413392
LDR (post-index, 64-bit) 1->2 (with chain penalty) 4.852585
LDR (post-index, 64-bit) throughput 0.652391
LDR (pre-index, 32-bit) 1->2 (with chain penalty) 3.134445
LDR (pre-index, 32-bit) throughput 0.613696
LDR (pre-index, 64-bit) 1->2 (with chain penalty) 3.927234
LDR (pre-index, 64-bit) throughput 0.501048
LDR (unsigned offset, 32-bit) 1->2 (with chain penalty) 4.297109
LDR (unsigned offset, 32-bit) throughput 0.513772
LDR (unsigned offset, 64-bit) 1->2 (with chain penalty) 4.029887
LDR (unsigned offset, 64-bit) throughput 0.503639
LDR (literal, 32-bit) throughput 0.501545
LDR (literal, 64-bit) throughput 0.541524
LDR (register, 32-bit) 1->2 (with chain penalty) 2.986009
LDR (register, 32-bit) 1->3 (with chain penalty) 1.779754
LDR (register, 32-bit) throughput 0.802882
LDR (register, 64-bit) 1->2 (with chain penalty) 3.205355
LDR (register, 64-bit) 1->3 (with chain penalty) 4.677191
LDR (register, 64-bit) throughput 0.532140
LDR (register, uxtw, 32-bit) 1->2 (with chain penalty) 4.019748
LDR (register, uxtw, 32-bit) 1->3 (with chain penalty) 5.475215
LDR (register, uxtw, 32-bit) throughput 0.472974
LDR (register, uxtw, 64-bit) 1->2 (with chain penalty) 3.816836
LDR (register, uxtw, 64-bit) 1->3 (with chain penalty) 2.874984
LDR (register, uxtw, 64-bit) throughput 0.547399
LDR (register, sxtw, 32-bit) 1->2 (with chain penalty) 2.974502
LDR (register, sxtw, 32-bit) 1->3 (with chain penalty) 4.922296
LDR (register, sxtw, 32-bit) throughput 0.610794
LDR (register, sxtw, 64-bit) 1->2 (with chain penalty) 11.515596
LDR (register, sxtw, 64-bit) 1->3 (with chain penalty) 4.959931
LDR (register, sxtw, 64-bit) throughput 0.518055
LDR (register, lsl, 32-bit) 1->2 (with chain penalty) 4.280039
LDR (register, lsl, 32-bit) 1->3 (with chain penalty) 5.023660
LDR (register, lsl, 32-bit) throughput 0.510059
LDR (register, lsl, 64-bit) 1->2 (with chain penalty) 4.748237
LDR (register, lsl, 64-bit) 1->3 (with chain penalty) 5.069458
LDR (register, lsl, 64-bit) throughput 0.486183
LDRB 1->2 (with chain penalty) 4.388431
LDRB throughput 0.502891
LDRB (post-index) 1->2 (with chain penalty) 5.046648
LDRB (post-index) throughput 0.536790
LDRB (pre-index) 1->2 (with chain penalty) 5.823114
LDRB (pre-index) throughput 0.557724
LDRB (unsigned offset) 1->2 (with chain penalty) 5.454965
LDRB (unsigned offset) throughput 0.535845
LDRB (register) 1->2 (with chain penalty) 8.835895
LDRB (register) 1->3 (with chain penalty) 5.253677
LDRB (register) throughput 0.501086
LDRB (register, uxtw) 1->2 (with chain penalty) 4.131632
LDRB (register, uxtw) 1->3 (with chain penalty) 3.917433
LDRB (register, uxtw) throughput 0.476065
LDRB (register, sxtw) 1->2 (with chain penalty) 4.304624
LDRB (register, sxtw) 1->3 (with chain penalty) 5.237230
LDRB (register, sxtw) throughput 0.504031
LDRH 1->2 (with chain penalty) 4.374333
LDRH throughput 0.604542
LDRH (post-index) 1->2 (with chain penalty) 6.336118
LDRH (post-index) throughput 0.384406
LDRH (pre-index) 1->2 (with chain penalty) 5.237825
LDRH (pre-index) throughput 0.546486
LDRH (unsigned offset) 1->2 (with chain penalty) 5.005581
LDRH (unsigned offset) throughput 0.560254
LDRH (register) 1->2 (with chain penalty) 5.544064
LDRH (register) 1->3 (with chain penalty) 5.557906
LDRH (register) throughput 0.505116
LDRH (register, uxtw) 1->2 (with chain penalty) 4.830699
LDRH (register, uxtw) 1->3 (with chain penalty) 4.132452
LDRH (register, uxtw) throughput 0.500236
LDRH (register, sxtw) 1->2 (with chain penalty) 5.597845
LDRH (register, sxtw) 1->3 (with chain penalty) 4.202584
LDRH (register, sxtw) throughput 0.505879
LDRH (register, lsl) 1->2 (with chain penalty) 4.340692
LDRH (register, lsl) 1->3 (with chain penalty) 4.054429
LDRH (register, lsl) throughput 0.500633
LDRSB (32-bit) 1->2 (with chain penalty) 4.121659
LDRSB (32-bit) throughput 0.523786
LDRSB (64-bit) 1->2 (with chain penalty) 4.143839
LDRSB (64-bit) throughput 0.528274
LDRSB (post-index, 32-bit) 1->2 (with chain penalty) 7.419567
LDRSB (post-index, 32-bit) throughput 0.563418
LDRSB (post-index, 64-bit) 1->2 (with chain penalty) 5.373216
LDRSB (post-index, 64-bit) throughput 0.595227
LDRSB (pre-index, 32-bit) 1->2 (with chain penalty) 4.227090
LDRSB (pre-index, 32-bit) throughput 0.606379
LDRSB (pre-index, 64-bit) 1->2 (with chain penalty) 2.787223
LDRSB (pre-index, 64-bit) throughput 0.641231
LDRSB (unsigned offset, 32-bit) 1->2 (with chain penalty) 4.518528
LDRSB (unsigned offset, 32-bit) throughput 0.583465
LDRSB (unsigned offset, 64-bit) 1->2 (with chain penalty) 5.918985
LDRSB (unsigned offset, 64-bit) throughput 0.527817
LDRSB (register, 32-bit) 1->2 (with chain penalty) 4.073166
LDRSB (register, 32-bit) 1->3 (with chain penalty) 6.104889
LDRSB (register, 32-bit) throughput 0.600752
LDRSB (register, 64-bit) 1->2 (with chain penalty) 4.183647
LDRSB (register, 64-bit) 1->3 (with chain penalty) 4.353711
LDRSB (register, 64-bit) throughput 0.529125
LDRSB (register, uxtw, 32-bit) 1->2 (with chain penalty) 4.133659
LDRSB (register, uxtw, 32-bit) 1->3 (with chain penalty) 4.051926
LDRSB (register, uxtw, 32-bit) throughput 0.511480
LDRSB (register, uxtw, 64-bit) 1->2 (with chain penalty) 4.035595
LDRSB (register, uxtw, 64-bit) 1->3 (with chain penalty) 4.073428
LDRSB (register, uxtw, 64-bit) throughput 0.503281
LDRSB (register, sxtw, 32-bit) 1->2 (with chain penalty) 4.043933
LDRSB (register, sxtw, 32-bit) 1->3 (with chain penalty) 4.355496
LDRSB (register, sxtw, 32-bit) throughput 0.513134
LDRSB (register, sxtw, 64-bit) 1->2 (with chain penalty) 4.343835
LDRSB (register, sxtw, 64-bit) 1->3 (with chain penalty) 4.168091
LDRSB (register, sxtw, 64-bit) throughput 0.520422
LDRSH (32-bit) 1->2 (with chain penalty) 4.394305
LDRSH (32-bit) throughput 0.512486
LDRSH (64-bit) 1->2 (with chain penalty) 4.028591
LDRSH (64-bit) throughput 0.500825
LDRSH (post-index, 32-bit) 1->2 (with chain penalty) 4.001970
LDRSH (post-index, 32-bit) throughput 0.560562
LDRSH (post-index, 64-bit) 1->2 (with chain penalty) 4.232196
LDRSH (post-index, 64-bit) throughput 0.642203
LDRSH (pre-index, 32-bit) 1->2 (with chain penalty) 3.929703
LDRSH (pre-index, 32-bit) throughput 0.387418
LDRSH (pre-index, 64-bit) 1->2 (with chain penalty) 4.364699
LDRSH (pre-index, 64-bit) throughput 0.543606
LDRSH (unsigned offset, 32-bit) 1->2 (with chain penalty) 5.008532
LDRSH (unsigned offset, 32-bit) throughput 0.627982
LDRSH (unsigned offset, 64-bit) 1->2 (with chain penalty) 4.635835
LDRSH (unsigned offset, 64-bit) throughput 0.501159
LDRSH (register, 32-bit) 1->2 (with chain penalty) 2.920338
LDRSH (register, 32-bit) 1->3 (with chain penalty) 4.382840
LDRSH (register, 32-bit) throughput 0.485965
LDRSH (register, 64-bit) 1->2 (with chain penalty) 4.343374
LDRSH (register, 64-bit) 1->3 (with chain penalty) 4.378748
LDRSH (register, 64-bit) throughput 0.556687
LDRSH (register, uxtw, 32-bit) 1->2 (with chain penalty) 4.567619
LDRSH (register, uxtw, 32-bit) 1->3 (with chain penalty) 4.846350
LDRSH (register, uxtw, 32-bit) throughput 0.499067
LDRSH (register, uxtw, 64-bit) 1->2 (with chain penalty) 4.752022
LDRSH (register, uxtw, 64-bit) 1->3 (with chain penalty) 4.173441
LDRSH (register, uxtw, 64-bit) throughput 0.381087
LDRSH (register, sxtw, 32-bit) 1->2 (with chain penalty) 4.445727
LDRSH (register, sxtw, 32-bit) 1->3 (with chain penalty) 4.475037
LDRSH (register, sxtw, 32-bit) throughput 0.534104
LDRSH (register, sxtw, 64-bit) 1->2 (with chain penalty) 5.571131
LDRSH (register, sxtw, 64-bit) 1->3 (with chain penalty) 4.402152
LDRSH (register, sxtw, 64-bit) throughput 0.515916
LDRSH (register, lsl, 32-bit) 1->2 (with chain penalty) 4.381762
LDRSH (register, lsl, 32-bit) 1->3 (with chain penalty) 4.556984
LDRSH (register, lsl, 32-bit) throughput 0.539430
LDRSH (register, lsl, 64-bit) 1->2 (with chain penalty) 4.290519
LDRSH (register, lsl, 64-bit) 1->3 (with chain penalty) 4.291225
LDRSH (register, lsl, 64-bit) throughput 0.572051
LDRSW 1->2 (with chain penalty) 3.624541
LDRSW throughput 0.664266
LDRSW (post-index) 1->2 (with chain penalty) 4.458256
LDRSW (post-index) throughput 0.572662
LDRSW (pre-index) 1->2 (with chain penalty) 4.321967
LDRSW (pre-index) throughput 0.500068
LDRSW (unsigned offset) 1->2 (with chain penalty) 4.686736
LDRSW (unsigned offset) throughput 0.312305
LDRSW (literal) throughput 0.533538
LDRSW (register) 1->2 (with chain penalty) 4.086919
LDRSW (register) 1->3 (with chain penalty) 4.538716
LDRSW (register) throughput 0.576869
LDRSW (register, uxtw) 1->2 (with chain penalty) 4.522640
LDRSW (register, uxtw) 1->3 (with chain penalty) 4.548308
LDRSW (register, uxtw) throughput 0.575413
LDRSW (register, sxtw) 1->2 (with chain penalty) 4.219498
LDRSW (register, sxtw) 1->3 (with chain penalty) 4.346348
LDRSW (register, sxtw) throughput 0.541591
LDRSW (register, lsl) 1->2 (with chain penalty) 4.630674
LDRSW (register, lsl) 1->3 (with chain penalty) 4.193037
LDRSW (register, lsl) throughput 0.425122
LDUR (32-bit) 1->2 (with chain penalty) 4.321447
LDUR (32-bit) throughput 0.488910
LDUR (64-bit) 1->2 (with chain penalty) 4.434077
LDUR (64-bit) throughput 0.655802
LDURB 1->2 (with chain penalty) 4.712086
LDURB throughput 0.520942
LDURH 1->2 (with chain penalty) 4.392170
LDURH throughput 0.481354
LDURSB (32-bit) 1->2 (with chain penalty) 4.473807
LDURSB (32-bit) throughput 0.571954
LDURSB (64-bit) 1->2 (with chain penalty) 4.490645
LDURSB (64-bit) throughput 0.546561
LDURSH (32-bit) 1->2 (with chain penalty) 5.691255
LDURSH (32-bit) throughput 0.513398
LDURSH (64-bit) 1->2 (with chain penalty) 4.418924
LDURSH (64-bit) throughput 0.591257
LDURSW 1->2 (with chain penalty) 4.959228
LDURSW throughput 0.503837
LSL (immediate, 32-bit) 1->2 1.110558
LSL (immediate, 32-bit) throughput 0.344373
LSL (immediate, 64-bit) 1->2 1.001450
LSL (immediate, 64-bit) throughput 0.339696
LSL (register, 32-bit) 1->2 0.943788
LSL (register, 32-bit) 1->3 1.011176
LSL (register, 32-bit) throughput 0.358331
LSL (register, 64-bit) 1->2 1.002224
LSL (register, 64-bit) 1->3 0.995526
LSL (register, 64-bit) throughput 0.340318
LSR (immediate, 32-bit) 1->2 0.999998
LSR (immediate, 32-bit) throughput 0.334506
LSR (immediate, 64-bit) 1->2 0.974744
LSR (immediate, 64-bit) throughput 0.334901
LSR (register, 32-bit) 1->2 0.759411
LSR (register, 32-bit) 1->3 1.017538
LSR (register, 32-bit) throughput 0.324230
LSR (register, 64-bit) 1->2 1.006537
LSR (register, 64-bit) 1->3 1.002235
LSR (register, 64-bit) throughput 0.420014
MADD (32-bit) 1->2 3.138926
MADD (32-bit) 1->3 2.553080
MADD (32-bit) 1->4 1.196828
MADD (32-bit) throughput 1.056894
MADD (64-bit) 1->2 3.257690
MADD (64-bit) 1->3 2.933732
MADD (64-bit) 1->4 0.961741
MADD (64-bit) throughput 0.906788
MNEG (32-bit) 1->2 3.001942
MNEG (32-bit) 1->3 3.316489
MNEG (32-bit) throughput 1.105070
MNEG (64-bit) 1->2 3.482142
MNEG (64-bit) 1->3 2.991488
MNEG (64-bit) throughput 1.139201
MOV (bitmask immediate, 32-bit) throughput 0.304318
MOV (bitmask immediate, 64-bit) throughput 0.251238
MOV (from sp, 32-bit) throughput 0.343093
MOV (from sp, 64-bit) throughput 0.270773
MOVK (32-bit) 1->1 1.278714
MOVK (32-bit) throughput 0.259996
MOVK (64-bit) 1->1 0.999928
MOVK (64-bit) throughput 0.322862
MOVN (32-bit) throughput 0.212576
MOVN (64-bit) throughput 0.251179
MOVZ (32-bit) throughput 0.251117
MOVZ (64-bit) throughput 0.250952
MRS (CNTFRQ_EL0) throughput 19.621549
MRS (CNTPCT_EL0) throughput 1.285935
MRS (CNTVCT_EL0) throughput 1.452263
MRS (DCZID_EL0) throughput 1.324984
MRS (FPCR) throughput 1.305691
MRS (FPSR) throughput 7.396338
MRS (NZCV) throughput 0.331167
MRS (TPIDRRO_EL0) throughput 1.242838
MRS (TPIDR_EL0) throughput 1.507344
MSR (FPCR) throughput 10.534700
MSR (FPSR) throughput 11.260972
MSR (TPIDR_EL0) throughput 10.333137
MSR (NZCV) throughput 0.378226
MSUB (32-bit) 1->2 2.987768
MSUB (32-bit) 1->3 3.016360
MSUB (32-bit) 1->4 1.010691
MSUB (32-bit) throughput 1.073758
MSUB (64-bit) 1->2 3.016238
MSUB (64-bit) 1->3 3.545916
MSUB (64-bit) 1->4 0.993373
MSUB (64-bit) throughput 0.990174
MUL (32-bit) 1->2 3.835368
MUL (32-bit) 1->3 1.896296
MUL (32-bit) throughput 1.084675
MUL (64-bit) 1->2 2.656042
MUL (64-bit) 1->3 2.900016
MUL (64-bit) throughput 1.142993
MVN (register, 32-bit) 1->2 1.007733
MVN (register, 32-bit) throughput 0.340498
MVN (register, 64-bit) 1->2 0.980389
MVN (register, 64-bit) throughput 0.335059
MVN (register, lsl, 32-bit) 1->2 2.441820
MVN (register, lsl, 32-bit) throughput 0.579095
MVN (register, lsl, 64-bit) 1->2 1.982837
MVN (register, lsl, 64-bit) throughput 0.702864
MVN (register, lsr, 32-bit) 1->2 1.966313
MVN (register, lsr, 32-bit) throughput 0.698766
MVN (register, lsr, 64-bit) 1->2 1.996533
MVN (register, lsr, 64-bit) throughput 0.737580
MVN (register, asr, 32-bit) 1->2 1.998020
MVN (register, asr, 32-bit) throughput 0.539739
MVN (register, asr, 64-bit) 1->2 1.968144
MVN (register, asr, 64-bit) throughput 0.721580
MVN (register, ror, 32-bit) 1->2 2.002996
MVN (register, ror, 32-bit) throughput 0.707091
MVN (register, ror, 64-bit) 1->2 2.005102
MVN (register, ror, 64-bit) throughput 0.522015
NEG (register, 32-bit) 1->2 0.998181
NEG (register, 32-bit) throughput 0.332776
NEG (register, 64-bit) 1->2 1.001579
NEG (register, 64-bit) throughput 0.371654
NEG (register, lsl, 32-bit) 1->2 2.004817
NEG (register, lsl, 32-bit) throughput 0.352660
NEG (register, lsl, 64-bit) 1->2 1.999029
NEG (register, lsl, 64-bit) throughput 0.642792
NEG (register, lsr, 32-bit) 1->2 2.017135
NEG (register, lsr, 32-bit) throughput 0.751452
NEG (register, lsr, 64-bit) 1->2 2.001334
NEG (register, lsr, 64-bit) throughput 0.774564
NEG (register, asr, 32-bit) 1->2 1.827614
NEG (register, asr, 32-bit) throughput 0.772062
NEG (register, asr, 64-bit) 1->2 2.038299
NEG (register, asr, 64-bit) throughput 0.654993
NEGS (register, 32-bit) 1->2 1.003690
NEGS (register, 32-bit) 3->2 1.014235
NEGS (register, 32-bit) throughput 0.691212
NEGS (register, 64-bit) 1->2 1.029062
NEGS (register, 64-bit) 3->2 1.264785
NEGS (register, 64-bit) throughput 0.488020
NEGS (register, lsl, 32-bit) 1->2 2.118354
NEGS (register, lsl, 32-bit) 3->2 2.121212
NEGS (register, lsl, 32-bit) throughput 1.037693
NEGS (register, lsl, 64-bit) 1->2 2.052487
NEGS (register, lsl, 64-bit) 3->2 2.403352
NEGS (register, lsl, 64-bit) throughput 0.711306
NEGS (register, lsr, 32-bit) 1->2 1.994714
NEGS (register, lsr, 32-bit) 3->2 2.554842
NEGS (register, lsr, 32-bit) throughput 0.508404
NEGS (register, lsr, 64-bit) 1->2 1.994434
NEGS (register, lsr, 64-bit) 3->2 2.005952
NEGS (register, lsr, 64-bit) throughput 0.816902
NEGS (register, asr, 32-bit) 1->2 1.987701
NEGS (register, asr, 32-bit) 3->2 2.346507
NEGS (register, asr, 32-bit) throughput 0.742936
NEGS (register, asr, 64-bit) 1->2 2.024417
NEGS (register, asr, 64-bit) 3->2 2.615057
NEGS (register, asr, 64-bit) throughput 0.482044
NGC (register, 32-bit) 1->2 1.007653
NGC (register, 32-bit) 1->3 1.009880
NGC (register, 32-bit) throughput 0.337560
NGC (register, 64-bit) 1->2 0.993044
NGC (register, 64-bit) 1->3 1.383560
NGC (register, 64-bit) throughput 0.344979
NGCS (register, 32-bit) 1->2 1.005012
NGCS (register, 32-bit) 1->3 1.034039
NGCS (register, 32-bit) 3->2 1.016602
NGCS (register, 32-bit) 3->3 0.997427
NGCS (register, 32-bit) throughput 0.823515
NGCS (register, 32-bit) throughput 0.622982
NGCS (register, 64-bit) 1->2 0.995766
NGCS (register, 64-bit) 1->3 0.994647
NGCS (register, 64-bit) 3->2 1.520986
NGCS (register, 64-bit) 3->3 0.994531
NGCS (register, 64-bit) throughput 0.873412
NGCS (register, 64-bit) throughput 0.640679
NOP throughput 0.240462
ORN (register, 32-bit) 1->2 1.002183
ORN (register, 32-bit) 1->3 1.008706
ORN (register, 32-bit) throughput 0.309213
ORN (register, 64-bit) 1->2 1.001701
ORN (register, 64-bit) 1->3 1.335583
ORN (register, 64-bit) throughput 0.330946
ORN (register, lsl, 32-bit) 1->2 1.922455
ORN (register, lsl, 32-bit) 1->3 1.700878
ORN (register, lsl, 32-bit) throughput 0.717911
ORN (register, lsl, 64-bit) 1->2 1.996530
ORN (register, lsl, 64-bit) 1->3 2.011210
ORN (register, lsl, 64-bit) throughput 0.710283
ORN (register, lsr, 32-bit) 1->2 2.003401
ORN (register, lsr, 32-bit) 1->3 2.222689
ORN (register, lsr, 32-bit) throughput 0.601278
ORN (register, lsr, 64-bit) 1->2 1.979435
ORN (register, lsr, 64-bit) 1->3 1.990490
ORN (register, lsr, 64-bit) throughput 0.787249
ORN (register, asr, 32-bit) 1->2 2.362830
ORN (register, asr, 32-bit) 1->3 2.108351
ORN (register, asr, 32-bit) throughput 0.709955
ORN (register, asr, 64-bit) 1->2 2.178064
ORN (register, asr, 64-bit) 1->3 2.159680
ORN (register, asr, 64-bit) throughput 0.737944
ORN (register, ror, 32-bit) 1->2 2.004898
ORN (register, ror, 32-bit) 1->3 1.994786
ORN (register, ror, 32-bit) throughput 0.669571
ORN (register, ror, 64-bit) 1->2 1.818422
ORN (register, ror, 64-bit) 1->3 2.219273
ORN (register, ror, 64-bit) throughput 0.623367
ORR (immediate, 32-bit) 1->2 0.997734
ORR (immediate, 32-bit) throughput 0.339660
ORR (immediate, 64-bit) 1->2 0.998866
ORR (immediate, 64-bit) throughput 0.332465
ORR (register, 32-bit) 1->2 0.998302
ORR (register, 32-bit) 1->3 0.996370
ORR (register, 32-bit) throughput 0.339626
ORR (register, 64-bit) 1->2 1.001660
ORR (register, 64-bit) 1->3 0.995164
ORR (register, 64-bit) throughput 0.424834
ORR (register, lsl, 32-bit) 1->2 2.002062
ORR (register, lsl, 32-bit) 1->3 1.994905
ORR (register, lsl, 32-bit) throughput 0.703444
ORR (register, lsl, 64-bit) 1->2 1.912905
ORR (register, lsl, 64-bit) 1->3 2.003516
ORR (register, lsl, 64-bit) throughput 0.672556
ORR (register, lsr, 32-bit) 1->2 2.002593
ORR (register, lsr, 32-bit) 1->3 2.003603
ORR (register, lsr, 32-bit) throughput 0.668405
ORR (register, lsr, 64-bit) 1->2 2.001620
ORR (register, lsr, 64-bit) 1->3 1.998141
ORR (register, lsr, 64-bit) throughput 0.667745
ORR (register, asr, 32-bit) 1->2 2.031339
ORR (register, asr, 32-bit) 1->3 1.998748
ORR (register, asr, 32-bit) throughput 0.669550
ORR (register, asr, 64-bit) 1->2 2.001553
ORR (register, asr, 64-bit) 1->3 1.923998
ORR (register, asr, 64-bit) throughput 0.898453
ORR (register, ror, 32-bit) 1->2 1.359639
ORR (register, ror, 32-bit) 1->3 2.015131
ORR (register, ror, 32-bit) throughput 0.856870
ORR (register, ror, 64-bit) 1->2 2.339577
ORR (register, ror, 64-bit) 1->3 2.057684
ORR (register, ror, 64-bit) throughput 0.737398
PRFM (register, PLDL1KEEP) throughput 2.366658
PRFM (register, PLDL1KEEP) throughput 0.448491
PRFM (register, PLDL1STRM) throughput 3.432769
PRFM (register, PLDL1STRM) throughput 0.500840
PRFM (register, PLDL2KEEP) throughput 2.183492
PRFM (register, PLDL2KEEP) throughput 2.044444
PRFM (register, PLDL2STRM) throughput 2.474707
PRFM (register, PLDL2STRM) throughput 2.337495
PRFM (register, PLDL3KEEP) throughput 1.903101
PRFM (register, PLDL3KEEP) throughput 2.383121
PRFM (register, PLDL3STRM) throughput 1.649719
PRFM (register, PLDL3STRM) throughput 2.293085
PRFM (register, PLIL1KEEP) throughput 2.304429
PRFM (register, PLIL1KEEP) throughput 1.972693
PRFM (register, PLIL1STRM) throughput 2.067304
PRFM (register, PLIL1STRM) throughput 2.090284
PRFM (register, PLIL2KEEP) throughput 2.319946
PRFM (register, PLIL2KEEP) throughput 2.078469
PRFM (register, PLIL2STRM) throughput 2.096154
PRFM (register, PLIL2STRM) throughput 2.053392
PRFM (register, PLIL3KEEP) throughput 1.966426
PRFM (register, PLIL3KEEP) throughput 2.210865
PRFM (register, PLIL3STRM) throughput 1.958898
PRFM (register, PLIL3STRM) throughput 2.223792
PRFM (register, PSTL1KEEP) throughput 2.796336
PRFM (register, PSTL1KEEP) throughput 0.497483
PRFM (register, PSTL1STRM) throughput 3.631398
PRFM (register, PSTL1STRM) throughput 0.498949
PRFM (register, PSTL2KEEP) throughput 2.415593
PRFM (register, PSTL2KEEP) throughput 2.245132
PRFM (register, PSTL2STRM) throughput 2.311644
PRFM (register, PSTL2STRM) throughput 2.134797
PRFM (register, PSTL3KEEP) throughput 1.930098
PRFM (register, PSTL3KEEP) throughput 1.654864
PRFM (register, PSTL3STRM) throughput 2.009697
PRFM (register, PSTL3STRM) throughput 1.521765
PSSBB throughput 29.430606
RBIT (32-bit) 1->2 1.007912
RBIT (32-bit) throughput 0.334038
RBIT (64-bit) 1->2 0.988039
RBIT (64-bit) throughput 0.351251
REV (32-bit) 1->2 0.993001
REV (32-bit) throughput 0.343402
REV (64-bit) 1->2 0.993762
REV (64-bit) throughput 0.420122
REV16 (32-bit) 1->2 1.249206
REV16 (32-bit) throughput 0.350687
REV16 (64-bit) 1->2 1.000258
REV16 (64-bit) throughput 0.339909
REV32 1->2 1.008991
REV32 throughput 0.366831
ROR (immediate, 32-bit) 1->2 0.784685
ROR (immediate, 32-bit) throughput 0.345924
ROR (immediate, 64-bit) 1->2 0.931590
ROR (immediate, 64-bit) throughput 0.338662
ROR (register, 32-bit) 1->2 0.928486
ROR (register, 32-bit) 1->3 0.973620
ROR (register, 32-bit) throughput 0.334887
ROR (register, 64-bit) 1->2 1.165951
ROR (register, 64-bit) 1->3 0.996057
ROR (register, 64-bit) throughput 0.333370
SBC (32-bit) 1->2 0.933983
SBC (32-bit) 1->3 0.933785
SBC (32-bit) 1->4 0.938271
SBC (32-bit) throughput 0.289599
SBC (64-bit) 1->2 0.999797
SBC (64-bit) 1->3 1.001092
SBC (64-bit) 1->4 1.003517
SBC (64-bit) throughput 0.375187
SBCS (32-bit) 1->2 0.995463
SBCS (32-bit) 1->3 1.049725
SBCS (32-bit) 1->4 1.224936
SBCS (32-bit) 4->2 1.005587
SBCS (32-bit) 4->3 0.988058
SBCS (32-bit) 4->4 0.998342
SBCS (32-bit) throughput 1.084498
SBCS (32-bit) throughput 0.687237
SBCS (64-bit) 1->2 0.936380
SBCS (64-bit) 1->3 0.987722
SBCS (64-bit) 1->4 1.242600
SBCS (64-bit) 4->2 1.007963
SBCS (64-bit) 4->3 1.006514
SBCS (64-bit) 4->4 1.014612
SBCS (64-bit) throughput 0.816001
SBCS (64-bit) throughput 0.607842
SBFIZ (32-bit) 1->2 1.000362
SBFIZ (32-bit) throughput 0.344130
SBFIZ (64-bit) 1->2 1.046099
SBFIZ (64-bit) throughput 0.334362
SBFX (32-bit) 1->2 1.005992
SBFX (32-bit) throughput 0.368015
SBFX (64-bit) 1->2 1.139500
SBFX (64-bit) throughput 0.330851
SDIV (fast, 32-bit) 1->2 9.518495
SDIV (fast, 32-bit) 1->3 7.695666
SDIV (fast, 32-bit) throughput 8.582758
SDIV (slow, 32-bit) 1->2 9.345429
SDIV (slow, 32-bit) 1->3 13.881416
SDIV (slow, 32-bit) throughput 15.943366
SDIV (slow, 32-bit) 1->2 18.561055
SDIV (slow, 32-bit) 1->3 10.478884
SDIV (slow, 32-bit) throughput 16.808209
SDIV (fast, 64-bit) 1->2 5.942182
SDIV (fast, 64-bit) 1->3 7.637275
SDIV (fast, 64-bit) throughput 10.470722
SDIV (medium, 64-bit) 1->2 14.345385
SDIV (medium, 64-bit) 1->3 15.115875
SDIV (medium, 64-bit) throughput 16.932808
SDIV (medium, 64-bit) 1->2 12.990643
SDIV (medium, 64-bit) 1->3 15.933564
SDIV (medium, 64-bit) throughput 19.733054
SDIV (slow, 64-bit) 1->2 21.932172
SDIV (slow, 64-bit) 1->3 23.227049
SDIV (slow, 64-bit) throughput 33.841319
SDIV (slow, 64-bit) 1->2 23.680575
SDIV (slow, 64-bit) 1->3 33.495257
SDIV (slow, 64-bit) throughput 28.844340
SMADDL 1->2 2.860401
SMADDL 1->3 3.059803
SMADDL 1->4 0.997017
SMADDL throughput 1.206069
SMNEGL 1->2 2.949267
SMNEGL 1->3 2.522393
SMNEGL throughput 0.803665
SMSUBL 1->2 3.174978
SMSUBL 1->3 2.726028
SMSUBL 1->4 0.996155
SMSUBL throughput 0.855693
SMULH 1->2 3.036645
SMULH 1->3 3.071626
SMULH throughput 1.114094
SMULL 1->2 2.923614
SMULL 1->3 2.500358
SMULL throughput 0.769158
SSBB throughput 23.477772
STLRB throughput 4.702660
STLRH throughput 6.038870
STLXP (32-bit) throughput 3.168703
STLXP (32-bit) throughput 2.996401
STLXP (64-bit) throughput 3.293522
STLXP (64-bit) throughput 3.034821
STLXR (32-bit) throughput 3.155222
STLXR (32-bit) throughput 3.139267
STLXR (64-bit) throughput 3.247223
STLXR (64-bit) throughput 3.252623
STLXRB throughput 3.039395
STLXRB throughput 3.004130
STLXRH throughput 3.036415
STLXRH throughput 3.013581
STNP (32-bit) throughput 1.006236
STNP (32-bit) throughput 1.040422
STNP (64-bit) throughput 1.017249
STNP (64-bit) throughput 1.033935
STP (32-bit) throughput 1.000935
STP (64-bit) throughput 0.999757
STP (post-index, 32-bit) 3->3 0.910829
STP (post-index, 32-bit) throughput 1.012611
STP (post-index, 64-bit) 3->3 1.065837
STP (post-index, 64-bit) throughput 1.026523
STP (pre-index, 32-bit) 3->3 1.023981
STP (pre-index, 32-bit) throughput 1.288093
STP (pre-index, 64-bit) 3->3 1.010690
STP (pre-index, 64-bit) throughput 1.804051
STP (signed offset, 32-bit) throughput 1.776866
STP (signed offset, 64-bit) throughput 1.631674
STR (32-bit) throughput 1.037049
STR (64-bit) throughput 1.066534
STR (post-index, 32-bit) 2->2 0.992272
STR (post-index, 32-bit) throughput 1.122895
STR (post-index, 64-bit) 2->2 0.959052
STR (post-index, 64-bit) throughput 1.291874
STR (pre-index, 32-bit) 2->2 0.980873
STR (pre-index, 32-bit) throughput 1.192080
STR (pre-index, 64-bit) 2->2 1.011426
STR (pre-index, 64-bit) throughput 1.158453
STR (unsigned offset, 32-bit) throughput 1.061188
STR (unsigned offset, 64-bit) throughput 1.324361
STR (register, 32-bit) throughput 1.046200
STR (register, 64-bit) throughput 1.072395
STR (register, uxtw, 32-bit) throughput 1.089956
STR (register, uxtw, 64-bit) throughput 1.018048
STR (register, sxtw, 32-bit) throughput 0.887770
STR (register, sxtw, 64-bit) throughput 1.062124
STR (register, lsl, 32-bit) throughput 1.107123
STR (register, lsl, 64-bit) throughput 1.038845
STRB throughput 0.869107
STRB (post-index) 2->2 1.001037
STRB (post-index) throughput 1.080036
STRB (pre-index) 2->2 0.838151
STRB (pre-index) throughput 1.041129
STRB (unsigned offset) throughput 1.067273
STRB (register) throughput 1.056076
STRB (register, uxtw) throughput 1.171605
STRB (register, sxtw) throughput 1.065470
STRH throughput 1.085214
STRH (post-index) 2->2 1.001321
STRH (post-index) throughput 1.001626
STRH (pre-index) 2->2 0.982245
STRH (pre-index) throughput 1.087158
STRH (unsigned offset) throughput 1.071208
STRH (register) throughput 1.097764
STRH (register, uxtw) throughput 0.917595
STRH (register, sxtw) throughput 1.044746
STUR (32-bit) throughput 1.130743
STUR (64-bit) throughput 1.049623
STURB throughput 0.964056
STURH throughput 0.914970
STXP (64-bit) throughput 3.281451
STXP (64-bit) throughput 2.546651
STXR (32-bit) throughput 2.267968
STXR (32-bit) throughput 3.296121
STXR (64-bit) throughput 2.382106
STXR (64-bit) throughput 3.009963
STXRB throughput 1.939569
STXRB throughput 2.892976
STXRH throughput 2.085682
STXRH throughput 3.540716
SUB (sxtb, 32-bit) 1->2 2.004211
SUB (sxtb, 32-bit) 1->3 2.043246
SUB (sxtb, 32-bit) throughput 0.736602
SUB (sxtb, 64-bit) 1->2 2.005738
SUB (sxtb, 64-bit) 1->3 2.002146
SUB (sxtb, 64-bit) throughput 0.576874
SUB (uxtb, 32-bit) 1->2 2.014217
SUB (uxtb, 32-bit) 1->3 2.044544
SUB (uxtb, 32-bit) throughput 0.696961
SUB (uxtb, 64-bit) 1->2 1.974298
SUB (uxtb, 64-bit) 1->3 2.010651
SUB (uxtb, 64-bit) throughput 0.684330
SUB (sxth, 32-bit) 1->2 2.000087
SUB (sxth, 32-bit) 1->3 2.121735
SUB (sxth, 32-bit) throughput 0.695633
SUB (sxth, 64-bit) 1->2 1.997632
SUB (sxth, 64-bit) 1->3 1.975452
SUB (sxth, 64-bit) throughput 0.710938
SUB (uxth, 32-bit) 1->2 2.051740
SUB (uxth, 32-bit) 1->3 1.573538
SUB (uxth, 32-bit) throughput 0.516153
SUB (uxth, 64-bit) 1->2 1.974093
SUB (uxth, 64-bit) 1->3 2.027425
SUB (uxth, 64-bit) throughput 0.802003
SUB (sxtw, 32-bit) 1->2 0.973586
SUB (sxtw, 32-bit) 1->3 0.987429
SUB (sxtw, 32-bit) throughput 0.300871
SUB (sxtw, 64-bit) 1->2 2.096520
SUB (sxtw, 64-bit) 1->3 2.015650
SUB (sxtw, 64-bit) throughput 0.655648
SUB (uxtw, 32-bit) 1->2 1.001863
SUB (uxtw, 32-bit) 1->3 0.998423
SUB (uxtw, 32-bit) throughput 0.334941
SUB (uxtw, 64-bit) 1->2 1.993801
SUB (uxtw, 64-bit) 1->3 2.001698
SUB (uxtw, 64-bit) throughput 0.750693
SUB (uxtx, 64-bit) 1->2 0.995164
SUB (uxtx, 64-bit) 1->3 1.003968
SUB (uxtx, 64-bit) throughput 0.334201
SUB (sxtx, 64-bit) 1->2 0.991052
SUB (sxtx, 64-bit) 1->3 1.003441
SUB (sxtx, 64-bit) throughput 0.334432
SUB (immediate, 32-bit) 1->2 1.002390
SUB (immediate, 32-bit) throughput 0.392191
SUB (immediate, 64-bit) 1->2 0.996047
SUB (immediate, 64-bit) throughput 0.338321
SUB (shifted immediate, 32-bit) 1->2 1.003848
SUB (shifted immediate, 32-bit) throughput 0.342450
SUB (shifted immediate, 64-bit) 1->2 1.003274
SUB (shifted immediate, 64-bit) throughput 0.334743
SUB (register, 32-bit) 1->2 1.018484
SUB (register, 32-bit) 1->3 0.993164
SUB (register, 32-bit) throughput 0.348630
SUB (register, 64-bit) 1->2 1.008058
SUB (register, 64-bit) 1->3 1.023849
SUB (register, 64-bit) throughput 0.332068
SUB (register, lsl, 32-bit) 1->2 2.234117
SUB (register, lsl, 32-bit) 1->3 1.994112
SUB (register, lsl, 32-bit) throughput 0.733360
SUB (register, lsl, 64-bit) 1->2 1.973130
SUB (register, lsl, 64-bit) 1->3 2.020804
SUB (register, lsl, 64-bit) throughput 0.601726
SUB (register, lsr, 32-bit) 1->2 2.336680
SUB (register, lsr, 32-bit) 1->3 2.189090
SUB (register, lsr, 32-bit) throughput 0.670024
SUB (register, lsr, 64-bit) 1->2 1.959530
SUB (register, lsr, 64-bit) 1->3 2.005255
SUB (register, lsr, 64-bit) throughput 0.525386
SUB (register, asr, 32-bit) 1->2 2.056567
SUB (register, asr, 32-bit) 1->3 2.012673
SUB (register, asr, 32-bit) throughput 0.733580
SUB (register, asr, 64-bit) 1->2 2.034259
SUB (register, asr, 64-bit) 1->3 2.025747
SUB (register, asr, 64-bit) throughput 0.661467
SUBS (sxtb, 32-bit) 1->2 1.716725
SUBS (sxtb, 32-bit) 1->3 2.000566
SUBS (sxtb, 32-bit) 4->2 2.478841
SUBS (sxtb, 32-bit) 4->3 2.000123
SUBS (sxtb, 32-bit) throughput 0.581329
SUBS (sxtb, 64-bit) 1->2 2.040935
SUBS (sxtb, 64-bit) 1->3 1.962865
SUBS (sxtb, 64-bit) 4->2 2.436956
SUBS (sxtb, 64-bit) 4->3 2.535168
SUBS (sxtb, 64-bit) throughput 0.753128
SUBS (uxtb, 32-bit) 1->2 1.993831
SUBS (uxtb, 32-bit) 1->3 2.003435
SUBS (uxtb, 32-bit) 4->2 1.998302
SUBS (uxtb, 32-bit) 4->3 2.013407
SUBS (uxtb, 32-bit) throughput 0.555973
SUBS (uxtb, 64-bit) 1->2 1.927597
SUBS (uxtb, 64-bit) 1->3 1.953495
SUBS (uxtb, 64-bit) 4->2 2.254119
SUBS (uxtb, 64-bit) 4->3 1.953660
SUBS (uxtb, 64-bit) throughput 0.670060
SUBS (sxth, 32-bit) 1->2 2.478737
SUBS (sxth, 32-bit) 1->3 2.004324
SUBS (sxth, 32-bit) 4->2 2.686343
SUBS (sxth, 32-bit) 4->3 1.983611
SUBS (sxth, 32-bit) throughput 0.713921
SUBS (sxth, 64-bit) 1->2 1.985050
SUBS (sxth, 64-bit) 1->3 2.011864
SUBS (sxth, 64-bit) 4->2 2.576234
SUBS (sxth, 64-bit) 4->3 5.129637
SUBS (sxth, 64-bit) throughput 0.523050
SUBS (uxth, 32-bit) 1->2 2.018709
SUBS (uxth, 32-bit) 1->3 2.009619
SUBS (uxth, 32-bit) 4->2 2.008260
SUBS (uxth, 32-bit) 4->3 1.950511
SUBS (uxth, 32-bit) throughput 0.713970
SUBS (uxth, 64-bit) 1->2 1.964434
SUBS (uxth, 64-bit) 1->3 1.744140
SUBS (uxth, 64-bit) 4->2 2.394311
SUBS (uxth, 64-bit) 4->3 2.007369
SUBS (uxth, 64-bit) throughput 0.774438
SUBS (sxtw, 32-bit) 1->2 0.999394
SUBS (sxtw, 32-bit) 1->3 0.998140
SUBS (sxtw, 32-bit) 4->2 1.005628
SUBS (sxtw, 32-bit) 4->3 1.633143
SUBS (sxtw, 32-bit) throughput 0.564462
SUBS (sxtw, 64-bit) 1->2 2.007989
SUBS (sxtw, 64-bit) 1->3 1.997505
SUBS (sxtw, 64-bit) 4->2 2.671290
SUBS (sxtw, 64-bit) 4->3 2.962775
SUBS (sxtw, 64-bit) throughput 0.738461
SUBS (uxtw, 32-bit) 1->2 0.998222
SUBS (uxtw, 32-bit) 1->3 0.911758
SUBS (uxtw, 32-bit) 4->2 1.007395
SUBS (uxtw, 32-bit) 4->3 0.998224
SUBS (uxtw, 32-bit) throughput 0.511613
SUBS (uxtw, 64-bit) 1->2 2.031998
SUBS (uxtw, 64-bit) 1->3 2.005174
SUBS (uxtw, 64-bit) 4->2 1.998384
SUBS (uxtw, 64-bit) 4->3 1.972726
SUBS (uxtw, 64-bit) throughput 0.803577
SUBS (uxtx, 64-bit) 1->2 0.983823
SUBS (uxtx, 64-bit) 1->3 0.997179
SUBS (uxtx, 64-bit) 4->2 0.999915
SUBS (uxtx, 64-bit) 4->3 1.019736
SUBS (uxtx, 64-bit) throughput 0.538506
SUBS (sxtx, 64-bit) 1->2 1.000040
SUBS (sxtx, 64-bit) 1->3 1.007167
SUBS (sxtx, 64-bit) 4->2 0.885515
SUBS (sxtx, 64-bit) 4->3 0.993622
SUBS (sxtx, 64-bit) throughput 0.543007
SUBS (immediate, 32-bit) 1->2 0.957980
SUBS (immediate, 32-bit) 3->2 1.329272
SUBS (immediate, 32-bit) throughput 0.544901
SUBS (immediate, 64-bit) 1->2 0.929336
SUBS (immediate, 64-bit) 3->2 1.101514
SUBS (immediate, 64-bit) throughput 0.506838
SUBS (shifted immediate, 32-bit) 1->2 0.962757
SUBS (shifted immediate, 32-bit) 3->2 1.031988
SUBS (shifted immediate, 32-bit) throughput 0.534780
SUBS (shifted immediate, 64-bit) 1->2 1.008543
SUBS (shifted immediate, 64-bit) 3->2 1.012385
SUBS (shifted immediate, 64-bit) throughput 0.565785
SUBS (register, 32-bit) 1->2 1.000203
SUBS (register, 32-bit) 1->3 1.001863
SUBS (register, 32-bit) 4->2 1.015914
SUBS (register, 32-bit) 4->3 0.996084
SUBS (register, 32-bit) throughput 0.486281
SUBS (register, 64-bit) 1->2 1.001741
SUBS (register, 64-bit) 1->3 1.002632
SUBS (register, 64-bit) 4->2 1.002789
SUBS (register, 64-bit) 4->3 0.998545
SUBS (register, 64-bit) throughput 0.581190
SUBS (register, lsl, 32-bit) 1->2 2.041629
SUBS (register, lsl, 32-bit) 1->3 1.771928
SUBS (register, lsl, 32-bit) 4->2 1.971644
SUBS (register, lsl, 32-bit) 4->3 2.018543
SUBS (register, lsl, 32-bit) throughput 0.747687
SUBS (register, lsl, 64-bit) 1->2 2.001804
SUBS (register, lsl, 64-bit) 1->3 2.005041
SUBS (register, lsl, 64-bit) 4->2 2.162206
SUBS (register, lsl, 64-bit) 4->3 2.009880
SUBS (register, lsl, 64-bit) throughput 0.719842
SUBS (register, lsr, 32-bit) 1->2 1.932946
SUBS (register, lsr, 32-bit) 1->3 2.281339
SUBS (register, lsr, 32-bit) 4->2 1.783663
SUBS (register, lsr, 32-bit) 4->3 2.220550
SUBS (register, lsr, 32-bit) throughput 0.698893
SUBS (register, lsr, 64-bit) 1->2 2.002951
SUBS (register, lsr, 64-bit) 1->3 1.996205
SUBS (register, lsr, 64-bit) 4->2 2.005103
SUBS (register, lsr, 64-bit) 4->3 2.203084
SUBS (register, lsr, 64-bit) throughput 0.674770
SUBS (register, asr, 32-bit) 1->2 2.004216
SUBS (register, asr, 32-bit) 1->3 2.002840
SUBS (register, asr, 32-bit) 4->2 2.276786
SUBS (register, asr, 32-bit) 4->3 2.004126
SUBS (register, asr, 32-bit) throughput 0.668377
SUBS (register, asr, 64-bit) 1->2 2.080564
SUBS (register, asr, 64-bit) 1->3 2.287363
SUBS (register, asr, 64-bit) 4->2 2.066296
SUBS (register, asr, 64-bit) 4->3 2.001721
SUBS (register, asr, 64-bit) throughput 0.671191
SXTB (32-bit) 1->2 0.999913
SXTB (32-bit) throughput 0.334218
SXTB (64-bit) 1->2 1.061738
SXTB (64-bit) throughput 0.339234
SXTH (32-bit) 1->2 0.999913
SXTH (32-bit) throughput 0.338622
SXTH (64-bit) 1->2 1.000000
SXTH (64-bit) throughput 0.335595
SXTW 1->2 1.000002
SXTW throughput 0.333986
TBNZ (not taken) throughput 0.587511
TBNZ (taken) throughput 1.411866
TBZ (not taken) throughput 0.588943
TBZ (taken) throughput 1.337055
TST (immediate, 32-bit) 2->1 0.999998
TST (immediate, 32-bit) throughput 0.362335
TST (immediate, 64-bit) 2->1 0.996204
TST (immediate, 64-bit) throughput 0.366183
TST (register, 32-bit) 3->1 0.998949
TST (register, 32-bit) 3->2 1.002551
TST (register, 32-bit) throughput 0.366800
TST (register, 64-bit) 3->1 1.004296
TST (register, 64-bit) 3->2 1.585223
TST (register, 64-bit) throughput 0.428843
TST (register, lsl, 32-bit) 3->1 2.005090
TST (register, lsl, 32-bit) 3->2 2.019315
TST (register, lsl, 32-bit) throughput 0.668674
TST (register, lsl, 64-bit) 3->1 2.003076
TST (register, lsl, 64-bit) 3->2 2.005181
TST (register, lsl, 64-bit) throughput 0.659913
TST (register, lsr, 32-bit) 3->1 2.004331
TST (register, lsr, 32-bit) 3->2 2.004090
TST (register, lsr, 32-bit) throughput 0.668040
TST (register, lsr, 64-bit) 3->1 2.001214
TST (register, lsr, 64-bit) 3->2 2.103861
TST (register, lsr, 64-bit) throughput 0.693147
TST (register, asr, 32-bit) 3->1 2.028614
TST (register, asr, 32-bit) 3->2 2.000968
TST (register, asr, 32-bit) throughput 0.669397
TST (register, asr, 64-bit) 3->1 1.955793
TST (register, asr, 64-bit) 3->2 2.022348
TST (register, asr, 64-bit) throughput 0.669822
UBFIZ (32-bit) 1->2 0.916562
UBFIZ (32-bit) throughput 0.346592
UBFIZ (64-bit) 1->2 1.028164
UBFIZ (64-bit) throughput 0.331585
UBFX (32-bit) 1->2 0.995410
UBFX (32-bit) throughput 0.335061
UBFX (64-bit) 1->2 1.002672
UBFX (64-bit) throughput 0.334148
UDIV (fast, 32-bit) 1->2 6.995709
UDIV (fast, 32-bit) 1->3 7.035936
UDIV (fast, 32-bit) throughput 7.239954
UDIV (slow, 32-bit) 1->2 13.583309
UDIV (slow, 32-bit) 1->3 13.090824
UDIV (slow, 32-bit) throughput 13.173184
UDIV (fast, 64-bit) 1->2 7.029979
UDIV (fast, 64-bit) 1->3 7.078598
UDIV (fast, 64-bit) throughput 7.045103
UDIV (medium, 64-bit) 1->2 13.018100
UDIV (medium, 64-bit) 1->3 12.219357
UDIV (medium, 64-bit) throughput 13.337779
UDIV (slow, 64-bit) 1->2 21.171416
UDIV (slow, 64-bit) 1->3 21.291294
UDIV (slow, 64-bit) throughput 21.235447
UMADDL 1->2 3.000649
UMADDL 1->3 3.009396
UMADDL 1->4 1.107719
UMADDL throughput 1.002065
UMNEGL 1->2 3.305753
UMNEGL 1->3 3.545043
UMNEGL throughput 1.023279
UMSUBL 1->2 3.009543
UMSUBL 1->3 3.009273
UMSUBL 1->4 0.994410
UMSUBL throughput 1.005992
UMULH 1->2 3.007371
UMULH 1->3 3.040855
UMULH throughput 0.999651
UMULL 1->2 3.052099
UMULL 1->3 3.002870
UMULL throughput 1.000560
UXTB 1->2 1.001051
UXTB throughput 0.334248
UXTH 1->2 1.001660
UXTH throughput 0.332955
YIELD throughput 0.254358
ABS (vector, 8B) 1->2 3.007129
ABS (vector, 8B) throughput 0.500126
ABS (vector, 16B) 1->2 3.001739
ABS (vector, 16B) throughput 0.501514
ABS (vector, 4H) 1->2 3.006235
ABS (vector, 4H) throughput 0.500707
ABS (vector, 8H) 1->2 3.046647
ABS (vector, 8H) throughput 0.499743
ABS (vector, 2S) 1->2 2.999596
ABS (vector, 2S) throughput 0.506094
ABS (vector, 4S) 1->2 2.998949
ABS (vector, 4S) throughput 0.501265
ABS (vector, 2D) 1->2 3.000807
ABS (vector, 2D) throughput 0.502202
ABS (vector, D) 1->2 3.132653
ABS (vector, D) throughput 0.504906
ADD (vector, 8B) 1->2 1.998828
ADD (vector, 8B) 1->3 2.010204
ADD (vector, 8B) throughput 0.500486
ADD (vector, 16B) 1->2 1.998020
ADD (vector, 16B) 1->3 1.956177
ADD (vector, 16B) throughput 0.513662
ADD (vector, 4H) 1->2 2.003894
ADD (vector, 4H) 1->3 2.004860
ADD (vector, 4H) throughput 0.501321
ADD (vector, 8H) 1->2 1.999109
ADD (vector, 8H) 1->3 2.004777
ADD (vector, 8H) throughput 0.500596
ADD (vector, 2S) 1->2 1.997091
ADD (vector, 2S) 1->3 1.992823
ADD (vector, 2S) throughput 0.500380
ADD (vector, 4S) 1->2 1.996446
ADD (vector, 4S) 1->3 2.003119
ADD (vector, 4S) throughput 0.501766
ADD (vector, 2D) 1->2 2.009273
ADD (vector, 2D) 1->3 2.002915
ADD (vector, 2D) throughput 0.501035
ADD (vector, D) 1->2 1.999031
ADD (vector, D) 1->3 1.995198
ADD (vector, D) throughput 0.500850
ADDHN (vector, 8H) 1->2 3.050794
ADDHN (vector, 8H) 1->3 3.004698
ADDHN (vector, 8H) throughput 0.499985
ADDHN (vector, 4S) 1->2 3.005184
ADDHN (vector, 4S) 1->3 3.003886
ADDHN (vector, 4S) throughput 0.502014
ADDHN (vector, 2D) 1->2 3.006722
ADDHN (vector, 2D) 1->3 3.114175
ADDHN (vector, 2D) throughput 0.483302
ADDHN2 (vector, 8H) 1->1 3.003695
ADDHN2 (vector, 8H) 1->2 3.553686
ADDHN2 (vector, 8H) 1->3 2.895089
ADDHN2 (vector, 8H) throughput 0.503806
ADDHN2 (vector, 8H) throughput 0.516070
ADDHN2 (vector, 4S) 1->1 3.003928
ADDHN2 (vector, 4S) 1->2 3.027697
ADDHN2 (vector, 4S) 1->3 3.003928
ADDHN2 (vector, 4S) throughput 0.505403
ADDHN2 (vector, 4S) throughput 0.567695
ADDHN2 (vector, 2D) 1->1 3.008586
ADDHN2 (vector, 2D) 1->2 3.015104
ADDHN2 (vector, 2D) 1->3 2.997130
ADDHN2 (vector, 2D) throughput 0.500000
ADDHN2 (vector, 2D) throughput 0.501769
ADDP (scalar) 1->2 1.999740
ADDP (scalar) throughput 0.485770
ADDP (vector, 8B) 1->2 1.945793
ADDP (vector, 8B) 1->3 2.073932
ADDP (vector, 8B) throughput 0.506383
ADDP (vector, 16B) 1->2 2.002389
ADDP (vector, 16B) 1->3 2.004050
ADDP (vector, 16B) throughput 0.499657
ADDP (vector, 4H) 1->2 1.998949
ADDP (vector, 4H) 1->3 1.911593
ADDP (vector, 4H) throughput 0.469217
ADDP (vector, 8H) 1->2 1.928630
ADDP (vector, 8H) 1->3 2.001374
ADDP (vector, 8H) throughput 0.508104
ADDP (vector, 2S) 1->2 2.041583
ADDP (vector, 2S) 1->3 1.995039
ADDP (vector, 2S) throughput 0.500121
ADDP (vector, 4S) 1->2 1.998425
ADDP (vector, 4S) 1->3 2.002755
ADDP (vector, 4S) throughput 0.500258
ADDP (vector, 2D) 1->2 2.006924
ADDP (vector, 2D) 1->3 2.006438
ADDP (vector, 2D) throughput 0.501918
ADDV (8B) 1->2 2.999030
ADDV (8B) throughput 0.501711
ADDV (16B) 1->2 3.138402
ADDV (16B) throughput 0.564285
ADDV (4H) 1->2 3.041059
ADDV (4H) throughput 0.504261
ADDV (8H) 1->2 3.002345
ADDV (8H) throughput 0.499465
ADDV (4S) 1->2 2.267006
ADDV (4S) throughput 0.502526
AESD 1->1 3.002176
AESD 1->2 2.980445
AESD throughput 0.501584
AESD throughput 0.503072
AESD + AESIMC 1->1 3.005629
AESD + AESIMC 1->2 3.150999
AESD + AESIMC throughput 1.003892
AESD + AESIMC throughput 0.503682
AESE 1->1 2.997372
AESE 1->2 2.999030
AESE throughput 0.500672
AESE throughput 0.503077
AESE + AESMC 1->1 2.985516
AESE + AESMC 1->2 3.008607
AESE + AESMC throughput 1.010763
AESE + AESMC throughput 0.503375
AESIMC 1->2 2.005255
AESIMC throughput 0.490820
AESMC 1->2 1.999232
AESMC throughput 0.500673
AND (vector, 8B) 1->2 2.003888
AND (vector, 8B) 1->3 2.002384
AND (vector, 8B) throughput 0.499328
AND (vector, 16B) 1->2 2.001388
AND (vector, 16B) 1->3 1.992553
AND (vector, 16B) throughput 0.500997
BIC (vector, immediate, 4H) 1->1 1.998060
BIC (vector, immediate, 4H) throughput 0.509014
BIC (vector, immediate, 4H) throughput 0.511099
BIC (vector, immediate, 8H) 1->1 2.020976
BIC (vector, immediate, 8H) throughput 0.502404
BIC (vector, immediate, 8H) throughput 0.483333
BIC (vector, immediate, 2S) 1->1 1.998547
BIC (vector, immediate, 2S) throughput 0.500066
BIC (vector, immediate, 2S) throughput 0.502164
BIC (vector, immediate, 4S) 1->1 2.049391
BIC (vector, immediate, 4S) throughput 0.338233
BIC (vector, immediate, 4S) throughput 0.500222
BIC (vector, register, 8B) 1->2 2.001376
BIC (vector, register, 8B) 1->3 2.051676
BIC (vector, register, 8B) throughput 0.500863
BIC (vector, register, 16B) 1->2 2.003846
BIC (vector, register, 16B) 1->3 1.999111
BIC (vector, register, 16B) throughput 0.502505
BIF (vector, 8B) 1->1 2.005589
BIF (vector, 8B) 1->2 2.003887
BIF (vector, 8B) 1->3 1.996849
BIF (vector, 8B) throughput 0.502516
BIF (vector, 8B) throughput 0.504366
BIF (vector, 16B) 1->1 2.004334
BIF (vector, 16B) 1->2 2.001499
BIF (vector, 16B) 1->3 2.015893
BIF (vector, 16B) throughput 0.501276
BIF (vector, 16B) throughput 0.557655
BIT (vector, 8B) 1->1 1.999396
BIT (vector, 8B) 1->2 2.004328
BIT (vector, 8B) 1->3 2.008941
BIT (vector, 8B) throughput 0.503053
BIT (vector, 8B) throughput 0.514536
BIT (vector, 16B) 1->1 2.015329
BIT (vector, 16B) 1->2 2.001538
BIT (vector, 16B) 1->3 2.002713
BIT (vector, 16B) throughput 0.502566
BIT (vector, 16B) throughput 0.523688
BSL (vector, 8B) 1->1 2.001636
BSL (vector, 8B) 1->2 2.007250
BSL (vector, 8B) 1->3 2.008503
BSL (vector, 8B) throughput 0.521760
BSL (vector, 8B) throughput 0.502481
BSL (vector, 16B) 1->1 2.006034
BSL (vector, 16B) 1->2 2.025592
BSL (vector, 16B) 1->3 2.002505
BSL (vector, 16B) throughput 0.498998
BSL (vector, 16B) throughput 0.505919
CLS (vector, 8B) 1->2 1.996446
CLS (vector, 8B) throughput 0.500612
CLS (vector, 16B) 1->2 2.000647
CLS (vector, 16B) throughput 0.500795
CLS (vector, 4H) 1->2 2.000201
CLS (vector, 4H) throughput 0.500506
CLS (vector, 8H) 1->2 1.996122
CLS (vector, 8H) throughput 0.525698
CLS (vector, 2S) 1->2 2.006266
CLS (vector, 2S) throughput 0.499495
CLS (vector, 4S) 1->2 1.999597
CLS (vector, 4S) throughput 0.501263
CLZ (vector, 8B) 1->2 1.998302
CLZ (vector, 8B) throughput 0.500269
CLZ (vector, 16B) 1->2 1.971159
CLZ (vector, 16B) throughput 0.503386
CLZ (vector, 4H) 1->2 2.005548
CLZ (vector, 4H) throughput 0.499646
CLZ (vector, 8H) 1->2 2.000445
CLZ (vector, 8H) throughput 0.501068
CLZ (vector, 2S) 1->2 1.999434
CLZ (vector, 2S) throughput 0.553521
CLZ (vector, 4S) 1->2 2.042433
CLZ (vector, 4S) throughput 0.505118
CMEQ (register, 8B) 1->2 1.997214
CMEQ (register, 8B) 1->3 2.002388
CMEQ (register, 8B) throughput 0.499884
CMEQ (register, 16B) 1->2 2.173807
CMEQ (register, 16B) 1->3 2.114626
CMEQ (register, 16B) throughput 0.489782
CMEQ (register, 4H) 1->2 1.935435
CMEQ (register, 4H) 1->3 1.999433
CMEQ (register, 4H) throughput 0.499959
CMEQ (register, 8H) 1->2 2.000121
CMEQ (register, 8H) 1->3 2.005658
CMEQ (register, 8H) throughput 0.500838
CMEQ (register, 2S) 1->2 2.006908
CMEQ (register, 2S) 1->3 2.004293
CMEQ (register, 2S) throughput 0.505572
CMEQ (register, 4S) 1->2 1.999675
CMEQ (register, 4S) 1->3 2.003320
CMEQ (register, 4S) throughput 0.500628
CMEQ (register, 2D) 1->2 2.001580
CMEQ (register, 2D) 1->3 2.001498
CMEQ (register, 2D) throughput 0.499859
CMEQ (register, D) 1->2 2.001456
CMEQ (register, D) 1->3 1.987589
CMEQ (register, D) throughput 0.501445
CMEQ (zero, 8B) 1->2 2.032940
CMEQ (zero, 8B) throughput 0.501969
CMEQ (zero, 16B) 1->2 1.990272
CMEQ (zero, 16B) throughput 0.500658
CMEQ (zero, 4H) 1->2 2.006763
CMEQ (zero, 4H) throughput 0.500522
CMEQ (zero, 8H) 1->2 2.002307
CMEQ (zero, 8H) throughput 0.499334
CMEQ (zero, 2S) 1->2 1.995399
CMEQ (zero, 2S) throughput 0.499455
CMEQ (zero, 4S) 1->2 2.001982
CMEQ (zero, 4S) throughput 0.501007
CMEQ (zero, 2D) 1->2 1.937706
CMEQ (zero, 2D) throughput 0.532663
CMEQ (zero, D) 1->2 2.001661
CMEQ (zero, D) throughput 0.501109
CMGE (register, 8B) 1->2 2.000605
CMGE (register, 8B) 1->3 2.004128
CMGE (register, 8B) throughput 0.501564
CMGE (register, 16B) 1->2 1.994195
CMGE (register, 16B) 1->3 2.013586
CMGE (register, 16B) throughput 0.420625
CMGE (register, 4H) 1->2 2.035997
CMGE (register, 4H) 1->3 1.999272
CMGE (register, 4H) throughput 0.501260
CMGE (register, 8H) 1->2 2.001499
CMGE (register, 8H) 1->3 2.007896
CMGE (register, 8H) throughput 0.502019
CMGE (register, 2S) 1->2 2.037051
CMGE (register, 2S) 1->3 2.004732
CMGE (register, 2S) throughput 0.487067
CMGE (register, 4S) 1->2 2.018829
CMGE (register, 4S) 1->3 1.996811
CMGE (register, 4S) throughput 0.501660
CMGE (register, 2D) 1->2 1.989375
CMGE (register, 2D) 1->3 2.001091
CMGE (register, 2D) throughput 0.505736
CMGE (register, D) 1->2 2.005376
CMGE (register, D) 1->3 2.001990
CMGE (register, D) throughput 0.499728
CMGE (zero, 8B) 1->2 2.001660
CMGE (zero, 8B) throughput 0.499207
CMGE (zero, 16B) 1->2 2.002143
CMGE (zero, 16B) throughput 0.500010
CMGE (zero, 4H) 1->2 2.003604
CMGE (zero, 4H) throughput 0.503835
CMGE (zero, 8H) 1->2 2.006360
CMGE (zero, 8H) throughput 0.500096
CMGE (zero, 2S) 1->2 1.988250
CMGE (zero, 2S) throughput 0.500926
CMGE (zero, 4S) 1->2 2.004698
CMGE (zero, 4S) throughput 0.504475
CMGE (zero, 2D) 1->2 1.999798
CMGE (zero, 2D) throughput 0.480170
CMGE (zero, D) 1->2 2.001814
CMGE (zero, D) throughput 0.500951
CMGT (register, 8B) 1->2 1.996122
CMGT (register, 8B) 1->3 1.994071
CMGT (register, 8B) throughput 0.501203
CMGT (register, 16B) 1->2 2.001702
CMGT (register, 16B) 1->3 2.001537
CMGT (register, 16B) throughput 0.591036
CMGT (register, 4H) 1->2 2.013325
CMGT (register, 4H) 1->3 1.987754
CMGT (register, 4H) throughput 0.512092
CMGT (register, 8H) 1->2 2.005629
CMGT (register, 8H) 1->3 2.007126
CMGT (register, 8H) throughput 0.501545
CMGT (register, 2S) 1->2 1.998345
CMGT (register, 2S) 1->3 1.994602
CMGT (register, 2S) throughput 0.551390
CMGT (register, 4S) 1->2 2.001172
CMGT (register, 4S) 1->3 2.000444
CMGT (register, 4S) throughput 0.579435
CMGT (register, 2D) 1->2 1.990171
CMGT (register, 2D) 1->3 2.004938
CMGT (register, 2D) throughput 0.512744
CMGT (register, D) 1->2 1.998949
CMGT (register, D) 1->3 2.001499
CMGT (register, D) throughput 0.500116
CMGT (zero, 8B) 1->2 2.001499
CMGT (zero, 8B) throughput 0.500333
CMGT (zero, 16B) 1->2 2.003922
CMGT (zero, 16B) throughput 0.502181
CMGT (zero, 4H) 1->2 2.003968
CMGT (zero, 4H) throughput 0.499843
CMGT (zero, 8H) 1->2 1.996044
CMGT (zero, 8H) throughput 0.501601
CMGT (zero, 2S) 1->2 2.005231
CMGT (zero, 2S) throughput 0.502294
CMGT (zero, 4S) 1->2 2.003645
CMGT (zero, 4S) throughput 0.505179
CMGT (zero, 2D) 1->2 2.002471
CMGT (zero, 2D) throughput 0.499990
CMGT (zero, D) 1->2 2.003808
CMGT (zero, D) throughput 0.447960
CMHI (register, 8B) 1->2 2.000343
CMHI (register, 8B) 1->3 1.999473
CMHI (register, 8B) throughput 0.506225
CMHI (register, 16B) 1->2 1.999718
CMHI (register, 16B) 1->3 1.998629
CMHI (register, 16B) throughput 0.500648
CMHI (register, 4H) 1->2 1.989102
CMHI (register, 4H) 1->3 2.035119
CMHI (register, 4H) throughput 0.506180
CMHI (register, 8H) 1->2 2.006965
CMHI (register, 8H) 1->3 1.999231
CMHI (register, 8H) throughput 0.500946
CMHI (register, 2S) 1->2 2.001378
CMHI (register, 2S) 1->3 2.035977
CMHI (register, 2S) throughput 0.500461
CMHI (register, 4S) 1->2 2.008586
CMHI (register, 4S) 1->3 1.996220
CMHI (register, 4S) throughput 0.504428
CMHI (register, 2D) 1->2 2.003078
CMHI (register, 2D) 1->3 1.992025
CMHI (register, 2D) throughput 0.502781
CMHI (register, D) 1->2 2.004010
CMHI (register, D) 1->3 2.000485
CMHI (register, D) throughput 0.499646
CMHS (register, 8B) 1->2 1.991622
CMHS (register, 8B) 1->3 2.002019
CMHS (register, 8B) throughput 0.496626
CMHS (register, 16B) 1->2 1.999516
CMHS (register, 16B) 1->3 2.002226
CMHS (register, 16B) throughput 0.501696
CMHS (register, 4H) 1->2 2.005998
CMHS (register, 4H) 1->3 2.001783
CMHS (register, 4H) throughput 0.690763
CMHS (register, 8H) 1->2 2.065523
CMHS (register, 8H) 1->3 2.454696
CMHS (register, 8H) throughput 0.499153
CMHS (register, 2S) 1->2 1.995297
CMHS (register, 2S) 1->3 2.228693
CMHS (register, 2S) throughput 0.511217
CMHS (register, 4S) 1->2 1.995689
CMHS (register, 4S) 1->3 2.005425
CMHS (register, 4S) throughput 0.501531
CMHS (register, 2D) 1->2 2.041059
CMHS (register, 2D) 1->3 2.001739
CMHS (register, 2D) throughput 0.499582
CMHS (register, D) 1->2 2.011261
CMHS (register, D) 1->3 2.012988
CMHS (register, D) throughput 0.498172
CMLE (zero, 8B) 1->2 1.970814
CMLE (zero, 8B) throughput 0.500550
CMLE (zero, 16B) 1->2 2.073166
CMLE (zero, 16B) throughput 0.521689
CMLE (zero, 4H) 1->2 2.291393
CMLE (zero, 4H) throughput 0.445284
CMLE (zero, 8H) 1->2 1.997589
CMLE (zero, 8H) throughput 0.484635
CMLE (zero, 2S) 1->2 2.007736
CMLE (zero, 2S) throughput 0.503221
CMLE (zero, 4S) 1->2 1.873106
CMLE (zero, 4S) throughput 0.507329
CMLE (zero, 2D) 1->2 2.284267
CMLE (zero, 2D) throughput 0.504268
CMLE (zero, D) 1->2 2.001492
CMLE (zero, D) throughput 0.501570
CMLT (zero, 8B) 1->2 2.312165
CMLT (zero, 8B) throughput 0.500992
CMLT (zero, 16B) 1->2 1.999069
CMLT (zero, 16B) throughput 0.500911
CMLT (zero, 4H) 1->2 1.986973
CMLT (zero, 4H) throughput 0.503219
CMLT (zero, 8H) 1->2 2.078910
CMLT (zero, 8H) throughput 0.503502
CMLT (zero, 2S) 1->2 2.004264
CMLT (zero, 2S) throughput 0.482685
CMLT (zero, 4S) 1->2 1.997697
CMLT (zero, 4S) throughput 0.506626
CMLT (zero, 2D) 1->2 1.924074
CMLT (zero, 2D) throughput 0.500628
CMLT (zero, D) 1->2 2.002547
CMLT (zero, D) throughput 0.500795
CMTST (register, 8B) 1->2 1.956177
CMTST (register, 8B) 1->3 2.266587
CMTST (register, 8B) throughput 0.501472
CMTST (register, 16B) 1->2 2.003147
CMTST (register, 16B) 1->3 1.994916
CMTST (register, 16B) throughput 0.507358
CMTST (register, 4H) 1->2 1.997420
CMTST (register, 4H) 1->3 1.998788
CMTST (register, 4H) throughput 0.548552
CMTST (register, 8H) 1->2 2.056292
CMTST (register, 8H) 1->3 2.007087
CMTST (register, 8H) throughput 0.499939
CMTST (register, 2S) 1->2 2.001334
CMTST (register, 2S) 1->3 2.007207
CMTST (register, 2S) throughput 0.500491
CMTST (register, 4S) 1->2 1.995359
CMTST (register, 4S) 1->3 1.995922
CMTST (register, 4S) throughput 0.497137
CMTST (register, 2D) 1->2 1.992997
CMTST (register, 2D) 1->3 2.034011
CMTST (register, 2D) throughput 0.499068
CMTST (register, D) 1->2 2.057699
CMTST (register, D) 1->3 2.003239
CMTST (register, D) throughput 0.503841
CNT (8B) 1->2 2.011778
CNT (8B) throughput 0.482074
CNT (16B) 1->2 1.834248
CNT (16B) throughput 0.481441
DUP (element, vector, 8B) 1->2 2.148641
DUP (element, vector, 8B) throughput 0.531609
DUP (element, vector, 16B) 1->2 2.004050
DUP (element, vector, 16B) throughput 0.507774
DUP (element, vector, 4H) 1->2 2.040938
DUP (element, vector, 4H) throughput 0.572068
DUP (element, vector, 8H) 1->2 1.976484
DUP (element, vector, 8H) throughput 0.502851
DUP (element, vector, 2S) 1->2 1.996122
DUP (element, vector, 2S) throughput 0.500768
DUP (element, vector, 4S) 1->2 2.005224
DUP (element, vector, 4S) throughput 0.499849
DUP (element, vector, 2D) 1->2 1.983744
DUP (element, vector, 2D) throughput 0.526271
DUP (element, scalar, B) 1->2 2.001419
DUP (element, scalar, B) throughput 0.501853
DUP (element, scalar, H) 1->2 2.003967
DUP (element, scalar, H) throughput 0.501200
DUP (element, scalar, S) 1->2 2.005183
DUP (element, scalar, S) throughput 0.608651
DUP (element, scalar, D) 1->2 1.954670
DUP (element, scalar, D) throughput 0.507820
DUP (general, 8B) 1->2 roundtrip 9.066604
DUP (general, 8B) throughput 0.501842
DUP (general, 16B) 1->2 roundtrip 9.039889
DUP (general, 16B) throughput 0.501340
DUP (general, 4H) 1->2 roundtrip 9.482526
DUP (general, 4H) throughput 0.433388
DUP (general, 8H) 1->2 roundtrip 9.025672
DUP (general, 8H) throughput 0.502759
DUP (general, 2S) 1->2 roundtrip 9.032102
DUP (general, 2S) throughput 0.500430
DUP (general, 4S) 1->2 roundtrip 9.012267
DUP (general, 4S) throughput 0.502364
DUP (general, 2D) 1->2 roundtrip 9.078728
DUP (general, 2D) throughput 0.501462
EOR (vector, 8B) 1->2 2.002408
EOR (vector, 8B) 1->3 2.084994
EOR (vector, 8B) throughput 0.517456
EOR (vector, 16B) 1->2 1.995085
EOR (vector, 16B) 1->3 2.004534
EOR (vector, 16B) throughput 0.502114
EXT (vector, 8B) 1->2 1.999919
EXT (vector, 8B) 1->3 2.043421
EXT (vector, 8B) throughput 0.505026
EXT (vector, 16B) 1->2 1.997172
EXT (vector, 16B) 1->3 2.014941
EXT (vector, 16B) throughput 0.500496
FABD (vector, 2S) 1->2 2.997977
FABD (vector, 2S) 1->3 3.017332
FABD (vector, 2S) throughput 0.545956
FABD (vector, 4S) 1->2 2.999115
FABD (vector, 4S) 1->3 2.987863
FABD (vector, 4S) throughput 0.498720
FABD (vector, 2D) 1->2 3.041788
FABD (vector, 2D) 1->3 3.003807
FABD (vector, 2D) throughput 0.499232
FABD (scalar, S) 1->2 3.002386
FABD (scalar, S) 1->3 3.136969
FABD (scalar, S) throughput 0.504517
FABD (scalar, D) 1->2 3.021868
FABD (scalar, D) 1->3 3.013891
FABD (scalar, D) throughput 0.500425
FABS (scalar, S) 1->2 2.002389
FABS (scalar, S) throughput 0.501584
FABS (scalar, D) 1->2 1.987221
FABS (scalar, D) throughput 0.501301
FABS (vector, 2S) 1->2 2.002505
FABS (vector, 2S) throughput 0.502177
FABS (vector, 4S) 1->2 2.002507
FABS (vector, 4S) throughput 0.508736
FABS (vector, 2D) 1->2 2.002551
FABS (vector, 2D) throughput 0.500835
FACGE (vector, 2S) 1->2 2.002752
FACGE (vector, 2S) 1->3 2.006364
FACGE (vector, 2S) throughput 0.500379
FACGE (vector, 4S) 1->2 1.996688
FACGE (vector, 4S) 1->3 1.877471
FACGE (vector, 4S) throughput 0.499821
FACGE (vector, 2D) 1->2 1.991583
FACGE (vector, 2D) 1->3 1.996003
FACGE (vector, 2D) throughput 0.499525
FACGE (scalar, S) 1->2 2.004414
FACGE (scalar, S) 1->3 1.999913
FACGE (scalar, S) throughput 0.501225
FACGE (scalar, D) 1->2 2.006074
FACGE (scalar, D) 1->3 2.000203
FACGE (scalar, D) throughput 0.504247
FACGT (vector, 2S) 1->2 2.008307
FACGT (vector, 2S) 1->3 1.999915
FACGT (vector, 2S) throughput 0.501108
FACGT (vector, 4S) 1->2 2.058673
FACGT (vector, 4S) 1->3 2.001821
FACGT (vector, 4S) throughput 0.501898
FACGT (vector, 2D) 1->2 2.001417
FACGT (vector, 2D) 1->3 2.016845
FACGT (vector, 2D) throughput 0.498137
FACGT (scalar, S) 1->2 1.904207
FACGT (scalar, S) 1->3 2.002876
FACGT (scalar, S) throughput 0.505936
FACGT (scalar, D) 1->2 2.004412
FACGT (scalar, D) 1->3 1.997336
FACGT (scalar, D) throughput 0.500476
FADD (vector, 2S) 1->2 3.039415
FADD (vector, 2S) 1->3 3.042276
FADD (vector, 2S) throughput 0.500370
FADD (vector, 4S) 1->2 3.005264
FADD (vector, 4S) 1->3 2.216904
FADD (vector, 4S) throughput 0.512810
FADD (vector, 2D) 1->2 3.064412
FADD (vector, 2D) 1->3 3.024382
FADD (vector, 2D) throughput 0.512824
FADD (scalar, S) 1->2 3.017706
FADD (scalar, S) 1->3 3.137789
FADD (scalar, S) throughput 0.499500
FADD (scalar, D) 1->2 2.997294
FADD (scalar, D) 1->3 3.000085
FADD (scalar, D) throughput 0.500457
FADDP (scalar, S) 1->2 3.017252
FADDP (scalar, S) throughput 0.500131
FADDP (scalar, D) 1->2 3.048722
FADDP (scalar, D) throughput 0.514400
FADDP (vector, 2S) 1->2 2.980663
FADDP (vector, 2S) 1->3 3.023402
FADDP (vector, 2S) throughput 0.500871
FADDP (vector, 4S) 1->2 2.942020
FADDP (vector, 4S) 1->3 3.047022
FADDP (vector, 4S) throughput 0.517003
FADDP (vector, 2D) 1->2 2.997292
FADDP (vector, 2D) 1->3 3.043937
FADDP (vector, 2D) throughput 0.499511
FCCMP (scalar, S) 3->1 1.997130
FCCMP (scalar, S) 3->2 2.021058
FCCMP (scalar, S) 3->3 2.005629
FCCMP (scalar, S) throughput 1.058537
FCCMP (scalar, D) 3->1 2.010040
FCCMP (scalar, D) 3->2 2.042685
FCCMP (scalar, D) 3->3 2.004211
FCCMP (scalar, D) throughput 1.076800
FCCMPE (scalar, S) 3->1 2.005750
FCCMPE (scalar, S) 3->2 2.001013
FCCMPE (scalar, S) 3->3 1.997357
FCCMPE (scalar, S) throughput 0.917496
FCCMPE (scalar, D) 3->1 2.001292
FCCMPE (scalar, D) 3->2 2.005608
FCCMPE (scalar, D) 3->3 2.002110
FCCMPE (scalar, D) throughput 1.078223
FCMEQ (vector, 2S) 1->2 2.006604
FCMEQ (vector, 2S) 1->3 1.867895
FCMEQ (vector, 2S) throughput 0.506825
FCMEQ (vector, 4S) 1->2 2.008853
FCMEQ (vector, 4S) 1->3 1.609889
FCMEQ (vector, 4S) throughput 0.479951
FCMEQ (vector, 2D) 1->2 2.179490
FCMEQ (vector, 2D) 1->3 1.998181
FCMEQ (vector, 2D) throughput 0.500106
FCMEQ (scalar, S) 1->2 1.846760
FCMEQ (scalar, S) 1->3 2.008772
FCMEQ (scalar, S) throughput 0.500204
FCMEQ (scalar, D) 1->2 2.008257
FCMEQ (scalar, D) 1->3 1.976208
FCMEQ (scalar, D) throughput 0.485502
FCMEQ (vector, zero, 2S) 1->2 2.025717
FCMEQ (vector, zero, 2S) throughput 0.502888
FCMEQ (vector, zero, 4S) 1->2 1.997710
FCMEQ (vector, zero, 4S) throughput 0.509201
FCMEQ (vector, zero, 2D) 1->2 2.475723
FCMEQ (vector, zero, 2D) throughput 0.500729
FCMEQ (scalar, zero, S) 1->2 2.064302
FCMEQ (scalar, zero, S) throughput 0.513147
FCMEQ (scalar, zero, D) 1->2 2.003712
FCMEQ (scalar, zero, D) throughput 0.503948
FCMGE (vector, 2S) 1->2 2.004453
FCMGE (vector, 2S) 1->3 1.999152
FCMGE (vector, 2S) throughput 0.500764
FCMGE (vector, 4S) 1->2 2.001011
FCMGE (vector, 4S) 1->3 1.988415
FCMGE (vector, 4S) throughput 0.500795
FCMGE (vector, 2D) 1->2 2.002828
FCMGE (vector, 2D) 1->3 2.002794
FCMGE (vector, 2D) throughput 0.501266
FCMGE (scalar, S) 1->2 2.004365
FCMGE (scalar, S) 1->3 2.001819
FCMGE (scalar, S) throughput 0.499687
FCMGE (scalar, D) 1->2 2.012643
FCMGE (scalar, D) 1->3 2.007910
FCMGE (scalar, D) throughput 0.501121
FCMGE (vector, zero, 2S) 1->2 2.050359
FCMGE (vector, zero, 2S) throughput 0.504173
FCMGE (vector, zero, 4S) 1->2 1.957363
FCMGE (vector, zero, 4S) throughput 0.502483
FCMGE (vector, zero, 2D) 1->2 1.996365
FCMGE (vector, zero, 2D) throughput 0.505976
FCMGE (scalar, zero, S) 1->2 2.006184
FCMGE (scalar, zero, S) throughput 0.500101
FCMGE (scalar, zero, D) 1->2 1.986841
FCMGE (scalar, zero, D) throughput 0.507546
FCMGT (vector, 2S) 1->2 2.007614
FCMGT (vector, 2S) 1->3 1.592154
FCMGT (vector, 2S) throughput 0.491833
FCMGT (vector, 4S) 1->2 2.002790
FCMGT (vector, 4S) 1->3 2.001863
FCMGT (vector, 4S) throughput 0.501749
FCMGT (vector, 2D) 1->2 2.001904
FCMGT (vector, 2D) 1->3 2.005222
FCMGT (vector, 2D) throughput 0.500749
FCMGT (scalar, S) 1->2 2.000807
FCMGT (scalar, S) 1->3 2.002748
FCMGT (scalar, S) throughput 0.514061
FCMGT (scalar, D) 1->2 2.018995
FCMGT (scalar, D) 1->3 2.007733
FCMGT (scalar, D) throughput 0.605849
FCMGT (vector, zero, 2S) 1->2 1.953895
FCMGT (vector, zero, 2S) throughput 0.504525
FCMGT (vector, zero, 4S) 1->2 2.021187
FCMGT (vector, zero, 4S) throughput 0.487339
FCMGT (vector, zero, 2D) 1->2 2.023363
FCMGT (vector, zero, 2D) throughput 0.767000
FCMGT (scalar, zero, S) 1->2 3.610909
FCMGT (scalar, zero, S) throughput 0.474408
FCMGT (scalar, zero, D) 1->2 1.980071
FCMGT (scalar, zero, D) throughput 0.659380
FCMLE (vector, zero, 2S) 1->2 1.973843
FCMLE (vector, zero, 2S) throughput 0.502182
FCMLE (vector, zero, 4S) 1->2 2.006478
FCMLE (vector, zero, 4S) throughput 0.501270
FCMLE (vector, zero, 2D) 1->2 2.001941
FCMLE (vector, zero, 2D) throughput 0.613984
FCMLE (scalar, zero, S) 1->2 2.001172
FCMLE (scalar, zero, S) throughput 0.525673
FCMLE (scalar, zero, D) 1->2 2.018028
FCMLE (scalar, zero, D) throughput 0.492320
FCMLT (vector, zero, 2S) 1->2 1.995014
FCMLT (vector, zero, 2S) throughput 0.488937
FCMLT (vector, zero, 4S) 1->2 2.220259
FCMLT (vector, zero, 4S) throughput 0.503450
FCMLT (vector, zero, 2D) 1->2 2.011091
FCMLT (vector, zero, 2D) throughput 0.459751
FCMLT (scalar, zero, S) 1->2 2.014313
FCMLT (scalar, zero, S) throughput 0.499631
FCMLT (scalar, zero, D) 1->2 2.002511
FCMLT (scalar, zero, D) throughput 0.441847
FCMP (scalar, S) 3->1 2.003352
FCMP (scalar, S) 3->2 2.003953
FCMP (scalar, S) throughput 0.984456
FCMP (scalar, D) 3->1 1.974929
FCMP (scalar, D) 3->2 2.038346
FCMP (scalar, D) throughput 1.002465
FCMPE (scalar, S) 3->1 1.998951
FCMPE (scalar, S) 3->2 2.007125
FCMPE (scalar, S) throughput 1.002804
FCMPE (scalar, D) 3->1 2.007896
FCMPE (scalar, D) 3->2 2.038025
FCMPE (scalar, D) throughput 1.007608
FCSEL (scalar, S) 1->2 1.997943
FCSEL (scalar, S) 1->3 2.006446
FCSEL (scalar, S) 1->4 2.074971
FCSEL (scalar, S) throughput 0.478855
FCSEL (scalar, D) 1->2 1.996204
FCSEL (scalar, D) 1->3 1.996890
FCSEL (scalar, D) 1->4 2.009475
FCSEL (scalar, D) throughput 0.514374
FCVT (S to D) 1->2 2.993055
FCVT (S to D) throughput 0.521553
FCVT (D to S) 1->2 2.997575
FCVT (D to S) throughput 0.396494
FCVTAS (scalar, S to 32-bit GPR) 1->2 roundtrip 11.603616
FCVTAS (scalar, S to 32-bit GPR) throughput 1.036901
FCVTAS (scalar, S to 64-bit GPR) 1->2 roundtrip 13.430579
FCVTAS (scalar, S to 64-bit GPR) throughput 1.067614
FCVTAS (scalar, D to 32-bit GPR) 1->2 roundtrip 10.658934
FCVTAS (scalar, D to 32-bit GPR) throughput 0.849991
FCVTAS (scalar, D to 64-bit GPR) 1->2 roundtrip 9.003498
FCVTAS (scalar, D to 64-bit GPR) throughput 1.221277
FCVTAS (scalar, S to S) 1->2 3.031498
FCVTAS (scalar, S to S) throughput 0.505448
FCVTAS (scalar, S to S) 1->2 2.869918
FCVTAS (scalar, S to S) throughput 0.560695
FCVTAS (scalar, D to D) 1->2 3.169740
FCVTAS (scalar, D to D) throughput 0.468550
FCVTAS (scalar, D to D) 1->2 2.782702
FCVTAS (scalar, D to D) throughput 0.568616
FCVTAS (vector, 2S) 1->2 3.999831
FCVTAS (vector, 2S) throughput 0.509222
FCVTAS (vector, 4S) 1->2 2.866493
FCVTAS (vector, 4S) throughput 0.501625
FCVTAS (vector, 2D) 1->2 2.986932
FCVTAS (vector, 2D) throughput 0.549963
FCVTAU (scalar, S to 32-bit GPR) 1->2 roundtrip 8.818556
FCVTAU (scalar, S to 32-bit GPR) throughput 1.029471
FCVTAU (scalar, S to 64-bit GPR) 1->2 roundtrip 10.525896
FCVTAU (scalar, S to 64-bit GPR) throughput 1.223013
FCVTAU (scalar, D to 32-bit GPR) 1->2 roundtrip 10.470033
FCVTAU (scalar, D to 32-bit GPR) throughput 1.035602
FCVTAU (scalar, D to 64-bit GPR) 1->2 roundtrip 10.495007
FCVTAU (scalar, D to 64-bit GPR) throughput 1.046352
FCVTAU (scalar, S to S) 1->2 2.986819
FCVTAU (scalar, S to S) throughput 0.500172
FCVTAU (scalar, S to S) 1->2 2.889492
FCVTAU (scalar, S to S) throughput 0.506585
FCVTAU (scalar, D to D) 1->2 3.264348
FCVTAU (scalar, D to D) throughput 0.543383
FCVTAU (scalar, D to D) 1->2 2.892832
FCVTAU (scalar, D to D) throughput 0.589352
FCVTAU (vector, 2S) 1->2 2.990128
FCVTAU (vector, 2S) throughput 0.548115
FCVTAU (vector, 4S) 1->2 2.990863
FCVTAU (vector, 4S) throughput 0.658071
FCVTAU (vector, 2D) 1->2 2.956689
FCVTAU (vector, 2D) throughput 0.503791
FCVTL (vector, 4S) 1->2 3.037293
FCVTL (vector, 4S) throughput 0.500050
FCVTL (vector, 2D) 1->2 3.042538
FCVTL (vector, 2D) throughput 0.500810
FCVTL2 (vector, 4S) 1->2 3.116172
FCVTL2 (vector, 4S) throughput 0.507461
FCVTL2 (vector, 2D) 1->2 3.030071
FCVTL2 (vector, 2D) throughput 0.499581
FCVTMS (scalar, S to 32-bit GPR) 1->2 roundtrip 10.101886
FCVTMS (scalar, S to 32-bit GPR) throughput 1.010978
FCVTMS (scalar, S to 64-bit GPR) 1->2 roundtrip 10.024300
FCVTMS (scalar, S to 64-bit GPR) throughput 0.999414
FCVTMS (scalar, D to 32-bit GPR) 1->2 roundtrip 9.993498
FCVTMS (scalar, D to 32-bit GPR) throughput 1.180365
FCVTMS (scalar, D to 64-bit GPR) 1->2 roundtrip 9.539635
FCVTMS (scalar, D to 64-bit GPR) throughput 1.572712
FCVTMS (scalar, S to S) 1->2 2.661297
FCVTMS (scalar, S to S) throughput 0.497103
FCVTMS (scalar, S to S) 1->2 2.834145
FCVTMS (scalar, S to S) throughput 0.641390
FCVTMS (scalar, D to D) 1->2 3.007653
FCVTMS (scalar, D to D) throughput 0.571661
FCVTMS (scalar, D to D) 1->2 2.941764
FCVTMS (scalar, D to D) throughput 0.571747
FCVTMS (vector, 2S) 1->2 3.691594
FCVTMS (vector, 2S) throughput 0.510426
FCVTMS (vector, 4S) 1->2 3.250913
FCVTMS (vector, 4S) throughput 0.464128
FCVTMS (vector, 2D) 1->2 3.285809
FCVTMS (vector, 2D) throughput 0.566691
FCVTMU (scalar, S to 32-bit GPR) 1->2 roundtrip 10.195782
FCVTMU (scalar, S to 32-bit GPR) throughput 0.864335
FCVTMU (scalar, S to 64-bit GPR) 1->2 roundtrip 12.414602
FCVTMU (scalar, S to 64-bit GPR) throughput 1.018955
FCVTMU (scalar, D to 32-bit GPR) 1->2 roundtrip 10.727224
FCVTMU (scalar, D to 32-bit GPR) throughput 1.038673
FCVTMU (scalar, D to 64-bit GPR) 1->2 roundtrip 10.574723
FCVTMU (scalar, D to 64-bit GPR) throughput 1.073544
FCVTMU (scalar, S to S) 1->2 2.859100
FCVTMU (scalar, S to S) throughput 0.535914
FCVTMU (scalar, S to S) 1->2 3.069142
FCVTMU (scalar, S to S) throughput 0.551570
FCVTMU (scalar, D to D) 1->2 2.664293
FCVTMU (scalar, D to D) throughput 0.533239
FCVTMU (scalar, D to D) 1->2 2.839785
FCVTMU (scalar, D to D) throughput 0.577255
FCVTMU (vector, 2S) 1->2 3.009597
FCVTMU (vector, 2S) throughput 0.558682
FCVTMU (vector, 4S) 1->2 5.205633
FCVTMU (vector, 4S) throughput 0.507248
FCVTMU (vector, 2D) 1->2 2.992145
FCVTMU (vector, 2D) throughput 0.534786
FCVTN (vector, 4S) 1->2 3.375417
FCVTN (vector, 4S) throughput 0.546083
FCVTN (vector, 2D) 1->2 3.008991
FCVTN (vector, 2D) throughput 0.597376
FCVTN2 (vector, 4S) 1->1 2.825799
FCVTN2 (vector, 4S) 1->2 3.068806
FCVTN2 (vector, 4S) throughput 0.520936
FCVTN2 (vector, 4S) throughput 0.519790
FCVTN2 (vector, 2D) 1->1 3.562772
FCVTN2 (vector, 2D) 1->2 3.884408
FCVTN2 (vector, 2D) throughput 0.459407
FCVTN2 (vector, 2D) throughput 0.523584
FCVTNS (scalar, S to 32-bit GPR) 1->2 roundtrip 11.140945
FCVTNS (scalar, S to 32-bit GPR) throughput 0.915345
FCVTNS (scalar, S to 64-bit GPR) 1->2 roundtrip 11.418873
FCVTNS (scalar, S to 64-bit GPR) throughput 1.033333
FCVTNS (scalar, D to 32-bit GPR) 1->2 roundtrip 10.348987
FCVTNS (scalar, D to 32-bit GPR) throughput 1.108204
FCVTNS (scalar, D to 64-bit GPR) 1->2 roundtrip 11.268111
FCVTNS (scalar, D to 64-bit GPR) throughput 1.066431
FCVTNS (scalar, S to S) 1->2 1.997886
FCVTNS (scalar, S to S) throughput 0.534696
FCVTNS (scalar, S to S) 1->2 3.027061
FCVTNS (scalar, S to S) throughput 0.499325
FCVTNS (scalar, D to D) 1->2 3.003336
FCVTNS (scalar, D to D) throughput 0.519442
FCVTNS (scalar, D to D) 1->2 3.006380
FCVTNS (scalar, D to D) throughput 0.500126
FCVTNS (vector, 2S) 1->2 3.035755
FCVTNS (vector, 2S) throughput 0.534818
FCVTNS (vector, 4S) 1->2 2.993950
FCVTNS (vector, 4S) throughput 0.549736
FCVTNS (vector, 2D) 1->2 3.008178
FCVTNS (vector, 2D) throughput 0.558094
FCVTNU (scalar, S to 32-bit GPR) 1->2 roundtrip 10.574919
FCVTNU (scalar, S to 32-bit GPR) throughput 1.055316
FCVTNU (scalar, S to 64-bit GPR) 1->2 roundtrip 10.450965
FCVTNU (scalar, S to 64-bit GPR) throughput 1.087463
FCVTNU (scalar, D to 32-bit GPR) 1->2 roundtrip 11.447600
FCVTNU (scalar, D to 32-bit GPR) throughput 1.001384
FCVTNU (scalar, D to 64-bit GPR) 1->2 roundtrip 10.571521
FCVTNU (scalar, D to 64-bit GPR) throughput 1.029495
FCVTNU (scalar, S to S) 1->2 2.965977
FCVTNU (scalar, S to S) throughput 0.492871
FCVTNU (scalar, S to S) 1->2 2.941985
FCVTNU (scalar, S to S) throughput 0.500926
FCVTNU (scalar, D to D) 1->2 3.646507
FCVTNU (scalar, D to D) throughput 0.752619
FCVTNU (scalar, D to D) 1->2 3.282938
FCVTNU (scalar, D to D) throughput 0.718735
FCVTNU (vector, 2S) 1->2 4.070864
FCVTNU (vector, 2S) throughput 0.571288
FCVTNU (vector, 4S) 1->2 3.004129
FCVTNU (vector, 4S) throughput 0.576835
FCVTNU (vector, 2D) 1->2 2.538863
FCVTNU (vector, 2D) throughput 0.628689
FCVTPS (scalar, S to 32-bit GPR) 1->2 roundtrip 10.717571
FCVTPS (scalar, S to 32-bit GPR) throughput 1.073475
FCVTPS (scalar, S to 64-bit GPR) 1->2 roundtrip 13.414665
FCVTPS (scalar, S to 64-bit GPR) throughput 1.101739
FCVTPS (scalar, D to 32-bit GPR) 1->2 roundtrip 12.572712
FCVTPS (scalar, D to 32-bit GPR) throughput 1.423803
FCVTPS (scalar, D to 64-bit GPR) 1->2 roundtrip 13.185186
FCVTPS (scalar, D to 64-bit GPR) throughput 1.017215
FCVTPS (scalar, S to S) 1->2 3.145810
FCVTPS (scalar, S to S) throughput 0.575828
FCVTPS (scalar, S to S) 1->2 3.146623
FCVTPS (scalar, S to S) throughput 0.502160
FCVTPS (scalar, D to D) 1->2 3.056686
FCVTPS (scalar, D to D) throughput 0.505396
FCVTPS (scalar, D to D) 1->2 3.215751
FCVTPS (scalar, D to D) throughput 0.591519
FCVTPS (vector, 2S) 1->2 3.006195
FCVTPS (vector, 2S) throughput 0.568417
FCVTPS (vector, 4S) 1->2 2.995118
FCVTPS (vector, 4S) throughput 0.495055
FCVTPS (vector, 2D) 1->2 2.999836
FCVTPS (vector, 2D) throughput 0.819517
FCVTPU (scalar, S to 32-bit GPR) 1->2 roundtrip 10.446611
FCVTPU (scalar, S to 32-bit GPR) throughput 0.913548
FCVTPU (scalar, S to 64-bit GPR) 1->2 roundtrip 9.523916
FCVTPU (scalar, S to 64-bit GPR) throughput 1.070175
FCVTPU (scalar, D to 32-bit GPR) 1->2 roundtrip 11.426467
FCVTPU (scalar, D to 32-bit GPR) throughput 1.037729
FCVTPU (scalar, D to 64-bit GPR) 1->2 roundtrip 10.584542
FCVTPU (scalar, D to 64-bit GPR) throughput 1.070458
FCVTPU (scalar, S to S) 1->2 3.041627
FCVTPU (scalar, S to S) throughput 0.527059
FCVTPU (scalar, S to S) 1->2 2.829353
FCVTPU (scalar, S to S) throughput 0.506241
FCVTPU (scalar, D to D) 1->2 3.523051
FCVTPU (scalar, D to D) throughput 0.480371
FCVTPU (scalar, D to D) 1->2 3.007856
FCVTPU (scalar, D to D) throughput 0.526266
FCVTPU (vector, 2S) 1->2 2.711960
FCVTPU (vector, 2S) throughput 0.466104
FCVTPU (vector, 4S) 1->2 2.925887
FCVTPU (vector, 4S) throughput 0.568106
FCVTPU (vector, 2D) 1->2 3.004981
FCVTPU (vector, 2D) throughput 0.491573
FCVTXN (scalar, D) 1->2 3.041785
FCVTXN (scalar, D) throughput 0.500409
FCVTXN (vector, 2D) 1->2 3.009921
FCVTXN (vector, 2D) throughput 0.513317
FCVTXN2 (vector, 2D) 1->1 3.064671
FCVTXN2 (vector, 2D) 1->2 3.527573
FCVTXN2 (vector, 2D) throughput 0.438883
FCVTXN2 (vector, 2D) throughput 0.521689
FCVTZS (scalar, integer, S to 32-bit GPR) 1->2 roundtrip 10.041067
FCVTZS (scalar, integer, S to 32-bit GPR) throughput 1.039943
FCVTZS (scalar, integer, S to 64-bit GPR) 1->2 roundtrip 13.757289
FCVTZS (scalar, integer, S to 64-bit GPR) throughput 1.749955
FCVTZS (scalar, integer, D to 32-bit GPR) 1->2 roundtrip 10.774294
FCVTZS (scalar, integer, D to 32-bit GPR) throughput 1.056831
FCVTZS (scalar, integer, D to 64-bit GPR) 1->2 roundtrip 10.506338
FCVTZS (scalar, integer, D to 64-bit GPR) throughput 1.103998
FCVTZS (scalar, integer, S to S) 1->2 2.999879
FCVTZS (scalar, integer, S to S) throughput 0.583946
FCVTZS (scalar, integer, S to S) 1->2 3.072139
FCVTZS (scalar, integer, S to S) throughput 0.626512
FCVTZS (scalar, integer, D to D) 1->2 3.119773
FCVTZS (scalar, integer, D to D) throughput 0.453751
FCVTZS (scalar, integer, D to D) 1->2 2.892484
FCVTZS (scalar, integer, D to D) throughput 0.601942
FCVTZS (vector, integer, 2S) 1->2 3.007290
FCVTZS (vector, integer, 2S) throughput 0.373931
FCVTZS (vector, integer, 4S) 1->2 3.387503
FCVTZS (vector, integer, 4S) throughput 0.498858
FCVTZS (vector, integer, 2D) 1->2 3.004697
FCVTZS (vector, integer, 2D) throughput 0.499621
FCVTZS (scalar, fixed-point, S to 32-bit GPR) 1->2 roundtrip 10.398485
FCVTZS (scalar, fixed-point, S to 32-bit GPR) throughput 1.005020
FCVTZS (scalar, fixed-point, S to 64-bit GPR) 1->2 roundtrip 10.056076
FCVTZS (scalar, fixed-point, S to 64-bit GPR) throughput 1.006716
FCVTZS (scalar, fixed-point, D to 32-bit GPR) 1->2 roundtrip 10.816793
FCVTZS (scalar, fixed-point, D to 32-bit GPR) throughput 1.008710
FCVTZS (scalar, fixed-point, D to 64-bit GPR) 1->2 roundtrip 10.099570
FCVTZS (scalar, fixed-point, D to 64-bit GPR) throughput 1.002450
FCVTZS (scalar, fixed-point, S to S) 1->2 3.006682
FCVTZS (scalar, fixed-point, S to S) throughput 0.503361
FCVTZS (scalar, fixed-point, S to S) 1->2 3.004332
FCVTZS (scalar, fixed-point, S to S) throughput 0.501524
FCVTZS (scalar, fixed-point, D to D) 1->2 3.004171
FCVTZS (scalar, fixed-point, D to D) throughput 0.512814
FCVTZS (scalar, fixed-point, D to D) 1->2 2.985498
FCVTZS (scalar, fixed-point, D to D) throughput 0.414732
FCVTZS (vector, fixed-point, 2S) 1->2 3.020926
FCVTZS (vector, fixed-point, 2S) throughput 0.559360
FCVTZS (vector, fixed-point, 4S) 1->2 5.346975
FCVTZS (vector, fixed-point, 4S) throughput 0.498832
FCVTZS (vector, fixed-point, 2D) 1->2 3.007552
FCVTZS (vector, fixed-point, 2D) throughput 0.495271
FCVTZU (scalar, integer, S to 32-bit GPR) 1->2 roundtrip 10.145493
FCVTZU (scalar, integer, S to 32-bit GPR) throughput 1.034596
FCVTZU (scalar, integer, S to 64-bit GPR) 1->2 roundtrip 11.097452
FCVTZU (scalar, integer, S to 64-bit GPR) throughput 1.060496
FCVTZU (scalar, integer, D to 32-bit GPR) 1->2 roundtrip 12.247709
FCVTZU (scalar, integer, D to 32-bit GPR) throughput 1.035967
FCVTZU (scalar, integer, D to 64-bit GPR) 1->2 roundtrip 11.609988
FCVTZU (scalar, integer, D to 64-bit GPR) throughput 0.976461
FCVTZU (scalar, integer, S to S) 1->2 3.016074
FCVTZU (scalar, integer, S to S) throughput 0.466359
FCVTZU (scalar, integer, S to S) 1->2 3.004079
FCVTZU (scalar, integer, S to S) throughput 0.498811
FCVTZU (scalar, integer, D to D) 1->2 3.009625
FCVTZU (scalar, integer, D to D) throughput 0.459779
FCVTZU (scalar, integer, D to D) 1->2 3.222534
FCVTZU (scalar, integer, D to D) throughput 0.546707
FCVTZU (vector, integer, 2S) 1->2 3.676983
FCVTZU (vector, integer, 2S) throughput 0.572760
FCVTZU (vector, integer, 4S) 1->2 3.831612
FCVTZU (vector, integer, 4S) throughput 0.547168
FCVTZU (vector, integer, 2D) 1->2 3.278282
FCVTZU (vector, integer, 2D) throughput 0.502514
FCVTZU (scalar, fixed-point, S to 32-bit GPR) 1->2 roundtrip 8.421535
FCVTZU (scalar, fixed-point, S to 32-bit GPR) throughput 1.001660
FCVTZU (scalar, fixed-point, S to 64-bit GPR) 1->2 roundtrip 10.507977
FCVTZU (scalar, fixed-point, S to 64-bit GPR) throughput 1.019138
FCVTZU (scalar, fixed-point, D to 32-bit GPR) 1->2 roundtrip 10.230273
FCVTZU (scalar, fixed-point, D to 32-bit GPR) throughput 0.912596
FCVTZU (scalar, fixed-point, D to 64-bit GPR) 1->2 roundtrip 12.176381
FCVTZU (scalar, fixed-point, D to 64-bit GPR) throughput 1.092583
FCVTZU (scalar, fixed-point, S to S) 1->2 3.238013
FCVTZU (scalar, fixed-point, S to S) throughput 0.550197
FCVTZU (scalar, fixed-point, S to S) 1->2 2.951210
FCVTZU (scalar, fixed-point, S to S) throughput 0.500965
FCVTZU (scalar, fixed-point, D to D) 1->2 3.234381
FCVTZU (scalar, fixed-point, D to D) throughput 0.506615
FCVTZU (scalar, fixed-point, D to D) 1->2 3.012834
FCVTZU (scalar, fixed-point, D to D) throughput 0.377403
FCVTZU (vector, fixed-point, 2S) 1->2 3.011905
FCVTZU (vector, fixed-point, 2S) throughput 0.568191
FCVTZU (vector, fixed-point, 4S) 1->2 2.986890
FCVTZU (vector, fixed-point, 4S) throughput 0.426544
FCVTZU (vector, fixed-point, 2D) 1->2 2.998424
FCVTZU (vector, fixed-point, 2D) throughput 0.604811
FDIV (vector, 2S) 1->2 8.524710
FDIV (vector, 2S) 1->3 8.484255
FDIV (vector, 2S) throughput 1.042281
FDIV (vector, 4S) 1->2 8.820111
FDIV (vector, 4S) 1->3 11.110762
FDIV (vector, 4S) throughput 1.584961
FDIV (vector, 2D) 1->2 11.640530
FDIV (vector, 2D) 1->3 11.415876
FDIV (vector, 2D) throughput 2.203663
FDIV (scalar, S) 1->2 8.402121
FDIV (scalar, S) 1->3 8.582267
FDIV (scalar, S) throughput 1.072185
FDIV (scalar, D) 1->2 10.786627
FDIV (scalar, D) 1->3 8.159040
FDIV (scalar, D) throughput 1.001098
FMADD (scalar, S) 1->2 4.003807
FMADD (scalar, S) 1->3 4.005183
FMADD (scalar, S) 1->4 4.005709
FMADD (scalar, S) throughput 0.504211
FMADD (scalar, D) 1->2 4.009919
FMADD (scalar, D) 1->3 4.051048
FMADD (scalar, D) 1->4 3.995396
FMADD (scalar, D) throughput 0.499899
FMAX (vector, 2S) 1->2 1.989215
FMAX (vector, 2S) 1->3 1.996366
FMAX (vector, 2S) throughput 0.503665
FMAX (vector, 4S) 1->2 2.001537
FMAX (vector, 4S) 1->3 2.010570
FMAX (vector, 4S) throughput 0.500496
FMAX (vector, 2D) 1->2 2.001564
FMAX (vector, 2D) 1->3 2.025912
FMAX (vector, 2D) throughput 0.649603
FMAX (scalar, S) 1->2 2.005848
FMAX (scalar, S) 1->3 2.005427
FMAX (scalar, S) throughput 0.537431
FMAX (scalar, D) 1->2 1.774451
FMAX (scalar, D) 1->3 2.105003
FMAX (scalar, D) throughput 0.618553
FMAXNM (vector, 2S) 1->2 1.992366
FMAXNM (vector, 2S) 1->3 6.331647
FMAXNM (vector, 2S) throughput 0.489815
FMAXNM (vector, 4S) 1->2 1.804837
FMAXNM (vector, 4S) 1->3 2.040985
FMAXNM (vector, 4S) throughput 0.564068
FMAXNM (vector, 2D) 1->2 2.419328
FMAXNM (vector, 2D) 1->3 2.105843
FMAXNM (vector, 2D) throughput 0.885836
FMAXNM (scalar, S) 1->2 5.322586
FMAXNM (scalar, S) 1->3 2.043239
FMAXNM (scalar, S) throughput 0.679429
FMAXNM (scalar, D) 1->2 2.160803
FMAXNM (scalar, D) 1->3 2.849304
FMAXNM (scalar, D) throughput 0.922383
FMAXNMP (scalar) 1->2 0.576800
FMAXNMP (scalar) throughput 0.499077
FMAXNMP (vector, 2S) 1->2 1.999655
FMAXNMP (vector, 2S) 1->3 1.966016
FMAXNMP (vector, 2S) throughput 0.509083
FMAXNMP (vector, 4S) 1->2 2.003527
FMAXNMP (vector, 4S) 1->3 2.023556
FMAXNMP (vector, 4S) throughput 0.721581
FMAXNMP (vector, 2D) 1->2 2.360014
FMAXNMP (vector, 2D) 1->3 1.996395
FMAXNMP (vector, 2D) throughput 0.500226
FMAXNMV (4S) 1->2 3.010154
FMAXNMV (4S) throughput 0.550183
FMAXP (scalar) 1->2 2.223625
FMAXP (scalar) throughput 0.516044
FMAXP (vector, 2S) 1->2 2.085603
FMAXP (vector, 2S) 1->3 1.996345
FMAXP (vector, 2S) throughput 0.568192
FMAXP (vector, 4S) 1->2 1.998261
FMAXP (vector, 4S) 1->3 2.001661
FMAXP (vector, 4S) throughput 0.507957
FMAXP (vector, 2D) 1->2 2.019602
FMAXP (vector, 2D) 1->3 3.541763
FMAXP (vector, 2D) throughput 0.542707
FMAXV (4S) 1->2 3.079730
FMAXV (4S) throughput 0.537902
FMIN (vector, 2S) 1->2 2.004649
FMIN (vector, 2S) 1->3 2.001660
FMIN (vector, 2S) throughput 0.528470
FMIN (vector, 4S) 1->2 1.859301
FMIN (vector, 4S) 1->3 1.996568
FMIN (vector, 4S) throughput 0.559163
FMIN (vector, 2D) 1->2 2.042478
FMIN (vector, 2D) 1->3 1.985137
FMIN (vector, 2D) throughput 0.527988
FMIN (scalar, S) 1->2 2.001660
FMIN (scalar, S) 1->3 1.893246
FMIN (scalar, S) throughput 0.499292
FMIN (scalar, D) 1->2 1.938327
FMIN (scalar, D) 1->3 2.014141
FMIN (scalar, D) throughput 0.552703
FMINNM (vector, 2S) 1->2 1.998302
FMINNM (vector, 2S) 1->3 2.002712
FMINNM (vector, 2S) throughput 0.500273
FMINNM (vector, 4S) 1->2 2.235015
FMINNM (vector, 4S) 1->3 1.991421
FMINNM (vector, 4S) throughput 0.443074
FMINNM (vector, 2D) 1->2 2.108962
FMINNM (vector, 2D) 1->3 2.025632
FMINNM (vector, 2D) throughput 0.508916
FMINNM (scalar, S) 1->2 2.596537
FMINNM (scalar, S) 1->3 1.922710
FMINNM (scalar, S) throughput 0.547823
FMINNM (scalar, D) 1->2 2.124868
FMINNM (scalar, D) 1->3 1.952563
FMINNM (scalar, D) throughput 0.569998
FMINNMP (scalar) 1->2 2.042396
FMINNMP (scalar) throughput 0.582984
FMINNMP (vector, 2S) 1->2 2.001051
FMINNMP (vector, 2S) 1->3 2.002632
FMINNMP (vector, 2S) throughput 0.552115
FMINNMP (vector, 4S) 1->2 1.948231
FMINNMP (vector, 4S) 1->3 2.001200
FMINNMP (vector, 4S) throughput 0.552306
FMINNMP (vector, 2D) 1->2 2.001619
FMINNMP (vector, 2D) 1->3 2.284364
FMINNMP (vector, 2D) throughput 0.520191
FMINNMV (4S) 1->2 3.261729
FMINNMV (4S) throughput 0.530006
FMINP (scalar) 1->2 2.112255
FMINP (scalar) throughput 0.534266
FMINP (vector, 2S) 1->2 2.009556
FMINP (vector, 2S) 1->3 1.966754
FMINP (vector, 2S) throughput 0.536985
FMINP (vector, 4S) 1->2 1.990420
FMINP (vector, 4S) 1->3 2.237802
FMINP (vector, 4S) throughput 0.663710
FMINP (vector, 2D) 1->2 1.983723
FMINP (vector, 2D) 1->3 2.004440
FMINP (vector, 2D) throughput 0.536236
FMINV (4S) 1->2 3.236783
FMINV (4S) throughput 0.471801
FMLA (by element, S) 1->1 4.510554
FMLA (by element, S) 1->2 4.481910
FMLA (by element, S) 1->3 4.062277
FMLA (by element, S) throughput 0.582743
FMLA (by element, S) throughput 0.529383
FMLA (by element, D) 1->1 4.530256
FMLA (by element, D) 1->2 4.094104
FMLA (by element, D) 1->3 3.994275
FMLA (by element, D) throughput 0.540946
FMLA (by element, D) throughput 0.498731
FMLA (by element, 2S) 1->1 4.859091
FMLA (by element, 2S) 1->2 4.123458
FMLA (by element, 2S) 1->3 3.212912
FMLA (by element, 2S) throughput 0.500852
FMLA (by element, 2S) throughput 0.524038
FMLA (by element, 4S) 1->1 4.158025
FMLA (by element, 4S) 1->2 4.016117
FMLA (by element, 4S) 1->3 4.487265
FMLA (by element, 4S) throughput 0.502025
FMLA (by element, 4S) throughput 0.501211
FMLA (by element, 2D) 1->1 4.317128
FMLA (by element, 2D) 1->2 4.020326
FMLA (by element, 2D) 1->3 4.189396
FMLA (by element, 2D) throughput 0.541501
FMLA (by element, 2D) throughput 0.466347
FMLA (vector, 2S) 1->1 4.276643
FMLA (vector, 2S) 1->2 4.325360
FMLA (vector, 2S) 1->3 4.652830
FMLA (vector, 2S) throughput 0.500935
FMLA (vector, 2S) throughput 0.528830
FMLA (vector, 4S) 1->1 4.569606
FMLA (vector, 4S) 1->2 4.003032
FMLA (vector, 4S) 1->3 3.997783
FMLA (vector, 4S) throughput 0.563335
FMLA (vector, 4S) throughput 0.532048
FMLA (vector, 2D) 1->1 3.391062
FMLA (vector, 2D) 1->2 4.005708
FMLA (vector, 2D) 1->3 4.572441
FMLA (vector, 2D) throughput 0.563265
FMLA (vector, 2D) throughput 0.560861
FMLS (by element, S) 1->1 4.605425
FMLS (by element, S) 1->2 3.092525
FMLS (by element, S) 1->3 4.012552
FMLS (by element, S) throughput 0.543858
FMLS (by element, S) throughput 0.558041
FMLS (by element, D) 1->1 5.271855
FMLS (by element, D) 1->2 4.060299
FMLS (by element, D) 1->3 4.334880
FMLS (by element, D) throughput 0.501913
FMLS (by element, D) throughput 0.522991
FMLS (by element, 2S) 1->1 4.648784
FMLS (by element, 2S) 1->2 4.029255
FMLS (by element, 2S) 1->3 3.655245
FMLS (by element, 2S) throughput 0.927654
FMLS (by element, 2S) throughput 0.687133
FMLS (by element, 4S) 1->1 3.863717
FMLS (by element, 4S) 1->2 4.357051
FMLS (by element, 4S) 1->3 3.688806
FMLS (by element, 4S) throughput 0.523209
FMLS (by element, 4S) throughput 0.531783
FMLS (by element, 2D) 1->1 4.364202
FMLS (by element, 2D) 1->2 4.489878
FMLS (by element, 2D) 1->3 3.998704
FMLS (by element, 2D) throughput 0.504997
FMLS (by element, 2D) throughput 0.506586
FMLS (vector, 2S) 1->1 4.252711
FMLS (vector, 2S) 1->2 3.949617
FMLS (vector, 2S) 1->3 4.765188
FMLS (vector, 2S) throughput 0.500541
FMLS (vector, 2S) throughput 0.541270
FMLS (vector, 4S) 1->1 4.011409
FMLS (vector, 4S) 1->2 4.270038
FMLS (vector, 4S) 1->3 4.024905
FMLS (vector, 4S) throughput 0.534261
FMLS (vector, 4S) throughput 0.536869
FMLS (vector, 2D) 1->1 4.005548
FMLS (vector, 2D) 1->2 3.482984
FMLS (vector, 2D) 1->3 4.002141
FMLS (vector, 2D) throughput 0.502889
FMLS (vector, 2D) throughput 0.509518
FMOV (S to 32-bit GPR) 1->2 roundtrip 7.262409
FMOV (S to 32-bit GPR) throughput 1.138684
FMOV (D to 64-bit GPR) 1->2 roundtrip 7.532818
FMOV (D to 64-bit GPR) throughput 1.114386
FMOV (D[1] to 64-bit GPR) 1->2 roundtrip 7.387935
FMOV (D[1] to 64-bit GPR) throughput 1.098989
FMOV (S from 32-bit GPR) 1->2 roundtrip 7.336815
FMOV (S from 32-bit GPR) throughput 0.550816
FMOV (D from 64-bit GPR) 1->2 roundtrip 7.315407
FMOV (D from 64-bit GPR) throughput 0.535606
FMOV (D[1] from 64-bit GPR) 1->2 roundtrip 9.993927
FMOV (D[1] from 64-bit GPR) throughput 0.532862
FMSUB (scalar, S) 1->2 4.882739
FMSUB (scalar, S) 1->3 4.128418
FMSUB (scalar, S) 1->4 4.246758
FMSUB (scalar, S) throughput 0.531251
FMSUB (scalar, D) 1->2 4.327763
FMSUB (scalar, D) 1->3 4.469558
FMSUB (scalar, D) 1->4 4.775833
FMSUB (scalar, D) throughput 0.520588
FMUL (by element, 2S) 1->2 4.186752
FMUL (by element, 2S) 1->3 4.001174
FMUL (by element, 2S) throughput 0.437302
FMUL (by element, 4S) 1->2 4.127683
FMUL (by element, 4S) 1->3 4.001418
FMUL (by element, 4S) throughput 0.549621
FMUL (by element, 2D) 1->2 4.036038
FMUL (by element, 2D) 1->3 4.007074
FMUL (by element, 2D) throughput 0.649999
FMUL (vector, 2S) 1->2 4.045654
FMUL (vector, 2S) 1->3 4.262703
FMUL (vector, 2S) throughput 0.498826
FMUL (vector, 4S) 1->2 4.905140
FMUL (vector, 4S) 1->3 4.243440
FMUL (vector, 4S) throughput 0.500774
FMUL (vector, 2D) 1->2 4.002231
FMUL (vector, 2D) 1->3 4.000685
FMUL (vector, 2D) throughput 0.531302
FMUL (scalar, S) 1->2 4.107715
FMUL (scalar, S) 1->3 4.478802
FMUL (scalar, S) throughput 0.535267
FMUL (scalar, D) 1->2 4.002019
FMUL (scalar, D) 1->3 4.081938
FMUL (scalar, D) throughput 0.511456
FMULX (vector, 2S) 1->2 4.075666
FMULX (vector, 2S) 1->3 4.104338
FMULX (vector, 2S) throughput 0.617016
FMULX (vector, 4S) 1->2 4.052475
FMULX (vector, 4S) 1->3 4.568221
FMULX (vector, 4S) throughput 0.586538
FMULX (vector, 2D) 1->2 4.291056
FMULX (vector, 2D) 1->3 4.038773
FMULX (vector, 2D) throughput 0.501289
FMULX (scalar, S) 1->2 4.525187
FMULX (scalar, S) 1->3 5.017153
FMULX (scalar, S) throughput 0.528340
FMULX (scalar, D) 1->2 4.084123
FMULX (scalar, D) 1->3 4.098681
FMULX (scalar, D) throughput 0.507264
FMULX (by element, 2S) 1->2 4.200224
FMULX (by element, 2S) 1->3 4.011031
FMULX (by element, 2S) throughput 0.569895
FMULX (by element, 4S) 1->2 4.537691
FMULX (by element, 4S) 1->3 4.338192
FMULX (by element, 4S) throughput 0.566259
FMULX (by element, 2D) 1->2 4.499638
FMULX (by element, 2D) 1->3 4.005844
FMULX (by element, 2D) throughput 0.536081
FNEG (scalar, S) 1->2 2.013848
FNEG (scalar, S) throughput 0.533214
FNEG (scalar, D) 1->2 2.005183
FNEG (scalar, D) throughput 0.557990
FNEG (vector, 2S) 1->2 2.008787
FNEG (vector, 2S) throughput 0.782475
FNEG (vector, 4S) 1->2 2.013588
FNEG (vector, 4S) throughput 0.490816
FNEG (vector, 2D) 1->2 2.002299
FNEG (vector, 2D) throughput 0.603482
FNMADD (scalar, S) 1->2 3.502479
FNMADD (scalar, S) 1->3 4.642398
FNMADD (scalar, S) 1->4 4.241711
FNMADD (scalar, S) throughput 0.580311
FNMADD (scalar, D) 1->2 4.304259
FNMADD (scalar, D) 1->3 4.088608
FNMADD (scalar, D) 1->4 3.949732
FNMADD (scalar, D) throughput 0.571812
FNMSUB (scalar, S) 1->2 6.207944
FNMSUB (scalar, S) 1->3 4.475177
FNMSUB (scalar, S) 1->4 4.184392
FNMSUB (scalar, S) throughput 0.509574
FNMSUB (scalar, D) 1->2 4.173437
FNMSUB (scalar, D) 1->3 3.948129
FNMSUB (scalar, D) 1->4 4.049152
FNMSUB (scalar, D) throughput 0.511223
FNMUL (scalar, S) 1->2 4.283304
FNMUL (scalar, S) 1->3 3.865035
FNMUL (scalar, S) throughput 0.497975
FNMUL (scalar, D) 1->2 4.072647
FNMUL (scalar, D) 1->3 4.518981
FNMUL (scalar, D) throughput 0.500854
FRECPE (scalar, S) 1->2 3.032585
FRECPE (scalar, S) throughput 1.057892
FRECPE (scalar, D) 1->2 2.646545
FRECPE (scalar, D) throughput 1.041507
FRECPE (vector, 2S) 1->2 3.541906
FRECPE (vector, 2S) throughput 1.152230
FRECPE (vector, 4S) 1->2 4.388301
FRECPE (vector, 4S) throughput 2.123917
FRECPE (vector, 2D) 1->2 4.826315
FRECPE (vector, 2D) throughput 1.762349
FRECPS (vector, 2S) 1->2 3.871055
FRECPS (vector, 2S) 1->3 4.210768
FRECPS (vector, 2S) throughput 0.499793
FRECPS (vector, 4S) 1->2 3.732778
FRECPS (vector, 4S) 1->3 4.434309
FRECPS (vector, 4S) throughput 0.532724
FRECPS (vector, 2D) 1->2 4.123998
FRECPS (vector, 2D) 1->3 5.038989
FRECPS (vector, 2D) throughput 0.590944
FRECPS (scalar, S) 1->2 3.956695
FRECPS (scalar, S) 1->3 3.521623
FRECPS (scalar, S) throughput 0.570431
FRECPS (scalar, D) 1->2 2.521919
FRECPS (scalar, D) 1->3 4.282170
FRECPS (scalar, D) throughput 0.609473
FRECPX (scalar, S) 1->2 2.815102
FRECPX (scalar, S) throughput 1.049834
FRECPX (scalar, D) 1->2 2.996078
FRECPX (scalar, D) throughput 1.052182
FRINTA (scalar, S) 1->2 3.169889
FRINTA (scalar, S) throughput 0.580934
FRINTA (scalar, D) 1->2 3.003875
FRINTA (scalar, D) throughput 0.510362
FRINTA (vector, 2S) 1->2 3.268979
FRINTA (vector, 2S) throughput 0.508253
FRINTA (vector, 4S) 1->2 3.019030
FRINTA (vector, 4S) throughput 0.562411
FRINTA (vector, 2D) 1->2 3.047137
FRINTA (vector, 2D) throughput 0.658554
FRINTI (scalar, S) 1->2 3.028459
FRINTI (scalar, S) throughput 0.548685
FRINTI (scalar, D) 1->2 2.996289
FRINTI (scalar, D) throughput 0.500429
FRINTI (vector, 2S) 1->2 3.554761
FRINTI (vector, 2S) throughput 0.505823
FRINTI (vector, 4S) 1->2 3.005918
FRINTI (vector, 4S) throughput 0.471539
FRINTI (vector, 2D) 1->2 3.265465
FRINTI (vector, 2D) throughput 0.539730
FRINTM (scalar, S) 1->2 3.561399
FRINTM (scalar, S) throughput 0.411683
FRINTM (scalar, D) 1->2 3.158083
FRINTM (scalar, D) throughput 0.576857
FRINTM (vector, 2S) 1->2 3.008568
FRINTM (vector, 2S) throughput 0.592108
FRINTM (vector, 4S) 1->2 2.997332
FRINTM (vector, 4S) throughput 0.573206
FRINTM (vector, 2D) 1->2 2.955198
FRINTM (vector, 2D) throughput 0.535116
FRINTN (scalar, S) 1->2 3.231363
FRINTN (scalar, S) throughput 0.506383
FRINTN (scalar, D) 1->2 3.094444
FRINTN (scalar, D) throughput 0.503457
FRINTN (vector, 2S) 1->2 3.611484
FRINTN (vector, 2S) throughput 0.476992
FRINTN (vector, 4S) 1->2 3.009720
FRINTN (vector, 4S) throughput 0.522611
FRINTN (vector, 2D) 1->2 2.374987
FRINTN (vector, 2D) throughput 0.503199
FRINTP (scalar, S) 1->2 3.008043
FRINTP (scalar, S) throughput 0.505219
FRINTP (scalar, D) 1->2 3.073819
FRINTP (scalar, D) throughput 0.545873
FRINTP (vector, 2S) 1->2 2.974266
FRINTP (vector, 2S) throughput 0.589942
FRINTP (vector, 4S) 1->2 2.980045
FRINTP (vector, 4S) throughput 0.583509
FRINTP (vector, 2D) 1->2 4.459906
FRINTP (vector, 2D) throughput 0.516048
FRINTX (scalar, S) 1->2 3.010046
FRINTX (scalar, S) throughput 0.526302
FRINTX (scalar, D) 1->2 3.064139
FRINTX (scalar, D) throughput 0.560471
FRINTX (vector, 2S) 1->2 3.004931
FRINTX (vector, 2S) throughput 0.580727
FRINTX (vector, 4S) 1->2 2.825430
FRINTX (vector, 4S) throughput 0.576709
FRINTX (vector, 2D) 1->2 3.191691
FRINTX (vector, 2D) throughput 0.582464
FRINTZ (scalar, S) 1->2 2.999111
FRINTZ (scalar, S) throughput 0.574588
FRINTZ (scalar, D) 1->2 3.001292
FRINTZ (scalar, D) throughput 0.501257
FRINTZ (vector, 2S) 1->2 3.220337
FRINTZ (vector, 2S) throughput 0.491193
FRINTZ (vector, 4S) 1->2 3.759242
FRINTZ (vector, 4S) throughput 0.503341
FRINTZ (vector, 2D) 1->2 3.917434
FRINTZ (vector, 2D) throughput 0.562542
FRSQRTE (scalar, S) 1->2 3.022026
FRSQRTE (scalar, S) throughput 1.093789
FRSQRTE (scalar, D) 1->2 2.882875
FRSQRTE (scalar, D) throughput 1.217684
FRSQRTE (vector, 2S) 1->2 3.706096
FRSQRTE (vector, 2S) throughput 1.005127
FRSQRTE (vector, 4S) 1->2 3.121202
FRSQRTE (vector, 4S) throughput 2.319847
FRSQRTE (vector, 2D) 1->2 4.861275
FRSQRTE (vector, 2D) throughput 2.019493
FRSQRTS (vector, 2S) 1->2 3.996850
FRSQRTS (vector, 2S) 1->3 4.006966
FRSQRTS (vector, 2S) throughput 0.505021
FRSQRTS (vector, 4S) 1->2 3.999151
FRSQRTS (vector, 4S) 1->3 4.044041
FRSQRTS (vector, 4S) throughput 0.500739
FRSQRTS (vector, 2D) 1->2 3.999272
FRSQRTS (vector, 2D) 1->3 3.998989
FRSQRTS (vector, 2D) throughput 0.500142
FRSQRTS (scalar, S) 1->2 3.987286
FRSQRTS (scalar, S) 1->3 4.005225
FRSQRTS (scalar, S) throughput 0.500627
FRSQRTS (scalar, D) 1->2 4.038138
FRSQRTS (scalar, D) 1->3 4.005337
FRSQRTS (scalar, D) throughput 0.500820
FSQRT (scalar, S) 1->2 10.082928
FSQRT (scalar, S) throughput 2.015929
FSQRT (scalar, D) 1->2 13.037594
FSQRT (scalar, D) throughput 2.232481
FSQRT (vector, 2S) 1->2 9.151747
FSQRT (vector, 2S) throughput 2.009602
FSQRT (vector, 4S) 1->2 12.848573
FSQRT (vector, 4S) throughput 3.237254
FSQRT (vector, 2D) 1->2 15.225853
FSQRT (vector, 2D) throughput 4.182419
FSUB (vector, 2S) 1->2 2.997502
FSUB (vector, 2S) 1->3 5.328179
FSUB (vector, 2S) throughput 0.537653
FSUB (vector, 4S) 1->2 3.532436
FSUB (vector, 4S) 1->3 3.004130
FSUB (vector, 4S) throughput 0.499550
FSUB (vector, 2D) 1->2 3.002221
FSUB (vector, 2D) 1->3 2.993416
FSUB (vector, 2D) throughput 0.501114
FSUB (scalar, S) 1->2 3.010752
FSUB (scalar, S) 1->3 3.188527
FSUB (scalar, S) throughput 0.506147
FSUB (scalar, D) 1->2 3.004645
FSUB (scalar, D) 1->3 3.022191
FSUB (scalar, D) throughput 0.460967
INS (element, B) 1->1 1.988376
INS (element, B) 1->2 2.000774
INS (element, B) throughput 0.508113
INS (element, B) throughput 0.489807
INS (element, H) 1->1 2.008252
INS (element, H) 1->2 1.909536
INS (element, H) throughput 0.503999
INS (element, H) throughput 0.504581
INS (element, S) 1->1 1.981414
INS (element, S) 1->2 2.001459
INS (element, S) throughput 0.500328
INS (element, S) throughput 0.501240
INS (element, D) 1->1 2.002874
INS (element, D) 1->2 2.002227
INS (element, D) throughput 0.519410
INS (element, D) throughput 0.521239
INS (general, B) 1->1 1.524256
INS (general, B) 1->2 roundtrip 9.034778
INS (general, B) throughput 0.757842
INS (general, H) 1->1 1.958713
INS (general, H) 1->2 roundtrip 9.091431
INS (general, H) throughput 0.753812
INS (general, S) 1->1 2.002261
INS (general, S) 1->2 roundtrip 9.011616
INS (general, S) throughput 0.761611
INS (general, D) 1->1 1.987297
INS (general, D) 1->2 roundtrip 9.160768
INS (general, D) throughput 0.750141
LD1 (multiple, one register, 8B) 1->2 roundtrip 7.496864
LD1 (multiple, one register, 8B) throughput 0.492584
LD1 (multiple, one register, 16B) 1->2 roundtrip 6.905282
LD1 (multiple, one register, 16B) throughput 0.500638
LD1 (multiple, one register, 4H) 1->2 roundtrip 7.037339
LD1 (multiple, one register, 4H) throughput 0.501437
LD1 (multiple, one register, 8H) 1->2 roundtrip 7.033447
LD1 (multiple, one register, 8H) throughput 0.499682
LD1 (multiple, one register, 2S) 1->2 roundtrip 6.995109
LD1 (multiple, one register, 2S) throughput 0.456356
LD1 (multiple, one register, 4S) 1->2 roundtrip 7.020445
LD1 (multiple, one register, 4S) throughput 0.501930
LD1 (multiple, one register, 1D) 1->2 roundtrip 7.064167
LD1 (multiple, one register, 1D) throughput 0.501518
LD1 (multiple, one register, 2D) 1->2 roundtrip 7.034202
LD1 (multiple, one register, 2D) throughput 0.500762
LD1 (multiple, two registers, 8B) 1->3 roundtrip 7.011493
LD1 (multiple, two registers, 8B) 2->3 roundtrip 7.186841
LD1 (multiple, two registers, 8B) throughput 0.999586
LD1 (multiple, two registers, 16B) 1->3 roundtrip 7.120672
LD1 (multiple, two registers, 16B) 2->3 roundtrip 7.392044
LD1 (multiple, two registers, 16B) throughput 1.090149
LD1 (multiple, two registers, 4H) 1->3 roundtrip 7.154723
LD1 (multiple, two registers, 4H) 2->3 roundtrip 7.095731
LD1 (multiple, two registers, 4H) throughput 1.001407
LD1 (multiple, two registers, 8H) 1->3 roundtrip 7.077802
LD1 (multiple, two registers, 8H) 2->3 roundtrip 7.102367
LD1 (multiple, two registers, 8H) throughput 0.999379
LD1 (multiple, two registers, 2S) 1->3 roundtrip 8.869814
LD1 (multiple, two registers, 2S) 2->3 roundtrip 6.627348
LD1 (multiple, two registers, 2S) throughput 1.001687
LD1 (multiple, two registers, 4S) 1->3 roundtrip 7.089131
LD1 (multiple, two registers, 4S) 2->3 roundtrip 7.070537
LD1 (multiple, two registers, 4S) throughput 1.015609
LD1 (multiple, two registers, 1D) 1->3 roundtrip 6.990187
LD1 (multiple, two registers, 1D) 2->3 roundtrip 7.239993
LD1 (multiple, two registers, 1D) throughput 1.002241
LD1 (multiple, two registers, 2D) 1->3 roundtrip 7.013282
LD1 (multiple, two registers, 2D) 2->3 roundtrip 8.550091
LD1 (multiple, two registers, 2D) throughput 1.034062
LD1 (multiple, three registers, 8B) throughput 1.517993
LD1 (multiple, three registers, 16B) throughput 1.510336
LD1 (multiple, three registers, 4H) throughput 1.511464
LD1 (multiple, three registers, 8H) throughput 1.502121
LD1 (multiple, three registers, 2S) throughput 1.518675
LD1 (multiple, three registers, 4S) throughput 1.508180
LD1 (multiple, three registers, 1D) throughput 1.503664
LD1 (multiple, three registers, 2D) throughput 1.515746
LD1 (multiple, four registers, 8B) throughput 1.902273
LD1 (multiple, four registers, 16B) throughput 2.024473
LD1 (multiple, four registers, 4H) throughput 2.030051
LD1 (multiple, four registers, 8H) throughput 2.073090
LD1 (multiple, four registers, 2S) throughput 2.034621
LD1 (multiple, four registers, 4S) throughput 2.031685
LD1 (multiple, four registers, 1D) throughput 2.018009
LD1 (multiple, four registers, 2D) throughput 2.003928
LD1 (single structure, B) 1->2 roundtrip 9.237689
LD1 (single structure, B) throughput 2.013528
LD1 (single structure, H) 1->2 roundtrip 9.105559
LD1 (single structure, H) throughput 2.047518
LD1 (single structure, S) 1->2 roundtrip 9.015028
LD1 (single structure, S) throughput 2.010168
LD1 (single structure, D) 1->2 roundtrip 8.175292
LD1 (single structure, D) throughput 2.252838
LD1R (8B) 1->2 roundtrip 9.032480
LD1R (8B) throughput 0.507917
LD1R (16B) 1->2 roundtrip 8.973735
LD1R (16B) throughput 0.502272
LD1R (4H) 1->2 roundtrip 9.197005
LD1R (4H) throughput 0.501230
LD1R (8H) 1->2 roundtrip 9.032142
LD1R (8H) throughput 0.501438
LD1R (2S) 1->2 roundtrip 9.083982
LD1R (2S) throughput 0.820501
LD1R (4S) 1->2 roundtrip 11.281370
LD1R (4S) throughput 0.500339
LD1R (1D) 1->2 roundtrip 9.281731
LD1R (1D) throughput 0.501154
LD1R (2D) 1->2 roundtrip 9.010712
LD1R (2D) throughput 0.501346
LD2 (multiple structures, 8B) 1->3 roundtrip 9.105677
LD2 (multiple structures, 8B) 2->3 roundtrip 9.083969
LD2 (multiple structures, 8B) throughput 1.009460
LD2 (multiple structures, 16B) 1->3 roundtrip 9.218350
LD2 (multiple structures, 16B) 2->3 roundtrip 9.042767
LD2 (multiple structures, 16B) throughput 0.979966
LD2 (multiple structures, 4H) 1->3 roundtrip 7.646544
LD2 (multiple structures, 4H) 2->3 roundtrip 9.016837
LD2 (multiple structures, 4H) throughput 1.001600
LD2 (multiple structures, 8H) 1->3 roundtrip 9.016312
LD2 (multiple structures, 8H) 2->3 roundtrip 9.527361
LD2 (multiple structures, 8H) throughput 1.047649
LD2 (multiple structures, 2S) 1->3 roundtrip 9.056487
LD2 (multiple structures, 2S) 2->3 roundtrip 9.027327
LD2 (multiple structures, 2S) throughput 1.002177
LD2 (multiple structures, 4S) 1->3 roundtrip 9.228103
LD2 (multiple structures, 4S) 2->3 roundtrip 11.305949
LD2 (multiple structures, 4S) throughput 1.115639
LD2 (multiple structures, 2D) 1->3 roundtrip 9.130457
LD2 (multiple structures, 2D) 2->3 roundtrip 9.018417
LD2 (multiple structures, 2D) throughput 1.003042
LD2 (single structure, B) 1->3 roundtrip 9.129090
LD2 (single structure, B) 2->3 roundtrip 9.060652
LD2 (single structure, B) throughput 1.250162
LD2 (single structure, H) 1->3 roundtrip 9.018789
LD2 (single structure, H) 2->3 roundtrip 9.088440
LD2 (single structure, H) throughput 0.633054
LD2 (single structure, S) 1->3 roundtrip 9.102636
LD2 (single structure, S) 2->3 roundtrip 9.164477
LD2 (single structure, S) throughput 1.332354
LD2 (single structure, D) 1->3 roundtrip 8.996682
LD2 (single structure, D) 2->3 roundtrip 9.171231
LD2 (single structure, D) throughput 1.363277
LD2R (8B) 1->3 roundtrip 8.151490
LD2R (8B) 2->3 roundtrip 9.055318
LD2R (8B) throughput 1.002141
LD2R (16B) 1->3 roundtrip 9.095853
LD2R (16B) 2->3 roundtrip 9.056817
LD2R (16B) throughput 0.998748
LD2R (4H) 1->3 roundtrip 8.842675
LD2R (4H) 2->3 roundtrip 9.147393
LD2R (4H) throughput 1.006504
LD2R (8H) 1->3 roundtrip 9.031579
LD2R (8H) 2->3 roundtrip 9.784526
LD2R (8H) throughput 1.010883
LD2R (2S) 1->3 roundtrip 9.166738
LD2R (2S) 2->3 roundtrip 9.096007
LD2R (2S) throughput 1.006395
LD2R (4S) 1->3 roundtrip 8.813419
LD2R (4S) 2->3 roundtrip 12.025902
LD2R (4S) throughput 1.042096
LD2R (1D) 1->3 roundtrip 9.013919
LD2R (1D) 2->3 roundtrip 9.474108
LD2R (1D) throughput 1.061019
LD2R (2D) 1->3 roundtrip 9.009001
LD2R (2D) 2->3 roundtrip 9.014220
LD2R (2D) throughput 1.001026
LD3 (multiple structures, 8B) throughput 1.985801
LD3 (multiple structures, 16B) throughput 2.020741
LD3 (multiple structures, 4H) throughput 2.425026
LD3 (multiple structures, 8H) throughput 2.012357
LD3 (multiple structures, 2S) throughput 2.027656
LD3 (multiple structures, 4S) throughput 2.003097
LD3 (multiple structures, 2D) throughput 2.009708
LD3 (single structure, B) throughput 2.034950
LD3 (single structure, H) throughput 2.042560
LD3 (single structure, S) throughput 2.031384
LD3 (single structure, D) throughput 2.024957
LD3R (8B) throughput 1.459800
LD3R (16B) throughput 1.507324
LD3R (4H) throughput 1.506469
LD3R (8H) throughput 1.479041
LD3R (2S) throughput 1.514882
LD3R (4S) throughput 1.519725
LD3R (1D) throughput 2.033667
LD3R (2D) throughput 2.028952
LD4 (multiple structures, 8B) throughput 2.066458
LD4 (multiple structures, 16B) throughput 4.011827
LD4 (multiple structures, 4H) throughput 2.009874
LD4 (multiple structures, 8H) throughput 4.051732
LD4 (multiple structures, 2S) throughput 2.000805
LD4 (multiple structures, 4S) throughput 4.020400
LD4 (multiple structures, 2D) throughput 4.098653
LD4 (multiple, 8B, post-index) throughput 2.286229
LD4 (multiple, 16B, post-index) throughput 4.078362
LD4 (multiple, 4H, post-index) throughput 2.016536
LD4 (multiple, 8H, post-index) throughput 3.831365
LD4 (multiple, 2S, post-index) throughput 2.006732
LD4 (multiple, 4S, post-index) throughput 4.019459
LD4 (multiple, 2D, post-index) throughput 4.011161
LD4 (single structure, B) throughput 2.032379
LD4 (single structure, H) throughput 2.001791
LD4 (single structure, S) throughput 2.087423
LD4 (single structure, D) throughput 2.002626
LD4 (single, B, post-index) throughput 2.008120
LD4 (single, H, post-index) throughput 2.033647
LD4 (single, S, post-index) throughput 1.780095
LD4 (single, D, post-index) throughput 2.013273
LD4R (8B) throughput 2.022372
LD4R (16B) throughput 2.034385
LD4R (4H) throughput 2.021174
LD4R (8H) throughput 2.007375
LD4R (2S) throughput 2.020434
LD4R (4S) throughput 2.010825
LD4R (1D) throughput 2.000273
LD4R (2D) throughput 2.022669
LDNP (S) 1->3 roundtrip 7.129860
LDNP (S) 2->3 roundtrip 7.035929
LDNP (S) throughput 1.001716
LDNP (D) 1->3 roundtrip 7.058020
LDNP (D) 2->3 roundtrip 7.215500
LDNP (D) throughput 1.006747
LDNP (Q) 1->3 roundtrip 7.033402
LDNP (Q) 2->3 roundtrip 4.138727
LDNP (Q) throughput 0.992942
LDNP (signed offset, S) 1->3 roundtrip 7.098021
LDNP (signed offset, S) 2->3 roundtrip 7.080957
LDNP (signed offset, S) throughput 1.001782
LDNP (signed offset, D) 1->3 roundtrip 7.070861
LDNP (signed offset, D) 2->3 roundtrip 7.151612
LDNP (signed offset, D) throughput 1.001563
LDNP (signed offset, Q) 1->3 roundtrip 6.995954
LDNP (signed offset, Q) 2->3 roundtrip 7.163104
LDNP (signed offset, Q) throughput 1.007582
LDP (S) 1->3 roundtrip 7.237894
LDP (S) 2->3 roundtrip 7.097876
LDP (S) throughput 1.004284
LDP (D) 1->3 roundtrip 6.999353
LDP (D) 2->3 roundtrip 7.555351
LDP (D) throughput 1.662848
LDP (Q) 1->3 roundtrip 8.648435
LDP (Q) 2->3 roundtrip 7.959010
LDP (Q) throughput 1.004211
LDP (post-index, S) 1->3 roundtrip 7.761390
LDP (post-index, S) 2->3 roundtrip 7.084072
LDP (post-index, S) throughput 1.102760
LDP (post-index, D) 1->3 roundtrip 6.691063
LDP (post-index, D) 2->3 roundtrip 9.554171
LDP (post-index, D) throughput 1.249167
LDP (post-index, Q) 1->3 roundtrip 6.596199
LDP (post-index, Q) 2->3 roundtrip 7.253716
LDP (post-index, Q) throughput 2.055624
LDP (pre-index, S) 1->3 roundtrip 6.779389
LDP (pre-index, S) 2->3 roundtrip 7.762803
LDP (pre-index, S) throughput 1.245505
LDP (pre-index, D) 1->3 roundtrip 8.912175
LDP (pre-index, D) 2->3 roundtrip 8.151609
LDP (pre-index, D) throughput 1.333533
LDP (pre-index, Q) 1->3 roundtrip 8.130755
LDP (pre-index, Q) 2->3 roundtrip 8.356180
LDP (pre-index, Q) throughput 1.191473
LDP (signed offset, S) 1->3 roundtrip 7.574074
LDP (signed offset, S) 2->3 roundtrip 7.816022
LDP (signed offset, S) throughput 1.072364
LDP (signed offset, D) 1->3 roundtrip 8.323065
LDP (signed offset, D) 2->3 roundtrip 8.036336
LDP (signed offset, D) throughput 1.023175
LDP (signed offset, Q) 1->3 roundtrip 9.448364
LDP (signed offset, Q) 2->3 roundtrip 7.566425
LDP (signed offset, Q) throughput 1.051238
LDR (S) 1->2 roundtrip 7.746717
LDR (S) throughput 0.499114
LDR (D) 1->2 roundtrip 6.939503
LDR (D) throughput 0.535427
LDR (Q) 1->2 roundtrip 10.153085
LDR (Q) throughput 0.506620
LDR (post-index, S) 1->2 roundtrip 7.068683
LDR (post-index, S) throughput 0.660144
LDR (post-index, D) 1->2 roundtrip 4.896653
LDR (post-index, D) throughput 0.506181
LDR (post-index, Q) 1->2 roundtrip 17.190853
LDR (post-index, Q) throughput 0.673746
LDR (pre-index, S) 1->2 roundtrip 8.663215
LDR (pre-index, S) throughput 0.475521
LDR (pre-index, D) 1->2 roundtrip 9.099869
LDR (pre-index, D) throughput 0.463561
LDR (pre-index, Q) 1->2 roundtrip 18.213276
LDR (pre-index, Q) throughput 0.497461
LDR (unsigned offset, S) 1->2 roundtrip 8.594967
LDR (unsigned offset, S) throughput 0.500802
LDR (unsigned offset, D) 1->2 roundtrip 6.723945
LDR (unsigned offset, D) throughput 0.553039
LDR (unsigned offset, Q) 1->2 roundtrip 5.054318
LDR (unsigned offset, Q) throughput 0.517326
LDR (literal, S) throughput 0.501477
LDR (literal, D) throughput 0.540213
LDR (literal, Q) throughput 0.495296
LDR (register, S) 1->2 roundtrip 7.468402
LDR (register, S) 1->3 roundtrip 8.114378
LDR (register, S) throughput 0.504002
LDR (register, D) 1->2 roundtrip 8.178510
LDR (register, D) 1->3 roundtrip 7.503237
LDR (register, D) throughput 0.566229
LDR (register, Q) 1->2 roundtrip 8.265896
LDR (register, Q) 1->3 roundtrip 7.296197
LDR (register, Q) throughput 0.271045
LDR (register, uxtw, S) 1->2 roundtrip 9.204787
LDR (register, uxtw, S) 1->3 roundtrip 7.163584
LDR (register, uxtw, S) throughput 0.570560
LDR (register, uxtw, D) 1->2 roundtrip 7.521730
LDR (register, uxtw, D) 1->3 roundtrip 7.347308
LDR (register, uxtw, D) throughput 0.482927
LDR (register, uxtw, Q) 1->2 roundtrip 7.301325
LDR (register, uxtw, Q) 1->3 roundtrip 7.270031
LDR (register, uxtw, Q) throughput 0.575110
LDR (register, sxtw, S) 1->2 roundtrip 7.537086
LDR (register, sxtw, S) 1->3 roundtrip 6.562206
LDR (register, sxtw, S) throughput 0.541562
LDR (register, sxtw, D) 1->2 roundtrip 7.555881
LDR (register, sxtw, D) 1->3 roundtrip 8.085580
LDR (register, sxtw, D) throughput 0.434113
LDR (register, sxtw, Q) 1->2 roundtrip 8.257919
LDR (register, sxtw, Q) 1->3 roundtrip 7.470659
LDR (register, sxtw, Q) throughput 0.534491
LDR (register, lsl, S) 1->2 roundtrip 5.657690
LDR (register, lsl, S) 1->3 roundtrip 8.016595
LDR (register, lsl, S) throughput 0.500096
LDR (register, lsl, D) 1->2 roundtrip 7.409360
LDR (register, lsl, D) 1->3 roundtrip 7.796671
LDR (register, lsl, D) throughput 0.549861
LDR (register, lsl, Q) 1->2 roundtrip 7.604372
LDR (register, lsl, Q) 1->3 roundtrip 9.329175
LDR (register, lsl, Q) throughput 0.502061
LDUR (S) 1->2 roundtrip 7.952640
LDUR (S) throughput 0.552248
LDUR (D) 1->2 roundtrip 7.898053
LDUR (D) throughput 0.542755
LDUR (Q) 1->2 roundtrip 6.684032
LDUR (Q) throughput 0.502185
MLA (by element, 4H) 1->1 2.651300
MLA (by element, 4H) 1->2 2.919289
MLA (by element, 4H) 1->3 3.005718
MLA (by element, 4H) throughput 0.416492
MLA (by element, 4H) throughput 0.551420
MLA (by element, 8H) 1->1 3.004814
MLA (by element, 8H) 1->2 2.980716
MLA (by element, 8H) 1->3 4.867201
MLA (by element, 8H) throughput 0.421445
MLA (by element, 8H) throughput 0.562966
MLA (by element, 2S) 1->1 2.991071
MLA (by element, 2S) 1->2 3.026394
MLA (by element, 2S) 1->3 3.025500
MLA (by element, 2S) throughput 0.542866
MLA (by element, 2S) throughput 0.563320
MLA (by element, 4S) 1->1 2.905926
MLA (by element, 4S) 1->2 3.038773
MLA (by element, 4S) 1->3 3.046851
MLA (by element, 4S) throughput 0.507081
MLA (by element, 4S) throughput 0.529790
MLA (vector, 8B) 1->1 3.014456
MLA (vector, 8B) 1->2 3.010854
MLA (vector, 8B) 1->3 3.623993
MLA (vector, 8B) throughput 0.499141
MLA (vector, 8B) throughput 0.533033
MLA (vector, 16B) 1->1 3.424985
MLA (vector, 16B) 1->2 3.524273
MLA (vector, 16B) 1->3 2.997898
MLA (vector, 16B) throughput 0.574058
MLA (vector, 16B) throughput 0.537056
MLA (vector, 4H) 1->1 3.011372
MLA (vector, 4H) 1->2 3.005741
MLA (vector, 4H) 1->3 3.364581
MLA (vector, 4H) throughput 0.558823
MLA (vector, 4H) throughput 0.447700
MLA (vector, 8H) 1->1 3.457334
MLA (vector, 8H) 1->2 3.010852
MLA (vector, 8H) 1->3 2.120557
MLA (vector, 8H) throughput 0.980558
MLA (vector, 8H) throughput 0.510988
MLA (vector, 2S) 1->1 3.000564
MLA (vector, 2S) 1->2 3.671882
MLA (vector, 2S) 1->3 3.077388
MLA (vector, 2S) throughput 0.530708
MLA (vector, 2S) throughput 0.512276
MLA (vector, 4S) 1->1 4.062491
MLA (vector, 4S) 1->2 2.988690
MLA (vector, 4S) 1->3 2.979174
MLA (vector, 4S) throughput 0.504614
MLA (vector, 4S) throughput 0.507961
MLS (by element, 4H) 1->1 2.957716
MLS (by element, 4H) 1->2 3.106535
MLS (by element, 4H) 1->3 3.535993
MLS (by element, 4H) throughput 0.498676
MLS (by element, 4H) throughput 0.556905
MLS (by element, 8H) 1->1 2.986776
MLS (by element, 8H) 1->2 2.962045
MLS (by element, 8H) 1->3 3.000323
MLS (by element, 8H) throughput 0.579716
MLS (by element, 8H) throughput 0.546364
MLS (by element, 2S) 1->1 3.607027
MLS (by element, 2S) 1->2 3.431976
MLS (by element, 2S) 1->3 2.872706
MLS (by element, 2S) throughput 0.500515
MLS (by element, 2S) throughput 0.526552
MLS (by element, 4S) 1->1 3.451403
MLS (by element, 4S) 1->2 3.019598
MLS (by element, 4S) 1->3 2.999959
MLS (by element, 4S) throughput 0.590091
MLS (by element, 4S) throughput 0.517215
MLS (vector, 8B) 1->1 3.181490
MLS (vector, 8B) 1->2 3.001738
MLS (vector, 8B) 1->3 3.307790
MLS (vector, 8B) throughput 0.529128
MLS (vector, 8B) throughput 0.443394
MLS (vector, 16B) 1->1 3.013403
MLS (vector, 16B) 1->2 3.216815
MLS (vector, 16B) 1->3 4.056943
MLS (vector, 16B) throughput 0.503182
MLS (vector, 16B) throughput 0.515519
MLS (vector, 4H) 1->1 3.012123
MLS (vector, 4H) 1->2 3.081776
MLS (vector, 4H) 1->3 3.751146
MLS (vector, 4H) throughput 0.501721
MLS (vector, 4H) throughput 0.533658
MLS (vector, 8H) 1->1 2.988269
MLS (vector, 8H) 1->2 3.000808
MLS (vector, 8H) 1->3 3.458138
MLS (vector, 8H) throughput 0.563467
MLS (vector, 8H) throughput 0.531125
MLS (vector, 2S) 1->1 2.993176
MLS (vector, 2S) 1->2 3.438311
MLS (vector, 2S) 1->3 2.991025
MLS (vector, 2S) throughput 0.501554
MLS (vector, 2S) throughput 0.392994
MLS (vector, 4S) 1->1 3.051061
MLS (vector, 4S) 1->2 2.742058
MLS (vector, 4S) 1->3 3.001465
MLS (vector, 4S) throughput 0.528878
MLS (vector, 4S) throughput 0.516196
MOV (vector, 8B) 1->2 1.999597
MOV (vector, 8B) throughput 0.590581
MOV (vector, 16B) 1->2 1.247376
MOV (vector, 16B) throughput 0.248444
MOVI (vector, 4H, zero) throughput 0.251940
MOVI (vector, 8H, zero) throughput 0.252171
MOVI (vector, 2S, zero) throughput 0.258209
MOVI (vector, 4S, zero) throughput 0.219978
MOVI (vector, 4H) throughput 0.537250
MOVI (vector, 8H) throughput 0.501582
MOVI (vector, 2S) throughput 0.535327
MOVI (vector, 4S) throughput 0.430070
MOVI (vector, 4H, lsl) throughput 0.529857
MOVI (vector, 8H, lsl) throughput 0.500820
MOVI (vector, 2S, lsl) throughput 0.594961
MOVI (vector, 4S, lsl) throughput 0.472988
MOVI (vector, 2S, msl) throughput 0.503963
MOVI (vector, 4S, msl) throughput 0.609193
MOVI (vector, 64-bit, 2D) throughput 0.502693
MOVI (vector, 64-bit, D) throughput 0.584680
MUL (by element, 4H) 1->2 3.001615
MUL (by element, 4H) 1->3 2.877025
MUL (by element, 4H) throughput 0.568466
MUL (by element, 8H) 1->2 3.021402
MUL (by element, 8H) 1->3 3.673616
MUL (by element, 8H) throughput 0.500567
MUL (by element, 2S) 1->2 3.597019
MUL (by element, 2S) 1->3 3.446838
MUL (by element, 2S) throughput 0.503001
MUL (by element, 4S) 1->2 3.228488
MUL (by element, 4S) 1->3 3.008003
MUL (by element, 4S) throughput 0.542315
MUL (vector, 8B) 1->2 3.010203
MUL (vector, 8B) 1->3 3.293325
MUL (vector, 8B) throughput 0.537211
MUL (vector, 16B) 1->2 2.885216
MUL (vector, 16B) 1->3 2.667125
MUL (vector, 16B) throughput 0.464154
MUL (vector, 4H) 1->2 3.332741
MUL (vector, 4H) 1->3 2.987184
MUL (vector, 4H) throughput 0.499980
MUL (vector, 8H) 1->2 3.616624
MUL (vector, 8H) 1->3 3.296550
MUL (vector, 8H) throughput 0.467136
MUL (vector, 2S) 1->2 3.282825
MUL (vector, 2S) 1->3 3.022032
MUL (vector, 2S) throughput 0.538245
MUL (vector, 4S) 1->2 3.175769
MUL (vector, 4S) 1->3 2.998626
MUL (vector, 4S) throughput 0.556138
MVNI (vector, 4H) throughput 0.573552
MVNI (vector, 8H) throughput 0.497753
MVNI (vector, 2S) throughput 0.595928
MVNI (vector, 4S) throughput 0.528199
MVNI (vector, 4H, lsl) throughput 0.437369
MVNI (vector, 8H, lsl) throughput 0.565829
MVNI (vector, 2S, lsl) throughput 0.500683
MVNI (vector, 4S, lsl) throughput 0.501579
MVNI (vector, 2S, msl) throughput 0.579338
MVNI (vector, 4S, msl) throughput 0.488313
NEG (vector, 8B) 1->2 2.013323
NEG (vector, 8B) throughput 0.508059
NEG (vector, 16B) 1->2 1.917469
NEG (vector, 16B) throughput 0.501281
NEG (vector, 4H) 1->2 2.008097
NEG (vector, 4H) throughput 0.527345
NEG (vector, 8H) 1->2 1.993436
NEG (vector, 8H) throughput 0.497871
NEG (vector, 2S) 1->2 2.057741
NEG (vector, 2S) throughput 0.548211
NEG (vector, 4S) 1->2 1.995479
NEG (vector, 4S) throughput 0.507916
NEG (vector, 2D) 1->2 2.010483
NEG (vector, 2D) throughput 0.830210
NOT (vector, 8B) 1->2 2.005074
NOT (vector, 8B) throughput 0.512108
NOT (vector, 16B) 1->2 2.185401
NOT (vector, 16B) throughput 0.497840
ORN (vector, 8B) 1->2 1.994586
ORN (vector, 8B) 1->3 2.015508
ORN (vector, 8B) throughput 0.508539
ORN (vector, 16B) 1->2 2.162469
ORN (vector, 16B) 1->3 2.228269
ORN (vector, 16B) throughput 0.467943
ORR (vector, immediate, 4H) 1->1 2.029656
ORR (vector, immediate, 4H) throughput 0.481015
ORR (vector, immediate, 4H) throughput 0.568080
ORR (vector, immediate, 8H) 1->1 2.064586
ORR (vector, immediate, 8H) throughput 0.776736
ORR (vector, immediate, 8H) throughput 0.473577
ORR (vector, immediate, 2S) 1->1 2.223360
ORR (vector, immediate, 2S) throughput 0.515830
ORR (vector, immediate, 2S) throughput 0.412866
ORR (vector, immediate, 4S) 1->1 2.002350
ORR (vector, immediate, 4S) throughput 0.506261
ORR (vector, immediate, 4S) throughput 0.502726
ORR (vector, 8B) 1->2 1.999571
ORR (vector, 8B) 1->3 2.002751
ORR (vector, 8B) throughput 0.392356
ORR (vector, 16B) 1->2 2.667962
ORR (vector, 16B) 1->3 2.002825
ORR (vector, 16B) throughput 0.349877
PMUL (8B) 1->2 3.778660
PMUL (8B) 1->3 1.895518
PMUL (8B) throughput 0.712787
PMUL (16B) 1->2 4.838376
PMUL (16B) 1->3 2.448032
PMUL (16B) throughput 0.512456
PMULL (1Q) 1->2 3.028828
PMULL (1Q) 1->3 2.997009
PMULL (1Q) throughput 0.500430
PMULL (8H) 1->2 3.007371
PMULL (8H) 1->3 3.092862
PMULL (8H) throughput 0.513959
PMULL2 (1Q) 1->2 3.095582
PMULL2 (1Q) 1->3 2.933247
PMULL2 (1Q) throughput 0.617001
PMULL2 (8H) 1->2 2.849522
PMULL2 (8H) 1->3 5.254128
PMULL2 (8H) throughput 0.770522
PMULL + EOR (1Q) 1->2 4.188216
PMULL + EOR (1Q) 1->3 4.196204
PMULL + EOR (1Q) 1->4 4.247578
PMULL + EOR (1Q) throughput 0.504712
PMULL + EOR (8H) 1->2 4.180045
PMULL + EOR (8H) 1->3 4.939697
PMULL + EOR (8H) 1->4 5.004923
PMULL + EOR (8H) throughput 0.578347
RADDHN (8H) 1->2 3.005673
RADDHN (8H) 1->3 3.030740
RADDHN (8H) throughput 0.497274
RADDHN (4S) 1->2 3.023591
RADDHN (4S) 1->3 3.025277
RADDHN (4S) throughput 0.502520
RADDHN (2D) 1->2 2.996732
RADDHN (2D) 1->3 3.056888
RADDHN (2D) throughput 0.503019
RADDHN2 (8H) 1->1 3.156757
RADDHN2 (8H) 1->2 3.623012
RADDHN2 (8H) 1->3 3.010248
RADDHN2 (8H) throughput 0.505579
RADDHN2 (8H) throughput 0.371679
RADDHN2 (4S) 1->1 3.150907
RADDHN2 (4S) 1->2 3.745533
RADDHN2 (4S) 1->3 2.694099
RADDHN2 (4S) throughput 0.501379
RADDHN2 (4S) throughput 0.500970
RADDHN2 (2D) 1->1 2.999084
RADDHN2 (2D) 1->2 3.005224
RADDHN2 (2D) 1->3 3.002875
RADDHN2 (2D) throughput 0.501953
RADDHN2 (2D) throughput 0.555179
RBIT (vector, 8B) 1->2 2.004617
RBIT (vector, 8B) throughput 0.574678
RBIT (vector, 16B) 1->2 1.898328
RBIT (vector, 16B) throughput 0.659125
REV16 (vector, 8B) 1->2 2.012634
REV16 (vector, 8B) throughput 0.624666
REV16 (vector, 16B) 1->2 2.005384
REV16 (vector, 16B) throughput 0.498250
REV32 (vector, 8B) 1->2 2.015148
REV32 (vector, 8B) throughput 0.500532
REV32 (vector, 16B) 1->2 1.848963
REV32 (vector, 16B) throughput 0.566067
REV32 (vector, 4H) 1->2 2.537875
REV32 (vector, 4H) throughput 0.498430
REV32 (vector, 8H) 1->2 1.983819
REV32 (vector, 8H) throughput 0.624582
REV64 (vector, 8B) 1->2 2.504411
REV64 (vector, 8B) throughput 0.496911
REV64 (vector, 16B) 1->2 2.015870
REV64 (vector, 16B) throughput 0.581281
REV64 (vector, 4H) 1->2 1.972754
REV64 (vector, 4H) throughput 0.499645
REV64 (vector, 8H) 1->2 1.983081
REV64 (vector, 8H) throughput 0.452944
REV64 (vector, 2S) 1->2 1.992644
REV64 (vector, 2S) throughput 0.590899
REV64 (vector, 4S) 1->2 1.914631
REV64 (vector, 4S) throughput 0.602440
RSHRN (8H) 1->2 3.014105
RSHRN (8H) throughput 0.597651
RSHRN (4S) 1->2 3.103403
RSHRN (4S) throughput 0.601344
RSHRN (2D) 1->2 3.009539
RSHRN (2D) throughput 0.567662
RSHRN2 (8H) 1->1 3.009717
RSHRN2 (8H) 1->2 3.617428
RSHRN2 (8H) throughput 0.501913
RSHRN2 (8H) throughput 0.543457
RSHRN2 (4S) 1->1 3.004216
RSHRN2 (4S) 1->2 3.026157
RSHRN2 (4S) throughput 0.417719
RSHRN2 (4S) throughput 0.566053
RSHRN2 (2D) 1->1 3.668177
RSHRN2 (2D) 1->2 2.581415
RSHRN2 (2D) throughput 0.451969
RSHRN2 (2D) throughput 0.539838
RSUBHN (8H) 1->2 2.982652
RSUBHN (8H) 1->3 2.994159
RSUBHN (8H) throughput 0.421083
RSUBHN (4S) 1->2 3.008464
RSUBHN (4S) 1->3 2.989121
RSUBHN (4S) throughput 0.564006
RSUBHN (2D) 1->2 3.557729
RSUBHN (2D) 1->3 3.031143
RSUBHN (2D) throughput 0.627254
RSUBHN2 (8H) 1->1 3.019074
RSUBHN2 (8H) 1->2 3.041829
RSUBHN2 (8H) 1->3 3.072098
RSUBHN2 (8H) throughput 0.540833
RSUBHN2 (8H) throughput 0.542734
RSUBHN2 (4S) 1->1 4.368918
RSUBHN2 (4S) 1->2 3.020609
RSUBHN2 (4S) 1->3 3.240532
RSUBHN2 (4S) throughput 0.483360
RSUBHN2 (4S) throughput 0.621139
RSUBHN2 (2D) 1->1 2.689037
RSUBHN2 (2D) 1->2 3.174594
RSUBHN2 (2D) 1->3 3.023611
RSUBHN2 (2D) throughput 0.604233
RSUBHN2 (2D) throughput 0.544724
SABA (vector, 8B) 1->1 2.998870
SABA (vector, 8B) 1->2 2.868923
SABA (vector, 8B) 1->3 3.009475
SABA (vector, 8B) throughput 0.599454
SABA (vector, 8B) throughput 0.536865
SABA (vector, 16B) 1->1 2.775812
SABA (vector, 16B) 1->2 3.006600
SABA (vector, 16B) 1->3 2.826876
SABA (vector, 16B) throughput 0.518390
SABA (vector, 16B) throughput 0.523267
SABA (vector, 4H) 1->1 3.005871
SABA (vector, 4H) 1->2 2.860097
SABA (vector, 4H) 1->3 3.009153
SABA (vector, 4H) throughput 0.503031
SABA (vector, 4H) throughput 0.519432
SABA (vector, 8H) 1->1 3.012635
SABA (vector, 8H) 1->2 3.639463
SABA (vector, 8H) 1->3 3.465409
SABA (vector, 8H) throughput 0.684085
SABA (vector, 8H) throughput 0.495022
SABA (vector, 2S) 1->1 3.011420
SABA (vector, 2S) 1->2 3.020408
SABA (vector, 2S) 1->3 2.880928
SABA (vector, 2S) throughput 0.568116
SABA (vector, 2S) throughput 0.426606
SABA (vector, 4S) 1->1 3.610996
SABA (vector, 4S) 1->2 3.028709
SABA (vector, 4S) 1->3 2.521606
SABA (vector, 4S) throughput 0.451987
SABA (vector, 4S) throughput 0.542796
SABAL (vector, 8H) 1->1 3.052599
SABAL (vector, 8H) 1->2 2.862481
SABAL (vector, 8H) 1->3 2.901147
SABAL (vector, 8H) throughput 0.532919
SABAL (vector, 8H) throughput 0.519014
SABAL (vector, 4S) 1->1 2.955786
SABAL (vector, 4S) 1->2 3.014858
SABAL (vector, 4S) 1->3 3.396685
SABAL (vector, 4S) throughput 0.482266
SABAL (vector, 4S) throughput 0.442707
SABAL (vector, 2D) 1->1 3.055825
SABAL (vector, 2D) 1->2 3.021014
SABAL (vector, 2D) 1->3 3.685871
SABAL (vector, 2D) throughput 0.403480
SABAL (vector, 2D) throughput 0.432392
SABAL2 (vector, 8H) 1->1 3.546463
SABAL2 (vector, 8H) 1->2 3.742812
SABAL2 (vector, 8H) 1->3 2.668307
SABAL2 (vector, 8H) throughput 0.503630
SABAL2 (vector, 8H) throughput 0.521434
SABAL2 (vector, 4S) 1->1 3.619215
SABAL2 (vector, 4S) 1->2 3.009677
SABAL2 (vector, 4S) 1->3 3.381162
SABAL2 (vector, 4S) throughput 0.434552
SABAL2 (vector, 4S) throughput 0.556649
SABAL2 (vector, 2D) 1->1 3.637452
SABAL2 (vector, 2D) 1->2 3.000567
SABAL2 (vector, 2D) 1->3 3.003960
SABAL2 (vector, 2D) throughput 0.570905
SABAL2 (vector, 2D) throughput 0.539789
SABD (vector, 8B) 1->2 3.312987
SABD (vector, 8B) 1->3 2.769235
SABD (vector, 8B) throughput 0.502359
SABD (vector, 16B) 1->2 3.567490
SABD (vector, 16B) 1->3 2.989731
SABD (vector, 16B) throughput 0.614586
SABD (vector, 4H) 1->2 3.355146
SABD (vector, 4H) 1->3 3.081927
SABD (vector, 4H) throughput 0.520798
SABD (vector, 8H) 1->2 3.045998
SABD (vector, 8H) 1->3 3.092206
SABD (vector, 8H) throughput 0.500790
SABD (vector, 2S) 1->2 2.996344
SABD (vector, 2S) 1->3 3.032403
SABD (vector, 2S) throughput 0.639312
SABD (vector, 4S) 1->2 2.964865
SABD (vector, 4S) 1->3 2.935895
SABD (vector, 4S) throughput 0.536731
SABDL (vector, 8H) 1->2 3.013685
SABDL (vector, 8H) 1->3 3.092662
SABDL (vector, 8H) throughput 0.508309
SABDL (vector, 4S) 1->2 2.797812
SABDL (vector, 4S) 1->3 3.014093
SABDL (vector, 4S) throughput 0.522330
SABDL (vector, 2D) 1->2 3.010288
SABDL (vector, 2D) 1->3 3.593778
SABDL (vector, 2D) throughput 0.586483
SABDL2 (vector, 8H) 1->2 2.997375
SABDL2 (vector, 8H) 1->3 3.006155
SABDL2 (vector, 8H) throughput 0.589966
SABDL2 (vector, 4S) 1->2 3.121799
SABDL2 (vector, 4S) 1->3 3.394735
SABDL2 (vector, 4S) throughput 0.500081
SABDL2 (vector, 2D) 1->2 3.685663
SABDL2 (vector, 2D) 1->3 3.000686
SABDL2 (vector, 2D) throughput 0.561512
SADALP (vector, 4H) 1->1 3.004996
SADALP (vector, 4H) 1->2 3.007693
SADALP (vector, 4H) throughput 0.394840
SADALP (vector, 4H) throughput 0.541280
SADALP (vector, 8H) 1->1 3.383163
SADALP (vector, 8H) 1->2 3.309917
SADALP (vector, 8H) throughput 0.518616
SADALP (vector, 8H) throughput 0.940106
SADALP (vector, 2S) 1->1 2.822840
SADALP (vector, 2S) 1->2 2.997860
SADALP (vector, 2S) throughput 0.629988
SADALP (vector, 2S) throughput 0.504692
SADALP (vector, 4S) 1->1 2.996531
SADALP (vector, 4S) 1->2 3.189474
SADALP (vector, 4S) throughput 0.544209
SADALP (vector, 4S) throughput 0.621157
SADALP (vector, 1D) 1->1 3.822951
SADALP (vector, 1D) 1->2 2.815154
SADALP (vector, 1D) throughput 0.538329
SADALP (vector, 1D) throughput 0.547505
SADALP (vector, 2D) 1->1 3.184567
SADALP (vector, 2D) 1->2 2.998870
SADALP (vector, 2D) throughput 0.501885
SADALP (vector, 2D) throughput 0.518254
SADDL (vector, 8H) 1->2 2.074264
SADDL (vector, 8H) 1->3 1.947960
SADDL (vector, 8H) throughput 0.539708
SADDL (vector, 4S) 1->2 2.461415
SADDL (vector, 4S) 1->3 2.002957
SADDL (vector, 4S) throughput 0.500723
SADDL (vector, 2D) 1->2 1.996255
SADDL (vector, 2D) 1->3 2.002956
SADDL (vector, 2D) throughput 0.409344
SADDL2 (vector, 8H) 1->2 2.003802
SADDL2 (vector, 8H) 1->3 2.135944
SADDL2 (vector, 8H) throughput 0.520689
SADDL2 (vector, 4S) 1->2 1.760975
SADDL2 (vector, 4S) 1->3 1.991428
SADDL2 (vector, 4S) throughput 0.526342
SADDL2 (vector, 2D) 1->2 2.067015
SADDL2 (vector, 2D) 1->3 1.995926
SADDL2 (vector, 2D) throughput 0.572240
SADDLP (vector, 4H) 1->2 2.094150
SADDLP (vector, 4H) throughput 0.501276
SADDLP (vector, 8H) 1->2 2.005872
SADDLP (vector, 8H) throughput 0.403376
SADDLP (vector, 2S) 1->2 1.925611
SADDLP (vector, 2S) throughput 0.701203
SADDLP (vector, 4S) 1->2 1.999915
SADDLP (vector, 4S) throughput 0.626526
SADDLP (vector, 1D) 1->2 1.896445
SADDLP (vector, 1D) throughput 0.515437
SADDLP (vector, 2D) 1->2 2.322773
SADDLP (vector, 2D) throughput 0.550845
SADDLV (vector, 8B) 1->2 2.999071
SADDLV (vector, 8B) throughput 0.636241
SADDLV (vector, 16B) 1->2 3.019153
SADDLV (vector, 16B) throughput 0.575372
SADDLV (vector, 4H) 1->2 3.030367
SADDLV (vector, 4H) throughput 0.578967
SADDLV (vector, 8H) 1->2 2.999272
SADDLV (vector, 8H) throughput 0.635103
SADDLV (vector, 4S) 1->2 3.055673
SADDLV (vector, 4S) throughput 0.278926
SADDW (vector, 8H) 1->2 2.022393
SADDW (vector, 8H) 1->3 1.939439
SADDW (vector, 8H) throughput 0.809379
SADDW (vector, 4S) 1->2 2.142636
SADDW (vector, 4S) 1->3 2.007137
SADDW (vector, 4S) throughput 0.429380
SADDW (vector, 2D) 1->2 2.003597
SADDW (vector, 2D) 1->3 1.467885
SADDW (vector, 2D) throughput 0.580798
SADDW2 (vector, 8H) 1->2 2.026643
SADDW2 (vector, 8H) 1->3 2.024498
SADDW2 (vector, 8H) throughput 0.503265
SADDW2 (vector, 4S) 1->2 2.004170
SADDW2 (vector, 4S) 1->3 2.010447
SADDW2 (vector, 4S) throughput 0.499028
SADDW2 (vector, 2D) 1->2 2.020449
SADDW2 (vector, 2D) 1->3 1.996132
SADDW2 (vector, 2D) throughput 0.504555
SCVTF (scalar, fixed-point, S from 32-bit GPR) 1->2 roundtrip 10.747176
SCVTF (scalar, fixed-point, S from 32-bit GPR) throughput 0.514834
SCVTF (scalar, fixed-point, D from 32-bit GPR) 1->2 roundtrip 10.475107
SCVTF (scalar, fixed-point, D from 32-bit GPR) throughput 0.508775
SCVTF (scalar, fixed-point, S from 64-bit GPR) 1->2 roundtrip 10.016344
SCVTF (scalar, fixed-point, S from 64-bit GPR) throughput 0.502420
SCVTF (scalar, fixed-point, D from 64-bit GPR) 1->2 roundtrip 11.965442
SCVTF (scalar, fixed-point, D from 64-bit GPR) throughput 1.263495
SCVTF (scalar, integer, S from 32-bit GPR) 1->2 roundtrip 11.604058
SCVTF (scalar, integer, S from 32-bit GPR) throughput 0.447383
SCVTF (scalar, integer, D from 32-bit GPR) 1->2 roundtrip 10.031948
SCVTF (scalar, integer, D from 32-bit GPR) throughput 0.503654
SCVTF (scalar, integer, S from 64-bit GPR) 1->2 roundtrip 10.027090
SCVTF (scalar, integer, S from 64-bit GPR) throughput 0.505900
SCVTF (scalar, integer, D from 64-bit GPR) 1->2 roundtrip 11.149041
SCVTF (scalar, integer, D from 64-bit GPR) throughput 0.499189
SCVTF (scalar, fixed-point, S from S) 1->2 3.291355
SCVTF (scalar, fixed-point, S from S) throughput 0.560211
SCVTF (scalar, fixed-point, D from D) 1->2 3.128059
SCVTF (scalar, fixed-point, D from D) throughput 0.571024
SCVTF (vector, fixed-point, 2S from 2S) 1->2 3.236119
SCVTF (vector, fixed-point, 2S from 2S) throughput 0.357278
SCVTF (vector, fixed-point, 4S from 4S) 1->2 3.007992
SCVTF (vector, fixed-point, 4S from 4S) throughput 0.447131
SCVTF (vector, fixed-point, 2D from 2D) 1->2 3.082175
SCVTF (vector, fixed-point, 2D from 2D) throughput 0.604206
SCVTF (scalar, integer, S from S) 1->2 3.059918
SCVTF (scalar, integer, S from S) throughput 0.328453
SCVTF (scalar, integer, D from D) 1->2 3.025104
SCVTF (scalar, integer, D from D) throughput 0.486123
SCVTF (vector, integer, 2S from 2S) 1->2 3.003636
SCVTF (vector, integer, 2S from 2S) throughput 0.478421
SCVTF (vector, integer, 4S from 4S) 1->2 3.049093
SCVTF (vector, integer, 4S from 4S) throughput 0.638962
SCVTF (vector, integer, 2D from 2D) 1->2 3.003180
SCVTF (vector, integer, 2D from 2D) throughput 0.496317
SHA1C 1->1 4.051272
SHA1C 1->2 5.016176
SHA1C 1->3 3.424852
SHA1C throughput 4.260635
SHA1C throughput 9.748699
SHA1H 1->2 2.212774
SHA1H throughput 1.138778
SHA1M 1->1 6.134673
SHA1M 1->2 4.875985
SHA1M 1->3 3.976708
SHA1M throughput 4.697822
SHA1M throughput 5.052569
SHA1P 1->1 4.413201
SHA1P 1->2 5.006492
SHA1P 1->3 4.649625
SHA1P throughput 4.088976
SHA1P throughput 5.241915
SHA1SU0 1->1 1.317131
SHA1SU0 1->2 1.779497
SHA1SU0 1->3 2.030424
SHA1SU0 throughput 1.039081
SHA1SU0 throughput 1.166148
SHA1SU1 1->1 1.998886
SHA1SU1 1->2 1.985158
SHA1SU1 throughput 1.008883
SHA1SU1 throughput 1.008367
SHA256H 1->1 4.006509
SHA256H 1->2 5.040704
SHA256H 1->3 5.554881
SHA256H throughput 2.902630
SHA256H throughput 2.328667
SHA256H2 1->1 4.008517
SHA256H2 1->2 5.028506
SHA256H2 1->3 5.271121
SHA256H2 throughput 1.938203
SHA256H2 throughput 2.837457
SHA256SU0 1->1 2.110497
SHA256SU0 1->2 2.008347
SHA256SU0 throughput 1.018245
SHA256SU0 throughput 1.167320
SHA256SU1 1->1 3.045821
SHA256SU1 1->2 2.995285
SHA256SU1 1->3 3.006964
SHA256SU1 throughput 1.207115
SHA256SU1 throughput 1.178254
SHADD (vector, 8B) 1->2 1.626995
SHADD (vector, 8B) 1->3 1.903988
SHADD (vector, 8B) throughput 0.514430
SHADD (vector, 16B) 1->2 1.915966
SHADD (vector, 16B) 1->3 1.993592
SHADD (vector, 16B) throughput 0.520691
SHADD (vector, 4H) 1->2 1.854030
SHADD (vector, 4H) 1->3 2.035794
SHADD (vector, 4H) throughput 0.399484
SHADD (vector, 8H) 1->2 2.370750
SHADD (vector, 8H) 1->3 2.111178
SHADD (vector, 8H) throughput 0.421902
SHADD (vector, 2S) 1->2 3.026012
SHADD (vector, 2S) 1->3 1.934143
SHADD (vector, 2S) throughput 0.559937
SHADD (vector, 4S) 1->2 1.672959
SHADD (vector, 4S) 1->3 5.758187
SHADD (vector, 4S) throughput 0.566609
SHL (vector, 8B) 1->2 2.105348
SHL (vector, 8B) throughput 0.574626
SHL (vector, 16B) 1->2 2.003999
SHL (vector, 16B) throughput 0.445028
SHL (vector, 4H) 1->2 2.044053
SHL (vector, 4H) throughput 0.700117
SHL (vector, 8H) 1->2 1.990233
SHL (vector, 8H) throughput 0.500839
SHL (vector, 2S) 1->2 2.002233
SHL (vector, 2S) throughput 0.620495
SHL (vector, 4S) 1->2 1.999740
SHL (vector, 4S) throughput 0.455981
SHL (vector, 2D) 1->2 2.038344
SHL (vector, 2D) throughput 0.528903
SHL (scalar, D) 1->2 1.947001
SHL (scalar, D) throughput 0.592678
SHLL (vector, 8H) 1->2 1.859510
SHLL (vector, 8H) throughput 0.475248
SHLL (vector, 4S) 1->2 2.162726
SHLL (vector, 4S) throughput 0.516511
SHLL (vector, 2D) 1->2 2.620067
SHLL (vector, 2D) throughput 0.491338
SHLL2 (vector, 8H) 1->2 1.862501
SHLL2 (vector, 8H) throughput 0.545502
SHLL2 (vector, 4S) 1->2 1.959241
SHLL2 (vector, 4S) throughput 0.582233
SHLL2 (vector, 2D) 1->2 1.994776
SHLL2 (vector, 2D) throughput 0.525899
SHRN (8H) 1->2 3.078216
SHRN (8H) throughput 0.661883
SHRN (4S) 1->2 4.815494
SHRN (4S) throughput 0.371687
SHRN (2D) 1->2 3.316214
SHRN (2D) throughput 0.507678
SHRN2 (8H) 1->1 3.007827
SHRN2 (8H) 1->2 3.006640
SHRN2 (8H) throughput 0.503419
SHRN2 (8H) throughput 0.499430
SHRN2 (4S) 1->1 2.997977
SHRN2 (4S) 1->2 3.003805
SHRN2 (4S) throughput 0.500379
SHRN2 (4S) throughput 0.501873
SHRN2 (2D) 1->1 2.997010
SHRN2 (2D) 1->2 3.002507
SHRN2 (2D) throughput 0.501437
SHRN2 (2D) throughput 0.507549
SHSUB (vector, 8B) 1->2 2.004899
SHSUB (vector, 8B) 1->3 2.004039
SHSUB (vector, 8B) throughput 0.501795
SHSUB (vector, 16B) 1->2 2.000342
SHSUB (vector, 16B) 1->3 1.957339
SHSUB (vector, 16B) throughput 0.446011
SHSUB (vector, 4H) 1->2 2.011948
SHSUB (vector, 4H) 1->3 1.726524
SHSUB (vector, 4H) throughput 0.487286
SHSUB (vector, 8H) 1->2 1.990778
SHSUB (vector, 8H) 1->3 2.006438
SHSUB (vector, 8H) throughput 0.501997
SHSUB (vector, 2S) 1->2 2.001495
SHSUB (vector, 2S) 1->3 2.009630
SHSUB (vector, 2S) throughput 0.504384
SHSUB (vector, 4S) 1->2 2.007168
SHSUB (vector, 4S) 1->3 1.935026
SHSUB (vector, 4S) throughput 0.501028
SLI (vector, 8B) 1->1 2.016049
SLI (vector, 8B) 1->2 1.831947
SLI (vector, 8B) throughput 0.506567
SLI (vector, 8B) throughput 0.503098
SLI (vector, 16B) 1->1 1.981928
SLI (vector, 16B) 1->2 1.999109
SLI (vector, 16B) throughput 0.542134
SLI (vector, 16B) throughput 0.503133
SLI (vector, 4H) 1->1 2.020328
SLI (vector, 4H) 1->2 2.035310
SLI (vector, 4H) throughput 0.501670
SLI (vector, 4H) throughput 0.501104
SLI (vector, 8H) 1->1 2.003808
SLI (vector, 8H) 1->2 2.018126
SLI (vector, 8H) throughput 0.501324
SLI (vector, 8H) throughput 0.500663
SLI (vector, 2S) 1->1 1.999071
SLI (vector, 2S) 1->2 2.001741
SLI (vector, 2S) throughput 0.501085
SLI (vector, 2S) throughput 0.503613
SLI (vector, 4S) 1->1 2.001417
SLI (vector, 4S) 1->2 2.000485
SLI (vector, 4S) throughput 0.501979
SLI (vector, 4S) throughput 0.505544
SLI (vector, 2D) 1->1 1.998383
SLI (vector, 2D) 1->2 2.002066
SLI (vector, 2D) throughput 0.501438
SLI (vector, 2D) throughput 0.626713
SLI (scalar, D) 1->1 2.325765
SLI (scalar, D) 1->2 1.914288
SLI (scalar, D) throughput 0.386480
SLI (scalar, D) throughput 0.343077
SMAX (vector, 8B) 1->2 2.734828
SMAX (vector, 8B) 1->3 1.611673
SMAX (vector, 8B) throughput 0.480095
SMAX (vector, 16B) 1->2 2.007288
SMAX (vector, 16B) 1->3 2.022247
SMAX (vector, 16B) throughput 0.499990
SMAX (vector, 4H) 1->2 1.994675
SMAX (vector, 4H) 1->3 1.792808
SMAX (vector, 4H) throughput 0.564807
SMAX (vector, 8H) 1->2 2.002549
SMAX (vector, 8H) 1->3 2.001720
SMAX (vector, 8H) throughput 0.515603
SMAX (vector, 2S) 1->2 1.846244
SMAX (vector, 2S) 1->3 2.043013
SMAX (vector, 2S) throughput 0.656958
SMAX (vector, 4S) 1->2 2.296030
SMAX (vector, 4S) 1->3 2.003608
SMAX (vector, 4S) throughput 0.632772
SMAXP (vector, 8B) 1->2 1.968545
SMAXP (vector, 8B) 1->3 2.082002
SMAXP (vector, 8B) throughput 0.516500
SMAXP (vector, 16B) 1->2 2.298335
SMAXP (vector, 16B) 1->3 2.024417
SMAXP (vector, 16B) throughput 0.501979
SMAXP (vector, 4H) 1->2 1.989820
SMAXP (vector, 4H) 1->3 2.053554
SMAXP (vector, 4H) throughput 0.499341
SMAXP (vector, 8H) 1->2 2.030617
SMAXP (vector, 8H) 1->3 1.991000
SMAXP (vector, 8H) throughput 0.418627
SMAXP (vector, 2S) 1->2 1.999742
SMAXP (vector, 2S) 1->3 2.034116
SMAXP (vector, 2S) throughput 0.479085
SMAXP (vector, 4S) 1->2 2.086236
SMAXP (vector, 4S) 1->3 2.012648
SMAXP (vector, 4S) throughput 0.506180
SMAXV (vector, 8B) 1->2 3.001274
SMAXV (vector, 8B) throughput 0.512884
SMAXV (vector, 16B) 1->2 3.007935
SMAXV (vector, 16B) throughput 0.501903
SMAXV (vector, 4H) 1->2 3.192244
SMAXV (vector, 4H) throughput 0.499464
SMAXV (vector, 8H) 1->2 3.014160
SMAXV (vector, 8H) throughput 0.500308
SMAXV (vector, 4S) 1->2 2.999595
SMAXV (vector, 4S) throughput 0.527162
SMIN (vector, 8B) 1->2 1.977120
SMIN (vector, 8B) 1->3 2.020017
SMIN (vector, 8B) throughput 0.498721
SMIN (vector, 16B) 1->2 1.993190
SMIN (vector, 16B) 1->3 2.000483
SMIN (vector, 16B) throughput 0.511552
SMIN (vector, 4H) 1->2 2.002025
SMIN (vector, 4H) 1->3 2.003766
SMIN (vector, 4H) throughput 0.520112
SMIN (vector, 8H) 1->2 1.988933
SMIN (vector, 8H) 1->3 2.069363
SMIN (vector, 8H) throughput 0.500815
SMIN (vector, 2S) 1->2 2.008381
SMIN (vector, 2S) 1->3 1.999271
SMIN (vector, 2S) throughput 0.504682
SMIN (vector, 4S) 1->2 1.999557
SMIN (vector, 4S) 1->3 1.995196
SMIN (vector, 4S) throughput 0.500542
SMINP (vector, 8B) 1->2 1.998342
SMINP (vector, 8B) 1->3 2.002187
SMINP (vector, 8B) throughput 0.395699
SMINP (vector, 16B) 1->2 1.847285
SMINP (vector, 16B) 1->3 1.999275
SMINP (vector, 16B) throughput 0.506697
SMINP (vector, 4H) 1->2 2.002141
SMINP (vector, 4H) 1->3 2.346024
SMINP (vector, 4H) throughput 0.523312
SMINP (vector, 8H) 1->2 1.993068
SMINP (vector, 8H) 1->3 2.034254
SMINP (vector, 8H) throughput 0.500921
SMINP (vector, 2S) 1->2 2.001130
SMINP (vector, 2S) 1->3 1.996486
SMINP (vector, 2S) throughput 0.504290
SMINP (vector, 4S) 1->2 2.049841
SMINP (vector, 4S) 1->3 2.000525
SMINP (vector, 4S) throughput 0.500491
SMINV (vector, 8B) 1->2 3.041788
SMINV (vector, 8B) throughput 0.500172
SMINV (vector, 16B) 1->2 3.005102
SMINV (vector, 16B) throughput 0.500977
SMINV (vector, 4H) 1->2 3.321040
SMINV (vector, 4H) throughput 0.524513
SMINV (vector, 8H) 1->2 3.443354
SMINV (vector, 8H) throughput 0.477800
SMINV (vector, 4S) 1->2 3.004535
SMINV (vector, 4S) throughput 0.470443
SMLAL (by element, 4S) 1->1 2.826182
SMLAL (by element, 4S) 1->2 3.007856
SMLAL (by element, 4S) 1->3 3.039942
SMLAL (by element, 4S) throughput 0.553001
SMLAL (by element, 4S) throughput 0.543355
SMLAL (by element, 2D) 1->1 3.496292
SMLAL (by element, 2D) 1->2 2.997418
SMLAL (by element, 2D) 1->3 3.418896
SMLAL (by element, 2D) throughput 0.598293
SMLAL (by element, 2D) throughput 0.579837
SMLAL2 (by element, 4S) 1->1 3.060125
SMLAL2 (by element, 4S) 1->2 3.918842
SMLAL2 (by element, 4S) 1->3 3.023200
SMLAL2 (by element, 4S) throughput 0.644695
SMLAL2 (by element, 4S) throughput 0.516338
SMLAL2 (by element, 2D) 1->1 2.881804
SMLAL2 (by element, 2D) 1->2 3.038486
SMLAL2 (by element, 2D) 1->3 4.950226
SMLAL2 (by element, 2D) throughput 0.510654
SMLAL2 (by element, 2D) throughput 0.566586
SMLAL (vector, 8H) 1->1 3.028980
SMLAL (vector, 8H) 1->2 2.949429
SMLAL (vector, 8H) 1->3 3.008869
SMLAL (vector, 8H) throughput 0.514041
SMLAL (vector, 8H) throughput 0.528330
SMLAL (vector, 4S) 1->1 3.295244
SMLAL (vector, 4S) 1->2 3.198064
SMLAL (vector, 4S) 1->3 3.296404
SMLAL (vector, 4S) throughput 0.490309
SMLAL (vector, 4S) throughput 0.450080
SMLAL (vector, 2D) 1->1 2.951768
SMLAL (vector, 2D) 1->2 3.062477
SMLAL (vector, 2D) 1->3 3.526685
SMLAL (vector, 2D) throughput 0.589969
SMLAL (vector, 2D) throughput 0.480925
SMLAL2 (vector, 8H) 1->1 2.849970
SMLAL2 (vector, 8H) 1->2 3.028642
SMLAL2 (vector, 8H) 1->3 3.156189
SMLAL2 (vector, 8H) throughput 0.500999
SMLAL2 (vector, 8H) throughput 0.478366
SMLAL2 (vector, 4S) 1->1 2.545938
SMLAL2 (vector, 4S) 1->2 3.017366
SMLAL2 (vector, 4S) 1->3 3.011268
SMLAL2 (vector, 4S) throughput 0.491994
SMLAL2 (vector, 4S) throughput 0.570666
SMLAL2 (vector, 2D) 1->1 4.350049
SMLAL2 (vector, 2D) 1->2 2.883715
SMLAL2 (vector, 2D) 1->3 3.649403
SMLAL2 (vector, 2D) throughput 0.508877
SMLAL2 (vector, 2D) throughput 0.499391
SMLSL (by element, 4S) 1->1 3.005588
SMLSL (by element, 4S) 1->2 3.004089
SMLSL (by element, 4S) 1->3 3.028720
SMLSL (by element, 4S) throughput 0.502365
SMLSL (by element, 4S) throughput 0.553020
SMLSL (by element, 2D) 1->1 2.947778
SMLSL (by element, 2D) 1->2 3.028599
SMLSL (by element, 2D) 1->3 3.217926
SMLSL (by element, 2D) throughput 0.494125
SMLSL (by element, 2D) throughput 0.504300
SMLSL2 (by element, 4S) 1->1 3.014650
SMLSL2 (by element, 4S) 1->2 4.463565
SMLSL2 (by element, 4S) 1->3 2.993067
SMLSL2 (by element, 4S) throughput 0.506629
SMLSL2 (by element, 4S) throughput 0.505831
SMLSL2 (by element, 2D) 1->1 3.001555
SMLSL2 (by element, 2D) 1->2 3.023283
SMLSL2 (by element, 2D) 1->3 3.963480
SMLSL2 (by element, 2D) throughput 0.462687
SMLSL2 (by element, 2D) throughput 0.570078
SMLSL (vector, 8H) 1->1 3.540537
SMLSL (vector, 8H) 1->2 2.898409
SMLSL (vector, 8H) 1->3 4.891859
SMLSL (vector, 8H) throughput 0.503461
SMLSL (vector, 8H) throughput 0.505369
SMLSL (vector, 4S) 1->1 2.998453
SMLSL (vector, 4S) 1->2 5.692473
SMLSL (vector, 4S) 1->3 4.123894
SMLSL (vector, 4S) throughput 0.542829
SMLSL (vector, 4S) throughput 0.509587
SMLSL (vector, 2D) 1->1 3.108366
SMLSL (vector, 2D) 1->2 2.997250
SMLSL (vector, 2D) 1->3 3.046002
SMLSL (vector, 2D) throughput 0.520369
SMLSL (vector, 2D) throughput 0.505021
SMLSL2 (vector, 8H) 1->1 3.000003
SMLSL2 (vector, 8H) 1->2 3.017474
SMLSL2 (vector, 8H) 1->3 3.006357
SMLSL2 (vector, 8H) throughput 0.476535
SMLSL2 (vector, 8H) throughput 0.503824
SMLSL2 (vector, 4S) 1->1 2.938481
SMLSL2 (vector, 4S) 1->2 3.004765
SMLSL2 (vector, 4S) 1->3 2.884797
SMLSL2 (vector, 4S) throughput 0.596734
SMLSL2 (vector, 4S) throughput 0.730539
SMLSL2 (vector, 2D) 1->1 2.598412
SMLSL2 (vector, 2D) 1->2 3.208983
SMLSL2 (vector, 2D) 1->3 3.115093
SMLSL2 (vector, 2D) throughput 0.500903
SMLSL2 (vector, 2D) throughput 0.419695
SMULL (by element, 4S) 1->2 3.005669
SMULL (by element, 4S) 1->3 3.255625
SMULL (by element, 4S) throughput 0.512605
SMULL (by element, 2D) 1->2 2.996527
SMULL (by element, 2D) 1->3 3.049404
SMULL (by element, 2D) throughput 0.486058
SMULL2 (by element, 4S) 1->2 3.016030
SMULL2 (by element, 4S) 1->3 2.945528
SMULL2 (by element, 4S) throughput 0.473397
SMULL2 (by element, 2D) 1->2 3.840938
SMULL2 (by element, 2D) 1->3 3.574636
SMULL2 (by element, 2D) throughput 0.641237
SMULL (vector, 8H) 1->2 2.752544
SMULL (vector, 8H) 1->3 2.584957
SMULL (vector, 8H) throughput 0.532668
SMULL (vector, 4S) 1->2 3.103032
SMULL (vector, 4S) 1->3 2.928861
SMULL (vector, 4S) throughput 0.439363
SMULL (vector, 2D) 1->2 2.989323
SMULL (vector, 2D) 1->3 3.100492
SMULL (vector, 2D) throughput 0.451012
SMULL2 (vector, 8H) 1->2 2.896022
SMULL2 (vector, 8H) 1->3 3.165623
SMULL2 (vector, 8H) throughput 0.451292
SMULL2 (vector, 4S) 1->2 2.779296
SMULL2 (vector, 4S) 1->3 3.345433
SMULL2 (vector, 4S) throughput 0.617391
SMULL2 (vector, 2D) 1->2 3.195457
SMULL2 (vector, 2D) 1->3 2.999587
SMULL2 (vector, 2D) throughput 0.604326
SQABS (scalar, B) 1->2 3.472400
SQABS (scalar, B) throughput 0.627009
SQABS (scalar, H) 1->2 2.303872
SQABS (scalar, H) throughput 0.548397
SQABS (scalar, S) 1->2 3.005932
SQABS (scalar, S) throughput 0.512640
SQABS (scalar, D) 1->2 3.088566
SQABS (scalar, D) throughput 0.507566
SQABS (vector, 8B) 1->2 3.003698
SQABS (vector, 8B) throughput 0.515342
SQABS (vector, 16B) 1->2 3.011217
SQABS (vector, 16B) throughput 0.508145
SQABS (vector, 4H) 1->2 3.005674
SQABS (vector, 4H) throughput 0.502244
SQABS (vector, 8H) 1->2 3.023524
SQABS (vector, 8H) throughput 0.500373
SQABS (vector, 2S) 1->2 2.864715
SQABS (vector, 2S) throughput 0.498016
SQABS (vector, 4S) 1->2 3.005181
SQABS (vector, 4S) throughput 0.499646
SQABS (vector, 2D) 1->2 3.009597
SQABS (vector, 2D) throughput 0.508673
SQADD (scalar, B) 1->2 3.048914
SQADD (scalar, B) 1->3 3.003927
SQADD (scalar, B) throughput 0.518105
SQADD (scalar, H) 1->2 3.001883
SQADD (scalar, H) 1->3 2.993458
SQADD (scalar, H) throughput 0.500476
SQADD (scalar, S) 1->2 3.012473
SQADD (scalar, S) 1->3 3.157192
SQADD (scalar, S) throughput 0.534810
SQADD (scalar, D) 1->2 3.026652
SQADD (scalar, D) 1->3 4.288236
SQADD (scalar, D) throughput 0.152139
SQADD (vector, 8B) 1->2 3.021192
SQADD (vector, 8B) 1->3 3.055438
SQADD (vector, 8B) throughput 0.505976
SQADD (vector, 16B) 1->2 3.008360
SQADD (vector, 16B) 1->3 2.996531
SQADD (vector, 16B) throughput 0.623501
SQADD (vector, 4H) 1->2 2.543183
SQADD (vector, 4H) 1->3 3.006992
SQADD (vector, 4H) throughput 0.500116
SQADD (vector, 8H) 1->2 3.129619
SQADD (vector, 8H) 1->3 3.234857
SQADD (vector, 8H) throughput 0.497800
SQADD (vector, 2S) 1->2 3.006642
SQADD (vector, 2S) 1->3 3.027332
SQADD (vector, 2S) throughput 0.500525
SQADD (vector, 4S) 1->2 2.990222
SQADD (vector, 4S) 1->3 3.020003
SQADD (vector, 4S) throughput 0.520818
SQADD (vector, 2D) 1->2 3.010245
SQADD (vector, 2D) 1->3 2.998262
SQADD (vector, 2D) throughput 0.498087
SQDMLAL (by element, S) 1->1 3.003928
SQDMLAL (by element, S) 1->2 3.049528
SQDMLAL (by element, S) 1->3 3.000242
SQDMLAL (by element, S) throughput 0.522752
SQDMLAL (by element, S) throughput 0.504277
SQDMLAL (by element, D) 1->1 3.017898
SQDMLAL (by element, D) 1->2 2.999677
SQDMLAL (by element, D) 1->3 2.985283
SQDMLAL (by element, D) throughput 0.526290
SQDMLAL (by element, D) throughput 0.541117
SQDMLAL (by element, 4S) 1->1 2.949351
SQDMLAL (by element, 4S) 1->2 3.192838
SQDMLAL (by element, 4S) 1->3 3.086383
SQDMLAL (by element, 4S) throughput 0.499584
SQDMLAL (by element, 4S) throughput 0.505966
SQDMLAL (by element, 2D) 1->1 3.009272
SQDMLAL (by element, 2D) 1->2 3.030329
SQDMLAL (by element, 2D) 1->3 3.044863
SQDMLAL (by element, 2D) throughput 0.507021
SQDMLAL (by element, 2D) throughput 0.514776
SQDMLAL2 (by element, 4S) 1->1 3.343962
SQDMLAL2 (by element, 4S) 1->2 2.995472
SQDMLAL2 (by element, 4S) 1->3 3.096468
SQDMLAL2 (by element, 4S) throughput 0.510379
SQDMLAL2 (by element, 4S) throughput 0.509917
SQDMLAL2 (by element, 2D) 1->1 2.020210
SQDMLAL2 (by element, 2D) 1->2 3.084143
SQDMLAL2 (by element, 2D) 1->3 2.998184
SQDMLAL2 (by element, 2D) throughput 0.504110
SQDMLAL2 (by element, 2D) throughput 0.502440
SQDMLAL (scalar, S) 1->1 2.974457
SQDMLAL (scalar, S) 1->2 3.009234
SQDMLAL (scalar, S) 1->3 3.187135
SQDMLAL (scalar, S) throughput 0.501851
SQDMLAL (scalar, S) throughput 0.500837
SQDMLAL (scalar, D) 1->1 3.012650
SQDMLAL (scalar, D) 1->2 3.009392
SQDMLAL (scalar, D) 1->3 3.008543
SQDMLAL (scalar, D) throughput 0.501549
SQDMLAL (scalar, D) throughput 0.509604
SQDMLAL (vector, 4S) 1->1 3.065719
SQDMLAL (vector, 4S) 1->2 2.812892
SQDMLAL (vector, 4S) 1->3 3.027910
SQDMLAL (vector, 4S) throughput 0.636140
SQDMLAL (vector, 4S) throughput 0.502144
SQDMLAL (vector, 2D) 1->1 3.006803
SQDMLAL (vector, 2D) 1->2 3.006519
SQDMLAL (vector, 2D) 1->3 2.999516
SQDMLAL (vector, 2D) throughput 0.507610
SQDMLAL (vector, 2D) throughput 0.536091
SQDMLAL2 (vector, 4S) 1->1 2.838606
SQDMLAL2 (vector, 4S) 1->2 3.033688
SQDMLAL2 (vector, 4S) 1->3 3.027577
SQDMLAL2 (vector, 4S) throughput 0.506570
SQDMLAL2 (vector, 4S) throughput 0.501430
SQDMLAL2 (vector, 2D) 1->1 2.821224
SQDMLAL2 (vector, 2D) 1->2 3.032901
SQDMLAL2 (vector, 2D) 1->3 3.007089
SQDMLAL2 (vector, 2D) throughput 0.504479
SQDMLAL2 (vector, 2D) throughput 0.505002
SQDMLSL (by element, S) 1->1 3.013400
SQDMLSL (by element, S) 1->2 2.996241
SQDMLSL (by element, S) 1->3 2.961169
SQDMLSL (by element, S) throughput 0.552747
SQDMLSL (by element, S) throughput 0.507680
SQDMLSL (by element, D) 1->1 3.130264
SQDMLSL (by element, D) 1->2 2.993860
SQDMLSL (by element, D) 1->3 2.996121
SQDMLSL (by element, D) throughput 0.490468
SQDMLSL (by element, D) throughput 0.501998
SQDMLSL (by element, 4S) 1->1 2.988384
SQDMLSL (by element, 4S) 1->2 3.001554
SQDMLSL (by element, 4S) 1->3 3.041623
SQDMLSL (by element, 4S) throughput 0.501407
SQDMLSL (by element, 4S) throughput 0.523436
SQDMLSL (by element, 2D) 1->1 2.999554
SQDMLSL (by element, 2D) 1->2 3.024981
SQDMLSL (by element, 2D) 1->3 3.039034
SQDMLSL (by element, 2D) throughput 0.495917
SQDMLSL (by element, 2D) throughput 0.500798
SQDMLSL2 (by element, 4S) 1->1 3.006561
SQDMLSL2 (by element, 4S) 1->2 3.004820
SQDMLSL2 (by element, 4S) 1->3 3.006520
SQDMLSL2 (by element, 4S) throughput 0.503037
SQDMLSL2 (by element, 4S) throughput 0.505295
SQDMLSL2 (by element, 2D) 1->1 2.900908
SQDMLSL2 (by element, 2D) 1->2 2.993828
SQDMLSL2 (by element, 2D) 1->3 3.001615
SQDMLSL2 (by element, 2D) throughput 0.501341
SQDMLSL2 (by element, 2D) throughput 0.501989
SQDMLSL (scalar, S) 1->1 2.982338
SQDMLSL (scalar, S) 1->2 3.004048
SQDMLSL (scalar, S) 1->3 3.012401
SQDMLSL (scalar, S) throughput 0.500045
SQDMLSL (scalar, S) throughput 0.500863
SQDMLSL (scalar, D) 1->1 3.007563
SQDMLSL (scalar, D) 1->2 3.017593
SQDMLSL (scalar, D) 1->3 2.997225
SQDMLSL (scalar, D) throughput 0.522114
SQDMLSL (scalar, D) throughput 0.502184
SQDMLSL (vector, 4S) 1->1 2.993659
SQDMLSL (vector, 4S) 1->2 3.019274
SQDMLSL (vector, 4S) 1->3 2.999233
SQDMLSL (vector, 4S) throughput 0.509191
SQDMLSL (vector, 4S) throughput 0.506240
SQDMLSL (vector, 2D) 1->1 3.106844
SQDMLSL (vector, 2D) 1->2 3.141441
SQDMLSL (vector, 2D) 1->3 2.979331
SQDMLSL (vector, 2D) throughput 0.504981
SQDMLSL (vector, 2D) throughput 0.517781
SQDMLSL2 (vector, 4S) 1->1 2.998583
SQDMLSL2 (vector, 4S) 1->2 3.021074
SQDMLSL2 (vector, 4S) 1->3 3.253337
SQDMLSL2 (vector, 4S) throughput 0.499372
SQDMLSL2 (vector, 4S) throughput 0.414922
SQDMLSL2 (vector, 2D) 1->1 2.998100
SQDMLSL2 (vector, 2D) 1->2 2.997576
SQDMLSL2 (vector, 2D) 1->3 3.009111
SQDMLSL2 (vector, 2D) throughput 0.500768
SQDMLSL2 (vector, 2D) throughput 0.500617
SQDMULH (by element, H) 1->2 3.061268
SQDMULH (by element, H) 1->3 3.034944
SQDMULH (by element, H) throughput 0.500101
SQDMULH (by element, S) 1->2 3.000768
SQDMULH (by element, S) 1->3 3.005669
SQDMULH (by element, S) throughput 0.612429
SQDMULH (by element, 4H) 1->2 3.153676
SQDMULH (by element, 4H) 1->3 3.092952
SQDMULH (by element, 4H) throughput 0.583670
SQDMULH (by element, 8H) 1->2 3.633149
SQDMULH (by element, 8H) 1->3 2.857361
SQDMULH (by element, 8H) throughput 0.591613
SQDMULH (by element, 2S) 1->2 3.010811
SQDMULH (by element, 2S) 1->3 3.013068
SQDMULH (by element, 2S) throughput 0.476230
SQDMULH (by element, 4S) 1->2 2.405200
SQDMULH (by element, 4S) 1->3 3.000484
SQDMULH (by element, 4S) throughput 0.508685
SQDMULH (scalar, H) 1->2 2.057005
SQDMULH (scalar, H) 1->3 3.003720
SQDMULH (scalar, H) throughput 0.723287
SQDMULH (scalar, S) 1->2 3.009720
SQDMULH (scalar, S) 1->3 3.008325
SQDMULH (scalar, S) throughput 0.502558
SQDMULH (vector, 4H) 1->2 2.909365
SQDMULH (vector, 4H) 1->3 4.915345
SQDMULH (vector, 4H) throughput 0.587159
SQDMULH (vector, 8H) 1->2 3.871524
SQDMULH (vector, 8H) 1->3 3.007936
SQDMULH (vector, 8H) throughput 0.556455
SQDMULH (vector, 2S) 1->2 3.009840
SQDMULH (vector, 2S) 1->3 3.242582
SQDMULH (vector, 2S) throughput 0.578232
SQDMULH (vector, 4S) 1->2 3.008221
SQDMULH (vector, 4S) 1->3 3.063854
SQDMULH (vector, 4S) throughput 0.546736
SQDMULL (by element, S) 1->2 3.069566
SQDMULL (by element, S) 1->3 3.005752
SQDMULL (by element, S) throughput 0.503520
SQDMULL (by element, D) 1->2 3.509877
SQDMULL (by element, D) 1->3 3.441182
SQDMULL (by element, D) throughput 0.593149
SQDMULL (by element, 4S) 1->2 2.979054
SQDMULL (by element, 4S) 1->3 3.257607
SQDMULL (by element, 4S) throughput 0.501907
SQDMULL (by element, 2D) 1->2 3.022577
SQDMULL (by element, 2D) 1->3 3.006814
SQDMULL (by element, 2D) throughput 0.499682
SQDMULL2 (by element, 4S) 1->2 3.059888
SQDMULL2 (by element, 4S) 1->3 3.067782
SQDMULL2 (by element, 4S) throughput 0.534277
SQDMULL2 (by element, 2D) 1->2 3.002980
SQDMULL2 (by element, 2D) 1->3 3.060039
SQDMULL2 (by element, 2D) throughput 0.535131
SQDMULL (scalar, S) 1->2 3.007775
SQDMULL (scalar, S) 1->3 5.528016
SQDMULL (scalar, S) throughput 0.593691
SQDMULL (scalar, D) 1->2 3.167561
SQDMULL (scalar, D) 1->3 2.713940
SQDMULL (scalar, D) throughput 0.491888
SQDMULL (vector, 4S) 1->2 3.006153
SQDMULL (vector, 4S) 1->3 3.554972
SQDMULL (vector, 4S) throughput 0.506529
SQDMULL (vector, 2D) 1->2 3.003226
SQDMULL (vector, 2D) 1->3 3.034560
SQDMULL (vector, 2D) throughput 0.568673
SQDMULL2 (vector, 4S) 1->2 2.993657
SQDMULL2 (vector, 4S) 1->3 3.801023
SQDMULL2 (vector, 4S) throughput 0.578436
SQDMULL2 (vector, 2D) 1->2 2.985018
SQDMULL2 (vector, 2D) 1->3 2.990198
SQDMULL2 (vector, 2D) throughput 0.517305
SQNEG (scalar, B) 1->2 2.706624
SQNEG (scalar, B) throughput 0.561048
SQNEG (scalar, H) 1->2 3.091304
SQNEG (scalar, H) throughput 0.581881
SQNEG (scalar, S) 1->2 2.822549
SQNEG (scalar, S) throughput 0.416655
SQNEG (scalar, D) 1->2 3.270133
SQNEG (scalar, D) throughput 0.454799
SQNEG (vector, 8B) 1->2 2.842037
SQNEG (vector, 8B) throughput 0.505356
SQNEG (vector, 16B) 1->2 3.290256
SQNEG (vector, 16B) throughput 0.536155
SQNEG (vector, 4H) 1->2 3.228559
SQNEG (vector, 4H) throughput 0.510857
SQNEG (vector, 8H) 1->2 3.268666
SQNEG (vector, 8H) throughput 0.573829
SQNEG (vector, 2S) 1->2 3.066290
SQNEG (vector, 2S) throughput 0.485769
SQNEG (vector, 4S) 1->2 2.986320
SQNEG (vector, 4S) throughput 0.519163
SQNEG (vector, 2D) 1->2 4.952564
SQNEG (vector, 2D) throughput 0.610702
SQRDMULH (by element, H) 1->2 3.956337
SQRDMULH (by element, H) 1->3 3.023799
SQRDMULH (by element, H) throughput 0.500678
SQRDMULH (by element, S) 1->2 3.039815
SQRDMULH (by element, S) 1->3 3.004089
SQRDMULH (by element, S) throughput 0.500623
SQRDMULH (by element, 4H) 1->2 2.997493
SQRDMULH (by element, 4H) 1->3 2.997089
SQRDMULH (by element, 4H) throughput 0.502153
SQRDMULH (by element, 8H) 1->2 2.999072
SQRDMULH (by element, 8H) 1->3 3.006400
SQRDMULH (by element, 8H) throughput 0.500661
SQRDMULH (by element, 2S) 1->2 3.004778
SQRDMULH (by element, 2S) 1->3 2.993297
SQRDMULH (by element, 2S) throughput 0.507419
SQRDMULH (by element, 4S) 1->2 3.010327
SQRDMULH (by element, 4S) 1->3 3.005468
SQRDMULH (by element, 4S) throughput 0.566450
SQRDMULH (scalar, H) 1->2 3.004778
SQRDMULH (scalar, H) 1->3 2.998587
SQRDMULH (scalar, H) throughput 0.500910
SQRDMULH (scalar, S) 1->2 3.003889
SQRDMULH (scalar, S) 1->3 3.000767
SQRDMULH (scalar, S) throughput 0.504085
SQRDMULH (vector, 4H) 1->2 2.992858
SQRDMULH (vector, 4H) 1->3 3.015767
SQRDMULH (vector, 4H) throughput 0.549746
SQRDMULH (vector, 8H) 1->2 3.006185
SQRDMULH (vector, 8H) 1->3 2.991169
SQRDMULH (vector, 8H) throughput 0.458436
SQRDMULH (vector, 2S) 1->2 3.008182
SQRDMULH (vector, 2S) 1->3 3.027859
SQRDMULH (vector, 2S) throughput 0.500982
SQRDMULH (vector, 4S) 1->2 3.006791
SQRDMULH (vector, 4S) 1->3 2.996568
SQRDMULH (vector, 4S) throughput 0.513310
SQRSHL (scalar, B) 1->2 2.997212
SQRSHL (scalar, B) 1->3 3.003806
SQRSHL (scalar, B) throughput 0.502941
SQRSHL (scalar, H) 1->2 2.999030
SQRSHL (scalar, H) 1->3 3.005306
SQRSHL (scalar, H) throughput 0.499329
SQRSHL (scalar, S) 1->2 2.997331
SQRSHL (scalar, S) 1->3 3.018343
SQRSHL (scalar, S) throughput 0.502596
SQRSHL (scalar, D) 1->2 2.998705
SQRSHL (scalar, D) 1->3 3.003076
SQRSHL (scalar, D) throughput 0.500071
SQRSHL (vector, 8B) 1->2 3.003153
SQRSHL (vector, 8B) 1->3 2.998908
SQRSHL (vector, 8B) throughput 0.501647
SQRSHL (vector, 16B) 1->2 2.994152
SQRSHL (vector, 16B) 1->3 2.959500
SQRSHL (vector, 16B) throughput 0.502137
SQRSHL (vector, 4H) 1->2 3.010105
SQRSHL (vector, 4H) 1->3 2.999029
SQRSHL (vector, 4H) throughput 0.527158
SQRSHL (vector, 8H) 1->2 2.998867
SQRSHL (vector, 8H) 1->3 3.003509
SQRSHL (vector, 8H) throughput 0.493901
SQRSHL (vector, 2S) 1->2 2.997212
SQRSHL (vector, 2S) 1->3 2.992458
SQRSHL (vector, 2S) throughput 0.499939
SQRSHL (vector, 4S) 1->2 4.260130
SQRSHL (vector, 4S) 1->3 2.994592
SQRSHL (vector, 4S) throughput 0.529662
SQRSHL (vector, 2D) 1->2 2.934866
SQRSHL (vector, 2D) 1->3 3.007734
SQRSHL (vector, 2D) throughput 0.500106
SQRSHRN (H) 1->2 2.998666
SQRSHRN (H) throughput 0.499641
SQRSHRN (S) 1->2 2.998949
SQRSHRN (S) throughput 0.500439
SQRSHRN (D) 1->2 3.010974
SQRSHRN (D) throughput 0.546988
SQRSHRN (8H) 1->2 3.888693
SQRSHRN (8H) throughput 0.500157
SQRSHRN (4S) 1->2 3.004079
SQRSHRN (4S) throughput 0.500658
SQRSHRN (2D) 1->2 2.998384
SQRSHRN (2D) throughput 0.504177
SQRSHRN2 (8H) 1->1 3.065678
SQRSHRN2 (8H) 1->2 3.012512
SQRSHRN2 (8H) throughput 0.501463
SQRSHRN2 (8H) throughput 0.505671
SQRSHRN2 (4S) 1->1 3.003928
SQRSHRN2 (4S) 1->2 3.002771
SQRSHRN2 (4S) throughput 0.503267
SQRSHRN2 (4S) throughput 0.501387
SQRSHRN2 (2D) 1->1 2.995481
SQRSHRN2 (2D) 1->2 3.040777
SQRSHRN2 (2D) throughput 0.506258
SQRSHRN2 (2D) throughput 0.504319
SQRSHRUN (H) 1->2 3.033688
SQRSHRUN (H) throughput 0.500688
SQRSHRUN (S) 1->2 3.033933
SQRSHRUN (S) throughput 0.500952
SQRSHRUN (D) 1->2 3.022434
SQRSHRUN (D) throughput 0.544000
SQRSHRUN (8H) 1->2 3.185138
SQRSHRUN (8H) throughput 0.519237
SQRSHRUN (4S) 1->2 3.006963
SQRSHRUN (4S) throughput 0.505507
SQRSHRUN (2D) 1->2 2.997414
SQRSHRUN (2D) throughput 0.513439
SQRSHRUN2 (8H) 1->1 2.995600
SQRSHRUN2 (8H) 1->2 2.999959
SQRSHRUN2 (8H) throughput 0.513489
SQRSHRUN2 (8H) throughput 0.508692
SQRSHRUN2 (4S) 1->1 2.997089
SQRSHRUN2 (4S) 1->2 2.996284
SQRSHRUN2 (4S) throughput 0.503411
SQRSHRUN2 (4S) throughput 0.499205
SQRSHRUN2 (2D) 1->1 3.006195
SQRSHRUN2 (2D) 1->2 3.338471
SQRSHRUN2 (2D) throughput 0.507309
SQRSHRUN2 (2D) throughput 0.501063
SQSHL (immediate, scalar, B) 1->2 2.001579
SQSHL (immediate, scalar, B) throughput 0.581757
SQSHL (immediate, scalar, H) 1->2 2.000406
SQSHL (immediate, scalar, H) throughput 0.504105
SQSHL (immediate, scalar, S) 1->2 2.004332
SQSHL (immediate, scalar, S) throughput 0.508069
SQSHL (immediate, scalar, D) 1->2 2.022099
SQSHL (immediate, scalar, D) throughput 0.501779
SQSHL (immediate, vector, 8B) 1->2 2.045715
SQSHL (immediate, vector, 8B) throughput 0.507496
SQSHL (immediate, vector, 16B) 1->2 1.996849
SQSHL (immediate, vector, 16B) throughput 0.504752
SQSHL (immediate, vector, 4H) 1->2 1.958491
SQSHL (immediate, vector, 4H) throughput 0.475284
SQSHL (immediate, vector, 8H) 1->2 1.924322
SQSHL (immediate, vector, 8H) throughput 0.499945
SQSHL (immediate, vector, 2S) 1->2 2.006104
SQSHL (immediate, vector, 2S) throughput 0.499651
SQSHL (immediate, vector, 4S) 1->2 1.999071
SQSHL (immediate, vector, 4S) throughput 0.513914
SQSHL (immediate, vector, 2D) 1->2 2.003401
SQSHL (immediate, vector, 2D) throughput 0.501179
SQSHL (scalar, B) 1->2 1.966152
SQSHL (scalar, B) 1->3 1.995559
SQSHL (scalar, B) throughput 0.501056
SQSHL (scalar, H) 1->2 2.070208
SQSHL (scalar, H) 1->3 2.067836
SQSHL (scalar, H) throughput 0.473409
SQSHL (scalar, S) 1->2 2.001609
SQSHL (scalar, S) 1->3 2.003694
SQSHL (scalar, S) throughput 0.546499
SQSHL (scalar, D) 1->2 2.006682
SQSHL (scalar, D) 1->3 2.005103
SQSHL (scalar, D) throughput 0.499636
SQSHL (vector, 8B) 1->2 1.998302
SQSHL (vector, 8B) 1->3 2.002590
SQSHL (vector, 8B) throughput 0.505026
SQSHL (vector, 16B) 1->2 2.001416
SQSHL (vector, 16B) 1->3 2.001658
SQSHL (vector, 16B) throughput 0.500805
SQSHL (vector, 4H) 1->2 1.990189
SQSHL (vector, 4H) 1->3 1.999433
SQSHL (vector, 4H) throughput 0.529579
SQSHL (vector, 8H) 1->2 1.998546
SQSHL (vector, 8H) 1->3 1.999193
SQSHL (vector, 8H) throughput 0.597052
SQSHL (vector, 2S) 1->2 2.001546
SQSHL (vector, 2S) 1->3 1.975511
SQSHL (vector, 2S) throughput 0.517262
SQSHL (vector, 4S) 1->2 2.002390
SQSHL (vector, 4S) 1->3 2.003684
SQSHL (vector, 4S) throughput 0.500851
SQSHL (vector, 2D) 1->2 2.003402
SQSHL (vector, 2D) 1->3 2.000243
SQSHL (vector, 2D) throughput 0.506491
SQSHLU (scalar, B) 1->2 2.005021
SQSHLU (scalar, B) throughput 0.558506
SQSHLU (scalar, H) 1->2 1.946845
SQSHLU (scalar, H) throughput 0.501963
SQSHLU (scalar, S) 1->2 2.013523
SQSHLU (scalar, S) throughput 0.499652
SQSHLU (scalar, D) 1->2 1.996244
SQSHLU (scalar, D) throughput 0.507304
SQSHLU (vector, 8B) 1->2 2.007560
SQSHLU (vector, 8B) throughput 0.500947
SQSHLU (vector, 16B) 1->2 2.004050
SQSHLU (vector, 16B) throughput 0.501488
SQSHLU (vector, 4H) 1->2 2.004009
SQSHLU (vector, 4H) throughput 0.500957
SQSHLU (vector, 8H) 1->2 2.003806
SQSHLU (vector, 8H) throughput 0.501832
SQSHLU (vector, 2S) 1->2 2.002956
SQSHLU (vector, 2S) throughput 0.502141
SQSHLU (vector, 4S) 1->2 2.002025
SQSHLU (vector, 4S) throughput 0.501807
SQSHLU (vector, 2D) 1->2 2.002670
SQSHLU (vector, 2D) throughput 0.501908
SQSHRN (H) 1->2 3.002505
SQSHRN (H) throughput 0.500192
SQSHRN (S) 1->2 2.999718
SQSHRN (S) throughput 0.504466
SQSHRN (D) 1->2 3.014901
SQSHRN (D) throughput 0.500673
SQSHRN (8H) 1->2 3.004859
SQSHRN (8H) throughput 0.500081
SQSHRN (4S) 1->2 3.003968
SQSHRN (4S) throughput 0.499925
SQSHRN (2D) 1->2 3.004373
SQSHRN (2D) throughput 0.520810
SQSHRN2 (8H) 1->1 3.053101
SQSHRN2 (8H) 1->2 2.998788
SQSHRN2 (8H) throughput 0.501544
SQSHRN2 (8H) throughput 0.506965
SQSHRN2 (4S) 1->1 3.562566
SQSHRN2 (4S) 1->2 3.062928
SQSHRN2 (4S) throughput 0.507279
SQSHRN2 (4S) throughput 0.449896
SQSHRN2 (2D) 1->1 3.006561
SQSHRN2 (2D) 1->2 2.998746
SQSHRN2 (2D) throughput 0.501675
SQSHRN2 (2D) throughput 0.500635
SQSHRUN (H) 1->2 2.997495
SQSHRUN (H) throughput 0.500825
SQSHRUN (S) 1->2 3.007733
SQSHRUN (S) throughput 0.499692
SQSHRUN (D) 1->2 3.001255
SQSHRUN (D) throughput 0.462327
SQSHRUN (8H) 1->2 3.001550
SQSHRUN (8H) throughput 0.514202
SQSHRUN (4S) 1->2 3.017884
SQSHRUN (4S) throughput 0.501276
SQSHRUN (2D) 1->2 2.819247
SQSHRUN (2D) throughput 0.499607
SQSHRUN2 (8H) 1->1 3.002262
SQSHRUN2 (8H) 1->2 2.993182
SQSHRUN2 (8H) throughput 0.546163
SQSHRUN2 (8H) throughput 0.500744
SQSHRUN2 (4S) 1->1 3.059402
SQSHRUN2 (4S) 1->2 3.002915
SQSHRUN2 (4S) throughput 0.460710
SQSHRUN2 (4S) throughput 0.503715
SQSHRUN2 (2D) 1->1 3.003928
SQSHRUN2 (2D) 1->2 2.997452
SQSHRUN2 (2D) throughput 0.501465
SQSHRUN2 (2D) throughput 0.503077
SQSUB (scalar, B) 1->2 3.479109
SQSUB (scalar, B) 1->3 3.039398
SQSUB (scalar, B) throughput 0.496778
SQSUB (scalar, H) 1->2 2.917777
SQSUB (scalar, H) 1->3 3.018289
SQSUB (scalar, H) throughput 0.506563
SQSUB (scalar, S) 1->2 3.006438
SQSUB (scalar, S) 1->3 3.004051
SQSUB (scalar, S) throughput 0.500106
SQSUB (scalar, D) 1->2 3.035833
SQSUB (scalar, D) 1->3 2.993780
SQSUB (scalar, D) throughput 0.500987
SQSUB (vector, 8B) 1->2 2.994675
SQSUB (vector, 8B) 1->3 2.998746
SQSUB (vector, 8B) throughput 0.500668
SQSUB (vector, 16B) 1->2 3.004051
SQSUB (vector, 16B) 1->3 3.004128
SQSUB (vector, 16B) throughput 0.501326
SQSUB (vector, 4H) 1->2 3.021299
SQSUB (vector, 4H) 1->3 3.003805
SQSUB (vector, 4H) throughput 0.500927
SQSUB (vector, 8H) 1->2 3.005508
SQSUB (vector, 8H) 1->3 3.001179
SQSUB (vector, 8H) throughput 0.520061
SQSUB (vector, 2S) 1->2 3.037698
SQSUB (vector, 2S) 1->3 3.015782
SQSUB (vector, 2S) throughput 0.504872
SQSUB (vector, 4S) 1->2 3.006562
SQSUB (vector, 4S) 1->3 3.689516
SQSUB (vector, 4S) throughput 0.497996
SQSUB (vector, 2D) 1->2 3.019276
SQSUB (vector, 2D) 1->3 3.012534
SQSUB (vector, 2D) throughput 0.500789
SQXTN (H) 1->2 3.056994
SQXTN (H) throughput 0.486639
SQXTN (S) 1->2 3.035064
SQXTN (S) throughput 0.500693
SQXTN (D) 1->2 3.052006
SQXTN (D) throughput 0.508721
SQXTN (8H) 1->2 3.006112
SQXTN (8H) throughput 0.490159
SQXTN (4S) 1->2 3.058840
SQXTN (4S) throughput 0.483904
SQXTN (2D) 1->2 3.137458
SQXTN (2D) throughput 0.501248
SQXTN2 (8H) 1->1 2.994751
SQXTN2 (8H) 1->2 3.003726
SQXTN2 (8H) throughput 0.501473
SQXTN2 (8H) throughput 0.502220
SQXTN2 (4S) 1->1 2.999313
SQXTN2 (4S) 1->2 3.002951
SQXTN2 (4S) throughput 0.501544
SQXTN2 (4S) throughput 0.504336
SQXTN2 (2D) 1->1 3.002870
SQXTN2 (2D) 1->2 3.004010
SQXTN2 (2D) throughput 0.500470
SQXTN2 (2D) throughput 0.501084
SQXTUN (H) 1->2 3.006763
SQXTUN (H) throughput 0.501007
SQXTUN (S) 1->2 3.008366
SQXTUN (S) throughput 0.465829
SQXTUN (D) 1->2 2.999960
SQXTUN (D) throughput 0.500070
SQXTUN (8H) 1->2 3.055923
SQXTUN (8H) throughput 0.500217
SQXTUN (4S) 1->2 2.537115
SQXTUN (4S) throughput 0.487003
SQXTUN (2D) 1->2 3.010213
SQXTUN (2D) throughput 0.507268
SQXTUN2 (8H) 1->1 3.004082
SQXTUN2 (8H) 1->2 3.064016
SQXTUN2 (8H) throughput 0.503944
SQXTUN2 (8H) throughput 0.500930
SQXTUN2 (4S) 1->1 2.999432
SQXTUN2 (4S) 1->2 2.991989
SQXTUN2 (4S) throughput 0.501832
SQXTUN2 (4S) throughput 0.501009
SQXTUN2 (2D) 1->1 2.998667
SQXTUN2 (2D) 1->2 3.003564
SQXTUN2 (2D) throughput 0.500733
SQXTUN2 (2D) throughput 0.500245
SRHADD (vector, 8B) 1->2 2.002430
SRHADD (vector, 8B) 1->3 1.996245
SRHADD (vector, 8B) throughput 0.527845
SRHADD (vector, 16B) 1->2 2.001455
SRHADD (vector, 16B) 1->3 2.002509
SRHADD (vector, 16B) throughput 0.500481
SRHADD (vector, 4H) 1->2 1.998302
SRHADD (vector, 4H) 1->3 2.003888
SRHADD (vector, 4H) throughput 0.500132
SRHADD (vector, 8H) 1->2 2.034964
SRHADD (vector, 8H) 1->3 1.999655
SRHADD (vector, 8H) throughput 0.496352
SRHADD (vector, 2S) 1->2 2.011419
SRHADD (vector, 2S) 1->3 2.008199
SRHADD (vector, 2S) throughput 0.486575
SRHADD (vector, 4S) 1->2 2.001778
SRHADD (vector, 4S) 1->3 1.998302
SRHADD (vector, 4S) throughput 0.501200
SRI (vector, 8B) 1->1 2.007837
SRI (vector, 8B) 1->2 1.994514
SRI (vector, 8B) throughput 0.501534
SRI (vector, 8B) throughput 0.499879
SRI (vector, 16B) 1->1 2.001659
SRI (vector, 16B) 1->2 2.003643
SRI (vector, 16B) throughput 0.711092
SRI (vector, 16B) throughput 0.754190
SRI (vector, 4H) 1->1 2.004212
SRI (vector, 4H) 1->2 2.184784
SRI (vector, 4H) throughput 0.500882
SRI (vector, 4H) throughput 0.531105
SRI (vector, 8H) 1->1 2.186553
SRI (vector, 8H) 1->2 2.123729
SRI (vector, 8H) throughput 0.518402
SRI (vector, 8H) throughput 0.505493
SRI (vector, 2S) 1->1 2.006189
SRI (vector, 2S) 1->2 1.907257
SRI (vector, 2S) throughput 0.465679
SRI (vector, 2S) throughput 0.544131
SRI (vector, 4S) 1->1 10.931492
SRI (vector, 4S) 1->2 14.516744
SRI (vector, 4S) throughput 0.508750
SRI (vector, 4S) throughput 0.505805
SRI (vector, 2D) 1->1 2.025453
SRI (vector, 2D) 1->2 2.001374
SRI (vector, 2D) throughput 0.500781
SRI (vector, 2D) throughput 0.500091
SRI (vector, D) 1->1 2.003644
SRI (vector, D) 1->2 2.001539
SRI (vector, D) throughput 0.501739
SRI (vector, D) throughput 0.502198
SRSHL (vector, 8B) 1->2 3.018916
SRSHL (vector, 8B) 1->3 3.868938
SRSHL (vector, 8B) throughput 0.497979
SRSHL (vector, 16B) 1->2 3.034883
SRSHL (vector, 16B) 1->3 3.002868
SRSHL (vector, 16B) throughput 0.508326
SRSHL (vector, 4H) 1->2 3.007155
SRSHL (vector, 4H) 1->3 3.031486
SRSHL (vector, 4H) throughput 0.499748
SRSHL (vector, 8H) 1->2 3.004088
SRSHL (vector, 8H) 1->3 3.000079
SRSHL (vector, 8H) throughput 0.501984
SRSHL (vector, 2S) 1->2 3.055885
SRSHL (vector, 2S) 1->3 3.010813
SRSHL (vector, 2S) throughput 0.501821
SRSHL (vector, 4S) 1->2 3.035877
SRSHL (vector, 4S) 1->3 2.992651
SRSHL (vector, 4S) throughput 0.499924
SRSHL (vector, 2D) 1->2 3.007206
SRSHL (vector, 2D) 1->3 2.999312
SRSHL (vector, 2D) throughput 0.500091
SRSHL (vector, D) 1->2 3.005426
SRSHL (vector, D) 1->3 3.009760
SRSHL (vector, D) throughput 0.500227
SRSHR (vector, 8B) 1->2 3.011845
SRSHR (vector, 8B) throughput 0.501372
SRSHR (vector, 16B) 1->2 3.002796
SRSHR (vector, 16B) throughput 0.500866
SRSHR (vector, 4H) 1->2 3.003765
SRSHR (vector, 4H) throughput 0.500151
SRSHR (vector, 8H) 1->2 3.000605
SRSHR (vector, 8H) throughput 0.501129
SRSHR (vector, 2S) 1->2 3.003969
SRSHR (vector, 2S) throughput 0.500749
SRSHR (vector, 4S) 1->2 2.616290
SRSHR (vector, 4S) throughput 0.506145
SRSHR (vector, 2D) 1->2 3.005104
SRSHR (vector, 2D) throughput 0.499243
SRSHR (vector, D) 1->2 2.998666
SRSHR (vector, D) throughput 0.501806
SRSRA (vector, 8B) 1->1 3.033130
SRSRA (vector, 8B) 1->2 3.109935
SRSRA (vector, 8B) throughput 0.511310
SRSRA (vector, 8B) throughput 0.511036
SRSRA (vector, 16B) 1->1 3.061165
SRSRA (vector, 16B) 1->2 2.990229
SRSRA (vector, 16B) throughput 0.500599
SRSRA (vector, 16B) throughput 0.502277
SRSRA (vector, 4H) 1->1 2.999274
SRSRA (vector, 4H) 1->2 3.005337
SRSRA (vector, 4H) throughput 0.499869
SRSRA (vector, 4H) throughput 0.635998
SRSRA (vector, 8H) 1->1 2.993943
SRSRA (vector, 8H) 1->2 3.004010
SRSRA (vector, 8H) throughput 0.502081
SRSRA (vector, 8H) throughput 0.504531
SRSRA (vector, 2S) 1->1 3.009961
SRSRA (vector, 2S) 1->2 3.005792
SRSRA (vector, 2S) throughput 0.503159
SRSRA (vector, 2S) throughput 0.499235
SRSRA (vector, 4S) 1->1 2.997251
SRSRA (vector, 4S) 1->2 3.039173
SRSRA (vector, 4S) throughput 0.501164
SRSRA (vector, 4S) throughput 0.500768
SRSRA (vector, 2D) 1->1 3.027452
SRSRA (vector, 2D) 1->2 3.001657
SRSRA (vector, 2D) throughput 0.503194
SRSRA (vector, 2D) throughput 0.502951
SRSRA (vector, D) 1->1 3.008625
SRSRA (vector, D) 1->2 3.006600
SRSRA (vector, D) throughput 0.503149
SRSRA (vector, D) throughput 0.507012
SSHL (vector, 8B) 1->2 2.004649
SSHL (vector, 8B) 1->3 2.022005
SSHL (vector, 8B) throughput 0.500693
SSHL (vector, 16B) 1->2 1.996004
SSHL (vector, 16B) 1->3 2.000406
SSHL (vector, 16B) throughput 0.649285
SSHL (vector, 4H) 1->2 1.966226
SSHL (vector, 4H) 1->3 1.790707
SSHL (vector, 4H) throughput 0.500085
SSHL (vector, 8H) 1->2 2.201452
SSHL (vector, 8H) 1->3 2.001886
SSHL (vector, 8H) throughput 0.500476
SSHL (vector, 2S) 1->2 2.001414
SSHL (vector, 2S) 1->3 2.002389
SSHL (vector, 2S) throughput 0.507486
SSHL (vector, 4S) 1->2 2.009136
SSHL (vector, 4S) 1->3 2.000566
SSHL (vector, 4S) throughput 0.500957
SSHL (vector, 2D) 1->2 1.998181
SSHL (vector, 2D) 1->3 2.001378
SSHL (vector, 2D) throughput 0.501248
SSHL (vector, D) 1->2 2.005710
SSHL (vector, D) 1->3 2.001011
SSHL (vector, D) throughput 0.474733
SSHLL (8H) 1->2 1.995962
SSHLL (8H) throughput 0.504129
SSHLL (4S) 1->2 2.002511
SSHLL (4S) throughput 0.503263
SSHLL (2D) 1->2 1.998989
SSHLL (2D) throughput 0.500491
SSHLL2 (8H) 1->2 1.999192
SSHLL2 (8H) throughput 0.503169
SSHLL2 (4S) 1->2 1.999556
SSHLL2 (4S) throughput 0.500658
SSHLL2 (2D) 1->2 2.008643
SSHLL2 (2D) throughput 0.502465
SSHR (vector, 8B) 1->2 2.067298
SSHR (vector, 8B) throughput 0.500162
SSHR (vector, 16B) 1->2 1.921283
SSHR (vector, 16B) throughput 0.500937
SSHR (vector, 4H) 1->2 1.997497
SSHR (vector, 4H) throughput 0.502946
SSHR (vector, 8H) 1->2 2.082301
SSHR (vector, 8H) throughput 0.549584
SSHR (vector, 2S) 1->2 2.004048
SSHR (vector, 2S) throughput 0.501718
SSHR (vector, 4S) 1->2 1.999798
SSHR (vector, 4S) throughput 0.500881
SSHR (vector, 2D) 1->2 2.001498
SSHR (vector, 2D) throughput 0.500663
SSHR (vector, D) 1->2 2.004696
SSHR (vector, D) throughput 0.501124
SSRA (vector, 8B) 1->1 3.005590
SSRA (vector, 8B) 1->2 3.012716
SSRA (vector, 8B) throughput 0.499995
SSRA (vector, 8B) throughput 0.502622
SSRA (vector, 16B) 1->1 3.007085
SSRA (vector, 16B) 1->2 3.000365
SSRA (vector, 16B) throughput 0.505188
SSRA (vector, 16B) throughput 0.500040
SSRA (vector, 4H) 1->1 3.005343
SSRA (vector, 4H) 1->2 3.004172
SSRA (vector, 4H) throughput 0.501270
SSRA (vector, 4H) throughput 0.599157
SSRA (vector, 8H) 1->1 3.028021
SSRA (vector, 8H) 1->2 3.007656
SSRA (vector, 8H) throughput 0.501559
SSRA (vector, 8H) throughput 0.552259
SSRA (vector, 2S) 1->1 3.002941
SSRA (vector, 2S) 1->2 3.002506
SSRA (vector, 2S) throughput 0.505633
SSRA (vector, 2S) throughput 0.507162
SSRA (vector, 4S) 1->1 2.999678
SSRA (vector, 4S) 1->2 3.003969
SSRA (vector, 4S) throughput 0.504049
SSRA (vector, 4S) throughput 0.504373
SSRA (vector, 2D) 1->1 3.078916
SSRA (vector, 2D) 1->2 3.017494
SSRA (vector, 2D) throughput 0.507461
SSRA (vector, 2D) throughput 0.500593
SSRA (vector, D) 1->1 3.034883
SSRA (vector, D) 1->2 2.997292
SSRA (vector, D) throughput 0.501989
SSRA (vector, D) throughput 0.500010
SSUBL (vector, 8H) 1->2 1.996770
SSUBL (vector, 8H) 1->3 1.993630
SSUBL (vector, 8H) throughput 0.500780
SSUBL (vector, 4S) 1->2 1.996286
SSUBL (vector, 4S) 1->3 1.998546
SSUBL (vector, 4S) throughput 0.500096
SSUBL (vector, 2D) 1->2 1.992507
SSUBL (vector, 2D) 1->3 1.998060
SSUBL (vector, 2D) throughput 0.499889
SSUBL2 (vector, 8H) 1->2 2.005992
SSUBL2 (vector, 8H) 1->3 2.003108
SSUBL2 (vector, 8H) throughput 0.500536
SSUBL2 (vector, 4S) 1->2 2.010679
SSUBL2 (vector, 4S) 1->3 2.038265
SSUBL2 (vector, 4S) throughput 0.369727
SSUBL2 (vector, 2D) 1->2 2.003269
SSUBL2 (vector, 2D) 1->3 2.001702
SSUBL2 (vector, 2D) throughput 0.500886
SSUBW (vector, 8H) 1->2 1.998060
SSUBW (vector, 8H) 1->3 2.001618
SSUBW (vector, 8H) throughput 0.505043
SSUBW (vector, 4S) 1->2 1.985226
SSUBW (vector, 4S) 1->3 2.017752
SSUBW (vector, 4S) throughput 0.501812
SSUBW (vector, 2D) 1->2 2.010447
SSUBW (vector, 2D) 1->3 2.022046
SSUBW (vector, 2D) throughput 0.502539
SSUBW2 (vector, 8H) 1->2 2.001740
SSUBW2 (vector, 8H) 1->3 2.001375
SSUBW2 (vector, 8H) throughput 0.739194
SSUBW2 (vector, 4S) 1->2 2.015306
SSUBW2 (vector, 4S) 1->3 1.987937
SSUBW2 (vector, 4S) throughput 0.502058
SSUBW2 (vector, 2D) 1->2 2.006924
SSUBW2 (vector, 2D) 1->3 2.001539
SSUBW2 (vector, 2D) throughput 0.498596
STNP (S) throughput 1.025963
STNP (D) throughput 1.043861
STNP (Q) throughput 2.078856
STNP (signed offset, S) throughput 1.038707
STNP (signed offset, D) throughput 1.041593
STNP (signed offset, Q) throughput 2.090197
STP (S) throughput 0.999960
STP (D) throughput 1.139441
STP (Q) throughput 2.008707
STP (post-index, S) 3->3 1.272286
STP (post-index, S) throughput 1.024868
STP (post-index, D) 3->3 1.322916
STP (post-index, D) throughput 1.014627
STP (post-index, Q) 3->3 2.014026
STP (post-index, Q) throughput 1.990785
STP (pre-index, S) 3->3 1.486040
STP (pre-index, S) throughput 1.007733
STP (pre-index, D) 3->3 1.194301
STP (pre-index, D) throughput 1.104939
STP (pre-index, Q) 3->3 2.037017
STP (pre-index, Q) throughput 2.112572
STP (signed offset, S) throughput 0.950166
STP (signed offset, D) throughput 1.072080
STP (signed offset, Q) throughput 2.012709
STR (S) throughput 1.056329
STR (D) throughput 1.051909
STR (Q) throughput 1.006660
STR (post-index, S) 3->3 1.283754
STR (post-index, S) throughput 0.948258
STR (post-index, D) 3->3 1.221233
STR (post-index, D) throughput 1.210981
STR (post-index, Q) 3->3 1.154575
STR (post-index, Q) throughput 0.598137
STR (pre-index, S) 3->3 1.535841
STR (pre-index, S) throughput 1.019057
STR (pre-index, D) 3->3 1.046766
STR (pre-index, D) throughput 1.056794
STR (pre-index, Q) 3->3 1.472915
STR (pre-index, Q) throughput 0.731031
STR (signed offset, S) throughput 1.053465
STR (signed offset, D) throughput 1.032076
STR (signed offset, Q) throughput 1.022392
STR (register, S) throughput 1.042215
STR (register, D) throughput 1.008838
STR (register, Q) throughput 1.001341
STR (register, uxtw, S) throughput 1.002501
STR (register, uxtw, D) throughput 1.095900
STR (register, uxtw, Q) throughput 1.069675
STR (register, sxtw, S) throughput 0.997972
STR (register, sxtw, D) throughput 0.992167
STR (register, sxtw, Q) throughput 1.278306
STR (register, lsl, S) throughput 1.000283
STR (register, lsl, D) throughput 0.999631
STR (register, lsl, Q) throughput 1.269661
STUR (S) throughput 1.304385
STUR (D) throughput 1.038822
STUR (Q) throughput 1.023398
SUB (vector, 8B) 1->2 1.884181
SUB (vector, 8B) 1->3 2.003527
SUB (vector, 8B) throughput 0.431326
SUB (vector, 16B) 1->2 2.000161
SUB (vector, 16B) 1->3 2.013864
SUB (vector, 16B) throughput 0.503737
SUB (vector, 4H) 1->2 1.790820
SUB (vector, 4H) 1->3 2.054359
SUB (vector, 4H) throughput 0.500081
SUB (vector, 8H) 1->2 1.881353
SUB (vector, 8H) 1->3 2.370348
SUB (vector, 8H) throughput 0.443212
SUB (vector, 2S) 1->2 2.011606
SUB (vector, 2S) 1->3 1.597435
SUB (vector, 2S) throughput 0.502015
SUB (vector, 4S) 1->2 1.997079
SUB (vector, 4S) 1->3 2.006958
SUB (vector, 4S) throughput 0.509589
SUB (vector, 2D) 1->2 2.001537
SUB (vector, 2D) 1->3 2.001619
SUB (vector, 2D) throughput 0.499879
SUB (vector, D) 1->2 2.003887
SUB (vector, D) 1->3 2.002632
SUB (vector, D) throughput 0.633295
SUBHN (vector, 8H) 1->2 4.381085
SUBHN (vector, 8H) 1->3 2.936640
SUBHN (vector, 8H) throughput 0.534330
SUBHN (vector, 4S) 1->2 6.561866
SUBHN (vector, 4S) 1->3 3.026140
SUBHN (vector, 4S) throughput 0.528292
SUBHN (vector, 2D) 1->2 2.633643
SUBHN (vector, 2D) 1->3 3.037886
SUBHN (vector, 2D) throughput 0.534753
SUBHN2 (vector, 8H) 1->1 2.937879
SUBHN2 (vector, 8H) 1->2 3.372501
SUBHN2 (vector, 8H) 1->3 3.031820
SUBHN2 (vector, 8H) throughput 0.519114
SUBHN2 (vector, 8H) throughput 0.501891
SUBHN2 (vector, 4S) 1->1 3.043085
SUBHN2 (vector, 4S) 1->2 3.004639
SUBHN2 (vector, 4S) 1->3 3.009285
SUBHN2 (vector, 4S) throughput 0.477181
SUBHN2 (vector, 4S) throughput 0.503986
SUBHN2 (vector, 2D) 1->1 3.000526
SUBHN2 (vector, 2D) 1->2 3.004292
SUBHN2 (vector, 2D) 1->3 3.005792
SUBHN2 (vector, 2D) throughput 0.501655
SUBHN2 (vector, 2D) throughput 0.502291
SUQADD (scalar, B) 1->1 3.014213
SUQADD (scalar, B) 1->2 3.020005
SUQADD (scalar, B) throughput 0.499380
SUQADD (scalar, B) throughput 0.500921
SUQADD (scalar, H) 1->1 3.006482
SUQADD (scalar, H) 1->2 3.003038
SUQADD (scalar, H) throughput 0.500773
SUQADD (scalar, H) throughput 0.503409
SUQADD (scalar, S) 1->1 3.010488
SUQADD (scalar, S) 1->2 2.998261
SUQADD (scalar, S) throughput 0.500445
SUQADD (scalar, S) throughput 0.607076
SUQADD (scalar, D) 1->1 2.995514
SUQADD (scalar, D) 1->2 2.998545
SUQADD (scalar, D) throughput 0.501564
SUQADD (scalar, D) throughput 0.500741
SUQADD (vector, 8B) 1->1 3.004292
SUQADD (vector, 8B) 1->2 3.009880
SUQADD (vector, 8B) throughput 0.506692
SUQADD (vector, 8B) throughput 0.501252
SUQADD (vector, 16B) 1->1 3.003079
SUQADD (vector, 16B) 1->2 1.968559
SUQADD (vector, 16B) throughput 0.493915
SUQADD (vector, 16B) throughput 0.499827
SUQADD (vector, 4H) 1->1 2.996972
SUQADD (vector, 4H) 1->2 3.004819
SUQADD (vector, 4H) throughput 0.536828
SUQADD (vector, 4H) throughput 0.499889
SUQADD (vector, 8H) 1->1 2.996281
SUQADD (vector, 8H) 1->2 3.004743
SUQADD (vector, 8H) throughput 0.509034
SUQADD (vector, 8H) throughput 0.503601
SUQADD (vector, 2S) 1->1 2.995599
SUQADD (vector, 2S) 1->2 3.003887
SUQADD (vector, 2S) throughput 0.502420
SUQADD (vector, 2S) throughput 0.500083
SUQADD (vector, 4S) 1->1 2.995682
SUQADD (vector, 4S) 1->2 2.993902
SUQADD (vector, 4S) throughput 0.458729
SUQADD (vector, 4S) throughput 0.761102
SUQADD (vector, 2D) 1->1 2.858124
SUQADD (vector, 2D) 1->2 3.452689
SUQADD (vector, 2D) throughput 0.488925
SUQADD (vector, 2D) throughput 0.503596
SXTL (8H) 1->2 2.008300
SXTL (8H) throughput 0.500658
SXTL (4S) 1->2 1.977197
SXTL (4S) throughput 0.502157
SXTL (2D) 1->2 1.989008
SXTL (2D) throughput 0.540235
SXTL2 (8H) 1->2 1.999657
SXTL2 (8H) throughput 0.514695
SXTL2 (4S) 1->2 2.064282
SXTL2 (4S) throughput 0.545762
SXTL2 (2D) 1->2 1.996387
SXTL2 (2D) throughput 0.583660
TBL (single register table, 8B) 1->2 2.025149
TBL (single register table, 8B) 1->3 2.035841
TBL (single register table, 8B) throughput 0.645442
TBL (single register table, 16B) 1->2 2.003846
TBL (single register table, 16B) 1->3 1.953104
TBL (single register table, 16B) throughput 0.501682
TBL (two register table, 8B) 1->2 1.944759
TBL (two register table, 8B) 1->3 1.942964
TBL (two register table, 8B) 1->4 1.990899
TBL (two register table, 8B) throughput 0.503022
TBL (two register table, 16B) 1->2 2.449338
TBL (two register table, 16B) 1->3 2.026723
TBL (two register table, 16B) 1->4 2.027946
TBL (two register table, 16B) throughput 0.546886
TBL (three register table, 8B) 1->2 4.387148
TBL (three register table, 8B) 1->3 4.742329
TBL (three register table, 8B) 1->4 1.743119
TBL (three register table, 8B) 1->5 4.164090
TBL (three register table, 8B) throughput 1.116361
TBL (three register table, 16B) 1->2 3.498044
TBL (three register table, 16B) 1->3 5.012106
TBL (three register table, 16B) 1->4 2.184351
TBL (three register table, 16B) 1->5 4.004693
TBL (three register table, 16B) throughput 1.082302
TBL (four register table, 8B) 1->2 5.270441
TBL (four register table, 8B) 1->3 4.002271
TBL (four register table, 8B) 1->4 4.041704
TBL (four register table, 8B) 1->5 6.698875
TBL (four register table, 8B) 1->6 3.868545
TBL (four register table, 8B) throughput 1.168290
TBL (four register table, 16B) 1->2 4.224936
TBL (four register table, 16B) 1->3 3.988502
TBL (four register table, 16B) 1->4 4.245066
TBL (four register table, 16B) 1->5 5.019683
TBL (four register table, 16B) 1->6 3.940122
TBL (four register table, 16B) throughput 1.726255
TBX (single register table, 8B) 1->1 2.105161
TBX (single register table, 8B) 1->2 2.035628
TBX (single register table, 8B) 1->3 2.354108
TBX (single register table, 8B) throughput 0.372355
TBX (single register table, 8B) throughput 0.543319
TBX (single register table, 16B) 1->1 2.003685
TBX (single register table, 16B) 1->2 2.008868
TBX (single register table, 16B) 1->3 1.995202
TBX (single register table, 16B) throughput 0.581042
TBX (single register table, 16B) throughput 0.570306
TBX (two register table, 8B) 1->1 4.189078
TBX (two register table, 8B) 1->2 4.246723
TBX (two register table, 8B) 1->3 1.991254
TBX (two register table, 8B) 1->4 4.386379
TBX (two register table, 8B) throughput 1.129195
TBX (two register table, 16B) 1->1 4.467096
TBX (two register table, 16B) 1->2 4.558736
TBX (two register table, 16B) 1->3 2.116799
TBX (two register table, 16B) 1->4 3.800919
TBX (two register table, 16B) throughput 1.063570
TBX (three register table, 8B) 1->1 6.387795
TBX (three register table, 8B) 1->2 4.213440
TBX (three register table, 8B) 1->3 2.958374
TBX (three register table, 8B) 1->4 2.989796
TBX (three register table, 8B) 1->5 6.086680
TBX (three register table, 8B) throughput 1.662913
TBX (three register table, 16B) 1->1 6.525186
TBX (three register table, 16B) 1->2 6.559284
TBX (three register table, 16B) 1->3 4.520745
TBX (three register table, 16B) 1->4 2.707402
TBX (three register table, 16B) 1->5 6.332463
TBX (three register table, 16B) throughput 1.735098
TBX (four register table, 8B) 1->1 11.984942
TBX (four register table, 8B) 1->2 8.525770
TBX (four register table, 8B) 1->3 7.290039
TBX (four register table, 8B) 1->4 4.365438
TBX (four register table, 8B) 1->5 2.023570
TBX (four register table, 8B) 1->6 8.652513
TBX (four register table, 8B) throughput 1.933548
TBX (four register table, 16B) 1->1 9.206829
TBX (four register table, 16B) 1->2 8.447791
TBX (four register table, 16B) 1->3 7.034643
TBX (four register table, 16B) 1->4 4.389924
TBX (four register table, 16B) 1->5 2.012877
TBX (four register table, 16B) 1->6 8.522473
TBX (four register table, 16B) throughput 2.084390
TRN1 (vector, 8B) 1->2 2.040816
TRN1 (vector, 8B) 1->3 2.038938
TRN1 (vector, 8B) throughput 0.506167
TRN1 (vector, 16B) 1->2 2.001782
TRN1 (vector, 16B) 1->3 2.001414
TRN1 (vector, 16B) throughput 0.557463
TRN1 (vector, 4H) 1->2 1.927152
TRN1 (vector, 4H) 1->3 1.997783
TRN1 (vector, 4H) throughput 0.394201
TRN1 (vector, 8H) 1->2 1.864094
TRN1 (vector, 8H) 1->3 1.998667
TRN1 (vector, 8H) throughput 0.472379
TRN1 (vector, 2S) 1->2 1.798929
TRN1 (vector, 2S) 1->3 2.114264
TRN1 (vector, 2S) throughput 0.549210
TRN1 (vector, 4S) 1->2 1.758556
TRN1 (vector, 4S) 1->3 1.963672
TRN1 (vector, 4S) throughput 0.529229
TRN1 (vector, 2D) 1->2 2.035978
TRN1 (vector, 2D) 1->3 1.976727
TRN1 (vector, 2D) throughput 0.834853
TRN2 (vector, 8B) 1->2 2.009228
TRN2 (vector, 8B) 1->3 2.205048
TRN2 (vector, 8B) throughput 0.484013
TRN2 (vector, 16B) 1->2 2.030527
TRN2 (vector, 16B) 1->3 1.995024
TRN2 (vector, 16B) throughput 0.539536
TRN2 (vector, 4H) 1->2 2.000162
TRN2 (vector, 4H) 1->3 2.010245
TRN2 (vector, 4H) throughput 0.345918
TRN2 (vector, 8H) 1->2 2.004454
TRN2 (vector, 8H) 1->3 2.009575
TRN2 (vector, 8H) throughput 0.547113
TRN2 (vector, 2S) 1->2 1.997097
TRN2 (vector, 2S) 1->3 2.006387
TRN2 (vector, 2S) throughput 0.499409
TRN2 (vector, 4S) 1->2 2.014498
TRN2 (vector, 4S) 1->3 2.036159
TRN2 (vector, 4S) throughput 0.545007
TRN2 (vector, 2D) 1->2 2.721746
TRN2 (vector, 2D) 1->3 2.001173
TRN2 (vector, 2D) throughput 0.557248
UABA (vector, 8B) 1->1 3.012376
UABA (vector, 8B) 1->2 3.015588
UABA (vector, 8B) 1->3 3.043489
UABA (vector, 8B) throughput 0.584227
UABA (vector, 8B) throughput 0.659806
UABA (vector, 16B) 1->1 3.606709
UABA (vector, 16B) 1->2 2.973085
UABA (vector, 16B) 1->3 3.424069
UABA (vector, 16B) throughput 0.521626
UABA (vector, 16B) throughput 0.505386
UABA (vector, 4H) 1->1 3.004285
UABA (vector, 4H) 1->2 3.004890
UABA (vector, 4H) 1->3 3.004782
UABA (vector, 4H) throughput 0.516618
UABA (vector, 4H) throughput 0.509410
UABA (vector, 8H) 1->1 2.998547
UABA (vector, 8H) 1->2 2.999555
UABA (vector, 8H) 1->3 2.998910
UABA (vector, 8H) throughput 0.499657
UABA (vector, 8H) throughput 0.501855
UABA (vector, 2S) 1->1 2.998988
UABA (vector, 2S) 1->2 3.004938
UABA (vector, 2S) 1->3 3.004981
UABA (vector, 2S) throughput 0.500450
UABA (vector, 2S) throughput 0.545581
UABA (vector, 4S) 1->1 3.000541
UABA (vector, 4S) 1->2 3.020530
UABA (vector, 4S) 1->3 3.041100
UABA (vector, 4S) throughput 0.480946
UABA (vector, 4S) throughput 0.499843
UABAL (vector, 8H) 1->1 2.999192
UABAL (vector, 8H) 1->2 2.999313
UABAL (vector, 8H) 1->3 3.937148
UABAL (vector, 8H) throughput 0.510577
UABAL (vector, 8H) throughput 0.513663
UABAL (vector, 4S) 1->1 3.884978
UABAL (vector, 4S) 1->2 3.001205
UABAL (vector, 4S) 1->3 2.751259
UABAL (vector, 4S) throughput 0.501767
UABAL (vector, 4S) throughput 0.506250
UABAL (vector, 2D) 1->1 3.102301
UABAL (vector, 2D) 1->2 3.013262
UABAL (vector, 2D) 1->3 3.071869
UABAL (vector, 2D) throughput 0.503591
UABAL (vector, 2D) throughput 0.517312
UABAL2 (vector, 8H) 1->1 3.069438
UABAL2 (vector, 8H) 1->2 3.000484
UABAL2 (vector, 8H) 1->3 3.013500
UABAL2 (vector, 8H) throughput 0.501271
UABAL2 (vector, 8H) throughput 0.499413
UABAL2 (vector, 4S) 1->1 3.016508
UABAL2 (vector, 4S) 1->2 2.149381
UABAL2 (vector, 4S) 1->3 2.968712
UABAL2 (vector, 4S) throughput 0.832119
UABAL2 (vector, 4S) throughput 0.521277
UABAL2 (vector, 2D) 1->1 2.955655
UABAL2 (vector, 2D) 1->2 2.866641
UABAL2 (vector, 2D) 1->3 2.743871
UABAL2 (vector, 2D) throughput 0.489441
UABAL2 (vector, 2D) throughput 0.815735
UABD (vector, 8B) 1->2 3.542901
UABD (vector, 8B) 1->3 3.458645
UABD (vector, 8B) throughput 0.736395
UABD (vector, 16B) 1->2 3.777826
UABD (vector, 16B) 1->3 3.014343
UABD (vector, 16B) throughput 0.607980
UABD (vector, 4H) 1->2 2.968354
UABD (vector, 4H) 1->3 3.257936
UABD (vector, 4H) throughput 0.582999
UABD (vector, 8H) 1->2 3.530229
UABD (vector, 8H) 1->3 6.125389
UABD (vector, 8H) throughput 0.546821
UABD (vector, 2S) 1->2 3.091853
UABD (vector, 2S) 1->3 2.688652
UABD (vector, 2S) throughput 0.504279
UABD (vector, 4S) 1->2 3.323197
UABD (vector, 4S) 1->3 3.126994
UABD (vector, 4S) throughput 0.515446
UABDL (vector, 8H) 1->2 2.940565
UABDL (vector, 8H) 1->3 2.894123
UABDL (vector, 8H) throughput 0.525405
UABDL (vector, 4S) 1->2 3.007571
UABDL (vector, 4S) 1->3 3.490585
UABDL (vector, 4S) throughput 0.554998
UABDL (vector, 2D) 1->2 4.685013
UABDL (vector, 2D) 1->3 2.950823
UABDL (vector, 2D) throughput 0.589253
UABDL2 (vector, 8H) 1->2 3.011346
UABDL2 (vector, 8H) 1->3 2.478033
UABDL2 (vector, 8H) throughput 0.456040
UABDL2 (vector, 4S) 1->2 2.384823
UABDL2 (vector, 4S) 1->3 2.737678
UABDL2 (vector, 4S) throughput 0.499366
UABDL2 (vector, 2D) 1->2 3.006923
UABDL2 (vector, 2D) 1->3 4.055362
UABDL2 (vector, 2D) throughput 0.436354
UADALP (vector, 4H) 1->1 3.005983
UADALP (vector, 4H) 1->2 2.897276
UADALP (vector, 4H) throughput 0.507710
UADALP (vector, 4H) throughput 0.511346
UADALP (vector, 8H) 1->1 3.007908
UADALP (vector, 8H) 1->2 3.022958
UADALP (vector, 8H) throughput 0.511524
UADALP (vector, 8H) throughput 0.610777
UADALP (vector, 2S) 1->1 2.961537
UADALP (vector, 2S) 1->2 2.992182
UADALP (vector, 2S) throughput 0.504693
UADALP (vector, 2S) throughput 0.495274
UADALP (vector, 4S) 1->1 2.994225
UADALP (vector, 4S) 1->2 3.008910
UADALP (vector, 4S) throughput 0.503443
UADALP (vector, 4S) throughput 0.498574
UADALP (vector, 1D) 1->1 3.021862
UADALP (vector, 1D) 1->2 2.347373
UADALP (vector, 1D) throughput 0.559677
UADALP (vector, 1D) throughput 0.697949
UADALP (vector, 2D) 1->1 3.150477
UADALP (vector, 2D) 1->2 3.185844
UADALP (vector, 2D) throughput 0.500667
UADALP (vector, 2D) throughput 0.508501
UADDL (vector, 8H) 1->2 2.008423
UADDL (vector, 8H) 1->3 1.998180
UADDL (vector, 8H) throughput 0.499430
UADDL (vector, 4S) 1->2 1.986666
UADDL (vector, 4S) 1->3 1.927467
UADDL (vector, 4S) throughput 0.207887
UADDL (vector, 2D) 1->2 4.468753
UADDL (vector, 2D) 1->3 2.010923
UADDL (vector, 2D) throughput 0.596283
UADDL2 (vector, 8H) 1->2 2.069321
UADDL2 (vector, 8H) 1->3 1.957000
UADDL2 (vector, 8H) throughput 0.628416
UADDL2 (vector, 4S) 1->2 2.301002
UADDL2 (vector, 4S) 1->3 2.038065
UADDL2 (vector, 4S) throughput 0.515035
UADDL2 (vector, 2D) 1->2 2.371403
UADDL2 (vector, 2D) 1->3 1.990831
UADDL2 (vector, 2D) throughput 0.562891
UADDLP (vector, 4H) 1->2 1.602521
UADDLP (vector, 4H) throughput 0.790884
UADDLP (vector, 8H) 1->2 2.007051
UADDLP (vector, 8H) throughput 0.873410
UADDLP (vector, 2S) 1->2 1.917857
UADDLP (vector, 2S) throughput 0.443352
UADDLP (vector, 4S) 1->2 2.044721
UADDLP (vector, 4S) throughput 0.518982
UADDLP (vector, 1D) 1->2 1.997618
UADDLP (vector, 1D) throughput 0.500838
UADDLP (vector, 2D) 1->2 3.511391
UADDLP (vector, 2D) throughput 0.687573
UADDLV (vector, 8B) 1->2 2.841930
UADDLV (vector, 8B) throughput 0.609047
UADDLV (vector, 16B) 1->2 2.831086
UADDLV (vector, 16B) throughput 0.463608
UADDLV (vector, 4H) 1->2 2.854879
UADDLV (vector, 4H) throughput 0.560475
UADDLV (vector, 8H) 1->2 3.003847
UADDLV (vector, 8H) throughput 0.499995
UADDLV (vector, 4S) 1->2 3.012268
UADDLV (vector, 4S) throughput 0.507081
UADDW (vector, 8H) 1->2 2.000041
UADDW (vector, 8H) 1->3 2.004939
UADDW (vector, 8H) throughput 0.501410
UADDW (vector, 4S) 1->2 2.001418
UADDW (vector, 4S) 1->3 1.992226
UADDW (vector, 4S) throughput 0.499940
UADDW (vector, 2D) 1->2 2.026859
UADDW (vector, 2D) 1->3 2.001660
UADDW (vector, 2D) throughput 0.500916
UADDW2 (vector, 8H) 1->2 2.002429
UADDW2 (vector, 8H) 1->3 2.007305
UADDW2 (vector, 8H) throughput 0.506165
UADDW2 (vector, 4S) 1->2 2.019518
UADDW2 (vector, 4S) 1->3 1.737755
UADDW2 (vector, 4S) throughput 0.531340
UADDW2 (vector, 2D) 1->2 2.004495
UADDW2 (vector, 2D) 1->3 2.434771
UADDW2 (vector, 2D) throughput 0.504535
UCVTF (scalar, fixed-point, S from 32-bit GPR) 1->2 roundtrip 10.457200
UCVTF (scalar, fixed-point, S from 32-bit GPR) throughput 0.698316
UCVTF (scalar, fixed-point, D from 32-bit GPR) 1->2 roundtrip 7.191101
UCVTF (scalar, fixed-point, D from 32-bit GPR) throughput 0.512621
UCVTF (scalar, fixed-point, S from 64-bit GPR) 1->2 roundtrip 11.849162
UCVTF (scalar, fixed-point, S from 64-bit GPR) throughput 0.619549
UCVTF (scalar, fixed-point, D from 64-bit GPR) 1->2 roundtrip 10.648273
UCVTF (scalar, fixed-point, D from 64-bit GPR) throughput 0.566978
UCVTF (scalar, integer, S from 32-bit GPR) 1->2 roundtrip 10.610343
UCVTF (scalar, integer, S from 32-bit GPR) throughput 0.528529
UCVTF (scalar, integer, D from 32-bit GPR) 1->2 roundtrip 10.524389
UCVTF (scalar, integer, D from 32-bit GPR) throughput 0.556105
UCVTF (scalar, integer, S from 64-bit GPR) 1->2 roundtrip 9.621484
UCVTF (scalar, integer, S from 64-bit GPR) throughput 0.432386
UCVTF (scalar, integer, D from 64-bit GPR) 1->2 roundtrip 10.426746
UCVTF (scalar, integer, D from 64-bit GPR) throughput 0.475069
UCVTF (scalar, fixed-point, S from S) 1->2 2.998061
UCVTF (scalar, fixed-point, S from S) throughput 0.466083
UCVTF (scalar, fixed-point, D from D) 1->2 3.176450
UCVTF (scalar, fixed-point, D from D) throughput 0.436402
UCVTF (vector, fixed-point, 2S from 2S) 1->2 3.280614
UCVTF (vector, fixed-point, 2S from 2S) throughput 0.503234
UCVTF (vector, fixed-point, 4S from 4S) 1->2 3.073526
UCVTF (vector, fixed-point, 4S from 4S) throughput 0.506796
UCVTF (vector, fixed-point, 2D from 2D) 1->2 2.747564
UCVTF (vector, fixed-point, 2D from 2D) throughput 0.500288
UCVTF (scalar, integer, S from S) 1->2 3.064312
UCVTF (scalar, integer, S from S) throughput 0.555895
UCVTF (scalar, integer, D from D) 1->2 2.887607
UCVTF (scalar, integer, D from D) throughput 0.679989
UCVTF (vector, integer, 2S from 2S) 1->2 3.152348
UCVTF (vector, integer, 2S from 2S) throughput 0.540997
UCVTF (vector, integer, 4S from 4S) 1->2 3.006375
UCVTF (vector, integer, 4S from 4S) throughput 0.506009
UCVTF (vector, integer, 2D from 2D) 1->2 3.224033
UCVTF (vector, integer, 2D from 2D) throughput 0.461208
UHADD (vector, 8B) 1->2 1.822367
UHADD (vector, 8B) 1->3 2.006196
UHADD (vector, 8B) throughput 0.510822
UHADD (vector, 16B) 1->2 2.101755
UHADD (vector, 16B) 1->3 2.019476
UHADD (vector, 16B) throughput 0.707074
UHADD (vector, 4H) 1->2 2.001859
UHADD (vector, 4H) 1->3 2.010407
UHADD (vector, 4H) throughput 0.557679
UHADD (vector, 8H) 1->2 2.113739
UHADD (vector, 8H) 1->3 1.977806
UHADD (vector, 8H) throughput 0.548886
UHADD (vector, 2S) 1->2 2.124150
UHADD (vector, 2S) 1->3 2.000000
UHADD (vector, 2S) throughput 0.530922
UHADD (vector, 4S) 1->2 2.019514
UHADD (vector, 4S) 1->3 2.091906
UHADD (vector, 4S) throughput 0.838553
UHSUB (vector, 8B) 1->2 1.995457
UHSUB (vector, 8B) 1->3 1.677583
UHSUB (vector, 8B) throughput 0.519878
UHSUB (vector, 16B) 1->2 2.002910
UHSUB (vector, 16B) 1->3 1.965772
UHSUB (vector, 16B) throughput 0.443214
UHSUB (vector, 4H) 1->2 2.012813
UHSUB (vector, 4H) 1->3 2.195572
UHSUB (vector, 4H) throughput 0.811867
UHSUB (vector, 8H) 1->2 2.010722
UHSUB (vector, 8H) 1->3 2.004332
UHSUB (vector, 8H) throughput 0.503533
UHSUB (vector, 2S) 1->2 1.985147
UHSUB (vector, 2S) 1->3 2.002343
UHSUB (vector, 2S) throughput 0.499525
UHSUB (vector, 4S) 1->2 1.996174
UHSUB (vector, 4S) 1->3 2.006441
UHSUB (vector, 4S) throughput 0.500719
UMAX (vector, 8B) 1->2 2.246729
UMAX (vector, 8B) 1->3 2.016166
UMAX (vector, 8B) throughput 0.500709
UMAX (vector, 16B) 1->2 2.034838
UMAX (vector, 16B) 1->3 2.002227
UMAX (vector, 16B) throughput 0.806014
UMAX (vector, 4H) 1->2 2.485459
UMAX (vector, 4H) 1->3 2.042385
UMAX (vector, 4H) throughput 0.500302
UMAX (vector, 8H) 1->2 1.998223
UMAX (vector, 8H) 1->3 2.003523
UMAX (vector, 8H) throughput 0.550813
UMAX (vector, 2S) 1->2 2.001579
UMAX (vector, 2S) 1->3 1.995321
UMAX (vector, 2S) throughput 0.500653
UMAX (vector, 4S) 1->2 2.003945
UMAX (vector, 4S) 1->3 2.484453
UMAX (vector, 4S) throughput 0.542465
UMAXP (vector, 8B) 1->2 1.995252
UMAXP (vector, 8B) 1->3 2.011448
UMAXP (vector, 8B) throughput 0.498453
UMAXP (vector, 16B) 1->2 2.000404
UMAXP (vector, 16B) 1->3 2.129050
UMAXP (vector, 16B) throughput 0.522371
UMAXP (vector, 4H) 1->2 2.040910
UMAXP (vector, 4H) 1->3 2.008082
UMAXP (vector, 4H) throughput 0.433786
UMAXP (vector, 8H) 1->2 1.998452
UMAXP (vector, 8H) 1->3 2.001662
UMAXP (vector, 8H) throughput 0.483485
UMAXP (vector, 2S) 1->2 1.775554
UMAXP (vector, 2S) 1->3 2.007881
UMAXP (vector, 2S) throughput 0.501685
UMAXP (vector, 4S) 1->2 2.003274
UMAXP (vector, 4S) 1->3 2.004558
UMAXP (vector, 4S) throughput 0.491161
UMAXV (vector, 8B) 1->2 3.412627
UMAXV (vector, 8B) throughput 0.675986
UMAXV (vector, 16B) 1->2 3.030833
UMAXV (vector, 16B) throughput 0.502907
UMAXV (vector, 4H) 1->2 3.019840
UMAXV (vector, 4H) throughput 0.499808
UMAXV (vector, 8H) 1->2 3.004698
UMAXV (vector, 8H) throughput 0.500673
UMAXV (vector, 4S) 1->2 3.003847
UMAXV (vector, 4S) throughput 0.509343
UMIN (vector, 8B) 1->2 2.001459
UMIN (vector, 8B) 1->3 2.002390
UMIN (vector, 8B) throughput 0.497056
UMIN (vector, 16B) 1->2 1.999401
UMIN (vector, 16B) 1->3 2.000554
UMIN (vector, 16B) throughput 0.511671
UMIN (vector, 4H) 1->2 2.497732
UMIN (vector, 4H) 1->3 2.454370
UMIN (vector, 4H) throughput 0.502363
UMIN (vector, 8H) 1->2 2.001539
UMIN (vector, 8H) 1->3 2.007289
UMIN (vector, 8H) throughput 0.504434
UMIN (vector, 2S) 1->2 1.996405
UMIN (vector, 2S) 1->3 2.005142
UMIN (vector, 2S) throughput 0.499692
UMIN (vector, 4S) 1->2 1.997416
UMIN (vector, 4S) 1->3 1.994159
UMIN (vector, 4S) throughput 0.500925
UMINP (vector, 8B) 1->2 2.002469
UMINP (vector, 8B) 1->3 1.999192
UMINP (vector, 8B) throughput 0.506676
UMINP (vector, 16B) 1->2 2.024820
UMINP (vector, 16B) 1->3 2.003846
UMINP (vector, 16B) throughput 0.428618
UMINP (vector, 4H) 1->2 2.009754
UMINP (vector, 4H) 1->3 1.971866
UMINP (vector, 4H) throughput 0.497687
UMINP (vector, 8H) 1->2 1.997458
UMINP (vector, 8H) 1->3 2.004089
UMINP (vector, 8H) throughput 0.501812
UMINP (vector, 2S) 1->2 2.103062
UMINP (vector, 2S) 1->3 2.001458
UMINP (vector, 2S) throughput 0.551793
UMINP (vector, 4S) 1->2 1.991057
UMINP (vector, 4S) 1->3 1.993393
UMINP (vector, 4S) throughput 0.500491
UMINV (vector, 8B) 1->2 3.000484
UMINV (vector, 8B) throughput 0.509009
UMINV (vector, 16B) 1->2 3.006317
UMINV (vector, 16B) throughput 0.500971
UMINV (vector, 4H) 1->2 3.004088
UMINV (vector, 4H) throughput 0.499263
UMINV (vector, 8H) 1->2 3.003928
UMINV (vector, 8H) throughput 0.509704
UMINV (vector, 4S) 1->2 3.005669
UMINV (vector, 4S) throughput 0.501567
UMLAL (by element, 4S) 1->1 2.999839
UMLAL (by element, 4S) 1->2 3.654811
UMLAL (by element, 4S) 1->3 2.994952
UMLAL (by element, 4S) throughput 0.501321
UMLAL (by element, 4S) throughput 0.501198
UMLAL (by element, 2D) 1->1 3.004009
UMLAL (by element, 2D) 1->2 3.004371
UMLAL (by element, 2D) 1->3 2.998261
UMLAL (by element, 2D) throughput 0.501119
UMLAL (by element, 2D) throughput 0.500744
UMLAL2 (by element, 4S) 1->1 3.008909
UMLAL2 (by element, 4S) 1->2 3.005014
UMLAL2 (by element, 4S) 1->3 3.003887
UMLAL2 (by element, 4S) throughput 0.495833
UMLAL2 (by element, 4S) throughput 0.525928
UMLAL2 (by element, 2D) 1->1 2.993212
UMLAL2 (by element, 2D) 1->2 2.993753
UMLAL2 (by element, 2D) 1->3 2.958075
UMLAL2 (by element, 2D) throughput 0.502463
UMLAL2 (by element, 2D) throughput 0.499606
UMLAL (vector, 8H) 1->1 3.006318
UMLAL (vector, 8H) 1->2 3.004172
UMLAL (vector, 8H) 1->3 2.995597
UMLAL (vector, 8H) throughput 0.502310
UMLAL (vector, 8H) throughput 0.501108
UMLAL (vector, 4S) 1->1 3.005658
UMLAL (vector, 4S) 1->2 3.006318
UMLAL (vector, 4S) 1->3 2.998707
UMLAL (vector, 4S) throughput 0.501954
UMLAL (vector, 4S) throughput 0.500863
UMLAL (vector, 2D) 1->1 2.998503
UMLAL (vector, 2D) 1->2 3.003765
UMLAL (vector, 2D) 1->3 3.003059
UMLAL (vector, 2D) throughput 0.506492
UMLAL (vector, 2D) throughput 0.547277
UMLAL2 (vector, 8H) 1->1 3.006034
UMLAL2 (vector, 8H) 1->2 3.003969
UMLAL2 (vector, 8H) 1->3 3.004293
UMLAL2 (vector, 8H) throughput 0.501139
UMLAL2 (vector, 8H) throughput 0.500333
UMLAL2 (vector, 4S) 1->1 2.944956
UMLAL2 (vector, 4S) 1->2 2.999759
UMLAL2 (vector, 4S) 1->3 3.021586
UMLAL2 (vector, 4S) throughput 0.513671
UMLAL2 (vector, 4S) throughput 0.500454
UMLAL2 (vector, 2D) 1->1 3.117640
UMLAL2 (vector, 2D) 1->2 2.989494
UMLAL2 (vector, 2D) 1->3 2.995324
UMLAL2 (vector, 2D) throughput 0.502070
UMLAL2 (vector, 2D) throughput 0.535454
UMLSL (by element, 4S) 1->1 2.998826
UMLSL (by element, 4S) 1->2 2.979390
UMLSL (by element, 4S) 1->3 3.005345
UMLSL (by element, 4S) throughput 0.505578
UMLSL (by element, 4S) throughput 0.504171
UMLSL (by element, 2D) 1->1 3.005872
UMLSL (by element, 2D) 1->2 3.045706
UMLSL (by element, 2D) 1->3 2.999595
UMLSL (by element, 2D) throughput 0.502419
UMLSL (by element, 2D) throughput 0.504071
UMLSL2 (by element, 4S) 1->1 2.999473
UMLSL2 (by element, 4S) 1->2 3.032361
UMLSL2 (by element, 4S) 1->3 3.007817
UMLSL2 (by element, 4S) throughput 0.511140
UMLSL2 (by element, 4S) throughput 0.506101
UMLSL2 (by element, 2D) 1->1 3.008587
UMLSL2 (by element, 2D) 1->2 3.003928
UMLSL2 (by element, 2D) 1->3 3.001823
UMLSL2 (by element, 2D) throughput 0.500258
UMLSL2 (by element, 2D) throughput 0.509506
UMLSL (vector, 8H) 1->1 2.988960
UMLSL (vector, 8H) 1->2 3.516208
UMLSL (vector, 8H) 1->3 2.996323
UMLSL (vector, 8H) throughput 0.500490
UMLSL (vector, 8H) throughput 0.588719
UMLSL (vector, 4S) 1->1 3.000284
UMLSL (vector, 4S) 1->2 3.043092
UMLSL (vector, 4S) 1->3 3.009342
UMLSL (vector, 4S) throughput 0.502667
UMLSL (vector, 4S) throughput 0.500668
UMLSL (vector, 2D) 1->1 3.006399
UMLSL (vector, 2D) 1->2 3.000729
UMLSL (vector, 2D) 1->3 3.000041
UMLSL (vector, 2D) throughput 0.508637
UMLSL (vector, 2D) throughput 0.505424
UMLSL2 (vector, 8H) 1->1 2.982321
UMLSL2 (vector, 8H) 1->2 3.026275
UMLSL2 (vector, 8H) 1->3 3.003522
UMLSL2 (vector, 8H) throughput 0.507018
UMLSL2 (vector, 8H) throughput 0.507179
UMLSL2 (vector, 4S) 1->1 3.033138
UMLSL2 (vector, 4S) 1->2 3.008344
UMLSL2 (vector, 4S) 1->3 3.004212
UMLSL2 (vector, 4S) throughput 0.511546
UMLSL2 (vector, 4S) throughput 0.500382
UMLSL2 (vector, 2D) 1->1 3.003846
UMLSL2 (vector, 2D) 1->2 3.028994
UMLSL2 (vector, 2D) 1->3 2.991364
UMLSL2 (vector, 2D) throughput 0.567365
UMLSL2 (vector, 2D) throughput 0.499688
UMULL (by element, 4S) 1->2 2.999758
UMULL (by element, 4S) 1->3 3.010757
UMULL (by element, 4S) throughput 0.510123
UMULL (by element, 2D) 1->2 3.044865
UMULL (by element, 2D) 1->3 3.000604
UMULL (by element, 2D) throughput 0.500536
UMULL2 (by element, 4S) 1->2 3.011146
UMULL2 (by element, 4S) 1->3 2.997209
UMULL2 (by element, 4S) throughput 0.511687
UMULL2 (by element, 2D) 1->2 2.732818
UMULL2 (by element, 2D) 1->3 2.643800
UMULL2 (by element, 2D) throughput 0.505435
UMULL (vector, 8H) 1->2 3.250536
UMULL (vector, 8H) 1->3 3.234991
UMULL (vector, 8H) throughput 0.500527
UMULL (vector, 4S) 1->2 3.001032
UMULL (vector, 4S) 1->3 3.079452
UMULL (vector, 4S) throughput 0.527473
UMULL (vector, 2D) 1->2 3.001379
UMULL (vector, 2D) 1->3 3.001637
UMULL (vector, 2D) throughput 0.528311
UMULL2 (vector, 8H) 1->2 3.052963
UMULL2 (vector, 8H) 1->3 3.001457
UMULL2 (vector, 8H) throughput 0.501687
UMULL2 (vector, 4S) 1->2 3.168095
UMULL2 (vector, 4S) 1->3 3.113388
UMULL2 (vector, 4S) throughput 0.513803
UMULL2 (vector, 2D) 1->2 3.231967
UMULL2 (vector, 2D) 1->3 3.006792
UMULL2 (vector, 2D) throughput 0.521272
UQADD (scalar, B) 1->2 3.017023
UQADD (scalar, B) 1->3 3.009117
UQADD (scalar, B) throughput 0.500387
UQADD (scalar, H) 1->2 3.001292
UQADD (scalar, H) 1->3 3.355431
UQADD (scalar, H) throughput 0.499081
UQADD (scalar, S) 1->2 3.001723
UQADD (scalar, S) 1->3 2.996384
UQADD (scalar, S) throughput 0.501174
UQADD (scalar, D) 1->2 3.004698
UQADD (scalar, D) 1->3 3.002997
UQADD (scalar, D) throughput 0.500501
UQADD (vector, 8B) 1->2 3.042355
UQADD (vector, 8B) 1->3 3.007143
UQADD (vector, 8B) throughput 0.442692
UQADD (vector, 16B) 1->2 3.203748
UQADD (vector, 16B) 1->3 3.003240
UQADD (vector, 16B) throughput 0.501757
UQADD (vector, 4H) 1->2 3.003078
UQADD (vector, 4H) 1->3 3.004050
UQADD (vector, 4H) throughput 0.501683
UQADD (vector, 8H) 1->2 3.009369
UQADD (vector, 8H) 1->3 3.006356
UQADD (vector, 8H) throughput 0.500719
UQADD (vector, 2S) 1->2 3.010152
UQADD (vector, 2S) 1->3 3.045059
UQADD (vector, 2S) throughput 0.503016
UQADD (vector, 4S) 1->2 3.497136
UQADD (vector, 4S) 1->3 3.036524
UQADD (vector, 4S) throughput 0.501969
UQADD (vector, 2D) 1->2 3.020203
UQADD (vector, 2D) 1->3 3.006276
UQADD (vector, 2D) throughput 0.501473
UQRSHL (scalar, B) 1->2 2.994993
UQRSHL (scalar, B) 1->3 3.028942
UQRSHL (scalar, B) throughput 0.499328
UQRSHL (scalar, H) 1->2 3.005913
UQRSHL (scalar, H) 1->3 2.999392
UQRSHL (scalar, H) throughput 0.505886
UQRSHL (scalar, S) 1->2 3.004048
UQRSHL (scalar, S) 1->3 3.013089
UQRSHL (scalar, S) throughput 0.499495
UQRSHL (scalar, D) 1->2 3.005224
UQRSHL (scalar, D) 1->3 3.005655
UQRSHL (scalar, D) throughput 0.500658
UQRSHL (vector, 8B) 1->2 3.007126
UQRSHL (vector, 8B) 1->3 3.024457
UQRSHL (vector, 8B) throughput 0.503057
UQRSHL (vector, 16B) 1->2 3.005468
UQRSHL (vector, 16B) 1->3 3.008626
UQRSHL (vector, 16B) throughput 0.503547
UQRSHL (vector, 4H) 1->2 2.780224
UQRSHL (vector, 4H) 1->3 3.007546
UQRSHL (vector, 4H) throughput 0.987806
UQRSHL (vector, 8H) 1->2 2.832677
UQRSHL (vector, 8H) 1->3 3.602381
UQRSHL (vector, 8H) throughput 0.574786
UQRSHL (vector, 2S) 1->2 3.007937
UQRSHL (vector, 2S) 1->3 3.538890
UQRSHL (vector, 2S) throughput 0.514295
UQRSHL (vector, 4S) 1->2 3.014361
UQRSHL (vector, 4S) 1->3 2.637247
UQRSHL (vector, 4S) throughput 0.664171
UQRSHL (vector, 2D) 1->2 2.719851
UQRSHL (vector, 2D) 1->3 2.971758
UQRSHL (vector, 2D) throughput 0.521039
UQRSHRN (H) 1->2 3.285268
UQRSHRN (H) throughput 0.514991
UQRSHRN (S) 1->2 3.573747
UQRSHRN (S) throughput 0.525182
UQRSHRN (D) 1->2 3.079974
UQRSHRN (D) throughput 0.508211
UQRSHRN (8H) 1->2 6.921608
UQRSHRN (8H) throughput 0.500784
UQRSHRN (4S) 1->2 8.954140
UQRSHRN (4S) throughput 0.505460
UQRSHRN (2D) 1->2 3.302136
UQRSHRN (2D) throughput 0.607179
UQRSHRN2 (8H) 1->1 2.679829
UQRSHRN2 (8H) 1->2 4.643218
UQRSHRN2 (8H) throughput 0.626300
UQRSHRN2 (8H) throughput 0.612307
UQRSHRN2 (4S) 1->1 2.806391
UQRSHRN2 (4S) 1->2 4.021526
UQRSHRN2 (4S) throughput 0.523523
UQRSHRN2 (4S) throughput 0.507590
UQRSHRN2 (2D) 1->1 3.029720
UQRSHRN2 (2D) 1->2 3.010251
UQRSHRN2 (2D) throughput 0.512515
UQRSHRN2 (2D) throughput 0.506653
UQSHL (immediate, scalar, B) 1->2 2.011004
UQSHL (immediate, scalar, B) throughput 0.501908
UQSHL (immediate, scalar, H) 1->2 2.138286
UQSHL (immediate, scalar, H) throughput 0.504419
UQSHL (immediate, scalar, S) 1->2 2.006277
UQSHL (immediate, scalar, S) throughput 0.537711
UQSHL (immediate, scalar, D) 1->2 2.012642
UQSHL (immediate, scalar, D) throughput 0.491334
UQSHL (immediate, vector, 8B) 1->2 1.910435
UQSHL (immediate, vector, 8B) throughput 0.502025
UQSHL (immediate, vector, 16B) 1->2 2.005506
UQSHL (immediate, vector, 16B) throughput 0.501063
UQSHL (immediate, vector, 4H) 1->2 1.873589
UQSHL (immediate, vector, 4H) throughput 0.500495
UQSHL (immediate, vector, 8H) 1->2 1.999960
UQSHL (immediate, vector, 8H) throughput 0.483492
UQSHL (immediate, vector, 2S) 1->2 1.995321
UQSHL (immediate, vector, 2S) throughput 0.510465
UQSHL (immediate, vector, 4S) 1->2 2.003523
UQSHL (immediate, vector, 4S) throughput 0.500481
UQSHL (immediate, vector, 2D) 1->2 2.189327
UQSHL (immediate, vector, 2D) throughput 0.511200
UQSHL (scalar, B) 1->2 2.005699
UQSHL (scalar, B) 1->3 2.002309
UQSHL (scalar, B) throughput 0.500141
UQSHL (scalar, H) 1->2 1.999029
UQSHL (scalar, H) 1->3 2.001499
UQSHL (scalar, H) throughput 0.502495
UQSHL (scalar, S) 1->2 2.002713
UQSHL (scalar, S) 1->3 2.027615
UQSHL (scalar, S) throughput 0.500989
UQSHL (scalar, D) 1->2 2.011378
UQSHL (scalar, D) 1->3 2.005669
UQSHL (scalar, D) throughput 0.516109
UQSHL (vector, 8B) 1->2 1.998342
UQSHL (vector, 8B) 1->3 2.004252
UQSHL (vector, 8B) throughput 0.505346
UQSHL (vector, 16B) 1->2 2.004697
UQSHL (vector, 16B) 1->3 2.003767
UQSHL (vector, 16B) throughput 0.499464
UQSHL (vector, 4H) 1->2 2.003962
UQSHL (vector, 4H) 1->3 2.001657
UQSHL (vector, 4H) throughput 0.500810
UQSHL (vector, 8H) 1->2 1.995239
UQSHL (vector, 8H) 1->3 2.508322
UQSHL (vector, 8H) throughput 0.497732
UQSHL (vector, 2S) 1->2 1.988098
UQSHL (vector, 2S) 1->3 2.015091
UQSHL (vector, 2S) throughput 0.502505
UQSHL (vector, 4S) 1->2 1.996445
UQSHL (vector, 4S) 1->3 2.004730
UQSHL (vector, 4S) throughput 0.499939
UQSHL (vector, 2D) 1->2 2.042112
UQSHL (vector, 2D) 1->3 2.001620
UQSHL (vector, 2D) throughput 0.477072
UQSHRN (H) 1->2 3.392759
UQSHRN (H) throughput 0.500182
UQSHRN (S) 1->2 3.012148
UQSHRN (S) throughput 0.500121
UQSHRN (D) 1->2 3.009032
UQSHRN (D) throughput 0.500921
UQSHRN (8H) 1->2 2.997577
UQSHRN (8H) throughput 0.501807
UQSHRN (4S) 1->2 3.007369
UQSHRN (4S) throughput 0.499591
UQSHRN (2D) 1->2 2.995474
UQSHRN (2D) throughput 0.500622
UQSHRN2 (8H) 1->1 3.004051
UQSHRN2 (8H) 1->2 3.006347
UQSHRN2 (8H) throughput 0.501301
UQSHRN2 (8H) throughput 0.591390
UQSHRN2 (4S) 1->1 2.945206
UQSHRN2 (4S) 1->2 3.006018
UQSHRN2 (4S) throughput 0.471756
UQSHRN2 (4S) throughput 0.513580
UQSHRN2 (2D) 1->1 3.052552
UQSHRN2 (2D) 1->2 2.997815
UQSHRN2 (2D) throughput 0.501554
UQSHRN2 (2D) throughput 0.502942
UQSUB (scalar, B) 1->2 3.002060
UQSUB (scalar, B) 1->3 3.268618
UQSUB (scalar, B) throughput 0.502396
UQSUB (scalar, H) 1->2 2.997059
UQSUB (scalar, H) 1->3 3.033528
UQSUB (scalar, H) throughput 0.501882
UQSUB (scalar, S) 1->2 3.004132
UQSUB (scalar, S) 1->3 3.001090
UQSUB (scalar, S) throughput 0.500780
UQSUB (scalar, D) 1->2 3.004332
UQSUB (scalar, D) 1->3 3.007651
UQSUB (scalar, D) throughput 0.505710
UQSUB (vector, 8B) 1->2 2.999029
UQSUB (vector, 8B) 1->3 2.999517
UQSUB (vector, 8B) throughput 0.500086
UQSUB (vector, 16B) 1->2 3.004657
UQSUB (vector, 16B) 1->3 2.997735
UQSUB (vector, 16B) throughput 0.499303
UQSUB (vector, 4H) 1->2 3.010649
UQSUB (vector, 4H) 1->3 2.998220
UQSUB (vector, 4H) throughput 0.504550
UQSUB (vector, 8H) 1->2 3.086739
UQSUB (vector, 8H) 1->3 3.016121
UQSUB (vector, 8H) throughput 0.489916
UQSUB (vector, 2S) 1->2 3.059730
UQSUB (vector, 2S) 1->3 1.996872
UQSUB (vector, 2S) throughput 0.573582
UQSUB (vector, 4S) 1->2 4.739487
UQSUB (vector, 4S) 1->3 2.591698
UQSUB (vector, 4S) throughput 1.121370
UQSUB (vector, 2D) 1->2 3.148319
UQSUB (vector, 2D) 1->3 4.411490
UQSUB (vector, 2D) throughput 0.599011
UQXTN (H) 1->2 2.977129
UQXTN (H) throughput 0.586779
UQXTN (S) 1->2 3.033272
UQXTN (S) throughput 0.756405
UQXTN (D) 1->2 3.023385
UQXTN (D) throughput 0.495257
UQXTN (8H) 1->2 4.256384
UQXTN (8H) throughput 0.832011
UQXTN (4S) 1->2 4.341512
UQXTN (4S) throughput 0.505235
UQXTN (2D) 1->2 3.025028
UQXTN (2D) throughput 0.504063
UQXTN2 (8H) 1->1 2.587378
UQXTN2 (8H) 1->2 4.379300
UQXTN2 (8H) throughput 0.520148
UQXTN2 (8H) throughput 0.727016
UQXTN2 (4S) 1->1 2.773772
UQXTN2 (4S) 1->2 2.628527
UQXTN2 (4S) throughput 0.624190
UQXTN2 (4S) throughput 0.510067
UQXTN2 (2D) 1->1 3.435462
UQXTN2 (2D) 1->2 2.918382
UQXTN2 (2D) throughput 0.300850
UQXTN2 (2D) throughput 0.603676
URECPE (vector, 2S) 1->2 3.013039
URECPE (vector, 2S) throughput 1.010032
URECPE (vector, 4S) 1->2 4.005748
URECPE (vector, 4S) throughput 2.037750
URHADD (vector, 8B) 1->2 2.004009
URHADD (vector, 8B) 1->3 2.461513
URHADD (vector, 8B) throughput 0.384811
URHADD (vector, 16B) 1->2 2.120179
URHADD (vector, 16B) 1->3 1.919364
URHADD (vector, 16B) throughput 0.770809
URHADD (vector, 4H) 1->2 1.303291
URHADD (vector, 4H) 1->3 2.494275
URHADD (vector, 4H) throughput 0.569315
URHADD (vector, 8H) 1->2 2.002632
URHADD (vector, 8H) 1->3 2.003766
URHADD (vector, 8H) throughput 0.501377
URHADD (vector, 2S) 1->2 2.004901
URHADD (vector, 2S) 1->3 1.988352
URHADD (vector, 2S) throughput 0.508580
URHADD (vector, 4S) 1->2 2.021343
URHADD (vector, 4S) 1->3 1.998022
URHADD (vector, 4S) throughput 0.509050
URSHL (vector, 8B) 1->2 3.007653
URSHL (vector, 8B) 1->3 2.835124
URSHL (vector, 8B) throughput 0.501543
URSHL (vector, 16B) 1->2 3.745608
URSHL (vector, 16B) 1->3 3.000669
URSHL (vector, 16B) throughput 0.388147
URSHL (vector, 4H) 1->2 5.821141
URSHL (vector, 4H) 1->3 2.970527
URSHL (vector, 4H) throughput 0.610191
URSHL (vector, 8H) 1->2 3.286739
URSHL (vector, 8H) 1->3 3.360378
URSHL (vector, 8H) throughput 0.319726
URSHL (vector, 2S) 1->2 2.488568
URSHL (vector, 2S) 1->3 3.125006
URSHL (vector, 2S) throughput 0.423143
URSHL (vector, 4S) 1->2 4.679257
URSHL (vector, 4S) 1->3 3.022717
URSHL (vector, 4S) throughput 0.509860
URSHL (vector, 2D) 1->2 3.009247
URSHL (vector, 2D) 1->3 3.068553
URSHL (vector, 2D) throughput 0.500045
URSHL (vector, D) 1->2 3.006317
URSHL (vector, D) 1->3 3.001816
URSHL (vector, D) throughput 0.500673
URSHR (vector, 8B) 1->2 2.998058
URSHR (vector, 8B) throughput 0.500698
URSHR (vector, 16B) 1->2 3.009920
URSHR (vector, 16B) throughput 0.500678
URSHR (vector, 4H) 1->2 3.004941
URSHR (vector, 4H) throughput 0.509312
URSHR (vector, 8H) 1->2 2.954608
URSHR (vector, 8H) throughput 0.501139
URSHR (vector, 2S) 1->2 2.946709
URSHR (vector, 2S) throughput 0.531821
URSHR (vector, 4S) 1->2 2.997495
URSHR (vector, 4S) throughput 0.501422
URSHR (vector, 2D) 1->2 2.993660
URSHR (vector, 2D) throughput 0.508366
URSHR (vector, D) 1->2 3.045716
URSHR (vector, D) throughput 0.499031
URSQRTE (vector, 2S) 1->2 3.004246
URSQRTE (vector, 2S) throughput 1.003665
URSQRTE (vector, 4S) 1->2 4.040411
URSQRTE (vector, 4S) throughput 2.008019
URSRA (vector, 8B) 1->1 3.003355
URSRA (vector, 8B) 1->2 3.003968
URSRA (vector, 8B) throughput 0.503118
URSRA (vector, 8B) throughput 0.512767
URSRA (vector, 16B) 1->1 3.244222
URSRA (vector, 16B) 1->2 3.004008
URSRA (vector, 16B) throughput 0.514830
URSRA (vector, 16B) throughput 0.502523
URSRA (vector, 4H) 1->1 3.028914
URSRA (vector, 4H) 1->2 2.998747
URSRA (vector, 4H) throughput 0.537276
URSRA (vector, 4H) throughput 0.499389
URSRA (vector, 8H) 1->1 3.047338
URSRA (vector, 8H) 1->2 2.995999
URSRA (vector, 8H) throughput 0.577622
URSRA (vector, 8H) throughput 0.523333
URSRA (vector, 2S) 1->1 3.011381
URSRA (vector, 2S) 1->2 3.006235
URSRA (vector, 2S) throughput 0.504517
URSRA (vector, 2S) throughput 0.503449
URSRA (vector, 4S) 1->1 3.079847
URSRA (vector, 4S) 1->2 3.000443
URSRA (vector, 4S) throughput 0.500712
URSRA (vector, 4S) throughput 0.500036
URSRA (vector, 2D) 1->1 3.000202
URSRA (vector, 2D) 1->2 2.998706
URSRA (vector, 2D) throughput 0.502314
URSRA (vector, 2D) throughput 0.502417
URSRA (vector, D) 1->1 3.013890
URSRA (vector, D) 1->2 3.039845
URSRA (vector, D) throughput 0.501417
URSRA (vector, D) throughput 0.501303
USHL (vector, 8B) 1->2 1.989495
USHL (vector, 8B) 1->3 1.999152
USHL (vector, 8B) throughput 0.502374
USHL (vector, 16B) 1->2 1.994234
USHL (vector, 16B) 1->3 1.991499
USHL (vector, 16B) throughput 0.526541
USHL (vector, 4H) 1->2 1.996527
USHL (vector, 4H) 1->3 1.996398
USHL (vector, 4H) throughput 0.501022
USHL (vector, 8H) 1->2 1.996244
USHL (vector, 8H) 1->3 1.997013
USHL (vector, 8H) throughput 0.500187
USHL (vector, 2S) 1->2 1.996202
USHL (vector, 2S) 1->3 1.997496
USHL (vector, 2S) throughput 0.499995
USHL (vector, 4S) 1->2 1.927933
USHL (vector, 4S) 1->3 2.036458
USHL (vector, 4S) throughput 0.457522
USHL (vector, 2D) 1->2 2.034435
USHL (vector, 2D) 1->3 2.138450
USHL (vector, 2D) throughput 0.589878
USHL (vector, D) 1->2 1.999655
USHL (vector, D) 1->3 1.996566
USHL (vector, D) throughput 0.477089
USHLL (8H) 1->2 2.001011
USHLL (8H) throughput 0.501119
USHLL (4S) 1->2 1.998870
USHLL (4S) throughput 0.505809
USHLL (2D) 1->2 1.875590
USHLL (2D) throughput 0.504577
USHLL2 (8H) 1->2 2.052146
USHLL2 (8H) throughput 0.446601
USHLL2 (4S) 1->2 1.999394
USHLL2 (4S) throughput 0.502997
USHLL2 (2D) 1->2 2.018545
USHLL2 (2D) throughput 0.502672
USHR (vector, 8B) 1->2 2.005020
USHR (vector, 8B) throughput 0.477208
USHR (vector, 16B) 1->2 2.004049
USHR (vector, 16B) throughput 0.504016
USHR (vector, 4H) 1->2 2.001456
USHR (vector, 4H) throughput 0.502161
USHR (vector, 8H) 1->2 2.001416
USHR (vector, 8H) throughput 0.501605
USHR (vector, 2S) 1->2 1.749885
USHR (vector, 2S) throughput 0.550078
USHR (vector, 4S) 1->2 2.005507
USHR (vector, 4S) throughput 0.502693
USHR (vector, 2D) 1->2 1.995962
USHR (vector, 2D) throughput 0.500121
USHR (vector, D) 1->2 2.041423
USHR (vector, D) throughput 0.494804
USQADD (scalar, B) 1->1 2.997816
USQADD (scalar, B) 1->2 3.003724
USQADD (scalar, B) throughput 0.503462
USQADD (scalar, B) throughput 0.501184
USQADD (scalar, H) 1->1 6.566940
USQADD (scalar, H) 1->2 2.973582
USQADD (scalar, H) throughput 0.503567
USQADD (scalar, H) throughput 0.500800
USQADD (scalar, S) 1->1 3.007369
USQADD (scalar, S) 1->2 2.998706
USQADD (scalar, S) throughput 0.502809
USQADD (scalar, S) throughput 0.500716
USQADD (scalar, D) 1->1 3.000890
USQADD (scalar, D) 1->2 3.033487
USQADD (scalar, D) throughput 0.513847
USQADD (scalar, D) throughput 0.499771
USQADD (vector, 8B) 1->1 2.999030
USQADD (vector, 8B) 1->2 2.997737
USQADD (vector, 8B) throughput 0.502815
USQADD (vector, 8B) throughput 0.506357
USQADD (vector, 16B) 1->1 2.998708
USQADD (vector, 16B) 1->2 3.000404
USQADD (vector, 16B) throughput 0.501625
USQADD (vector, 16B) throughput 0.510639
USQADD (vector, 4H) 1->1 3.000766
USQADD (vector, 4H) 1->2 2.994345
USQADD (vector, 4H) throughput 0.500242
USQADD (vector, 4H) throughput 0.504551
USQADD (vector, 8H) 1->1 2.997130
USQADD (vector, 8H) 1->2 2.985403
USQADD (vector, 8H) throughput 0.475672
USQADD (vector, 8H) throughput 0.505702
USQADD (vector, 2S) 1->1 3.004091
USQADD (vector, 2S) 1->2 3.003768
USQADD (vector, 2S) throughput 0.590591
USQADD (vector, 2S) throughput 0.493307
USQADD (vector, 4S) 1->1 3.036975
USQADD (vector, 4S) 1->2 3.004050
USQADD (vector, 4S) throughput 0.504719
USQADD (vector, 4S) throughput 0.502641
USQADD (vector, 2D) 1->1 2.998869
USQADD (vector, 2D) 1->2 2.865945
USQADD (vector, 2D) throughput 0.501979
USQADD (vector, 2D) throughput 0.514321
USRA (vector, 8B) 1->1 3.001335
USRA (vector, 8B) 1->2 2.947156
USRA (vector, 8B) throughput 0.501108
USRA (vector, 8B) throughput 0.503346
USRA (vector, 16B) 1->1 3.010770
USRA (vector, 16B) 1->2 3.003035
USRA (vector, 16B) throughput 0.504724
USRA (vector, 16B) throughput 0.589196
USRA (vector, 4H) 1->1 2.993619
USRA (vector, 4H) 1->2 2.998624
USRA (vector, 4H) throughput 0.501589
USRA (vector, 4H) throughput 0.500894
USRA (vector, 8H) 1->1 3.803372
USRA (vector, 8H) 1->2 2.293926
USRA (vector, 8H) throughput 0.603799
USRA (vector, 8H) throughput 0.759979
USRA (vector, 2S) 1->1 2.623391
USRA (vector, 2S) 1->2 3.010146
USRA (vector, 2S) throughput 0.314360
USRA (vector, 2S) throughput 0.873608
USRA (vector, 4S) 1->1 2.998970
USRA (vector, 4S) 1->2 3.022581
USRA (vector, 4S) throughput 0.509754
USRA (vector, 4S) throughput 0.505011
USRA (vector, 2D) 1->1 2.812467
USRA (vector, 2D) 1->2 2.506845
USRA (vector, 2D) throughput 0.380819
USRA (vector, 2D) throughput 0.506094
USRA (vector, D) 1->1 2.992441
USRA (vector, D) 1->2 2.139951
USRA (vector, D) throughput 0.556692
USRA (vector, D) throughput 0.817266
USUBL (vector, 8H) 1->2 2.753544
USUBL (vector, 8H) 1->3 2.007528
USUBL (vector, 8H) throughput 0.517599
USUBL (vector, 4S) 1->2 1.994681
USUBL (vector, 4S) 1->3 2.013432
USUBL (vector, 4S) throughput 0.520950
USUBL (vector, 2D) 1->2 2.013319
USUBL (vector, 2D) 1->3 2.005394
USUBL (vector, 2D) throughput 0.500893
USUBL2 (vector, 8H) 1->2 2.014213
USUBL2 (vector, 8H) 1->3 2.003954
USUBL2 (vector, 8H) throughput 0.472374
USUBL2 (vector, 4S) 1->2 1.577588
USUBL2 (vector, 4S) 1->3 1.999742
USUBL2 (vector, 4S) throughput 0.372265
USUBL2 (vector, 2D) 1->2 2.098169
USUBL2 (vector, 2D) 1->3 2.038050
USUBL2 (vector, 2D) throughput 1.150694
USUBW (vector, 8H) 1->2 1.682018
USUBW (vector, 8H) 1->3 2.008733
USUBW (vector, 8H) throughput 0.965066
USUBW (vector, 4S) 1->2 1.996234
USUBW (vector, 4S) 1->3 2.033434
USUBW (vector, 4S) throughput 0.499165
USUBW (vector, 2D) 1->2 1.365220
USUBW (vector, 2D) 1->3 2.073189
USUBW (vector, 2D) throughput 0.499442
USUBW2 (vector, 8H) 1->2 1.997746
USUBW2 (vector, 8H) 1->3 2.001944
USUBW2 (vector, 8H) throughput 0.522561
USUBW2 (vector, 4S) 1->2 2.337822
USUBW2 (vector, 4S) 1->3 0.591963
USUBW2 (vector, 4S) throughput 1.147010
USUBW2 (vector, 2D) 1->2 1.948438
USUBW2 (vector, 2D) 1->3 1.832933
USUBW2 (vector, 2D) throughput 0.336697
UXTL (8H) 1->2 1.664685
UXTL (8H) throughput 0.530700
UXTL (4S) 1->2 2.001617
UXTL (4S) throughput 0.501073
UXTL (2D) 1->2 1.996365
UXTL (2D) throughput 0.384898
UXTL2 (8H) 1->2 1.933410
UXTL2 (8H) throughput 0.503815
UXTL2 (4S) 1->2 1.927957
UXTL2 (4S) throughput 0.782738
UXTL2 (2D) 1->2 1.862616
UXTL2 (2D) throughput 0.862132
UZP1 (vector, 8B) 1->2 2.007480
UZP1 (vector, 8B) 1->3 2.048063
UZP1 (vector, 8B) throughput 0.529028
UZP1 (vector, 16B) 1->2 2.028290
UZP1 (vector, 16B) 1->3 1.998299
UZP1 (vector, 16B) throughput 0.510807
UZP1 (vector, 4H) 1->2 2.080379
UZP1 (vector, 4H) 1->3 2.001418
UZP1 (vector, 4H) throughput 0.448635
UZP1 (vector, 8H) 1->2 2.013215
UZP1 (vector, 8H) 1->3 2.060852
UZP1 (vector, 8H) throughput 0.432702
UZP1 (vector, 2S) 1->2 2.005752
UZP1 (vector, 2S) 1->3 2.007434
UZP1 (vector, 2S) throughput 0.536685
UZP1 (vector, 4S) 1->2 1.400319
UZP1 (vector, 4S) 1->3 2.638996
UZP1 (vector, 4S) throughput 0.539946
UZP1 (vector, 2D) 1->2 2.003700
UZP1 (vector, 2D) 1->3 2.259963
UZP1 (vector, 2D) throughput 0.528597
UZP2 (vector, 8B) 1->2 1.940772
UZP2 (vector, 8B) 1->3 2.443712
UZP2 (vector, 8B) throughput 0.501203
UZP2 (vector, 16B) 1->2 1.937812
UZP2 (vector, 16B) 1->3 1.913362
UZP2 (vector, 16B) throughput 0.623700
UZP2 (vector, 4H) 1->2 1.822546
UZP2 (vector, 4H) 1->3 1.977655
UZP2 (vector, 4H) throughput 0.513033
UZP2 (vector, 8H) 1->2 2.005845
UZP2 (vector, 8H) 1->3 1.998345
UZP2 (vector, 8H) throughput 0.503386
UZP2 (vector, 2S) 1->2 2.011903
UZP2 (vector, 2S) 1->3 1.991578
UZP2 (vector, 2S) throughput 0.310717
UZP2 (vector, 4S) 1->2 4.347735
UZP2 (vector, 4S) 1->3 1.997954
UZP2 (vector, 4S) throughput 0.548875
UZP2 (vector, 2D) 1->2 2.001721
UZP2 (vector, 2D) 1->3 2.774334
UZP2 (vector, 2D) throughput 0.375724
XTN (8H) 1->2 2.000087
XTN (8H) throughput 0.433367
XTN (4S) 1->2 2.017537
XTN (4S) throughput 0.514384
XTN (2D) 1->2 1.927030
XTN (2D) throughput 0.503530
XTN2 (8H) 1->1 1.999742
XTN2 (8H) 1->2 2.023476
XTN2 (8H) throughput 0.540852
XTN2 (8H) throughput 0.512553
XTN2 (4S) 1->1 2.052769
XTN2 (4S) 1->2 2.039766
XTN2 (4S) throughput 0.579195
XTN2 (4S) throughput 0.537616
XTN2 (2D) 1->1 1.990717
XTN2 (2D) 1->2 2.163577
XTN2 (2D) throughput 0.671383
XTN2 (2D) throughput 0.612017
ZIP1 (vector, 8B) 1->2 2.863807
ZIP1 (vector, 8B) 1->3 1.739579
ZIP1 (vector, 8B) throughput 0.557652
ZIP1 (vector, 16B) 1->2 1.804700
ZIP1 (vector, 16B) 1->3 2.086436
ZIP1 (vector, 16B) throughput 0.584670
ZIP1 (vector, 4H) 1->2 2.678185
ZIP1 (vector, 4H) 1->3 1.765431
ZIP1 (vector, 4H) throughput 0.744936
ZIP1 (vector, 8H) 1->2 2.069380
ZIP1 (vector, 8H) 1->3 0.972466
ZIP1 (vector, 8H) throughput 0.469937
ZIP1 (vector, 2S) 1->2 1.958476
ZIP1 (vector, 2S) 1->3 1.795895
ZIP1 (vector, 2S) throughput 0.517368
ZIP1 (vector, 4S) 1->2 1.592768
ZIP1 (vector, 4S) 1->3 1.996014
ZIP1 (vector, 4S) throughput 0.645911
ZIP1 (vector, 2D) 1->2 2.002308
ZIP1 (vector, 2D) 1->3 1.772047
ZIP1 (vector, 2D) throughput 0.499879
ZIP2 (vector, 8B) 1->2 2.007073
ZIP2 (vector, 8B) 1->3 2.026400
ZIP2 (vector, 8B) throughput 0.500886
ZIP2 (vector, 16B) 1->2 1.906780
ZIP2 (vector, 16B) 1->3 2.001659
ZIP2 (vector, 16B) throughput 0.500764
ZIP2 (vector, 4H) 1->2 2.001617
ZIP2 (vector, 4H) 1->3 2.004940
ZIP2 (vector, 4H) throughput 0.500015
ZIP2 (vector, 8H) 1->2 2.001500
ZIP2 (vector, 8H) 1->3 2.085533
ZIP2 (vector, 8H) throughput 0.618345
ZIP2 (vector, 2S) 1->2 1.944943
ZIP2 (vector, 2S) 1->3 1.967563
ZIP2 (vector, 2S) throughput 0.507152
ZIP2 (vector, 4S) 1->2 1.962262
ZIP2 (vector, 4S) 1->3 2.004400
ZIP2 (vector, 4S) throughput 0.433273
ZIP2 (vector, 2D) 1->2 1.553542
ZIP2 (vector, 2D) 1->3 4.027645
ZIP2 (vector, 2D) throughput 0.426050
This file has been truncated, but you can view the full file.
ADC (32-bit) 1->2 1.001724
ADC (32-bit) 1->3 0.997957
ADC (32-bit) 1->4 1.031768
ADC (32-bit) throughput 0.252231
ADC (64-bit) 1->2 1.001924
ADC (64-bit) 1->3 1.006011
ADC (64-bit) 1->4 1.003799
ADC (64-bit) throughput 0.253425
ADCS (32-bit) 1->2 1.000050
ADCS (32-bit) 1->3 1.013846
ADCS (32-bit) 1->4 1.011756
ADCS (32-bit) 4->2 0.997830
ADCS (32-bit) 4->3 1.016450
ADCS (32-bit) 4->4 0.998229
ADCS (32-bit) throughput 0.753265
ADCS (32-bit) throughput 0.574634
ADCS (64-bit) 1->2 0.998628
ADCS (64-bit) 1->3 1.011108
ADCS (64-bit) 1->4 0.998153
ADCS (64-bit) 4->2 1.005563
ADCS (64-bit) 4->3 1.005975
ADCS (64-bit) 4->4 0.985275
ADCS (64-bit) throughput 0.751609
ADCS (64-bit) throughput 0.567030
ADD (sxtb, 32-bit) 1->2 2.005350
ADD (sxtb, 32-bit) 1->3 2.009502
ADD (sxtb, 32-bit) throughput 0.503392
ADD (sxtb, 64-bit) 1->2 1.997356
ADD (sxtb, 64-bit) 1->3 1.990158
ADD (sxtb, 64-bit) throughput 0.500940
ADD (uxtb, 32-bit) 1->2 0.999027
ADD (uxtb, 32-bit) 1->3 0.998802
ADD (uxtb, 32-bit) throughput 0.250812
ADD (uxtb, 64-bit) 1->2 1.000599
ADD (uxtb, 64-bit) 1->3 1.000549
ADD (uxtb, 64-bit) throughput 0.250596
ADD (sxth, 32-bit) 1->2 1.997554
ADD (sxth, 32-bit) 1->3 1.997131
ADD (sxth, 32-bit) throughput 0.500022
ADD (sxth, 64-bit) 1->2 2.002900
ADD (sxth, 64-bit) 1->3 2.003974
ADD (sxth, 64-bit) throughput 0.500025
ADD (uxth, 32-bit) 1->2 1.002500
ADD (uxth, 32-bit) 1->3 1.003875
ADD (uxth, 32-bit) throughput 0.253688
ADD (uxth, 64-bit) 1->2 0.999526
ADD (uxth, 64-bit) 1->3 1.002025
ADD (uxth, 64-bit) throughput 0.250703
ADD (sxtw, 32-bit) 1->2 0.999801
ADD (sxtw, 32-bit) 1->3 1.001573
ADD (sxtw, 32-bit) throughput 0.251906
ADD (sxtw, 64-bit) 1->2 2.002474
ADD (sxtw, 64-bit) 1->3 2.006486
ADD (sxtw, 64-bit) throughput 0.502009
ADD (uxtw, 32-bit) 1->2 0.999401
ADD (uxtw, 32-bit) 1->3 0.998253
ADD (uxtw, 32-bit) throughput 0.250781
ADD (uxtw, 64-bit) 1->2 1.001400
ADD (uxtw, 64-bit) 1->3 0.999051
ADD (uxtw, 64-bit) throughput 0.250359
ADD (uxtx, 64-bit) 1->2 1.000699
ADD (uxtx, 64-bit) 1->3 1.000000
ADD (uxtx, 64-bit) throughput 0.250515
ADD (sxtx, 64-bit) 1->2 0.998129
ADD (sxtx, 64-bit) 1->3 0.998153
ADD (sxtx, 64-bit) throughput 0.250690
ADD (immediate, 32-bit) 1->2 0.999801
ADD (immediate, 32-bit) throughput 0.250362
ADD (immediate, 64-bit) 1->2 1.001824
ADD (immediate, 64-bit) throughput 0.250340
ADD (shifted immediate, 32-bit) 1->2 1.000849
ADD (shifted immediate, 32-bit) throughput 0.250800
ADD (shifted immediate, 64-bit) 1->2 1.001800
ADD (shifted immediate, 64-bit) throughput 0.250268
ADD (register, 32-bit) 1->2 1.002400
ADD (register, 32-bit) 1->3 1.001825
ADD (register, 32-bit) throughput 0.250306
ADD (register, 64-bit) 1->2 0.998154
ADD (register, 64-bit) 1->3 0.900994
ADD (register, 64-bit) throughput 0.251913
ADD (register, lsl, 32-bit) 1->2 1.999230
ADD (register, lsl, 32-bit) 1->3 2.017136
ADD (register, lsl, 32-bit) throughput 0.501649
ADD (register, lsl, 64-bit) 1->2 2.001312
ADD (register, lsl, 64-bit) 1->3 1.995312
ADD (register, lsl, 64-bit) throughput 0.504281
ADD (register, lsr, 32-bit) 1->2 2.007464
ADD (register, lsr, 32-bit) 1->3 2.017951
ADD (register, lsr, 32-bit) throughput 0.501388
ADD (register, lsr, 64-bit) 1->2 2.013573
ADD (register, lsr, 64-bit) 1->3 2.011425
ADD (register, lsr, 64-bit) throughput 0.502631
ADD (register, asr, 32-bit) 1->2 1.999724
ADD (register, asr, 32-bit) 1->3 2.003575
ADD (register, asr, 32-bit) throughput 0.501634
ADD (register, asr, 64-bit) 1->2 2.015700
ADD (register, asr, 64-bit) 1->3 2.003551
ADD (register, asr, 64-bit) throughput 0.497531
ADDS (sxtb, 32-bit) 1->2 1.994942
ADDS (sxtb, 32-bit) 1->3 1.999376
ADDS (sxtb, 32-bit) 4->2 2.007260
ADDS (sxtb, 32-bit) 4->3 2.002750
ADDS (sxtb, 32-bit) throughput 0.501850
ADDS (sxtb, 64-bit) 1->2 2.003599
ADDS (sxtb, 64-bit) 1->3 2.002845
ADDS (sxtb, 64-bit) 4->2 1.999176
ADDS (sxtb, 64-bit) 4->3 2.002700
ADDS (sxtb, 64-bit) throughput 0.502311
ADDS (uxtb, 32-bit) 1->2 1.001822
ADDS (uxtb, 32-bit) 1->3 1.001950
ADDS (uxtb, 32-bit) 4->2 0.999301
ADDS (uxtb, 32-bit) 4->3 1.006266
ADDS (uxtb, 32-bit) throughput 0.500996
ADDS (uxtb, 64-bit) 1->2 1.001899
ADDS (uxtb, 64-bit) 1->3 1.009424
ADDS (uxtb, 64-bit) 4->2 1.006266
ADDS (uxtb, 64-bit) 4->3 1.007464
ADDS (uxtb, 64-bit) throughput 0.501490
ADDS (sxth, 32-bit) 1->2 2.002974
ADDS (sxth, 32-bit) 1->3 2.009724
ADDS (sxth, 32-bit) 4->2 1.627990
ADDS (sxth, 32-bit) 4->3 1.939374
ADDS (sxth, 32-bit) throughput 0.546155
ADDS (sxth, 64-bit) 1->2 1.984059
ADDS (sxth, 64-bit) 1->3 1.998253
ADDS (sxth, 64-bit) 4->2 2.018298
ADDS (sxth, 64-bit) 4->3 1.951331
ADDS (sxth, 64-bit) throughput 0.502869
ADDS (uxth, 32-bit) 1->2 1.002321
ADDS (uxth, 32-bit) 1->3 0.998154
ADDS (uxth, 32-bit) 4->2 0.997979
ADDS (uxth, 32-bit) 4->3 0.999178
ADDS (uxth, 32-bit) throughput 0.503615
ADDS (uxth, 64-bit) 1->2 0.999875
ADDS (uxth, 64-bit) 1->3 0.998128
ADDS (uxth, 64-bit) 4->2 1.000125
ADDS (uxth, 64-bit) 4->3 1.078642
ADDS (uxth, 64-bit) throughput 0.500879
ADDS (sxtw, 32-bit) 1->2 0.998378
ADDS (sxtw, 32-bit) 1->3 0.998403
ADDS (sxtw, 32-bit) 4->2 0.999276
ADDS (sxtw, 32-bit) 4->3 1.025513
ADDS (sxtw, 32-bit) throughput 0.501026
ADDS (sxtw, 64-bit) 1->2 2.002749
ADDS (sxtw, 64-bit) 1->3 2.012298
ADDS (sxtw, 64-bit) 4->2 1.997255
ADDS (sxtw, 64-bit) 4->3 1.999451
ADDS (sxtw, 64-bit) throughput 0.499841
ADDS (uxtw, 32-bit) 1->2 0.998851
ADDS (uxtw, 32-bit) 1->3 0.998802
ADDS (uxtw, 32-bit) 4->2 1.002724
ADDS (uxtw, 32-bit) 4->3 0.986746
ADDS (uxtw, 32-bit) throughput 0.501441
ADDS (uxtw, 64-bit) 1->2 1.001975
ADDS (uxtw, 64-bit) 1->3 1.001225
ADDS (uxtw, 64-bit) 4->2 1.002801
ADDS (uxtw, 64-bit) 4->3 0.999301
ADDS (uxtw, 64-bit) throughput 0.499794
ADDS (uxtx, 64-bit) 1->2 0.998252
ADDS (uxtx, 64-bit) 1->3 0.998079
ADDS (uxtx, 64-bit) 4->2 0.999251
ADDS (uxtx, 64-bit) 4->3 1.010952
ADDS (uxtx, 64-bit) throughput 0.501163
ADDS (sxtx, 64-bit) 1->2 0.998951
ADDS (sxtx, 64-bit) 1->3 1.001274
ADDS (sxtx, 64-bit) 4->2 1.007417
ADDS (sxtx, 64-bit) 4->3 1.003523
ADDS (sxtx, 64-bit) throughput 0.500452
ADDS (immediate, 32-bit) 1->2 0.998802
ADDS (immediate, 32-bit) 3->2 1.002825
ADDS (immediate, 32-bit) throughput 0.500680
ADDS (immediate, 64-bit) 1->2 1.000623
ADDS (immediate, 64-bit) 3->2 1.003692
ADDS (immediate, 64-bit) throughput 0.500884
ADDS (shifted immediate, 32-bit) 1->2 0.998851
ADDS (shifted immediate, 32-bit) 3->2 0.999351
ADDS (shifted immediate, 32-bit) throughput 0.501528
ADDS (shifted immediate, 64-bit) 1->2 1.000574
ADDS (shifted immediate, 64-bit) 3->2 0.999226
ADDS (shifted immediate, 64-bit) throughput 0.499782
ADDS (register, 32-bit) 1->2 0.998079
ADDS (register, 32-bit) 1->3 1.001972
ADDS (register, 32-bit) 4->2 1.002950
ADDS (register, 32-bit) 4->3 1.006691
ADDS (register, 32-bit) throughput 0.502149
ADDS (register, 64-bit) 1->2 0.999476
ADDS (register, 64-bit) 1->3 0.998128
ADDS (register, 64-bit) 4->2 1.006088
ADDS (register, 64-bit) 4->3 0.997930
ADDS (register, 64-bit) throughput 0.500820
ADDS (register, lsl, 32-bit) 1->2 2.003225
ADDS (register, lsl, 32-bit) 1->3 2.009483
ADDS (register, lsl, 32-bit) 4->2 2.002700
ADDS (register, lsl, 32-bit) 4->3 2.009255
ADDS (register, lsl, 32-bit) throughput 0.501276
ADDS (register, lsl, 64-bit) 1->2 2.003100
ADDS (register, lsl, 64-bit) 1->3 2.002949
ADDS (register, lsl, 64-bit) 4->2 2.011675
ADDS (register, lsl, 64-bit) 4->3 1.992481
ADDS (register, lsl, 64-bit) throughput 0.502322
ADDS (register, lsr, 32-bit) 1->2 2.004125
ADDS (register, lsr, 32-bit) 1->3 2.010249
ADDS (register, lsr, 32-bit) 4->2 2.003549
ADDS (register, lsr, 32-bit) 4->3 2.002774
ADDS (register, lsr, 32-bit) throughput 0.502409
ADDS (register, lsr, 64-bit) 1->2 1.997729
ADDS (register, lsr, 64-bit) 1->3 2.002951
ADDS (register, lsr, 64-bit) 4->2 1.998054
ADDS (register, lsr, 64-bit) 4->3 1.997230
ADDS (register, lsr, 64-bit) throughput 0.495598
ADDS (register, asr, 32-bit) 1->2 2.003100
ADDS (register, asr, 32-bit) 1->3 1.999327
ADDS (register, asr, 32-bit) 4->2 1.997355
ADDS (register, asr, 32-bit) 4->3 2.010573
ADDS (register, asr, 32-bit) throughput 0.501348
ADDS (register, asr, 64-bit) 1->2 1.999026
ADDS (register, asr, 64-bit) 1->3 1.999301
ADDS (register, asr, 64-bit) 4->2 2.012235
ADDS (register, asr, 64-bit) 4->3 1.998777
ADDS (register, asr, 64-bit) throughput 0.501207
ADR throughput 0.250932
ADRP throughput 0.250684
AND (immediate, 32-bit) 1->2 1.003515
AND (immediate, 32-bit) throughput 0.250353
AND (immediate, 64-bit) 1->2 1.005138
AND (immediate, 64-bit) throughput 0.250112
AND (register, 32-bit) 1->2 1.001225
AND (register, 32-bit) 1->3 1.001225
AND (register, 32-bit) throughput 0.250787
AND (register, 64-bit) 1->2 1.001850
AND (register, 64-bit) 1->3 1.000574
AND (register, 64-bit) throughput 0.251455
AND (register, lsl, 32-bit) 1->2 2.012233
AND (register, lsl, 32-bit) 1->3 2.006241
AND (register, lsl, 32-bit) throughput 0.500041
AND (register, lsl, 64-bit) 1->2 2.008009
AND (register, lsl, 64-bit) 1->3 1.999200
AND (register, lsl, 64-bit) throughput 0.500253
AND (register, lsr, 32-bit) 1->2 2.012505
AND (register, lsr, 32-bit) 1->3 2.005789
AND (register, lsr, 32-bit) throughput 0.501500
AND (register, lsr, 64-bit) 1->2 1.997929
AND (register, lsr, 64-bit) 1->3 1.994108
AND (register, lsr, 64-bit) throughput 0.502383
AND (register, asr, 32-bit) 1->2 1.998703
AND (register, asr, 32-bit) 1->3 2.002849
AND (register, asr, 32-bit) throughput 0.501784
AND (register, asr, 64-bit) 1->2 1.999250
AND (register, asr, 64-bit) 1->3 1.999251
AND (register, asr, 64-bit) throughput 0.496600
AND (register, ror, 32-bit) 1->2 1.999825
AND (register, ror, 32-bit) 1->3 2.001872
AND (register, ror, 32-bit) throughput 0.500060
AND (register, ror, 64-bit) 1->2 1.998054
AND (register, ror, 64-bit) 1->3 1.991853
AND (register, ror, 64-bit) throughput 0.500069
ADDS (immediate, 32-bit) 1->2 0.999900
ADDS (immediate, 32-bit) 3->2 1.014133
ADDS (immediate, 32-bit) throughput 0.501335
ADDS (immediate, 64-bit) 1->2 0.999026
ADDS (immediate, 64-bit) 3->2 1.003050
ADDS (immediate, 64-bit) throughput 0.501131
ANDS (register, 32-bit) 1->2 0.999026
ANDS (register, 32-bit) 1->3 1.007533
ANDS (register, 32-bit) 4->2 0.995536
ANDS (register, 32-bit) 4->3 0.999551
ANDS (register, 32-bit) throughput 0.500766
ANDS (register, 64-bit) 1->2 1.001800
ANDS (register, 64-bit) 1->3 0.999925
ANDS (register, 64-bit) 4->2 1.006450
ANDS (register, 64-bit) 4->3 0.998054
ANDS (register, 64-bit) throughput 0.500234
ANDS (register, lsl, 32-bit) 1->2 2.000299
ANDS (register, lsl, 32-bit) 1->3 1.998129
ANDS (register, lsl, 32-bit) 4->2 1.999825
ANDS (register, lsl, 32-bit) 4->3 2.011886
ANDS (register, lsl, 32-bit) throughput 0.501613
ANDS (register, lsl, 64-bit) 1->2 2.002850
ANDS (register, lsl, 64-bit) 1->3 2.005558
ANDS (register, lsl, 64-bit) 4->2 2.009274
ANDS (register, lsl, 64-bit) 4->3 2.002548
ANDS (register, lsl, 64-bit) throughput 0.500944
ANDS (register, lsr, 32-bit) 1->2 2.002951
ANDS (register, lsr, 32-bit) 1->3 2.003475
ANDS (register, lsr, 32-bit) 4->2 2.003817
ANDS (register, lsr, 32-bit) 4->3 1.997581
ANDS (register, lsr, 32-bit) throughput 0.501497
ANDS (register, lsr, 64-bit) 1->2 2.002899
ANDS (register, lsr, 64-bit) 1->3 1.999427
ANDS (register, lsr, 64-bit) 4->2 1.999326
ANDS (register, lsr, 64-bit) 4->3 2.004575
ANDS (register, lsr, 64-bit) throughput 0.499851
ANDS (register, asr, 32-bit) 1->2 2.004824
ANDS (register, asr, 32-bit) 1->3 2.005592
ANDS (register, asr, 32-bit) 4->2 2.004248
ANDS (register, asr, 32-bit) 4->3 2.011249
ANDS (register, asr, 32-bit) throughput 0.502803
ANDS (register, asr, 64-bit) 1->2 2.003394
ANDS (register, asr, 64-bit) 1->3 2.004893
ANDS (register, asr, 64-bit) 4->2 2.002700
ANDS (register, asr, 64-bit) 4->3 1.998980
ANDS (register, asr, 64-bit) throughput 0.500670
ANDS (register, ror, 32-bit) 1->2 2.010674
ANDS (register, ror, 32-bit) 1->3 2.005569
ANDS (register, ror, 32-bit) 4->2 2.002649
ANDS (register, ror, 32-bit) 4->3 2.003325
ANDS (register, ror, 32-bit) throughput 0.500187
ANDS (register, ror, 64-bit) 1->2 1.999401
ANDS (register, ror, 64-bit) 1->3 1.998228
ANDS (register, ror, 64-bit) 4->2 1.998005
ANDS (register, ror, 64-bit) 4->3 2.010586
ANDS (register, ror, 64-bit) throughput 0.496239
ASR (immediate, 32-bit) 1->2 1.000574
ASR (immediate, 32-bit) throughput 0.250125
ASR (immediate, 64-bit) 1->2 1.001250
ASR (immediate, 64-bit) throughput 0.250962
ASR (register, 32-bit) 1->2 1.000050
ASR (register, 32-bit) 1->3 1.000025
ASR (register, 32-bit) throughput 0.250125
ASR (register, 64-bit) 1->2 1.004844
ASR (register, 64-bit) 1->3 0.999975
ASR (register, 64-bit) throughput 0.250368
B throughput 1.003642
B.cc (not taken) throughput 1.003006
B.cc (taken) throughput 0.501071
BFC (32-bit) 1->1 0.998129
BFC (32-bit) throughput 1.001878
BFC (64-bit) 1->1 1.000575
BFC (64-bit) throughput 1.000896
BFI (32-bit) 1->1 1.001925
BFI (32-bit) 1->2 0.999276
BFI (32-bit) throughput 1.001425
BFI (64-bit) 1->1 0.999401
BFI (64-bit) 1->2 1.003024
BFI (64-bit) throughput 0.999807
BFXIL (32-bit) 1->1 1.001925
BFXIL (32-bit) 1->2 0.999352
BFXIL (32-bit) throughput 1.000415
BFXIL (64-bit) 1->1 0.998253
BFXIL (64-bit) 1->2 1.006591
BFXIL (64-bit) throughput 1.001947
BIC (register, 32-bit) 1->2 1.000624
BIC (register, 32-bit) 1->3 1.008371
BIC (register, 32-bit) throughput 0.250371
BIC (register, 64-bit) 1->2 1.000000
BIC (register, 64-bit) 1->3 1.000650
BIC (register, 64-bit) throughput 0.250888
BIC (register, lsl, 32-bit) 1->2 2.003550
BIC (register, lsl, 32-bit) 1->3 2.005691
BIC (register, lsl, 32-bit) throughput 0.501860
BIC (register, lsl, 64-bit) 1->2 2.004267
BIC (register, lsl, 64-bit) 1->3 2.003100
BIC (register, lsl, 64-bit) throughput 0.500078
BIC (register, lsr, 32-bit) 1->2 2.011073
BIC (register, lsr, 32-bit) 1->3 2.005068
BIC (register, lsr, 32-bit) throughput 0.499838
BIC (register, lsr, 64-bit) 1->2 2.011235
BIC (register, lsr, 64-bit) 1->3 2.003524
BIC (register, lsr, 64-bit) throughput 0.500804
BIC (register, asr, 32-bit) 1->2 1.997654
BIC (register, asr, 32-bit) 1->3 2.036382
BIC (register, asr, 32-bit) throughput 0.500702
BIC (register, asr, 64-bit) 1->2 2.002999
BIC (register, asr, 64-bit) 1->3 2.002975
BIC (register, asr, 64-bit) throughput 0.503371
BIC (register, ror, 32-bit) 1->2 2.005589
BIC (register, ror, 32-bit) 1->3 1.999225
BIC (register, ror, 32-bit) throughput 0.501612
BIC (register, ror, 64-bit) 1->2 1.997555
BIC (register, ror, 64-bit) 1->3 2.003049
BIC (register, ror, 64-bit) throughput 0.501768
BICS (register, 32-bit) 1->2 1.000624
BICS (register, 32-bit) 1->3 1.000150
BICS (register, 32-bit) 4->2 1.005089
BICS (register, 32-bit) 4->3 1.005243
BICS (register, 32-bit) throughput 0.503072
BICS (register, 64-bit) 1->2 1.001224
BICS (register, 64-bit) 1->3 0.998286
BICS (register, 64-bit) 4->2 0.997530
BICS (register, 64-bit) 4->3 0.998876
BICS (register, 64-bit) throughput 0.501400
BICS (register, lsl, 32-bit) 1->2 2.011003
BICS (register, lsl, 32-bit) 1->3 1.999252
BICS (register, lsl, 32-bit) 4->2 2.006848
BICS (register, lsl, 32-bit) 4->3 1.999525
BICS (register, lsl, 32-bit) throughput 0.500864
BICS (register, lsl, 64-bit) 1->2 1.999601
BICS (register, lsl, 64-bit) 1->3 2.003250
BICS (register, lsl, 64-bit) 4->2 1.998103
BICS (register, lsl, 64-bit) 4->3 2.003668
BICS (register, lsl, 64-bit) throughput 0.497650
BICS (register, lsr, 32-bit) 1->2 2.003226
BICS (register, lsr, 32-bit) 1->3 2.003075
BICS (register, lsr, 32-bit) 4->2 2.002899
BICS (register, lsr, 32-bit) 4->3 2.005489
BICS (register, lsr, 32-bit) throughput 0.502021
BICS (register, lsr, 64-bit) 1->2 2.002975
BICS (register, lsr, 64-bit) 1->3 2.009680
BICS (register, lsr, 64-bit) 4->2 2.002724
BICS (register, lsr, 64-bit) 4->3 2.002850
BICS (register, lsr, 64-bit) throughput 0.500945
BICS (register, asr, 32-bit) 1->2 1.998228
BICS (register, asr, 32-bit) 1->3 2.002850
BICS (register, asr, 32-bit) 4->2 2.002775
BICS (register, asr, 32-bit) 4->3 2.002900
BICS (register, asr, 32-bit) throughput 0.499991
BICS (register, asr, 64-bit) 1->2 1.999301
BICS (register, asr, 64-bit) 1->3 2.002900
BICS (register, asr, 64-bit) 4->2 1.997207
BICS (register, asr, 64-bit) 4->3 2.010032
BICS (register, asr, 64-bit) throughput 0.499969
BICS (register, ror, 32-bit) 1->2 1.997904
BICS (register, ror, 32-bit) 1->3 1.997830
BICS (register, ror, 32-bit) 4->2 2.011977
BICS (register, ror, 32-bit) 4->3 1.999527
BICS (register, ror, 32-bit) throughput 0.500025
BICS (register, ror, 64-bit) 1->2 2.011604
BICS (register, ror, 64-bit) 1->3 1.999250
BICS (register, ror, 64-bit) 4->2 2.002650
BICS (register, ror, 64-bit) 4->3 2.020721
BICS (register, ror, 64-bit) throughput 0.499844
BL throughput 1.005465
CBNZ (not taken) throughput 0.501323
CBNZ (taken) throughput 1.006102
CBZ (not taken) throughput 0.502231
CBZ (taken) throughput 1.003983
CCMN (immediate, 32-bit) 2->1 1.002900
CCMN (immediate, 32-bit) 2->2 1.001525
CCMN (immediate, 32-bit) throughput 0.630423
CCMN (immediate, 32-bit) throughput 0.519367
CCMN (immediate, 64-bit) 2->1 0.997829
CCMN (immediate, 64-bit) 2->2 0.999476
CCMN (immediate, 64-bit) throughput 0.631394
CCMN (immediate, 64-bit) throughput 0.518466
CCMN (register, 32-bit) 3->1 1.015423
CCMN (register, 32-bit) 3->2 0.999825
CCMN (register, 32-bit) 3->3 1.001575
CCMN (register, 32-bit) throughput 0.631630
CCMN (register, 32-bit) throughput 0.518700
CCMN (register, 64-bit) 3->1 1.008433
CCMN (register, 64-bit) 3->2 0.997854
CCMN (register, 64-bit) 3->3 1.000350
CCMN (register, 64-bit) throughput 0.631065
CCMN (register, 64-bit) throughput 0.517972
CCMP (immediate, 32-bit) 2->1 0.997579
CCMP (immediate, 32-bit) 2->2 1.001450
CCMP (immediate, 32-bit) throughput 0.631015
CCMP (immediate, 32-bit) throughput 0.519040
CCMP (immediate, 64-bit) 2->1 1.002824
CCMP (immediate, 64-bit) 2->2 1.001800
CCMP (immediate, 64-bit) throughput 0.629861
CCMP (immediate, 64-bit) throughput 0.505524
CCMP (register, 32-bit) 3->1 0.997978
CCMP (register, 32-bit) 3->2 0.998977
CCMP (register, 32-bit) 3->3 0.998752
CCMP (register, 32-bit) throughput 0.631543
CCMP (register, 32-bit) throughput 0.519354
CCMP (register, 64-bit) 3->1 1.002675
CCMP (register, 64-bit) 3->2 1.011399
CCMP (register, 64-bit) 3->3 1.000524
CCMP (register, 64-bit) throughput 0.631163
CCMP (register, 64-bit) throughput 0.520718
CINC (32-bit) 1->2 1.000100
CINC (32-bit) 1->3 0.997480
CINC (32-bit) throughput 0.250062
CINC (64-bit) 1->2 1.001250
CINC (64-bit) 1->3 0.999251
CINC (64-bit) throughput 0.250778
CINV (32-bit) 1->2 1.001900
CINV (32-bit) 1->3 1.002824
CINV (32-bit) throughput 0.250147
CINV (64-bit) 1->2 0.999377
CINV (64-bit) 1->3 0.998029
CINV (64-bit) throughput 0.250806
CLREX throughput 5.012049
CLREX throughput 5.005650
CLS (32-bit) 1->2 1.008857
CLS (32-bit) throughput 0.251572
CLS (64-bit) 1->2 1.000499
CLS (64-bit) throughput 0.250909
CLZ (32-bit) 1->2 1.001150
CLZ (32-bit) throughput 0.250091
CLZ (64-bit) 1->2 0.998876
CLZ (64-bit) throughput 0.248278
CMN (sxtb, 32-bit) 3->1 2.003024
CMN (sxtb, 32-bit) 3->2 1.999026
CMN (sxtb, 32-bit) throughput 0.502303
CMN (sxtb, 64-bit) 3->1 2.003618
CMN (sxtb, 64-bit) 3->2 2.009198
CMN (sxtb, 64-bit) throughput 0.500708
CMN (uxtb, 32-bit) 3->1 0.974359
CMN (uxtb, 32-bit) 3->2 1.008283
CMN (uxtb, 32-bit) throughput 0.293192
CMN (uxtb, 64-bit) 3->1 1.007249
CMN (uxtb, 64-bit) 3->2 1.006548
CMN (uxtb, 64-bit) throughput 0.292180
CMN (sxth, 32-bit) 3->1 2.003941
CMN (sxth, 32-bit) 3->2 2.026015
CMN (sxth, 32-bit) throughput 0.500693
CMN (sxth, 64-bit) 3->1 1.998202
CMN (sxth, 64-bit) 3->2 2.002723
CMN (sxth, 64-bit) throughput 0.500683
CMN (uxth, 32-bit) 3->1 1.004499
CMN (uxth, 32-bit) 3->2 0.997530
CMN (uxth, 32-bit) throughput 0.291944
CMN (uxth, 64-bit) 3->1 1.005917
CMN (uxth, 64-bit) 3->2 1.004474
CMN (uxth, 64-bit) throughput 0.292589
CMN (sxtw, 32-bit) 3->1 0.997830
CMN (sxtw, 32-bit) 3->2 0.998029
CMN (sxtw, 32-bit) throughput 0.292710
CMN (sxtw, 64-bit) 3->1 2.009079
CMN (sxtw, 64-bit) 3->2 2.002775
CMN (sxtw, 64-bit) throughput 0.501884
CMN (uxtw, 32-bit) 3->1 0.998029
CMN (uxtw, 32-bit) 3->2 0.997978
CMN (uxtw, 32-bit) throughput 0.293496
CMN (uxtw, 64-bit) 3->1 1.011302
CMN (uxtw, 64-bit) 3->2 1.012908
CMN (uxtw, 64-bit) throughput 0.292717
CMN (uxtx, 64-bit) 3->1 0.998876
CMN (uxtx, 64-bit) 3->2 0.999551
CMN (uxtx, 64-bit) throughput 0.291955
CMN (sxtx, 64-bit) 3->1 1.008923
CMN (sxtx, 64-bit) 3->2 0.999327
CMN (sxtx, 64-bit) throughput 0.292627
CMN (immediate, 32-bit) 2->1 0.997930
CMN (immediate, 32-bit) throughput 0.293324
CMN (immediate, 64-bit) 2->1 1.005101
CMN (immediate, 64-bit) throughput 0.292095
CMN (shifted immediate, 32-bit) 2->1 0.997380
CMN (shifted immediate, 32-bit) throughput 0.294060
CMN (shifted immediate, 64-bit) 2->1 0.999426
CMN (shifted immediate, 64-bit) throughput 0.291615
CMN (register, 32-bit) 3->1 0.998403
CMN (register, 32-bit) 3->2 1.007790
CMN (register, 32-bit) throughput 0.293121
CMN (register, 64-bit) 3->1 1.002799
CMN (register, 64-bit) 3->2 0.998478
CMN (register, 64-bit) throughput 0.293687
CMN (register, lsl, 32-bit) 3->1 2.003176
CMN (register, lsl, 32-bit) 3->2 2.003349
CMN (register, lsl, 32-bit) throughput 0.500671
CMN (register, lsl, 64-bit) 3->1 2.003350
CMN (register, lsl, 64-bit) 3->2 2.005750
CMN (register, lsl, 64-bit) throughput 0.501875
CMN (register, lsr, 32-bit) 3->1 1.999950
CMN (register, lsr, 32-bit) 3->2 2.002850
CMN (register, lsr, 32-bit) throughput 0.499863
CMN (register, lsr, 64-bit) 3->1 2.003598
CMN (register, lsr, 64-bit) 3->2 2.003450
CMN (register, lsr, 64-bit) throughput 0.500555
CMN (register, asr, 32-bit) 3->1 2.003349
CMN (register, asr, 32-bit) 3->2 2.005217
CMN (register, asr, 32-bit) throughput 0.500953
CMN (register, asr, 64-bit) 3->1 2.027215
CMN (register, asr, 64-bit) 3->2 1.998727
CMN (register, asr, 64-bit) throughput 0.500900
CMP (sxtb, 32-bit) 3->1 1.997878
CMP (sxtb, 32-bit) 3->2 2.009930
CMP (sxtb, 32-bit) throughput 0.502275
CMP (sxtb, 64-bit) 3->1 2.002948
CMP (sxtb, 64-bit) 3->2 1.997855
CMP (sxtb, 64-bit) throughput 0.500865
CMP (uxtb, 32-bit) 3->1 0.999625
CMP (uxtb, 32-bit) 3->2 0.999376
CMP (uxtb, 32-bit) throughput 0.295216
CMP (uxtb, 64-bit) 3->1 0.998003
CMP (uxtb, 64-bit) 3->2 1.013148
CMP (uxtb, 64-bit) throughput 0.293722
CMP (sxth, 32-bit) 3->1 2.006311
CMP (sxth, 32-bit) 3->2 2.015798
CMP (sxth, 32-bit) throughput 0.501815
CMP (sxth, 64-bit) 3->1 1.997879
CMP (sxth, 64-bit) 3->2 1.999226
CMP (sxth, 64-bit) throughput 0.501350
CMP (uxth, 32-bit) 3->1 1.005941
CMP (uxth, 32-bit) 3->2 0.999252
CMP (uxth, 32-bit) throughput 0.292656
CMP (uxth, 64-bit) 3->1 1.002950
CMP (uxth, 64-bit) 3->2 1.002950
CMP (uxth, 64-bit) throughput 0.292058
CMP (sxtw, 32-bit) 3->1 1.007809
CMP (sxtw, 32-bit) 3->2 1.001423
CMP (sxtw, 32-bit) throughput 0.292150
CMP (sxtw, 64-bit) 3->1 2.004115
CMP (sxtw, 64-bit) 3->2 2.005474
CMP (sxtw, 64-bit) throughput 0.501884
CMP (uxtw, 32-bit) 3->1 0.998129
CMP (uxtw, 32-bit) 3->2 1.002775
CMP (uxtw, 32-bit) throughput 0.292259
CMP (uxtw, 64-bit) 3->1 1.003075
CMP (uxtw, 64-bit) 3->2 0.998078
CMP (uxtw, 64-bit) throughput 0.292053
CMP (uxtx, 64-bit) 3->1 0.997980
CMP (uxtx, 64-bit) 3->2 1.005838
CMP (uxtx, 64-bit) throughput 0.292599
CMP (sxtx, 64-bit) 3->1 1.002950
CMP (sxtx, 64-bit) 3->2 1.002874
CMP (sxtx, 64-bit) throughput 0.293237
CMP (immediate, 32-bit) 2->1 0.999151
CMP (immediate, 32-bit) throughput 0.292355
CMP (immediate, 64-bit) 2->1 1.005543
CMP (immediate, 64-bit) throughput 0.292698
CMP (shifted immediate, 32-bit) 2->1 0.999352
CMP (shifted immediate, 32-bit) throughput 0.292642
CMP (shifted immediate, 64-bit) 2->1 1.006589
CMP (shifted immediate, 64-bit) throughput 0.294072
CMP (register, 32-bit) 3->1 0.999650
CMP (register, 32-bit) 3->2 0.997888
CMP (register, 32-bit) throughput 0.292622
CMP (register, 64-bit) 3->1 1.000896
CMP (register, 64-bit) 3->2 0.997331
CMP (register, 64-bit) throughput 0.292715
CMP (register, lsl, 32-bit) 3->1 1.997504
CMP (register, lsl, 32-bit) 3->2 2.010810
CMP (register, lsl, 32-bit) throughput 0.501080
CMP (register, lsl, 64-bit) 3->1 2.002650
CMP (register, lsl, 64-bit) 3->2 1.999376
CMP (register, lsl, 64-bit) throughput 0.501568
CMP (register, lsr, 32-bit) 3->1 2.014695
CMP (register, lsr, 32-bit) 3->2 1.998801
CMP (register, lsr, 32-bit) throughput 0.500714
CMP (register, lsr, 64-bit) 3->1 1.998403
CMP (register, lsr, 64-bit) 3->2 1.999000
CMP (register, lsr, 64-bit) throughput 0.500840
CMP (register, asr, 32-bit) 3->1 2.006361
CMP (register, asr, 32-bit) 3->2 1.999202
CMP (register, asr, 32-bit) throughput 0.500901
CMP (register, asr, 64-bit) 3->1 1.999425
CMP (register, asr, 64-bit) 3->2 2.003500
CMP (register, asr, 64-bit) throughput 0.503881
CNEG (32-bit) 1->2 1.001824
CNEG (32-bit) 1->3 1.003768
CNEG (32-bit) throughput 0.250343
CNEG (64-bit) 1->2 1.001350
CNEG (64-bit) 1->3 1.003849
CNEG (64-bit) throughput 0.251497
CSDB throughput 0.203890
CSEL (32-bit) 1->2 1.000924
CSEL (32-bit) 1->3 1.001423
CSEL (32-bit) 1->4 1.010389
CSEL (32-bit) throughput 0.262741
CSEL (64-bit) 1->2 1.403981
CSEL (64-bit) 1->3 0.998168
CSEL (64-bit) 1->4 0.993724
CSEL (64-bit) throughput 0.250844
CSET (32-bit) 1->2 0.999600
CSET (32-bit) throughput 0.250362
CSET (64-bit) 1->2 0.998877
CSET (64-bit) throughput 0.250964
CSETM (32-bit) 1->2 0.998777
CSETM (32-bit) throughput 0.250750
CSETM (64-bit) 1->2 1.007863
CSETM (64-bit) throughput 0.250744
CSINC (32-bit) 1->2 1.000999
CSINC (32-bit) 1->3 1.000775
CSINC (32-bit) 1->4 0.999177
CSINC (32-bit) throughput 0.252337
CSINC (64-bit) 1->2 1.001825
CSINC (64-bit) 1->3 1.001850
CSINC (64-bit) 1->4 1.002800
CSINC (64-bit) throughput 0.251876
CSINV (32-bit) 1->2 0.998851
CSINV (32-bit) 1->3 0.998827
CSINV (32-bit) 1->4 0.999925
CSINV (32-bit) throughput 0.249922
CSINV (64-bit) 1->2 0.998203
CSINV (64-bit) 1->3 1.001775
CSINV (64-bit) 1->4 1.003343
CSINV (64-bit) throughput 0.250797
CSNEG (32-bit) 1->2 0.999075
CSNEG (32-bit) 1->3 1.000100
CSNEG (32-bit) 1->4 1.013328
CSNEG (32-bit) throughput 0.251188
CSNEG (64-bit) 1->2 1.001850
CSNEG (64-bit) 1->3 1.001349
CSNEG (64-bit) 1->4 1.007264
CSNEG (64-bit) throughput 0.250784
DMB (SY) throughput 4.019746
DMB (ST) throughput 3.997205
DMB (LD) throughput 3.975467
DMB (ISH) throughput 3.974113
DMB (ISHST) throughput 3.997902
DMB (ISHLD) throughput 3.986851
DMB (NSH) throughput 4.011021
DMB (NSHST) throughput 4.014446
DMB (NSHLD) throughput 4.003493
DMB (OSH) throughput 4.002045
DMB (OSHST) throughput 4.001422
DMB (OSHLD) throughput 4.023491
DSB (SY) throughput 16.045199
DSB (ST) throughput 16.024004
DSB (LD) throughput 16.011502
DSB (ISH) throughput 15.950617
DSB (ISHST) throughput 16.022783
DSB (ISHLD) throughput 16.022867
DSB (NSH) throughput 16.029418
DSB (NSHST) throughput 16.002994
DSB (NSHLD) throughput 16.006345
DSB (OSH) throughput 16.018524
DSB (OSHST) throughput 16.027838
DSB (OSHLD) throughput 16.049716
EON (register, 32-bit) 1->2 0.998253
EON (register, 32-bit) 1->3 1.002051
EON (register, 32-bit) throughput 0.250356
EON (register, 64-bit) 1->2 1.000050
EON (register, 64-bit) 1->3 1.000250
EON (register, 64-bit) throughput 0.250909
EON (register, lsl, 32-bit) 1->2 2.003550
EON (register, lsl, 32-bit) 1->3 2.003750
EON (register, lsl, 32-bit) throughput 0.501207
EON (register, lsl, 64-bit) 1->2 1.999377
EON (register, lsl, 64-bit) 1->3 2.006587
EON (register, lsl, 64-bit) throughput 0.502593
EON (register, lsr, 32-bit) 1->2 1.999325
EON (register, lsr, 32-bit) 1->3 2.002824
EON (register, lsr, 32-bit) throughput 0.500844
EON (register, lsr, 64-bit) 1->2 2.004418
EON (register, lsr, 64-bit) 1->3 2.010007
EON (register, lsr, 64-bit) throughput 0.501241
EON (register, asr, 32-bit) 1->2 2.005390
EON (register, asr, 32-bit) 1->3 2.003175
EON (register, asr, 32-bit) throughput 0.500184
EON (register, asr, 64-bit) 1->2 2.004966
EON (register, asr, 64-bit) 1->3 1.999151
EON (register, asr, 64-bit) throughput 0.501491
EON (register, ror, 32-bit) 1->2 1.997355
EON (register, ror, 32-bit) 1->3 1.997854
EON (register, ror, 32-bit) throughput 0.502700
EON (register, ror, 64-bit) 1->2 1.999201
EON (register, ror, 64-bit) 1->3 2.007165
EON (register, ror, 64-bit) throughput 0.500843
EOR (immediate, 32-bit) 1->2 1.001825
EOR (immediate, 32-bit) throughput 0.251678
EOR (immediate, 64-bit) 1->2 0.999376
EOR (immediate, 64-bit) throughput 0.250371
EOR (register, 32-bit) 1->2 0.998129
EOR (register, 32-bit) 1->3 0.999650
EOR (register, 32-bit) throughput 0.251932
EOR (register, 64-bit) 1->2 1.001200
EOR (register, 64-bit) 1->3 0.998826
EOR (register, 64-bit) throughput 0.250869
EOR (register, lsl, 32-bit) 1->2 2.002649
EOR (register, lsl, 32-bit) 1->3 2.003125
EOR (register, lsl, 32-bit) throughput 0.501360
EOR (register, lsl, 64-bit) 1->2 1.998103
EOR (register, lsl, 64-bit) 1->3 1.999350
EOR (register, lsl, 64-bit) throughput 0.501497
EOR (register, lsr, 32-bit) 1->2 2.003149
EOR (register, lsr, 32-bit) 1->3 1.997554
EOR (register, lsr, 32-bit) throughput 0.503515
EOR (register, lsr, 64-bit) 1->2 2.005992
EOR (register, lsr, 64-bit) 1->3 2.005664
EOR (register, lsr, 64-bit) throughput 0.500887
EOR (register, asr, 32-bit) 1->2 2.002625
EOR (register, asr, 32-bit) 1->3 1.998976
EOR (register, asr, 32-bit) throughput 0.501519
EOR (register, asr, 64-bit) 1->2 2.003550
EOR (register, asr, 64-bit) 1->3 2.003499
EOR (register, asr, 64-bit) throughput 0.500406
EOR (register, ror, 32-bit) 1->2 2.003600
EOR (register, ror, 32-bit) 1->3 2.017399
EOR (register, ror, 32-bit) throughput 0.499997
EOR (register, ror, 64-bit) 1->2 1.999250
EOR (register, ror, 64-bit) 1->3 2.002376
EOR (register, ror, 64-bit) throughput 0.501550
EXTR (register, 32-bit) 1->2 1.000674
EXTR (register, 32-bit) 1->3 2.008863
EXTR (register, 32-bit) throughput 1.002214
EXTR (register, 64-bit) 1->2 1.001250
EXTR (register, 64-bit) 1->3 1.999375
EXTR (register, 64-bit) throughput 1.001978
ISB (SY) throughput 25.067329
LDNP (32-bit) 1->3 (with chain penalty) 4.030096
LDNP (32-bit) 2->3 (with chain penalty) 4.063344
LDNP (32-bit) throughput 0.501579
LDNP (64-bit) 1->3 (with chain penalty) 4.015050
LDNP (64-bit) 2->3 (with chain penalty) 4.016775
LDNP (64-bit) throughput 0.502062
LDP (32-bit) 1->3 (with chain penalty) 4.001899
LDP (32-bit) 2->3 (with chain penalty) 4.007611
LDP (32-bit) throughput 0.501006
LDP (64-bit) 1->3 (with chain penalty) 4.017296
LDP (64-bit) 2->3 (with chain penalty) 4.013423
LDP (64-bit) throughput 0.498423
LDP (post-index, 32-bit) 1->3 (with chain penalty) 4.141027
LDP (post-index, 32-bit) 2->3 (with chain penalty) 4.395065
LDP (post-index, 32-bit) throughput 0.820692
LDP (post-index, 64-bit) 1->3 (with chain penalty) 7.154856
LDP (post-index, 64-bit) 2->3 (with chain penalty) 4.799932
LDP (post-index, 64-bit) throughput 0.766868
LDP (pre-index, 32-bit) 1->3 (with chain penalty) 4.043588
LDP (pre-index, 32-bit) 2->3 (with chain penalty) 4.015238
LDP (pre-index, 32-bit) throughput 0.757041
LDP (pre-index, 64-bit) 1->3 (with chain penalty) 4.441302
LDP (pre-index, 64-bit) 2->3 (with chain penalty) 4.051144
LDP (pre-index, 64-bit) throughput 0.768783
LDP (signed offset, 32-bit) 1->3 (with chain penalty) 4.005763
LDP (signed offset, 32-bit) 2->3 (with chain penalty) 4.005661
LDP (signed offset, 32-bit) throughput 0.502540
LDP (signed offset, 64-bit) 1->3 (with chain penalty) 4.009807
LDP (signed offset, 64-bit) 2->3 (with chain penalty) 4.005790
LDP (signed offset, 64-bit) throughput 0.500922
LDPSW (post-index) 1->3 (with chain penalty) 4.072938
LDPSW (post-index) 2->3 (with chain penalty) 4.035088
LDPSW (post-index) throughput 0.755173
LDPSW (pre-index) 1->3 (with chain penalty) 4.070140
LDPSW (pre-index) 2->3 (with chain penalty) 3.968215
LDPSW (pre-index) throughput 0.756324
LDPSW (signed offset) 1->3 (with chain penalty) 4.010297
LDPSW (signed offset) 2->3 (with chain penalty) 4.007974
LDPSW (signed offset) throughput 0.500834
LDR (32-bit) 1->2 (with chain penalty) 4.017048
LDR (32-bit) throughput 0.500134
LDR (64-bit) 1->2 (with chain penalty) 4.021792
LDR (64-bit) throughput 0.501223
LDR (post-index, 32-bit) 1->2 (with chain penalty) 4.334915
LDR (post-index, 32-bit) throughput 0.580015
LDR (post-index, 64-bit) 1->2 (with chain penalty) 4.245224
LDR (post-index, 64-bit) throughput 0.561074
LDR (pre-index, 32-bit) 1->2 (with chain penalty) 4.082148
LDR (pre-index, 32-bit) throughput 0.553236
LDR (pre-index, 64-bit) 1->2 (with chain penalty) 4.185985
LDR (pre-index, 64-bit) throughput 0.559594
LDR (unsigned offset, 32-bit) 1->2 (with chain penalty) 4.026397
LDR (unsigned offset, 32-bit) throughput 0.502865
LDR (unsigned offset, 64-bit) 1->2 (with chain penalty) 4.008151
LDR (unsigned offset, 64-bit) throughput 0.501223
LDR (literal, 32-bit) throughput 0.499729
LDR (literal, 64-bit) throughput 0.503729
LDR (register, 32-bit) 1->2 (with chain penalty) 4.006511
LDR (register, 32-bit) 1->3 (with chain penalty) 4.007987
LDR (register, 32-bit) throughput 0.501176
LDR (register, 64-bit) 1->2 (with chain penalty) 4.031471
LDR (register, 64-bit) 1->3 (with chain penalty) 4.017546
LDR (register, 64-bit) throughput 0.499794
LDR (register, uxtw, 32-bit) 1->2 (with chain penalty) 4.026899
LDR (register, uxtw, 32-bit) 1->3 (with chain penalty) 4.027169
LDR (register, uxtw, 32-bit) throughput 0.499738
LDR (register, uxtw, 64-bit) 1->2 (with chain penalty) 4.020587
LDR (register, uxtw, 64-bit) 1->3 (with chain penalty) 4.020409
LDR (register, uxtw, 64-bit) throughput 0.501560
LDR (register, sxtw, 32-bit) 1->2 (with chain penalty) 4.004694
LDR (register, sxtw, 32-bit) 1->3 (with chain penalty) 3.928103
LDR (register, sxtw, 32-bit) throughput 0.499094
LDR (register, sxtw, 64-bit) 1->2 (with chain penalty) 4.011432
LDR (register, sxtw, 64-bit) 1->3 (with chain penalty) 4.020095
LDR (register, sxtw, 64-bit) throughput 0.501357
LDR (register, lsl, 32-bit) 1->2 (with chain penalty) 4.071491
LDR (register, lsl, 32-bit) 1->3 (with chain penalty) 4.024270
LDR (register, lsl, 32-bit) throughput 0.503504
LDR (register, lsl, 64-bit) 1->2 (with chain penalty) 4.006266
LDR (register, lsl, 64-bit) 1->3 (with chain penalty) 4.007374
LDR (register, lsl, 64-bit) throughput 0.500668
LDRB 1->2 (with chain penalty) 4.014158
LDRB throughput 0.499869
LDRB (post-index) 1->2 (with chain penalty) 4.316805
LDRB (post-index) throughput 0.591563
LDRB (pre-index) 1->2 (with chain penalty) 4.162812
LDRB (pre-index) throughput 0.572155
LDRB (unsigned offset) 1->2 (with chain penalty) 4.009531
LDRB (unsigned offset) throughput 0.502548
LDRB (register) 1->2 (with chain penalty) 4.003618
LDRB (register) 1->3 (with chain penalty) 4.017565
LDRB (register) throughput 0.501220
LDRB (register, uxtw) 1->2 (with chain penalty) 4.021651
LDRB (register, uxtw) 1->3 (with chain penalty) 4.005988
LDRB (register, uxtw) throughput 0.499719
LDRB (register, sxtw) 1->2 (with chain penalty) 4.007448
LDRB (register, sxtw) 1->3 (with chain penalty) 3.944590
LDRB (register, sxtw) throughput 0.500109
LDRH 1->2 (with chain penalty) 4.012178
LDRH throughput 0.500621
LDRH (post-index) 1->2 (with chain penalty) 4.090203
LDRH (post-index) throughput 0.560347
LDRH (pre-index) 1->2 (with chain penalty) 4.558019
LDRH (pre-index) throughput 0.580368
LDRH (unsigned offset) 1->2 (with chain penalty) 3.999175
LDRH (unsigned offset) throughput 0.501345
LDRH (register) 1->2 (with chain penalty) 4.012034
LDRH (register) 1->3 (with chain penalty) 4.017408
LDRH (register) throughput 0.499869
LDRH (register, uxtw) 1->2 (with chain penalty) 4.005716
LDRH (register, uxtw) 1->3 (with chain penalty) 4.000771
LDRH (register, uxtw) throughput 0.501997
LDRH (register, sxtw) 1->2 (with chain penalty) 4.031594
LDRH (register, sxtw) 1->3 (with chain penalty) 3.998403
LDRH (register, sxtw) throughput 0.500452
LDRH (register, lsl) 1->2 (with chain penalty) 3.994460
LDRH (register, lsl) 1->3 (with chain penalty) 4.008461
LDRH (register, lsl) throughput 0.499991
LDRSB (32-bit) 1->2 (with chain penalty) 4.004769
LDRSB (32-bit) throughput 0.500976
LDRSB (64-bit) 1->2 (with chain penalty) 4.010871
LDRSB (64-bit) throughput 0.501706
LDRSB (post-index, 32-bit) 1->2 (with chain penalty) 4.171054
LDRSB (post-index, 32-bit) throughput 0.599933
LDRSB (post-index, 64-bit) 1->2 (with chain penalty) 4.557743
LDRSB (post-index, 64-bit) throughput 0.604768
LDRSB (pre-index, 32-bit) 1->2 (with chain penalty) 4.556681
LDRSB (pre-index, 32-bit) throughput 0.605927
LDRSB (pre-index, 64-bit) 1->2 (with chain penalty) 4.447756
LDRSB (pre-index, 64-bit) throughput 0.597689
LDRSB (unsigned offset, 32-bit) 1->2 (with chain penalty) 4.008184
LDRSB (unsigned offset, 32-bit) throughput 0.500124
LDRSB (unsigned offset, 64-bit) 1->2 (with chain penalty) 3.993088
LDRSB (unsigned offset, 64-bit) throughput 0.501294
LDRSB (register, 32-bit) 1->2 (with chain penalty) 4.018748
LDRSB (register, 32-bit) 1->3 (with chain penalty) 4.003118
LDRSB (register, 32-bit) throughput 0.499716
LDRSB (register, 64-bit) 1->2 (with chain penalty) 4.008370
LDRSB (register, 64-bit) 1->3 (with chain penalty) 4.025615
LDRSB (register, 64-bit) throughput 0.497757
LDRSB (register, uxtw, 32-bit) 1->2 (with chain penalty) 4.014577
LDRSB (register, uxtw, 32-bit) 1->3 (with chain penalty) 4.018925
LDRSB (register, uxtw, 32-bit) throughput 0.502091
LDRSB (register, uxtw, 64-bit) 1->2 (with chain penalty) 4.005093
LDRSB (register, uxtw, 64-bit) 1->3 (with chain penalty) 4.007334
LDRSB (register, uxtw, 64-bit) throughput 0.501025
LDRSB (register, sxtw, 32-bit) 1->2 (with chain penalty) 4.009398
LDRSB (register, sxtw, 32-bit) 1->3 (with chain penalty) 3.914832
LDRSB (register, sxtw, 32-bit) throughput 0.500056
LDRSB (register, sxtw, 64-bit) 1->2 (with chain penalty) 4.029021
LDRSB (register, sxtw, 64-bit) 1->3 (with chain penalty) 4.021049
LDRSB (register, sxtw, 64-bit) throughput 0.500622
LDRSH (32-bit) 1->2 (with chain penalty) 4.009380
LDRSH (32-bit) throughput 0.501507
LDRSH (64-bit) 1->2 (with chain penalty) 4.006037
LDRSH (64-bit) throughput 0.500979
LDRSH (post-index, 32-bit) 1->2 (with chain penalty) 4.247946
LDRSH (post-index, 32-bit) throughput 0.587233
LDRSH (post-index, 64-bit) 1->2 (with chain penalty) 4.618051
LDRSH (post-index, 64-bit) throughput 0.617256
LDRSH (pre-index, 32-bit) 1->2 (with chain penalty) 4.492430
LDRSH (pre-index, 32-bit) throughput 0.555038
LDRSH (pre-index, 64-bit) 1->2 (with chain penalty) 4.494413
LDRSH (pre-index, 64-bit) throughput 0.598118
LDRSH (unsigned offset, 32-bit) 1->2 (with chain penalty) 4.031631
LDRSH (unsigned offset, 32-bit) throughput 0.501308
LDRSH (unsigned offset, 64-bit) 1->2 (with chain penalty) 4.008537
LDRSH (unsigned offset, 64-bit) throughput 0.500528
LDRSH (register, 32-bit) 1->2 (with chain penalty) 4.006938
LDRSH (register, 32-bit) 1->3 (with chain penalty) 4.003145
LDRSH (register, 32-bit) throughput 0.499856
LDRSH (register, 64-bit) 1->2 (with chain penalty) 4.016650
LDRSH (register, 64-bit) 1->3 (with chain penalty) 4.012051
LDRSH (register, 64-bit) throughput 0.500724
LDRSH (register, uxtw, 32-bit) 1->2 (with chain penalty) 4.000874
LDRSH (register, uxtw, 32-bit) 1->3 (with chain penalty) 4.008335
LDRSH (register, uxtw, 32-bit) throughput 0.503037
LDRSH (register, uxtw, 64-bit) 1->2 (with chain penalty) 4.014573
LDRSH (register, uxtw, 64-bit) 1->3 (with chain penalty) 4.029748
LDRSH (register, uxtw, 64-bit) throughput 0.499732
LDRSH (register, sxtw, 32-bit) 1->2 (with chain penalty) 4.004316
LDRSH (register, sxtw, 32-bit) 1->3 (with chain penalty) 3.923781
LDRSH (register, sxtw, 32-bit) throughput 0.500647
LDRSH (register, sxtw, 64-bit) 1->2 (with chain penalty) 4.064896
LDRSH (register, sxtw, 64-bit) 1->3 (with chain penalty) 3.998374
LDRSH (register, sxtw, 64-bit) throughput 0.500537
LDRSH (register, lsl, 32-bit) 1->2 (with chain penalty) 4.018027
LDRSH (register, lsl, 32-bit) 1->3 (with chain penalty) 4.005689
LDRSH (register, lsl, 32-bit) throughput 0.501001
LDRSH (register, lsl, 64-bit) 1->2 (with chain penalty) 3.999303
LDRSH (register, lsl, 64-bit) 1->3 (with chain penalty) 4.009365
LDRSH (register, lsl, 64-bit) throughput 0.500166
LDRSW 1->2 (with chain penalty) 3.999899
LDRSW throughput 0.499778
LDRSW (post-index) 1->2 (with chain penalty) 4.413461
LDRSW (post-index) throughput 0.594854
LDRSW (pre-index) 1->2 (with chain penalty) 4.594317
LDRSW (pre-index) throughput 0.609555
LDRSW (unsigned offset) 1->2 (with chain penalty) 4.009487
LDRSW (unsigned offset) throughput 0.501868
LDRSW (literal) throughput 0.500390
LDRSW (register) 1->2 (with chain penalty) 4.013461
LDRSW (register) 1->3 (with chain penalty) 4.258063
LDRSW (register) throughput 0.549595
LDRSW (register, uxtw) 1->2 (with chain penalty) 3.983973
LDRSW (register, uxtw) 1->3 (with chain penalty) 4.080187
LDRSW (register, uxtw) throughput 0.500647
LDRSW (register, sxtw) 1->2 (with chain penalty) 4.036974
LDRSW (register, sxtw) 1->3 (with chain penalty) 4.036920
LDRSW (register, sxtw) throughput 0.501317
LDRSW (register, lsl) 1->2 (with chain penalty) 4.009140
LDRSW (register, lsl) 1->3 (with chain penalty) 3.998153
LDRSW (register, lsl) throughput 0.499916
LDUR (32-bit) 1->2 (with chain penalty) 4.017552
LDUR (32-bit) throughput 0.499384
LDUR (64-bit) 1->2 (with chain penalty) 4.010527
LDUR (64-bit) throughput 0.500430
LDURB 1->2 (with chain penalty) 4.018989
LDURB throughput 0.500309
LDURH 1->2 (with chain penalty) 3.999677
LDURH throughput 0.500578
LDURSB (32-bit) 1->2 (with chain penalty) 3.994110
LDURSB (32-bit) throughput 0.500412
LDURSB (64-bit) 1->2 (with chain penalty) 4.026342
LDURSB (64-bit) throughput 0.501109
LDURSH (32-bit) 1->2 (with chain penalty) 4.005794
LDURSH (32-bit) throughput 0.497604
LDURSH (64-bit) 1->2 (with chain penalty) 4.018915
LDURSH (64-bit) throughput 0.500368
LDURSW 1->2 (with chain penalty) 4.032879
LDURSW throughput 0.501715
LSL (immediate, 32-bit) 1->2 1.001225
LSL (immediate, 32-bit) throughput 0.250781
LSL (immediate, 64-bit) 1->2 0.998980
LSL (immediate, 64-bit) throughput 0.250859
LSL (register, 32-bit) 1->2 1.001824
LSL (register, 32-bit) 1->3 1.002199
LSL (register, 32-bit) throughput 0.250325
LSL (register, 64-bit) 1->2 1.002097
LSL (register, 64-bit) 1->3 0.998826
LSL (register, 64-bit) throughput 0.250143
LSR (immediate, 32-bit) 1->2 1.001475
LSR (immediate, 32-bit) throughput 0.250318
LSR (immediate, 64-bit) 1->2 0.999226
LSR (immediate, 64-bit) throughput 0.250947
LSR (register, 32-bit) 1->2 1.000449
LSR (register, 32-bit) 1->3 1.001800
LSR (register, 32-bit) throughput 0.250524
LSR (register, 64-bit) 1->2 1.003221
LSR (register, 64-bit) 1->3 1.002246
LSR (register, 64-bit) throughput 0.250318
MADD (32-bit) 1->2 3.002074
MADD (32-bit) 1->3 2.999299
MADD (32-bit) 1->4 1.000125
MADD (32-bit) throughput 1.002728
MADD (64-bit) 1->2 3.016174
MADD (64-bit) 1->3 3.006840
MADD (64-bit) 1->4 1.001450
MADD (64-bit) throughput 1.004038
MNEG (32-bit) 1->2 2.997455
MNEG (32-bit) 1->3 3.008650
MNEG (32-bit) throughput 0.996317
MNEG (64-bit) 1->2 3.014297
MNEG (64-bit) 1->3 3.011079
MNEG (64-bit) throughput 1.005815
MOV (bitmask immediate, 32-bit) throughput 0.251603
MOV (bitmask immediate, 64-bit) throughput 0.251812
MOV (from sp, 32-bit) throughput 0.251258
MOV (from sp, 64-bit) throughput 0.250719
MOVK (32-bit) 1->1 1.000924
MOVK (32-bit) throughput 0.252103
MOVK (64-bit) 1->1 1.007164
MOVK (64-bit) throughput 0.250325
MOVN (32-bit) throughput 0.250225
MOVN (64-bit) throughput 0.251223
MOVZ (32-bit) throughput 0.251697
MOVZ (64-bit) throughput 0.250246
MRS (CNTFRQ_EL0) throughput 16.060659
MRS (CNTPCT_EL0) throughput 1.250827
MRS (CNTVCT_EL0) throughput 1.254877
MRS (DCZID_EL0) throughput 1.253462
MRS (FPCR) throughput 1.250896
MRS (FPSR) throughput 7.027045
MRS (NZCV) throughput 0.251500
MRS (TPIDRRO_EL0) throughput 0.751328
MRS (TPIDR_EL0) throughput 0.751586
MSR (FPCR) throughput 8.011875
MSR (FPSR) throughput 10.013823
MSR (TPIDR_EL0) throughput 10.007260
MSR (NZCV) throughput 0.292710
MSUB (32-bit) 1->2 2.993969
MSUB (32-bit) 1->3 2.999301
MSUB (32-bit) 1->4 1.000724
MSUB (32-bit) throughput 1.001202
MSUB (64-bit) 1->2 3.003200
MSUB (64-bit) 1->3 3.011005
MSUB (64-bit) 1->4 0.999252
MSUB (64-bit) throughput 1.001960
MUL (32-bit) 1->2 3.012875
MUL (32-bit) 1->3 3.003150
MUL (32-bit) throughput 1.001591
MUL (64-bit) 1->2 3.003292
MUL (64-bit) 1->3 3.006587
MUL (64-bit) throughput 1.000786
MVN (register, 32-bit) 1->2 0.998777
MVN (register, 32-bit) throughput 0.250334
MVN (register, 64-bit) 1->2 0.999975
MVN (register, 64-bit) throughput 0.250837
MVN (register, lsl, 32-bit) 1->2 1.999301
MVN (register, lsl, 32-bit) throughput 0.501083
MVN (register, lsl, 64-bit) 1->2 1.997556
MVN (register, lsl, 64-bit) throughput 0.500580
MVN (register, lsr, 32-bit) 1->2 1.980332
MVN (register, lsr, 32-bit) throughput 0.500834
MVN (register, lsr, 64-bit) 1->2 2.003549
MVN (register, lsr, 64-bit) throughput 0.500552
MVN (register, asr, 32-bit) 1->2 1.998078
MVN (register, asr, 32-bit) throughput 0.500881
MVN (register, asr, 64-bit) 1->2 2.002850
MVN (register, asr, 64-bit) throughput 0.502186
MVN (register, ror, 32-bit) 1->2 1.999401
MVN (register, ror, 32-bit) throughput 0.499875
MVN (register, ror, 64-bit) 1->2 2.009107
MVN (register, ror, 64-bit) throughput 0.501687
NEG (register, 32-bit) 1->2 0.998801
NEG (register, 32-bit) throughput 0.250787
NEG (register, 64-bit) 1->2 0.998802
NEG (register, 64-bit) throughput 0.250781
NEG (register, lsl, 32-bit) 1->2 1.998004
NEG (register, lsl, 32-bit) throughput 0.501202
NEG (register, lsl, 64-bit) 1->2 1.999051
NEG (register, lsl, 64-bit) throughput 0.500059
NEG (register, lsr, 32-bit) 1->2 2.003675
NEG (register, lsr, 32-bit) throughput 0.502004
NEG (register, lsr, 64-bit) 1->2 2.011361
NEG (register, lsr, 64-bit) throughput 0.500075
NEG (register, asr, 32-bit) 1->2 2.000623
NEG (register, asr, 32-bit) throughput 0.500861
NEG (register, asr, 64-bit) 1->2 1.998901
NEG (register, asr, 64-bit) throughput 0.501204
NEGS (register, 32-bit) 1->2 1.001850
NEGS (register, 32-bit) 3->2 0.999300
NEGS (register, 32-bit) throughput 0.500156
NEGS (register, 64-bit) 1->2 1.000824
NEGS (register, 64-bit) 3->2 1.017573
NEGS (register, 64-bit) throughput 0.500823
NEGS (register, lsl, 32-bit) 1->2 2.000274
NEGS (register, lsl, 32-bit) 3->2 2.006937
NEGS (register, lsl, 32-bit) throughput 0.501332
NEGS (register, lsl, 64-bit) 1->2 2.002874
NEGS (register, lsl, 64-bit) 3->2 1.998501
NEGS (register, lsl, 64-bit) throughput 0.500181
NEGS (register, lsr, 32-bit) 1->2 1.998005
NEGS (register, lsr, 32-bit) 3->2 2.005948
NEGS (register, lsr, 32-bit) throughput 0.501975
NEGS (register, lsr, 64-bit) 1->2 1.999226
NEGS (register, lsr, 64-bit) 3->2 1.997654
NEGS (register, lsr, 64-bit) throughput 0.501350
NEGS (register, asr, 32-bit) 1->2 2.002850
NEGS (register, asr, 32-bit) 3->2 1.999252
NEGS (register, asr, 32-bit) throughput 0.501751
NEGS (register, asr, 64-bit) 1->2 2.005768
NEGS (register, asr, 64-bit) 3->2 1.999127
NEGS (register, asr, 64-bit) throughput 0.500708
NGC (register, 32-bit) 1->2 1.000424
NGC (register, 32-bit) 1->3 1.003099
NGC (register, 32-bit) throughput 0.250913
NGC (register, 64-bit) 1->2 1.000600
NGC (register, 64-bit) 1->3 0.997101
NGC (register, 64-bit) throughput 0.250193
NGCS (register, 32-bit) 1->2 0.998927
NGCS (register, 32-bit) 1->3 1.008249
NGCS (register, 32-bit) 3->2 0.999376
NGCS (register, 32-bit) 3->3 1.000575
NGCS (register, 32-bit) throughput 0.751531
NGCS (register, 32-bit) throughput 0.568298
NGCS (register, 64-bit) 1->2 0.999451
NGCS (register, 64-bit) 1->3 1.004142
NGCS (register, 64-bit) 3->2 1.002650
NGCS (register, 64-bit) 3->3 0.999925
NGCS (register, 64-bit) throughput 0.751479
NGCS (register, 64-bit) throughput 0.566243
NOP throughput 0.200606
ORN (register, 32-bit) 1->2 1.000624
ORN (register, 32-bit) 1->3 0.998254
ORN (register, 32-bit) throughput 0.250203
ORN (register, 64-bit) 1->2 1.001900
ORN (register, 64-bit) 1->3 1.002050
ORN (register, 64-bit) throughput 0.250790
ORN (register, lsl, 32-bit) 1->2 2.003126
ORN (register, lsl, 32-bit) 1->3 1.999724
ORN (register, lsl, 32-bit) throughput 0.501787
ORN (register, lsl, 64-bit) 1->2 1.999700
ORN (register, lsl, 64-bit) 1->3 1.999126
ORN (register, lsl, 64-bit) throughput 0.494400
ORN (register, lsr, 32-bit) 1->2 1.997555
ORN (register, lsr, 32-bit) 1->3 1.997954
ORN (register, lsr, 32-bit) throughput 0.50076
View raw

(Sorry about that, but we can’t show files that are this big right now.)

View raw

(Sorry about that, but we can’t show files that are this big right now.)

View raw

(Sorry about that, but we can’t show files that are this big right now.)

View raw

(Sorry about that, but we can’t show files that are this big right now.)

View raw

(Sorry about that, but we can’t show files that are this big right now.)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment