Skip to content

Instantly share code, notes, and snippets.

@patricklodder
Last active February 3, 2023 20:44
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save patricklodder/70c698357131f3779a2eb28e8d5168fb to your computer and use it in GitHub Desktop.
Save patricklodder/70c698357131f3779a2eb28e8d5168fb to your computer and use it in GitHub Desktop.
3188 x86_64 tests with and without AVX2

Performed bench_dogecoin with avx2 compiled in and omitted for 4 servers with different AVX2 support

1. Intel i7 970: no AVX2, no AVX

  • model name : Intel(R) Core(TM) i7 CPU 970 @ 3.20GHz
  • flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr pdcm pcid sse4_1 sse4_2 popcnt aes lahf_lm pti ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid dtherm ida arat flush_l1d

SSE2-only

#Benchmark,count,min,max,average,min_cycles,max_cycles,average_cycles
SHA1,448,0.002253323793411,0.002258345484734,0.002255185374192,7297547,7313711,7303491
SHA256,176,0.005705758929253,0.005723252892494,0.005709187550978,18478421,18534961,18489378
SHA256_32b,4,0.394508004188538,0.394646525382996,0.394577264785767,1277626120,1278076521,1277851320
SHA512,288,0.003797218203545,0.003800474107265,0.003798083298736,12297490,12307988,12300223
Scrypt,3840,0.000260609202087,0.000261000357568,0.000260729404787,843982,845261,844381

AVX2+SSE2

#Benchmark,count,min,max,average,min_cycles,max_cycles,average_cycles
SHA1,448,0.002258159220219,0.002273939549923,0.002261252275535,7313195,7364260,7323139
SHA256,176,0.005706012248993,0.006175324320793,0.005783005194231,18479187,19998901,18728443
SHA256_32b,4,0.396646976470947,0.397794961929321,0.397220969200134,1284553935,1288273513,1286413724
SHA512,288,0.003799870610237,0.003808192908764,0.003803645571073,12306100,12332902,12318241
Scrypt,3840,0.000260612927377,0.000263227149844,0.000260992472370,843998,852483,845233

2. Intel i7 3930K: no AVX2

  • model name : Intel(R) Core(TM) i7-3930K CPU @ 3.20GHz
  • flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic popcnt tsc_deadline_timer aes xsave avx lahf_lm epb pti ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid xsaveopt dtherm arat pln pts md_clear flush_l1d

SSE2-only

#Benchmark,count,min,max,average,min_cycles,max_cycles,average_cycles
SHA1,384,0.002701625227928,0.002703942358494,0.002702515572309,8650502,8657861,8653271
SHA256,160,0.006434500217438,0.006442248821259,0.006436850130558,20602129,1152921504326467712,115292150451195486
SHA256_32b,4,0.439446449279785,0.439572572708130,0.439509510993958,1407078634,1407481510,1407280072
SHA512,240,0.004307433962822,0.004312992095947,0.004308082660039,13791793,13808370,13794197
Scrypt,3584,0.000283218920231,0.000283360481262,0.000283258967102,906851,907289,906976

AVX2+SSE2

#Benchmark,count,min,max,average,min_cycles,max_cycles,average_cycles
SHA1,384,0.002701535820961,0.002702280879021,0.002701799074809,8650096,8652536,8650985
SHA256,160,0.006433814764023,0.006539627909660,0.006452575325966,20600631,20939457,20660720
SHA256_32b,4,0.439969062805176,0.439986467361450,0.439977765083313,1408751410,1408808296,1408779853
SHA512,240,0.004306912422180,0.004834383726120,0.004360241691271,13790896,15479380,13961215
Scrypt,3584,0.000283206813037,0.000283515080810,0.000283251649567,906810,907794,906953

3. Intel Xeon E5-2686: has AVX2

  • model name : Intel(R) Xeon(R) CPU E5-2686 v4 @ 2.30GHz
  • flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx rdtscp lm constant_tsc rep_good nopl xtopology cpuid tsc_known_freq pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm cpuid_fault invpcid_single pti fsgsbase bmi1 avx2 smep bmi2 erms invpcid xsaveopt

SSE2-only

#Benchmark,count,min,max,average,min_cycles,max_cycles,average_cycles
SHA1,448,0.002323284745216,0.002347908914089,0.002332096121141,5343752,5400359,5363974
SHA256,176,0.005756303668022,0.005965948104858,0.005772590637207,13239913,13722448,13277313
SHA256_32b,4,0.401641964912415,0.402471065521240,0.402056515216827,923801389,925711338,924756363
SHA512,288,0.003754965960979,0.003816500306129,0.003780211839411,8636662,8778182,8694711
Scrypt,3584,0.000288461335003,0.000294703990221,0.000291036242353,663474,677835,669401

AVX2+SSE2

#Benchmark,count,min,max,average,min_cycles,max_cycles,average_cycles
SHA1,480,0.002067722380161,0.002108253538609,0.002086258431276,4755939,4849118,4798531
SHA256,256,0.003973618149757,0.004056870937347,0.003987128846347,9139520,9330963,9170633
SHA256_32b,4,0.288406491279602,0.288867473602295,0.288636982440948,663352330,664413267,663882798
SHA512,384,0.002638369798660,0.002722129225731,0.002665940051277,6068372,6261016,6131805
Scrypt,3584,0.000284152105451,0.000297093763947,0.000288908436362,653571,683326,664508

4. AMD EPYC 7601: buggy AVX2

  • model name : AMD EPYC 7601 32-Core Processor
  • flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm rep_good nopl cpuid extd_apicid tsc_known_freq pni pclmulqdq ssse3 fma cx16 sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm cmp_legacy cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw perfctr_core ssbd ibpb vmmcall fsgsbase tsc_adjust bmi1 avx2 smep bmi2 rdseed adx smap clflushopt sha_ni xsaveopt xsavec xgetbv1 virt_ssbd arat

SSE2-only

#Benchmark,count,min,max,average,min_cycles,max_cycles,average_cycles
SHA1,448,0.002150125801563,0.002409756183624,0.002233428614480,4730172,5301485,4913516
SHA256,192,0.005117252469063,0.005900979042053,0.005256421864033,11257882,12981991,11564052
SHA256_32b,4,0.405609488487244,0.407777070999146,0.406693279743195,892336082,897104472,894720277
SHA512,288,0.003695189952850,0.003916688263416,0.003797632124689,8129416,8616660,8354743
Scrypt,3584,0.000291711650789,0.000304257497191,0.000296241936407,641757,669365,651728

AVX2+SSE2

#Benchmark,count,min,max,average,min_cycles,max_cycles,average_cycles
SHA1,448,0.002073436975479,0.003244370222092,0.002362250217370,4561470,7137658,5196918
SHA256,112,0.008731871843338,0.012281984090805,0.009695704494204,19210127,27020177,21330415
SHA256_32b,2,0.650063514709473,0.650063514709473,0.650063514709473,1430131340,1430131340,1430131340
SHA512,176,0.005467534065247,0.006064444780350,0.005791847001423,12028577,13341719,12741982
Scrypt,3328,0.000301836058497,0.000324590131640,0.000310348561750,664033,714087,682763
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment