Example AMD output:
$ lscpu | grep 'Model name'
Model name: AMD Ryzen Threadripper 3970X 32-Core Processor
$ gcc -DFAST puzzle.S -o puzzle && ./puzzle
1000000000 iterations, 2758479060 cycles, 2.76 cycles/iteration
1651816810 106000000000
$ gcc -DSLOW puzzle.S -o puzzle && ./puzzle
1000000000 iterations, 6551777517 cycles, 6.55 cycles/iteration
1651816814 110000000000
Example Intel output:
$ lscpu | grep 'Model name'
Model name: Intel(R) Core(TM) i7-5930K CPU @ 3.50GHz
$ gcc -DFAST puzzle.S -o puzzle && ./puzzle
1000000000 iterations, 6354813654 cycles, 6.35 cycles/iteration
1651817850 122000000000
$ gcc -DSLOW puzzle.S -o puzzle && ./puzzle
1000000000 iterations, 6317220244 cycles, 6.32 cycles/iteration
1651817864 136000000000