Skip to content

Instantly share code, notes, and snippets.

@Micky774
Created April 23, 2022 04:05
Show Gist options
  • Save Micky774/8d1ce86bd7ab40659c048bedaee00cbb to your computer and use it in GitHub Desktop.
Save Micky774/8d1ce86bd7ab40659c048bedaee00cbb to your computer and use it in GitHub Desktop.
Benchmarks for `_assert_all_finite` with cython
num_elements x DTYPE
Performance on BRANCH
Performance on MAIN
======================
1e2xFP64
3.11 µs ± 111 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)
13 µs ± 113 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)
1e3xFP64
6.19 µs ± 110 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)
13.5 µs ± 111 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)
1e4xFP32
38.8 µs ± 339 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each)
24.3 µs ± 664 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each)
1e4xFP64
36.1 µs ± 142 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each)
17.2 µs ± 205 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)
5e4xFP32
33.1 µs ± 746 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each)
59.1 µs ± 1.2 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)
5e4xFP64
171 µs ± 1.36 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)
169 µs ± 248 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each)
1e5xFP64
337 µs ± 1.37 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)
46.4 µs ± 857 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each)
1e5xFP32
363 µs ± 1.28 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)
102 µs ± 2.21 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)
1e5xFP16
210 µs ± 3.37 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)
211 µs ± 3.29 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment