Skip to content

Instantly share code, notes, and snippets.

@zamazan4ik
Created December 27, 2023 06:19
Show Gist options
  • Save zamazan4ik/fb76cddbe0028b757c4947775ac42133 to your computer and use it in GitHub Desktop.
Save zamazan4ik/fb76cddbe0028b757c4947775ac42133 to your computer and use it in GitHub Desktop.
pathfinding: PGO optimized vs Release
cargo pgo optimize bench
Running benches/algos.rs (x86_64-unknown-linux-gnu/release/deps/algos-6abe6ee237edc094)
Benchmarking corner_to_corner_astar
Benchmarking corner_to_corner_astar: Warming up for 3.0000 s
Benchmarking corner_to_corner_astar: Collecting 100 samples in estimated 5.0335 s (384k iterations)
Benchmarking corner_to_corner_astar: Analyzing
corner_to_corner_astar time: [13.168 µs 13.171 µs 13.174 µs]
change: [-54.991% -54.977% -54.964%] (p = 0.00 < 0.05)
Performance has improved.
Found 1 outliers among 100 measurements (1.00%)
1 (1.00%) high severe
Benchmarking corner_to_corner_bfs
Benchmarking corner_to_corner_bfs: Warming up for 3.0000 s
Benchmarking corner_to_corner_bfs: Collecting 100 samples in estimated 5.2109 s (35k iterations)
Benchmarking corner_to_corner_bfs: Analyzing
corner_to_corner_bfs time: [146.98 µs 146.99 µs 147.01 µs]
change: [-63.600% -63.388% -63.279%] (p = 0.00 < 0.05)
Performance has improved.
Found 1 outliers among 100 measurements (1.00%)
1 (1.00%) high mild
Benchmarking corner_to_corner_dfs
Benchmarking corner_to_corner_dfs: Warming up for 3.0000 s
Benchmarking corner_to_corner_dfs: Collecting 100 samples in estimated 5.6034 s (900 iterations)
Benchmarking corner_to_corner_dfs: Analyzing
corner_to_corner_dfs time: [6.3738 ms 6.5179 ms 6.6639 ms]
change: [-39.581% -38.235% -36.837%] (p = 0.00 < 0.05)
Performance has improved.
Benchmarking corner_to_corner_dijkstra
Benchmarking corner_to_corner_dijkstra: Warming up for 3.0000 s
Benchmarking corner_to_corner_dijkstra: Collecting 100 samples in estimated 5.9179 s (25k iterations)
Benchmarking corner_to_corner_dijkstra: Analyzing
corner_to_corner_dijkstra
time: [234.67 µs 234.76 µs 234.84 µs]
change: [-66.813% -66.799% -66.785%] (p = 0.00 < 0.05)
Performance has improved.
Found 1 outliers among 100 measurements (1.00%)
1 (1.00%) high mild
Benchmarking corner_to_corner_fringe
Benchmarking corner_to_corner_fringe: Warming up for 3.0000 s
Benchmarking corner_to_corner_fringe: Collecting 100 samples in estimated 5.0769 s (273k iterations)
Benchmarking corner_to_corner_fringe: Analyzing
corner_to_corner_fringe time: [18.577 µs 18.626 µs 18.683 µs]
change: [-67.393% -67.304% -67.211%] (p = 0.00 < 0.05)
Performance has improved.
Found 7 outliers among 100 measurements (7.00%)
7 (7.00%) high mild
Benchmarking corner_to_corner_idastar
Benchmarking corner_to_corner_idastar: Warming up for 3.0000 s
Benchmarking corner_to_corner_idastar: Collecting 100 samples in estimated 5.0430 s (247k iterations)
Benchmarking corner_to_corner_idastar: Analyzing
corner_to_corner_idastar
time: [20.004 µs 20.102 µs 20.223 µs]
change: [-55.086% -54.790% -54.471%] (p = 0.00 < 0.05)
Performance has improved.
Found 20 outliers among 100 measurements (20.00%)
4 (4.00%) high mild
16 (16.00%) high severe
Benchmarking corner_to_corner_iddfs
Benchmarking corner_to_corner_iddfs: Warming up for 3.0000 s
Benchmarking corner_to_corner_iddfs: Collecting 100 samples in estimated 6.0055 s (15k iterations)
Benchmarking corner_to_corner_iddfs: Analyzing
corner_to_corner_iddfs time: [396.80 µs 397.58 µs 398.28 µs]
change: [-20.031% -19.887% -19.749%] (p = 0.00 < 0.05)
Performance has improved.
Benchmarking no_path_astar
Benchmarking no_path_astar: Warming up for 3.0000 s
Benchmarking no_path_astar: Collecting 100 samples in estimated 5.8837 s (20k iterations)
Benchmarking no_path_astar: Analyzing
no_path_astar time: [291.88 µs 292.02 µs 292.17 µs]
change: [-60.666% -60.626% -60.566%] (p = 0.00 < 0.05)
Performance has improved.
Found 8 outliers among 100 measurements (8.00%)
6 (6.00%) high mild
2 (2.00%) high severe
Benchmarking no_path_bfs
Benchmarking no_path_bfs: Warming up for 3.0000 s
Benchmarking no_path_bfs: Collecting 100 samples in estimated 5.1681 s (35k iterations)
Benchmarking no_path_bfs: Analyzing
no_path_bfs time: [146.52 µs 146.57 µs 146.64 µs]
change: [-63.325% -63.212% -63.124%] (p = 0.00 < 0.05)
Performance has improved.
Found 14 outliers among 100 measurements (14.00%)
2 (2.00%) low mild
6 (6.00%) high mild
6 (6.00%) high severe
Benchmarking no_path_dijkstra
Benchmarking no_path_dijkstra: Warming up for 3.0000 s
Benchmarking no_path_dijkstra: Collecting 100 samples in estimated 5.9327 s (25k iterations)
Benchmarking no_path_dijkstra: Analyzing
no_path_dijkstra time: [235.00 µs 235.05 µs 235.09 µs]
change: [-66.380% -66.373% -66.367%] (p = 0.00 < 0.05)
Performance has improved.
Found 6 outliers among 100 measurements (6.00%)
1 (1.00%) low mild
3 (3.00%) high mild
2 (2.00%) high severe
Benchmarking no_path_fringe
Benchmarking no_path_fringe: Warming up for 3.0000 s
Benchmarking no_path_fringe: Collecting 100 samples in estimated 5.0888 s (15k iterations)
Benchmarking no_path_fringe: Analyzing
no_path_fringe time: [348.28 µs 348.61 µs 349.01 µs]
change: [-66.049% -66.014% -65.978%] (p = 0.00 < 0.05)
Performance has improved.
Found 1 outliers among 100 measurements (1.00%)
1 (1.00%) high mild
Benchmarking separate_components
Benchmarking separate_components: Warming up for 3.0000 s
Benchmarking separate_components: Collecting 100 samples in estimated 5.7465 s (15k iterations)
Benchmarking separate_components: Analyzing
separate_components time: [390.54 µs 396.80 µs 403.31 µs]
change: [-83.647% -83.503% -83.328%] (p = 0.00 < 0.05)
Performance has improved.
Found 19 outliers among 100 measurements (19.00%)
8 (8.00%) high mild
11 (11.00%) high severe
Running benches/algos-fill.rs (x86_64-unknown-linux-gnu/release/deps/algos_fill-dacc7246a85b3e12)
Benchmarking fill-corner_to_corner_astar
Benchmarking fill-corner_to_corner_astar: Warming up for 3.0000 s
Benchmarking fill-corner_to_corner_astar: Collecting 100 samples in estimated 5.0714 s (242k iterations)
Benchmarking fill-corner_to_corner_astar: Analyzing
fill-corner_to_corner_astar
time: [20.698 µs 20.702 µs 20.706 µs]
change: [-37.409% -37.389% -37.372%] (p = 0.00 < 0.05)
Performance has improved.
Found 8 outliers among 100 measurements (8.00%)
2 (2.00%) low mild
4 (4.00%) high mild
2 (2.00%) high severe
Benchmarking fill-corner_to_corner_bfs
Benchmarking fill-corner_to_corner_bfs: Warming up for 3.0000 s
Benchmarking fill-corner_to_corner_bfs: Collecting 100 samples in estimated 6.5087 s (20k iterations)
Benchmarking fill-corner_to_corner_bfs: Analyzing
fill-corner_to_corner_bfs
time: [323.66 µs 323.69 µs 323.74 µs]
change: [-39.483% -39.451% -39.427%] (p = 0.00 < 0.05)
Performance has improved.
Found 6 outliers among 100 measurements (6.00%)
4 (4.00%) high mild
2 (2.00%) high severe
Benchmarking fill-corner_to_corner_dfs
Benchmarking fill-corner_to_corner_dfs: Warming up for 3.0000 s
Benchmarking fill-corner_to_corner_dfs: Collecting 100 samples in estimated 5.1439 s (2200 iterations)
Benchmarking fill-corner_to_corner_dfs: Analyzing
fill-corner_to_corner_dfs
time: [2.3379 ms 2.3383 ms 2.3388 ms]
change: [+5.9748% +6.1466% +6.2555%] (p = 0.00 < 0.05)
Performance has regressed.
Found 13 outliers among 100 measurements (13.00%)
2 (2.00%) low severe
6 (6.00%) low mild
2 (2.00%) high mild
3 (3.00%) high severe
Benchmarking fill-corner_to_corner_dijkstra
Benchmarking fill-corner_to_corner_dijkstra: Warming up for 3.0000 s
Benchmarking fill-corner_to_corner_dijkstra: Collecting 100 samples in estimated 5.4827 s (15k iterations)
Benchmarking fill-corner_to_corner_dijkstra: Analyzing
fill-corner_to_corner_dijkstra
time: [374.63 µs 374.68 µs 374.74 µs]
change: [-39.659% -39.633% -39.600%] (p = 0.00 < 0.05)
Performance has improved.
Found 4 outliers among 100 measurements (4.00%)
2 (2.00%) high mild
2 (2.00%) high severe
Benchmarking fill-corner_to_corner_fringe
Benchmarking fill-corner_to_corner_fringe: Warming up for 3.0000 s
Benchmarking fill-corner_to_corner_fringe: Collecting 100 samples in estimated 5.0296 s (263k iterations)
Benchmarking fill-corner_to_corner_fringe: Analyzing
fill-corner_to_corner_fringe
time: [19.144 µs 19.155 µs 19.167 µs]
change: [-51.959% -51.934% -51.906%] (p = 0.00 < 0.05)
Performance has improved.
Found 17 outliers among 100 measurements (17.00%)
5 (5.00%) high mild
12 (12.00%) high severe
Benchmarking fill-corner_to_corner_idastar
Benchmarking fill-corner_to_corner_idastar: Warming up for 3.0000 s
Benchmarking fill-corner_to_corner_idastar: Collecting 100 samples in estimated 5.0550 s (247k iterations)
Benchmarking fill-corner_to_corner_idastar: Analyzing
fill-corner_to_corner_idastar
time: [20.617 µs 20.621 µs 20.625 µs]
change: [-26.287% -26.267% -26.248%] (p = 0.00 < 0.05)
Performance has improved.
Found 3 outliers among 100 measurements (3.00%)
2 (2.00%) high mild
1 (1.00%) high severe
Benchmarking fill-corner_to_corner_iddfs
Benchmarking fill-corner_to_corner_iddfs: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 5.3s, enable flat sampling, or reduce sample count to 60.
Benchmarking fill-corner_to_corner_iddfs: Collecting 100 samples in estimated 5.2743 s (5050 iterations)
Benchmarking fill-corner_to_corner_iddfs: Analyzing
fill-corner_to_corner_iddfs
time: [1.0605 ms 1.0607 ms 1.0610 ms]
change: [-34.484% -34.448% -34.403%] (p = 0.00 < 0.05)
Performance has improved.
Found 4 outliers among 100 measurements (4.00%)
3 (3.00%) high mild
1 (1.00%) high severe
Benchmarking fill-no_path_astar
Benchmarking fill-no_path_astar: Warming up for 3.0000 s
Benchmarking fill-no_path_astar: Collecting 100 samples in estimated 6.0269 s (15k iterations)
Benchmarking fill-no_path_astar: Analyzing
fill-no_path_astar time: [399.19 µs 399.38 µs 399.61 µs]
change: [-36.714% -36.672% -36.632%] (p = 0.00 < 0.05)
Performance has improved.
Found 1 outliers among 100 measurements (1.00%)
1 (1.00%) high severe
Benchmarking fill-no_path_bfs
Benchmarking fill-no_path_bfs: Warming up for 3.0000 s
Benchmarking fill-no_path_bfs: Collecting 100 samples in estimated 6.5550 s (20k iterations)
Benchmarking fill-no_path_bfs: Analyzing
fill-no_path_bfs time: [324.68 µs 324.72 µs 324.77 µs]
change: [-36.158% -36.140% -36.123%] (p = 0.00 < 0.05)
Performance has improved.
Found 7 outliers among 100 measurements (7.00%)
1 (1.00%) low mild
5 (5.00%) high mild
1 (1.00%) high severe
Benchmarking fill-no_path_dijkstra
Benchmarking fill-no_path_dijkstra: Warming up for 3.0000 s
Benchmarking fill-no_path_dijkstra: Collecting 100 samples in estimated 5.9174 s (15k iterations)
Benchmarking fill-no_path_dijkstra: Analyzing
fill-no_path_dijkstra time: [389.85 µs 389.91 µs 389.97 µs]
change: [-37.311% -37.284% -37.263%] (p = 0.00 < 0.05)
Performance has improved.
Found 3 outliers among 100 measurements (3.00%)
3 (3.00%) high mild
Benchmarking fill-no_path_fringe
Benchmarking fill-no_path_fringe: Warming up for 3.0000 s
Benchmarking fill-no_path_fringe: Collecting 100 samples in estimated 5.3553 s (15k iterations)
Benchmarking fill-no_path_fringe: Analyzing
fill-no_path_fringe time: [350.69 µs 350.73 µs 350.78 µs]
change: [-46.251% -46.238% -46.226%] (p = 0.00 < 0.05)
Performance has improved.
Found 3 outliers among 100 measurements (3.00%)
2 (2.00%) high mild
1 (1.00%) high severe
Running benches/edmondskarp.rs (x86_64-unknown-linux-gnu/release/deps/edmondskarp-e0723be8f8fe8795)
Benchmarking wikipedia_example_dense
Benchmarking wikipedia_example_dense: Warming up for 3.0000 s
Benchmarking wikipedia_example_dense: Collecting 100 samples in estimated 5.0054 s (2.7M iterations)
Benchmarking wikipedia_example_dense: Analyzing
wikipedia_example_dense time: [1.8810 µs 1.8854 µs 1.8937 µs]
change: [-68.882% -68.834% -68.766%] (p = 0.00 < 0.05)
Performance has improved.
Found 6 outliers among 100 measurements (6.00%)
3 (3.00%) high mild
3 (3.00%) high severe
Benchmarking wikipedia_example_sparse
Benchmarking wikipedia_example_sparse: Warming up for 3.0000 s
Benchmarking wikipedia_example_sparse: Collecting 100 samples in estimated 5.0074 s (1.2M iterations)
Benchmarking wikipedia_example_sparse: Analyzing
wikipedia_example_sparse
time: [4.4419 µs 4.4508 µs 4.4674 µs]
change: [-72.594% -72.564% -72.531%] (p = 0.00 < 0.05)
Performance has improved.
Found 3 outliers among 100 measurements (3.00%)
1 (1.00%) high mild
2 (2.00%) high severe
Running benches/kuhn_munkres.rs (x86_64-unknown-linux-gnu/release/deps/kuhn_munkres-3f6270c35034b5ff)
Benchmarking Compare kuhn_munkres with different input sizes/32
Benchmarking Compare kuhn_munkres with different input sizes/32: Warming up for 3.0000 s
Benchmarking Compare kuhn_munkres with different input sizes/32: Collecting 100 samples in estimated 5.0322 s (434k iterations)
Benchmarking Compare kuhn_munkres with different input sizes/32: Analyzing
Compare kuhn_munkres with different input sizes/32
time: [10.420 µs 10.946 µs 11.486 µs]
change: [-44.233% -41.909% -39.288%] (p = 0.00 < 0.05)
Performance has improved.
Found 3 outliers among 100 measurements (3.00%)
1 (1.00%) low mild
2 (2.00%) high mild
Benchmarking Compare kuhn_munkres with different input sizes/64
Benchmarking Compare kuhn_munkres with different input sizes/64: Warming up for 3.0000 s
Benchmarking Compare kuhn_munkres with different input sizes/64: Collecting 100 samples in estimated 5.1570 s (56k iterations)
Benchmarking Compare kuhn_munkres with different input sizes/64: Analyzing
Compare kuhn_munkres with different input sizes/64
time: [77.386 µs 81.967 µs 86.500 µs]
change: [-33.325% -29.610% -25.339%] (p = 0.00 < 0.05)
Performance has improved.
Benchmarking Compare kuhn_munkres with different input sizes/128
Benchmarking Compare kuhn_munkres with different input sizes/128: Warming up for 3.0000 s
Benchmarking Compare kuhn_munkres with different input sizes/128: Collecting 100 samples in estimated 6.8082 s (15k iterations)
Benchmarking Compare kuhn_munkres with different input sizes/128: Analyzing
Compare kuhn_munkres with different input sizes/128
time: [461.57 µs 484.36 µs 508.54 µs]
change: [-29.645% -26.086% -22.387%] (p = 0.00 < 0.05)
Performance has improved.
Found 1 outliers among 100 measurements (1.00%)
1 (1.00%) high mild
Benchmarking Compare kuhn_munkres with different input sizes/256
Benchmarking Compare kuhn_munkres with different input sizes/256: Warming up for 3.0000 s
Benchmarking Compare kuhn_munkres with different input sizes/256: Collecting 100 samples in estimated 5.0296 s (1600 iterations)
Benchmarking Compare kuhn_munkres with different input sizes/256: Analyzing
Compare kuhn_munkres with different input sizes/256
time: [3.3617 ms 3.4686 ms 3.5805 ms]
change: [-23.732% -20.133% -16.863%] (p = 0.00 < 0.05)
Performance has improved.
Found 3 outliers among 100 measurements (3.00%)
3 (3.00%) high mild
Benchmarking Compare kuhn_munkres with different input sizes/512
Benchmarking Compare kuhn_munkres with different input sizes/512: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 8.8s, or reduce sample count to 50.
Benchmarking Compare kuhn_munkres with different input sizes/512: Collecting 100 samples in estimated 8.7919 s (100 iterations)
Benchmarking Compare kuhn_munkres with different input sizes/512: Analyzing
Compare kuhn_munkres with different input sizes/512
time: [89.358 ms 90.066 ms 90.774 ms]
change: [-23.026% -22.131% -21.281%] (p = 0.00 < 0.05)
Performance has improved.
Running benches/movingai.rs (x86_64-unknown-linux-gnu/release/deps/movingai-cb08caf5e0d06d06)
Benchmarking arena
Benchmarking arena: Warming up for 3.0000 s
Benchmarking arena: Collecting 100 samples in estimated 5.6933 s (600 iterations)
Benchmarking arena: Analyzing
arena time: [9.4861 ms 9.4868 ms 9.4876 ms]
change: [-51.716% -51.710% -51.705%] (p = 0.00 < 0.05)
Performance has improved.
Found 3 outliers among 100 measurements (3.00%)
2 (2.00%) high mild
1 (1.00%) high severe
Running benches/separate_components.rs (x86_64-unknown-linux-gnu/release/deps/separate_components-be44c211178c816c)
Benchmarking separate_components
Benchmarking separate_components: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 7.9s, enable flat sampling, or reduce sample count to 50.
Benchmarking separate_components: Collecting 100 samples in estimated 7.8594 s (5050 iterations)
Benchmarking separate_components: Analyzing
separate_components time: [1.5535 ms 1.5537 ms 1.5540 ms]
change: [+292.62% +296.23% +299.55%] (p = 0.00 < 0.05)
Performance has regressed.
Found 4 outliers among 100 measurements (4.00%)
2 (2.00%) low mild
2 (2.00%) high mild
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment