Enhancement: Reduce noise by running benchmarks multiple times in multiple subprocesses #262

itamarst · 2024-07-10T18:26:18Z

Python has a built-in source of noise: the hash seed is randomized. Noise is bad because you can't e.g. tell if a 3% slowdown is noise, or due to a code change. How to get rid of noise?:

Setting a fixed value (export PYTHONHASHSEED=123) for the hash seed can give distorted results: maybe your tweaked code is faster with a particular fixed seed, but slower with other seeds. It also doesn't help with other sources of randomness.

The other approach is to run the benchmarks multiple times in multiple processes, to get a variety of seeds, and then use the combined results to report speed. This allows averaging out the impact of different random seeds, and results in less noise. This is ideally something the pytest-benchmark framework would do, so it can aggregate different benchmarks' results into one final result.

The text was updated successfully, but these errors were encountered:

itamarst mentioned this issue Jul 19, 2024

Easy run time benchmarking in GitHub Actions scientific-python/faster-scientific-python-ideas#2

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhancement: Reduce noise by running benchmarks multiple times in multiple subprocesses #262

Enhancement: Reduce noise by running benchmarks multiple times in multiple subprocesses #262

itamarst commented Jul 10, 2024

Enhancement: Reduce noise by running benchmarks multiple times in multiple subprocesses #262

Enhancement: Reduce noise by running benchmarks multiple times in multiple subprocesses #262

Comments

itamarst commented Jul 10, 2024