Last active
June 1, 2023 15:36
-
-
Save sdruskat/c61cb274945159c9e891a8f29aff450b to your computer and use it in GitHub Desktop.
A Jupyter notebook showing slightly more complex stratified proportionate random sampling from a Dask dataset based on value counts.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment