Skip to content

Instantly share code, notes, and snippets.

@sdruskat
Last active June 1, 2023 15:36
Show Gist options
  • Save sdruskat/c61cb274945159c9e891a8f29aff450b to your computer and use it in GitHub Desktop.
Save sdruskat/c61cb274945159c9e891a8f29aff450b to your computer and use it in GitHub Desktop.
A Jupyter notebook showing slightly more complex stratified proportionate random sampling from a Dask dataset based on value counts.
Display the source blob
Display the rendered blob
Raw
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment