Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Save TrevorBasinger/cb3a6fbaa762bb3b0254a91738fa54ed to your computer and use it in GitHub Desktop.
Save TrevorBasinger/cb3a6fbaa762bb3b0254a91738fa54ed to your computer and use it in GitHub Desktop.
INFO:__main__:Splitting keys = ['demographics_ohe', 'continuous', 'sig_faults', 'demographics_encoded', 'horizon', 'target', 'discrete_ohe', 'discrete']
INFO:__main__:Starting iteration over chunks and split datasets for num_samples = 60739
INFO:__main__:Finished splitting files
INFO:__main__:Outer dimension in /slow1/datasets/flogistix/etl_dataset/processed/train_signal.h5 is consistent of size 3569
INFO:__main__:Outer dimension in /slow1/datasets/flogistix/etl_dataset/processed/train_background.h5 is consistent of size 57170
INFO:__main__:Split dataset with no loss, found 3569 signal samples and 57170 background samples in total of 60739 samples
INFO:__main__:Splitting keys = ['demographics_ohe', 'continuous', 'sig_faults', 'demographics_encoded', 'horizon', 'target', 'discrete_ohe', 'discrete']
INFO:__main__:Starting iteration over chunks and split datasets for num_samples = 26012
INFO:__main__:Finished splitting files
INFO:__main__:Outer dimension in /slow1/datasets/flogistix/etl_dataset/processed/val_signal.h5 is consistent of size 1573
INFO:__main__:Outer dimension in /slow1/datasets/flogistix/etl_dataset/processed/val_background.h5 is consistent of size 24439
INFO:__main__:Split dataset with no loss, found 1573 signal samples and 24439 background samples in total of 26012 samples
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment