Skip to content

Instantly share code, notes, and snippets.

@sanjurm16
Created January 20, 2019 20:19
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save sanjurm16/15559e48d967606c9ceecfea80b6b7b2 to your computer and use it in GitHub Desktop.
Save sanjurm16/15559e48d967606c9ceecfea80b6b7b2 to your computer and use it in GitHub Desktop.
train_df.where("cabin is null").count()/train_df.count() *100
#77% of values is missing for cabin column. So ignoring this column for the model
wo_cabin_train_df = train_df.drop("cabin")
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment