Skip to content

Instantly share code, notes, and snippets.

@AyishaR
Created January 14, 2021 15:41
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save AyishaR/0ba042ef48b84cd3372735c4e9c02d06 to your computer and use it in GitHub Desktop.
Save AyishaR/0ba042ef48b84cd3372735c4e9c02d06 to your computer and use it in GitHub Desktop.
# Splitting into train, val and test set -- 80-10-10 split
# First, an 80-20 split
train_df, val_test_df = train_test_split(df, test_size = 0.2, random_state = 3)
# Then split the 20% into half
val_df, test_df = train_test_split(val_test_df, test_size = 0.5, random_state = 3)
Xtrain = train_bow.toarray()
ytrain = train_df[languages]
Xval = val_bow.toarray()
yval = val_df[languages]
ytest = test_df[languages]
Xtest = test_bow.toarray()
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment