Skip to content

Instantly share code, notes, and snippets.

@caleb-kaiser
Created October 25, 2019 15:52
Show Gist options
  • Save caleb-kaiser/da124e4763b0c1444298048bf404ddd4 to your computer and use it in GitHub Desktop.
Save caleb-kaiser/da124e4763b0c1444298048bf404ddd4 to your computer and use it in GitHub Desktop.
Split test and featureize
dvc run \
-f split.dvc \
-d code/split_train_test.py \
-d data/Posts.tsv \
-o data/Posts-train.tsv \
-o data/Posts-test.tsv \
python \
code/split_train_test.py \
data/Posts.tsv \
0.2 \
20191001 \
data/Posts-train.tsv \
data/Posts-test.tsv
dvc run \
-f featurize.dvc \
-d code/featurization.py \
-d data/Posts-train.tsv \
-d data/Posts-test.tsv \
-o data/matrix-train.pkl \
-o data/matrix-test.pkl \
python \
code/featurization.py \
data/Posts-train.tsv \
data/Posts-test.tsv \
data/matrix-train.pkl \
data/matrix-test.pkl
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment