Skip to content

Instantly share code, notes, and snippets.

@davidandrzej
Created September 17, 2013 21:12
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save davidandrzej/6600704 to your computer and use it in GitHub Desktop.
Save davidandrzej/6600704 to your computer and use it in GitHub Desktop.
Subsample CSV rows in Python
import csv
import random
outf = open('output.csv','w')
writer = csv.writer(outf)
for row in csv.reader(open('input.csv', 'rb')):
if(random.randint(0,1) == 0): # eg, drop 50%
writer.writerow(row)
outf.close()
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment