Skip to content

Instantly share code, notes, and snippets.

@CalebFenton
Created October 7, 2016 16:35
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save CalebFenton/19f3264d99c681499cdaa4fadc791cea to your computer and use it in GitHub Desktop.
Save CalebFenton/19f3264d99c681499cdaa4fadc791cea to your computer and use it in GitHub Desktop.
Convert 23andme raw data to dumb string of G's A's T's and C's
import sys
with open(sys.argv[1], 'r') as f:
lines = f.readlines()
for line in lines:
if line.startswith('#'):
continue
parts = line.strip().split('\t')
chem = parts[len(parts) - 1]
sys.stdout.write(chem)
@CalebFenton
Copy link
Author

Usage:

unzip genome_Caleb_Fenton_v4_Full_20161007092404.zip
python get_dna.py > dna.txt
shasum -a 256 dna.txt
4e238b7005bd7f2188fd1766d3a706c8c9280676a37e1ea9490641a2195c905c  dna.txt

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment