Skip to content

Instantly share code, notes, and snippets.

@luizirber
Last active December 20, 2015 03:49
Show Gist options
  • Save luizirber/6065962 to your computer and use it in GitHub Desktop.
Save luizirber/6065962 to your computer and use it in GitHub Desktop.
Quick and dirty parser
chr10_0 CTCTGCCCTGGGGTTTGAGATCCGCGGGTGCTTACTGACCCTTTTGGGTTGGAGTTAGCTCAAGCGGTTACCTCCTCAGGCTGGACTTTCTATCTGTCCAAAGCCAACtg
---------------------CCGCGGGTGCTTACTGACCCTTTT----------------------------------------------------------------- 1 teste
chr10_0 CTCTGCCCTGGGGTTTGAGATCCGCGGGTGCTTACTGACCCTTTTGGGTTGGAGTTAGCTCAAGCGGTTACCTCCTCAGGCTGGACTTTCTATCTGTCCAAAGCCAACtg
----------------------CGCGGGTGCTTACTGACC---------------------------------------------------------------------- 129 teste
chr10_0 CTCTGCCCTGGGGTTTGAGATCCGCGGGTGCTTACTGACCCTTTTGGGTTGGAGTTAGCTCAAGCGGTTACCTCCTCAGGCTGGACTTTCTATCTGTCCAAAGCCAACtg
----------------------CGCGGGTGCTTACTGACCC--------------------------------------------------------------------- 56 teste
chr10_10 CCTTGCCCGTCACGCAATGGAAGTTCGTGGGGAACCTGGCGCTAAACCATTCGTAGACTATCTGCTTCTGGGTCGGGGTT
---------------------------------ACCTGGCGCTAAACCATTCGTAGAC---------------------- 1 teste
chr10_10 CCTTGCCCGTCACGCAATGGAAGTTCGTGGGGAACCTGGCGCTAAACCATTCGTAGACTATCTGCTTCTGGGTCGGGGTT
---------------------------------------CGCTAAACCATTCGTAGA----------------------- 1 teste
chr10_0 CTCTGCCCTGGGGTTTGAGATCCGCGGGTGCTTACTGACCCTTTTGGGTTGGAGTTAGCTCAAGCGGTTACCTCCTCAGGCTGGACTTTCTATCTGTCCAAAGCCAACtg
---------------------CCGCGGGTGCTTACTGACCCTTTT----------------------------------------------------------------- 1 teste
----------------------CGCGGGTGCTTACTGACC---------------------------------------------------------------------- 129 teste
----------------------CGCGGGTGCTTACTGACCC--------------------------------------------------------------------- 56 teste
chr10_10 CCTTGCCCGTCACGCAATGGAAGTTCGTGGGGAACCTGGCGCTAAACCATTCGTAGACTATCTGCTTCTGGGTCGGGGTT
---------------------------------ACCTGGCGCTAAACCATTCGTAGAC---------------------- 1 teste
---------------------------------------CGCTAAACCATTCGTAGA----------------------- 1 teste
from __future__ import print_function
from collections import OrderedDict
parsed = OrderedDict()
with open('input', 'r') as dataset:
header = True
while header:
header = dataset.readline().rstrip()
match = dataset.readline().rstrip()
_ = dataset.readline()
if header:
key, seq = header.split()
if key not in parsed:
parsed[key] = {'seq': header, 'alignments': []}
parsed[key]['alignments'].append(match)
for k in parsed:
print(parsed[k]['seq'])
print(*(parsed[k]['alignments']), sep='\n')
print()
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment