Skip to content

Instantly share code, notes, and snippets.

@mandyRae
Created May 11, 2016 18:43
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save mandyRae/ac1105947844b6c28cd037abd6f5d5ff to your computer and use it in GitHub Desktop.
Save mandyRae/ac1105947844b6c28cd037abd6f5d5ff to your computer and use it in GitHub Desktop.
data for bioinformatics research on genes and proteins, goes with bioinfo.py
'''
Data file for bioinformatics research project
All of the following are genes were parsed for this project.
All are copied from NCBI's Nucleotide Database. Links are provided.
'''
'''GENES FROM CHROMOSOME 10 USED TO FIND CG CONTENT'''
#length 3618, http://www.ncbi.nlm.nih.gov/nuccore/XM_011539816.1
mannosebindinglectin ='ACATTGCTGAGCCCAGCCTCCTCCCTCACTCTGAGGCATCTGCCAGAGCCCCAGGCTAGAGGGCCAGCGTCCTTGTCACTGAGTCCCTGCTCTGCAGAAACACCAGTGAGGACCATGTCCCTGTTTCCATCACTCCCTCTCCTTCTCCTGAGTATGGTGGCAGCGTCTTACTCAGAAACTGTGACCTGTGAGGATGCCCAAAAGACCTGCCCTGCAGTGATTGCCTGTAGCTCTCCAGGCATCAACGGCTTCCCAGGCAAAGATGGGCGTGATGGCACCAAGGGAGAAAAGGGGGAACCAGGCCAAGGGCTCAGAGGCTTACAGGGCCCCCCTGGAAAGTTGGGGCCTCCAGGAAATCCAGGGCCTTCTGGGTCACCAGGACCAAAGGGCCAAAAAGGAGACCCTGGAAAAAGTCCGGATGGTGATAGTAGCCTGGCTGCCTCAGAAAGAAAAGCTCTGCAAACAGAAATGGCACGTATCAAAAAGTGGCTCACCTTCTCTCTGGGCAAACAAGTTGGGAACAAGTTCTTCCTGACCAATGGTGAAATAATGACCTTTGAAAAAGTGAAGGCCTTGTGTGTCAAGTTCCAGGCCTCTGTGGCCACCCCCAGGAATGCTGCAGAGAATGGAGCCATTCAGAATCTCATCAAGGAGGAAGCCTTCCTGGGCATCACTGATGAGAAGACAGAAGGGCAGTTTGTGGATCTGACAGGAAATAGACTGACCTACACAAACTGGAACGAGGGTGAACCCAACAATGCTGGTTCTGATGAAGATTGTGTATTGCTACTGAAAAATGGCCAGTGGAATGACGTCCCCTGCTCCACCTCCCATCTGGCCGTCTGTGAGTTCCCTATCTGAAGGGTCATATCACTCAGGCCCTCCTTGTCTTTTTACTGCAACCCACAGGCCCACAGTATGCTTGAAAAGATAAATTATATCAATTTCCTCATATCCAGTATTGTTCCTTTTGTGGGCAATCACTAAAAATGATCACTAACAGCACCAACAAAGCAATAATAGTAGTAGTAGTAGTTAGCAGCAGCAGTAGTAGTCATGCTAATTATATAATATTTTTAATATATACTATGAGGCCCTATCTTTTGCATCCTACATTAATTATCTAGTTTAATTAATCTGTAATGCTTTCGATAGTGTTAACTTGCTGCAGTATGAAAATAAGACGGATTTATTTTTCCATTTACAACAAACACCTGTGCTCTGTTGAGCCTTCCTTTCTGTTTGGGTAGAGGGCTCCCCTAATGACATCACCACAGTTTAATACCACAGCTTTTTACCAAGTTTCAGGTATTAAGAAAATCTATTTTGTAACTTTCTCTATGAACTCTGTTTTCTTTCTAATGAGATATTAAACCATGTAAAGAACATAAATAACAAATCTCAAGCAAACAGCTTCACAAATTCTCACACACATACATACCTATATACTCACTTTCTAGATTAAGATATGGGACATTTTTGACTCCCTAGAAGCCCCGTTATAACTCCTCCTAGTACTAACTCCTAGGAAAATACTATTCTGACCTCCATGACTGCACAGTAATTTCGTCTGTTTATAAACATTGTATAGTTGGAATCATATTGTGTGTAATGTTGTATGTCTTGTTTACTCAGAATTAAGTCTGTGAGATTCATTCATGTCATGTGTACAAAAGTTTCATCCTTTTCATTGCCATGTAGGGTTCCCTTATATTAATATTCCTCAGTTCATCCATTCTATTGTTAATAGGCACTTAAGTGGCTTCCAATTTTTGGCCATGAGGAAGAGAACCCACGAACATTCCTGGACTTGTCTTTTGGTGGACATGGTGCACTAATTTCACTACCTATCCAGGAGTGGAACTGGTAGAGGATGAGGAAAGCATGTATTCAGCTTTAGTAGATATTACCAGTTTTCCTAAGTGATTGTATGAATTTATGCTCCTACCGGCAATGTGTGGCAGTCCTAGATGCTCTATGTGCTTGTAAAAAGTCAATGTTTTCAGTTCTCTTGATTTTCATTATTCCTGTGGATGTAAAGTGATATTTCCCCATGGTTTTAATCTGTATTTCCCCAACATGTAATAAGGTTGAACACTTTTTTATATGCTTATTGGGCACTTGGGTATCTTCTTTTGTGAAGTACCCGTTCACATTTTTGTATTTTGTTTAAATTAGTTAGCCAATATTTTTCTTACTGATTTTTAAGTTATTTTTACATTCTGAATATGTCCTTTTTAATGTGTATTACAAATATTTTGCTAGTTTTTGACTTGCTCCTAATGTTGAATTTTGATGAACAAAATTTCCTAATTTTGAGAAAGTCTTATTTATTCATATTTTCTTTCAAAATTAGTGCTTTTTGTGTCATGTTTAAGAAATTTTTGCCCATCCCAAAATCATAAGATATTTTTCATGATTTTGAAACCATGAAGAGATTTTTCATGATTTTGAAATCATGAAGATATTTTTCCATTTTTTTCTAATAGTTTTATTAATAAACATTCTATCTATTCCTGGTAGAATAGATATCCACTTGAGACAGCACTATGTAGGAAAGACCATTTTTCCTCCACTGAACTAGGGTGGTGCATTTTTGTAAGTTAGGTAACTGTATGTGTGTGTGTCTGTTTCTGGGCTGTCTATTCTAGTCTATTTGTTGATGCTTGTGTCAAACAGTACACTATCTTAATTATTGTACATTTATAGTTGTAACTATAGTCCAGCTTTGTTCTTCTTAAAGTCAAGATTTCCATATAAATATTAGAAACAGCTTCTCAATTTCTACAAAATCCTGATGAGGTTTCTACTGGGACCACATTGAGTCTATCAATCAACTTATGCAGAACTGGCAACTTACTACTGAATCTCTAATCAATGTTCATCATGTATCGCTTCATGTAACTAGAATTTCTTTAACTTAATTGCTATGTTTTGACATTTTTAGTTTAAAAACCTTGTATATCTTGTTTTGGTGGTTTTAGTGATTTTAATAATATATTTTAAATATTTTTTCTTTTCTATTGTTGTACACAGAAATACAGTTAAGTTTTGTGTGTAGTCTTACGATGTTTAGTAAACTCAATAAGTTTATTTCTTAAATCTAGTAATTTGTAGATTCCTCTGGATTTTGTATATGCATAGTCATGTAAGCTGAAAATATGGCAATACTTGCTTCTTCCCAATTGCTTTACCTTTTTTCTTACCTTATTGCACTGGTTAGCAACCCCAATACAGAGACCACCAGATCAGGTATAGACTCCTGAAAGACAATATAATGAAGTGCTCCAGTCAGGCCTATCTAAACTGGATTCACAGCTCTGTCACTTAATTGCTACATGATCTAGAGCCAGTTACTTTGTGTTTCAGCCATGTATTTGCAGCTGAGAGAAAATAATCATTCTTATTTCATGAAAATTGTGGGGATGATGAAATAAGTTAACACCTTTAAAGTGTGTAGTAAAGTATCAGGATACTATATTTTAGGTCTTAATACACACAGTTATGCCGCTAGATACATGCTTTTTAATGAGATAATGTGATATTATACATAACACATATCGATTTTTAAAAATTAAATCAACCTTGCTTTGATGGAATAAACTCCATTTAGTCACA'
#length 4141, http://www.ncbi.nlm.nih.gov/nuccore/XM_011540119.1
transcriptionfactor7 = 'CATTAAGGCAGTGTGTTCCTCTCGCCCTGTCAATAATCTCCGCTCCCAGACTACTCCGTTCCTCCGGATTTCGATCCCCCTTTTTCTATCTGTCAATCAGCGCCGCCTTTGAACTGAAAAGCTCTCAGTCTAACTTCAACTCACTCAAATCCGAGCGGCACGAGCACCTCCTGTATCTTCGGCTTCCCCCCCCCTTTGCTCTTTATATCTGACTTCTTGTTGTTGTTGGTGTTTTTTTTTTTTTTACCCCCCTTTTTTATTTATTATTTTTTTGCACATTGATCGGATCCTTGGGAACGAGAGAAAAAAGAAACCCAAACTCACGCGTGCAGAAGATCTCCCCCCCCTTCCCCTCCCCTCCTCCCTCTTTTCCCCTCCCCAGGAGAAAAAGACCCCCAAGCAGAAAAAAGTTCACCTTGGACTCGTCTTTTTCTTGCAATATTTTTTGGGGGGGCAAAACTTTTTGGGGGTGATTTTTTTTGGCTTTTCTTCCTCCTTCATTTTTCTTCCAAAATTGCTGCTGGTGGGTGAAAAAAAAATGCCGCAGCTGAACGGCGGTGGAGGGGATGACCTAGGCGCCAACGACGAACTGATTTCCTTCAAAGACGAGGGCGAACAGGAGGAGAAGAGCTCCGAAAACTCCTCGGCAGAGAGGGATTTAGCTGATGTCAAATCGTCTCTAGTCAATGAATCAGAAACGAATCAAAACAGCTCCTCCGATTCCGAGGCGGAAAGACGGCCTCCGCCTCGCTCCGAAAGTTTCCGAGACAAATCCCGGGAAAGTTTGGAAGAAGCGGCCAAGAGGCAAGATGGAGGGCTCTTTAAGGGGCCACCGTATCCCGGCTACCCCTTCATCATGATCCCCGACCTGACGAGCCCCTACCTCCCCAACGGATCGCTCTCGCCCACCGCCCGAACCCTCCATTTTCAGTCCGGCAGCACACATTACTCTGCGTACAAAACGATTGAACACCAGATTGCAGTTCAGTATCTCCAGATGAAATGGCCACTGCTTGATGTCCAGGCAGGGAGCCTCCAGAGTAGACAAGCCCTCAAGGATGCCCGGTCCCCATCACCGGCACACATTGTCCAGAGCCCCCTCCCTTGCTGCACTCAGGGACATGACTGTCAGCACTTCTACCCCCCCTCAGACTTCACTGTCAGCACTCAAGTCTTCAGGGACATGAAAAGGAGCCACTCCTTACAAAAAGTTGGGGAGCCCTGGTGTATTGAGTCTAACAAAGTGCCAGTGGTGCAGCACCCTCACCATGTCCACCCCCTCACGCCTCTTATCACGTACAGCAATGAACACTTCACGCCGGGAAACCCACCTCCACACTTACCAGCCGACGTAGACCCCAAAACAGGAATCCCACGGCCTCCGCACCCTCCAGATATATCCCCGTATTACCCACTATCGCCTGGCACCGTAGGACAAATCCCCCATCCGCTAGGATGGTTAGTACCACAGCAAGGTCAACCAGTGTACCCAATCACGACAGGAGGATTCAGACACCCCTACCCCACAGCTCTGACCGTCAATGCTTCCATGTCCAGCTTTCTGTCTTCTAGGTTCCCTCCCCATATGGTCCCACCACATCATACGCTACACACGACGGGCATTCCGCATCCGGCCATAGTCACACCAACAGTCAAACAGGAATCGTCCCAGAGTGATGTCGGCTCACTCCATAGTTCAAAGCATCAGGACTCCAAAAAGGAAGAAGAAAAGAAGAAGCCCCACATAAAGAAACCTCTTAATGCATTCATGTTGTATATGAAGGAAATGAGAGCAAAGGTCGTAGCTGAGTGCACGTTGAAAGAAAGCGCGGCCATCAACCAGATCCTTGGGCGGAGGTGGCATGCACTGTCCAGAGAAGAGCAAGCGAAATACTACGAGCTGGCCCGGAAGGAGCGACAGCTTCATATGCAACTGTACCCCGGCTGGTCCGCGCGGGATAACTATGGAAAGAAGAAGAAGAGGAAAAGGGACAAGCAGCCGGGAGAGACCAATGGAGAAAAAAAAAAAGTGCGTTCGCTACATACAAGGTGAAGGCAGCTGCCTCAGCCCACCCTCTTCAGATGGAAGCTTACTAGATTCGCCTCCCCCCTCCCCGAACCTGCTAGGCTCCCCTCCCCGAGACGCCAAGTCACAGACTGAGCAGACCCAGCCTCTGTCGCTGTCCCTGAAGCCCGACCCCCTGGCCCACCTGTCCATGATGCCTCCGCCACCCGCCCTCCTGCTCGCTGAGGCCACCCACAAGGCCTCCGCCCTCTGTCCCAACGGGGCCCTGGACCTGCCCCCAGCCGCTTTGCAGCCTGCCGCCCCCTCCTCATCAATTGCACAGCCGTCGACTTCTTCCTTACATTCCCACAGCTCCCTGGCCGGGACCCAGCCCCAGCCGCTGTCGCTCGTCACCAAGTCTTTAGAATAGCTTTAGCGTCGTGAACCCCGCTGCTTTGTTTATGGTTTTGTTTCACTTTTCTTAATTTGCCCCCCACCCCCACCTTGAAAGGTTTTGTTTTGTACTCTCTTAATTTTGTGCCATGTGGCTACATTAGTTGATGTTTATCGAGTTCATTGGTCAATATTTGACCCATTCTTATTTCAATTTCTCCTTTTAAATATGTAGATGAGAGAAGAACCTCATGATTCTACCAAAATTTTTATCAACAGCTGTTTAAAGTCTTTGTAGCGTTTAAAAAATATATATATATACATAACTGTTATGTAGTTCGGATAGCTTAGTTTTAAAAGACTGATTAAAAAACAAAAAGAAAAAAAAAGCAATTTTGAAGCAGCCCTCCAGAAGGAGTTGGTTCTGTATTATTTGTATTAAATACGAGCTTGCGAACCAATCATTTTACATCTGGTTTTTAAACCGTAAGGGCACCATGAATGCAGTGCCGTTACTTTTTTTTTTTTTTTCTGTGTGAAACAACTCTTATTGTGATGTTACTTGTTATTGTTTAAATGTACAGAAACAAAGGGTAAAAATGTGTTAATATACCTTGTTCCATGGTGTTGTTCTTTTGGGGGGAGGGGACGCTACTCAACACTTAATAGAATCACAACGCTGTTGGGCCAGTAGTATTTATTGCTTTAGAGATTGCTTGTCGTACCTGTATGTCGTCCCTTTTTAAATATGTTTTCCTTTTTCTTGAAACTGTATAAAGTTTTTTTCCCCCTTAGCATAAGCATCTTATATATAACAACTCATTTGTACAAGGTTTTTAAGTTTATATATAAAATGTGTATATATATTTTTGTTTCCCCTTTTTGACTTTTTTTTTTCTGTATGAAACCCAGATGTCACCAAATGGACATTAATAGTTGCATTAAGGATCAGTAGCATTAACAAAAGTTGCTTTAAAAGCCATTATGTAAAACAAGACTTGAAAATGAGTGAGGGAATTTTAGCGACACTGTCTGAGCAGCAGTGGGAACCATCTTCGTTTCCCCTTTGAACTCCCAGTGGGATGCCCTACCCTGCGCCCTTAGGACCCGGACTGACCGTGTACAAAACTTTACGTGCCAAAATTCTCAGTGAATTTAGCTTTCTCCCTCTTTTTGATGCTGTAATTTTTGTTCATCATGTTTTGCTGTGATGTTACATAGGTAGATTTGTATGTAGTTTTAATGTCACCTATAACAAAATGTGTTTGGTAGCAGATTGTCCAGAAAGCATTTTAAATGAAGAGGTATAAACCCTTAAGGGCCAAAATTCTGTATATTAGATTACTCTTAAACGAAAAACCAGCTGCCGCTTTTATGTACACATATTACATACGAGTAGGCAGCAGACTTTAAAAATAAAAAAAACCTAGGCATGTTGATGTTGCAAAATGCTGTATAAAGCTGAAACCTGTTCATTCAGTGCCATTGTAGTTGACATGAAGCGATTGTAAAACTGTCTCCGATTTTTCTCTGGTTTATTAAAATGCTAACTATAACATTTTTTGTGAATACTTTGAATGTTTCCTAACAGTTGTGATGTTACTGTTCCGTTTTATGCTCTTATTCCAAGTTCATTTTTAATGGTTTGGAAGCCATTTTTGTAATGAATAAATGTTCATGCTGTACAGTATCTGTAGCATGCCGTTCTGGATTAATAAAAGCAACTTAGTATGTGCAGATAAA'
#length 1935, http://www.ncbi.nlm.nih.gov/nuccore/1004170671
cyclindependentkinase1 = 'AGCCGCCCTTTCCTCTTTCTTTCGCGCTCTAGCCACCCGGGAAGGCCTGCCCAGCGTAGCTGGGCTCTGATTGGCTGCTTTGAAAGTCTACGGGCTACCCGATTGGTGAATCCGGGGCCCTTTAGCGCGGATCTACCATACCCATTGACTAACTATGGAAGATTATACCAAAATAGAGAAAATTGGAGAAGGTACCTATGGAGTTGTGTATAAGGGTAGACACAAAACTACAGGTCAAGTGGTAGCCATGAAAAAAATCAGACTAGAAAGTGAAGAGGAAGGGGTTCCTAGTACTGCAATTCGGGAAATTTCTCTATTAAAGGAACTTCGTCATCCAAATATAGTCAGTCTTCAGGATGTGCTTATGCAGGATTCCAGGTTATATCTCATCTTTGAGTTTCTTTCCATGGATCTGAAGAAATACTTGGATTCTATCCCTCCTGGTCAGTACATGGATTCTTCACTTGTTAAGAGTTATTTATACCAAATCCTACAGGGGATTGTGTTTTGTCACTCTAGAAGAGTTCTTCACAGAGACTTAAAACCTCAAAATCTCTTGATTGATGACAAAGGAACAATTAAACTGGCTGATTTTGGCCTTGCCAGAGCTTTTGGAATACCTATCAGAGTATATACACATGAGGTAGTAACACTCTGGTACAGATCTCCAGAAGTATTGCTGGGGTCAGCTCGTTACTCAACTCCAGTTGACATTTGGAGTATAGGCACCATATTTGCTGAACTAGCAACTAAGAAACCACTTTTCCATGGGGATTCAGAAATTGATCAACTCTTCAGGATTTTCAGAGCTTTGGGCACTCCCAATAATGAAGTGTGGCCAGAAGTGGAATCTTTACAGGACTATAAGAATACATTTCCCAAATGGAAACCAGGAAGCCTAGCATCCCATGTCAAAAACTTGGATGAAAATGGCTTGGATTTGCTCTCGAAAATGTTAATCTATGATCCAGCCAAACGAATTTCTGGCAAAATGGCACTGAATCATCCATATTTTAATGATTTGGACAATCAGATTAAGAAGATGTAGCTTTCTGACAAAAAGTTTCCATATGTTATATCAACAGATAGTTGTGTTTTTATTGTTAACTCTTGTCTATTTTTGTCTTATATATATTTCTTTGTTATCAAACTTCAGCTGTACTTCGTCTTCTAATTTCAAAAATATAACTTAAAAATGTAAATATTCTATATGAATTTAAATATAATTCTGTAAATGTGTGTAGGTCTCACTGTAACAACTATTTGTTACTATAATAAAACTATAATATTGATGTCAGGAATCAGGAAAAAATTTGAGTTGGCTTAAATCATCTCAGTCCTTATGGCAGTTTTATTTTCCTGTAGTTGGAACTACTAAAATTTAGGAAAATGCTAAGTTCAAGTTTCGTAATGCTTTGAAGTATTTTTATGCTCTGAATGTTTAAATGTTCTCATCAGTTTCTTGCCATGTTGTTAACTATACAACCTGGCTAAAGATGAATATTTTTCTACTGGTATTTTAATTTTTGACCTAAATGTTTAAGCATTCGGAATGAGAAAACTATACAGATTTGAGAAATGATGCTAAATTTATAGGAGTTTTCAGTAACTTAAAAAGCTAACATGAGAGCATGCCAAAATTTGCTAAGTCTTACAAAGATCAAGGGCTGTCCGCAACAGGGAAGAACAGTTTTGAAAATTTATGAACTATCTTATTTTTAGGTAGGTTTTGAAAGCTTTTTGTCTAAGTGAATTCTTATGCCTTGGTCAGAGTAATAACTGAAGGAGTTGCTTATCTTGGCTTTCGAGTCTGAGTTTAAAACTACACATTTTGACATAGTGTTTATTAGCAGCCATCTAAAAAGGCTCTAATGTATATTTAACTAAAATTACTAGCTTTGGGAATTAAACTGTTTAACAAATAAAAAAAAAAAA'
#length 3951, http://www.ncbi.nlm.nih.gov/nuccore/XM_005252448.1
integrinbeta1 = 'ATCAGACGCGCAGAGGAGGCGGGGCCGCGGCTGGTTTCCTGCCGGGGGGCGGCTCTGGGCCGCCGAGTCCCCTCCTCCCGCCCCTGAGGAGGAGGAGCCGCCGCCACCCGCCGCGCCCGACACCCGGGAGGCCCCGCCAGCCCGCGGGAGAGGCCCAGCGGGAGTCGCGGAACAGCAGGCCCGAGCCCACCGCGCCGGGCCCCGGACGCCGCGCGGAAAAGATGAATTTACAACCAATTTTCTGGATTGGACTGATCAGTTCAGTTTGCTGTGTGTTTGCTCAAACAGATGAAAATAGATGTTTAAAAGCAAATGCCAAATCATGTGGAGAATGTATACAAGCAGGGCCAAATTGTGGGTGGTGCACAAATTCAACATTTTTACAGGAAGGAATGCCTACTTCTGCACGATGTGATGATTTAGAAGCCTTAAAAAAGAAGGGTTGCCCTCCAGATGACATAGAAAATCCCAGAGGCTCCAAAGATATAAAGAAAAATAAAAATGTAACCAACCGTAGCAAAGGAACAGCAGAGAAGCTCAAGCCAGAGGATATTACTCAGATCCAACCACAGCAGTTGGTTTTGCGATTAAGATCAGGGGAGCCACAGACATTTACATTAAAATTCAAGAGAGCTGAAGACTATCCCATTGACCTCTACTACCTTATGGACCTGTCTTACTCAATGAAAGACGATTTGGAGAATGTAAAAAGTCTTGGAACAGATCTGATGAATGAAATGAGGAGGATTACTTCGGACTTCAGAATTGGATTTGGCTCATTTGTGGAAAAGACTGTGATGCCTTACATTAGCACAACACCAGCTAAGCTCAGGAACCCTTGCACAAGTGAACAGAACTGCACCAGCCCATTTAGCTACAAAAATGTGCTCAGTCTTACTAATAAAGGAGAAGTATTTAATGAACTTGTTGGAAAACAGCGCATATCTGGAAATTTGGATTCTCCAGAAGGTGGTTTCGATGCCATCATGCAAGTTGCAGTTTGTGGATCACTGATTGGCTGGAGGAATGTTACACGGCTGCTGGTGTTTTCCACAGATGCCGGGTTTCACTTTGCTGGAGATGGGAAACTTGGTGGCATTGTTTTACCAAATGATGGACAATGTCACCTGGAAAATAATATGTACACAATGAGCCATTATTATGATTATCCTTCTATTGCTCACCTTGTCCAGAAACTGAGTGAAAATAATATTCAGACAATTTTTGCAGTTACTGAAGAATTTCAGCCTGTTTACAAGGAGCTGAAAAACTTGATCCCTAAGTCAGCAGTAGGAACATTATCTGCAAATTCTAGCAATGTAATTCAGTTGATCATTGATGCATACAATTCCCTTTCCTCAGAAGTCATTTTGGAAAACGGCAAATTGTCAGAAGGCGTAACAATAAGTTACAAATCTTACTGCAAGAACGGGGTGAATGGAACAGGGGAAAATGGAAGAAAATGTTCCAATATTTCCATTGGAGATGAGGTTCAATTTGAAATTAGCATAACTTCAAATAAGTGTCCAAAAAAGGATTCTGACAGCTTTAAAATTAGGCCTCTGGGCTTTACGGAGGAAGTAGAGGTTATTCTTCAGTACATCTGTGAATGTGAATGCCAAAGCGAAGGCATCCCTGAAAGTCCCAAGTGTCATGAAGGAAATGGGACATTTGAGTGTGGCGCGTGCAGGTGCAATGAAGGGCGTGTTGGTAGACATTGTGAATGCAGCACAGATGAAGTTAACAGTGAAGACATGGATGCTTACTGCAGGAAAGAAAACAGTTCAGAAATCTGCAGTAACAATGGAGAGTGCGTCTGCGGACAGTGTGTTTGTAGGAAGAGGGATAATACAAATGAAATTTATTCTGGCAAATTCTGCGAGTGTGATAATTTCAACTGTGATAGATCCAATGGCTTAATTTGTGGAGGAAATGGTGTTTGCAAGTGTCGTGTGTGTGAGTGCAACCCCAACTACACTGGCAGTGCATGTGACTGTTCTTTGGATACTAGTACTTGTGAAGCCAGCAACGGACAGATCTGCAATGGCCGGGGCATCTGCGAGTGTGGTGTCTGTAAGTGTACAGATCCGAAGTTTCAAGGGCAAACGTGTGAGATGTGTCAGACCTGCCTTGGTGTCTGTGCTGAGCATAAAGAATGTGTTCAGTGCAGAGCCTTCAATAAAGGAGAAAAGAAAGACACATGCACACAGGAATGTTCCTATTTTAACATTACCAAGGTAGAAAGTCGGGACAAATTACCCCAGCCGGTCCAACCTGATCCTGTGTCCCATTGTAAGGAGAAGGATGTTGACGACTGTTGGTTCTATTTTACGTATTCAGTGAATGGGAACAACGAGGTCATGGTTCATGTTGTGGAGAATCCAGAGTGTCCCACTGGTCCAGACATCATTCCAATTGTAGCTGGTGTGGTTGCTGGAATTGTTCTTATTGGCCTTGCATTACTGCTGATATGGAAGCTTTTAATGATAATTCATGACAGAAGGGAGTTTGCTAAATTTGAAAAGGAGAAAATGAATGCCAAATGGGACACGCAAGAAAATCCGATTTACAAGAGTCCTATTAATAATTTCAAGAATCCAAACTACGGACGTAAAGCTGGTCTCTAAATTGCCGGTGAAAATCCTATTTATAAGAGTGCCGTAACAACTGTGGTCAATCCGAAGTATGAGGGAAAATGAGTACTGCCCGTGCAAATCCCACAACACTGAATGCAAAGTAGCAATTTCCATAGTCACAGTTAGGTAGCTTTAGGGCAATATTGCCATGGTTTTACTCATGTGCAGGTTTTGAAAATGTACAATATGTATAATTTTTAAAATGTTTTATTATTTTGAAAATAATGTTGTAATTCATGCCAGGGACTGACAAAAGACTTGAGACAGGATGGTTACTCTTGTCAGCTAAGGTCACATTGTGCCTTTTTGACCTTTTCTTCCTGGACTATTGAAATCAAGCTTATTGGATTAAGTGATATTTCTATAGCGATTGAAAGGGCAATAGTTAAAGTAATGAGCATGATGAGAGTTTCTGTTAATCATGTATTAAAACTGATTTTTAGCTTTACAAATATGTCAGTTTGCAGTTATGCAGAATCCAAAGTAAATGTCCTGCTAGCTAGTTAAGGATTGTTTTAAATCTGTTATTTTGCTATTTGCCTGTTAGACATGACTGATGACATATCTGAAAGACAAGTATGTTGAGAGTTGCTGGTGTAAAATACGTTTGAAATAGTTGATCTACAAAGGCCATGGGAAAAATTCAGAGAGTTAGGAAGGAAAAACCAATAGCTTTAAAACCTGTGTGCCATTTTAAGAGTTACTTAATGTTTGGTAACTTTTATGCCTTCACTTTACAAATTCAAGCCTTAGATAAAAGAACCGAGCAATTTTCTGCTAAAAAGTCCTTGATTTAGCACTATTTACATACAGGCCATACTTTACAAAGTATTTGCTGAATGGGGACCTTTTGAGTTGAATTTATTTTATTATTTTTATTTTGTTTAATGTCTGGTGCTTTCTGTCACCTCTTCTAATCTTTTAATGTATTTGTTTGCAATTTTGGGGTAAGACTTTTTTTATGAGTACTTTTTCTTTGAAGTTTTAGCGGTCAATTTGCCTTTTTAATGAACATGTGAAGTTATACTGTGGCTATGCAACAGCTCTCACCTACGCGAGTCTTACTTTGAGTTAGTGCCATAACAGACCACTGTATGTTTACTTCTCACCATTTGAGTTGCCCATCTTGTTTCACACTAGTCACATTCTTGTTTTAAGTGCCTTTAGTTTTAACAGTTCACTTTTTACAGTGCTATTTACTGAAGTTATTTATTAAATATGCCTAAAATACTTAAATCGGATGTCTTGACTCTGATGTATTTTATCAGGTTGTGTGCATGAAATTTTTATAGATTAAAGAAGTTGAGGAAAAGCA'
#length 3,009, http://www.ncbi.nlm.nih.gov/nuccore/187954982
adenosinedeaminase = 'GCGCTTTCCTGCTCAGTCCTGAAAAGTGAGCCGCTCCCGGGTTTGCAACCTCAAGCTTCGCAGCAGCGGCGGCGGCGGCTGCCGGGAAGGAGGCAGGTGCAGGTGCAGGAGGGAGGCGGCTCTGGGCTCCGCGCCTGGGTCTCGGCCATGGCCTCGGTCCTGGGGAGCGGCAGAGGGTCTGGAGGGCTGAGCAGTCAACTCAAATGCAAGTCCAAGAGGAGGAGGAGGCGGAGGTCCAAGCGGAAAGATAAAGTAAGCATATTGTCAACCTTCCTCGCTCCTTTCAAGCACCTGAGTCCTGGCATCACAAACACGGAGGATGACGACACCCTCAGTACCAGCAGCGCGGAGGTGAAGGAGAACCGCAACGTGGGCAACCTGGCCGCGCGGCCACCGCCCTCCGGGGACCGGGCCCGGGGCGGCGCGCCCGGCGCGAAGAGGAAGCGGCCGCTGGAGGAGGGGAATGGGGGCCACTTGTGCAAACTGCAGCTGGTCTGGAAGAAGCTGTCGTGGTCGGTGGCGCCCAAGAACGCGCTGGTGCAGCTGCACGAGCTGAGGCCGGGCCTGCAGTACCGGACAGTGTCGCAGACGGGCCCGGTGCATGCCCCGGTCTTCGCGGTAGCGGTGGAGGTGAACGGGCTCACGTTCGAGGGCACAGGCCCCACCAAGAAGAAGGCCAAGATGCGCGCGGCGGAGCTGGCACTCAGGTCCTTCGTGCAGTTCCCCAACGCCTGCCAGGCGCACCTGGCCATGGGCGGGGGCCCGGGCCCCGGCACGGACTTCACCTCCGACCAGGCCGATTTCCCCGACACGCTCTTCCAGGAGTTCGAGCCCCCGGCGCCGCGCCCCGGACTCGCGGGAGGCCGCCCCGGGGACGCCGCGCTTCTGTCCGCGGCCTACGGGCGACGGCGGCTGCTGTGCCGCGCGCTGGACCTGGTGGGCCCGACCCCCGCCACCCCCGCGGCCCCGGGCGAGCGCAACCCCGTGGTGCTGCTGAACCGCCTGCGCGCCGGGCTGCGCTACGTGTGTCTGGCAGAACCGGCCGAGCGGCGCGCGCGGAGCTTCGTGATGGCCGTGAGCGTGGACGGCAGGACGTTCGAGGGCTCGGGGCGCAGCAAGAAGCTGGCCCGGGGTCAGGCCGCGCAGGCCGCACTGCAGGAGCTGTTCGACATCCAGATGCCCGGCCACGCGCCCGGCAGGGCCAGGAGGACGCCAATGCCGCAGGAATTCGCAGACTCCATATCCCAGCTGGTCACACAGAAGTTCCGCGAGGTGACGACGGACCTCACGCCCATGCACGCCCGCCATAAAGCGCTGGCAGGAATCGTCATGACCAAAGGCCTGGATGCTCGGCAGGCGCAGGTCGTGGCCCTGTCCTCGGGGACCAAGTGCATCAGCGGCGAGCACCTCAGTGACCAGGGGCTGGTGGTGAATGACTGCCACGCGGAGGTCGTGGCCCGGCGGGCGTTCCTGCACTTCCTCTACACGCAGCTGGAGCTGCACCTGAGCAAGCGGCGCGAGGACTCAGAGCGATCGATATTCGTGCGGTTAAAAGAAGGTGGCTACCGGCTGCGAGAGAACATCCTCTTCCATCTCTACGTGAGCACCTCCCCCTGTGGAGACGCAAGACTCCACTCTCCCTACGAGATCACCACAGACCTGCACAGCAGCAAACACCTCGTCAGGAAGTTCCGCGGGCACCTGCGCACCAAGATCGAGTCCGGGGAAGGGACGGTCCCCGTGCGTGGCCCCAGCGCAGTGCAGACCTGGGACGGCGTCCTGCTGGGGGAGCAGCTGATCACCATGTCCTGCACGGACAAGATCGCCAGGTGGAACGTCCTGGGGCTGCAGGGCGCGCTCCTGTCCCACTTCGTGGAGCCCGTGTACCTGCAGAGCATCGTGGTGGGCAGCCTGCACCACACGGGCCACCTCGCACGCGTCATGAGCCACCGCATGGAGGGTGTCGGCCAGCTGCCCGCCTCCTACCGGCACAACCGGCCTCTCCTCAGCGGCGTGAGTGACACCGAGGCGCGCCAGCCGGGGAAGTCGCCCCCCTTCAGCATGAACTGGGTCGTGGGCAGCGCGGACCTGGAGATTATCAACGCCACCACTGGGCGGAGGAGCTGTGGGGGCCCATCCCGGCTCTGCAAGCACGTGCTGTCTGCACGGTGGGCGCGGCTGTATGGCAGGCTGAGCACACGGACACCCAGCCCTGGAGACACGCCCTCCATGTACTGTGAGGCCAAGCTGGGGGCGCACACCTACCAGTCTGTGAAACAGCAGCTGTTCAAGGCCTTTCAGAAGGCTGGCCTGGGCACCTGGGTGAGGAAACCACCGGAGCAGCAGCAGTTTCTACTGACTCTCTAGGCTGCGGGCTCCTGGCTGCTGGAGCTGAGCGGGACGCTGGAGGGATGGGACCGTGTCTGGGGGGCGACGTGGCGGGTCGGCCGGTTCCCTGCATTCGTTTTACTTTGGTGTCCCAGAAACACGTGAGTGTGCAATGTTTGGACGAGCAACAACACAAATTCAGAACGTGCCTCTTTCCAGATCGCTGGCCCCAGAACCCTGTCCCCCACACCCAGGGGCACACGCACTGTTGAGTTAGCGCCGACTCTTCCTGTGGAGTCTGAGGGAGGGGCTCCATTCAGGCAAAGGGGTTTTAGCTGCAGCCTTGGAAGGAGGCACCGACACGACACCAGGCAGGAGTGAGCCTCAGGCCCCGTCCCTGCACCCCACCCCTGCGTGCGCCTCTTGGTGATGCTGGGGTCTCACTAGCTTGAGGGGGCACATGAAGATAAGCCACAAATGAAGAGAAAAGCCATGCCCACCCCAGCCCCAGAGAAACCAATAAGAATCCTCTATTATTTTCACTATTCATTTAGGTTTTTATACTCCACCTCCTTTCAAAAAAGATTTAAGATGTACGACATTACCGAACACCTAAAATAGAACCAGAGAAACGAAAGCCATTCCCACAAAGTGAAGGAACAGTTTCCAAAACCCCTGCGA'
#length 3115, http://www.ncbi.nlm.nih.gov/nuccore/NM_001278212.1
leucinerichrepeat='CCCCGCGCGCCCCGCCCCGCCGGCTCGGAACTCGCCTGGGGCGCCGCCGGCGGCGGAGGGAGCGTGACTGCGCTGCGCAGGGCGCTAGGAGGCATTGTCGCCGCTCAGGCCCTTTTGTGAGAAGCAGACCAGCCTGGGGGCTGGCGGCAGGACACCTGTGTCTGCATGCTGAAGAAGATGGGTGAGGCCGTGGCCAGAGTAGCAAGGAAGGTCAACGAGACGGTGGAGAGCGGCTCTGACACTCTGGACCTGGCCGAGTGCAAGCTGGTCTCCTTTCCCATTGGCATCTACAAGGTCCTGCGGAATGTCTCTGGCCAGATCCACCTCATCACCCTGGCTAACAACGAGCTTAAGTCCCTCACCAGCAAGTTCATGACCACATTCAGTCAGCTCCGAGAGCTCCACCTGGAGGGGAACTTCCTACACCGCCTCCCCAGCGAGGTCAGTGCCCTGCAGCACCTCAAGGCCATTGACCTGTCCCGGAACCAGTTCCAGGACTTCCCTGAGCAGCTTACCGCCCTGCCGGCGCTGGAGACCATCAACCTGGAGGAGAACGAGATCGTAGATGTGCCCGTGGAGAAGCTGGCCGCCATGCCAGCCTTGCGCAGCATCAACCTCCGCTTCAACCCACTCAACGCCGAGGTGCGCGTGATCGCCCCGCCGCTCATCAAGTTTGACATGCTCATGTCTCCGGAAGGCGCAAGAGCCCCCCTACCTTAGGCCACCCTCCTCATGCCCACCCAGCAAGGGACAGAGGCCACAGGCCTGGAACCCTGGAAGGGAGGGAGGCCCATGGGAGGCCAAGCCTGGGGGCTGGGGGCGGGTGGGCCGAGCAGCACGTGGTGGGTGGGGTGCAGCTGGTCTGGATAGATAGCTTACAGCAGTAGTGGGCTCTGGAATGCCCAAGGGAAGAGGCAAGGTGGGGCCTGCAGCCTGGACTCGGCACTCACAGCTGCTGTGCAAACTCAGGCAGATCTCCTGCCCTCTCTGAGCCTTGTCACTTGAAAAAAACAGGACCCTTTCCCTCCTTTGGGCTCCCTGGAGGTTTTTAAGCAGTACGTGCCTCCAAGTTACCTCCAGATCAGCAGGCACAGGTGGGCATTGCCAGGTATTTTCTGAGCCCCTGCGGGTTTGAGGCCTTGTTTTTAGTGCTGAGAGCCAGTTGCTGCCCTGAGAAGAGAAGACAACCTCCATCTATTTATTGCTTCCTGAGAACTGACCTGGATGCGGCCCTCTGCAGGGCCCAGTCTTCAGTCCTGTGGTCCCTGGACTGGTGGGAACCTGAACTAGGAGTCCTGGGAGAGCTGTGGTGGGAATATGGGCTGGCACTGCTGCAGGGCAAGAACATTCATGTAGGAGCCCGAGGACCAGCAGGCTGGGAATGGGGAGCAAGTCACGTCAGCTCTGTCATTCCCCACAGTTAACAAATTGGCGGGGTGGGAAGTCCTGAGTGCTCCGTCCCTCTAGCATCACTCCTGAGCTGCGGGAGAGGTGGCCCAGAGAACAGCAGAGTCAGTTACACCTGCAGCTCTTGTCTAAAGTGATTAGATGGCCACCCTCACCACTGTCCAGTCCAGCAGCAGCCTGGCTGCCTTGTCATGGCCTCCTGGGGGCAGAAGGCGATGTGGACCACGGGATTTGTAGCCAGCCAGCTCCCAGGCCAACGCCCAAAGCCCTGATGACCTGGTTCTTCTGAGGCCCTCAACCTGGCATCTTAGGGTATGGTCAGGCAACAGGGTGACCAGCTGTCCTGGTTTCCCAGGACATGGAACTTTCAATGCTAAAACTGGGACAGTACCCAGCAAGTGGGGATGGTTGGTCCCCTACCAGGAGAGGGCCTGGGGCTCTTGCTTCCCGAGAACGCCTGTGGCTTGAAGAACCTTGACTGCTTGGTCCTCAGGTATCTACCTCCCACCTTCTCCTCATCTGTGGAGCAAGCCAACTCAGTGCCCCAGACCCCACCTGATCTGCATCTTTGTTTGCATCTCCAGAGACACCTGAGGCCCCAGAGCTTGAGGCAAAGCCAGGCCGTCCAAATCCTGTGTGCCGTGGACGAGTGGCCACTTTACTACTCCTAAGGCTAAGATGTTGAGAGCTCAGACCACTGCTCAGAGCAGTAATCCCTGCTCAGAATGCTCCCAGTTCCCTCGTCCCTGCCCAGGTCTCTTGTCTCTTGGGAAGGAACTGATAGGTCGGGCCATTGTTGGGCCATCACTGAGCGCTCAGTATCTCAAGAGACTCTGTTCATTCTGCTCGTATCCCAAGGCCTGGTTGGTCAAACTCTGGGCAAAGGGTTTTCAGGATGAGGAGGTCAAGACAGGATCTCCAGAGCTACCGAGTTCATCTGTGGGTGTTGGGGGCAAGTGGGGGCTGAAGTCCTGTGCAGGCTGCGCTGGCCCCACCTGCCTTGTGCCCTGGAGTGGGGTTTCTCCTTGTTGAAGAAGAGGCATCCTTCTCTGATGTGCACAAACACAATGTATGACCAGAGCCTTGCAACTCAAAGTGTGGTCTGTGGACCAGCAGCGGCAGTGACACCTGGGAGCTTGTTAGGAATGCAGAGTCTAGGCCTCACCCTATACCTCCCGACTCAGACCCTGCATTTTAGCAAGACCCCCAGCTGATTCCTATAAGCACTTTAGAGTTTGAGAAGCAAGGACCTAGGCTGGGGATGTCCTCCGAGCAGAGGGTGAAGTTTCTCTCAGTTCTCTCCCTGCCACTTCCAGGGATCTGAGCCTGTGCTCAGCCTCCTCCCTAACCCACCCTGGGAGACACTTGGCCTGTTAGATTGTTCCAGAGTCTGCATGGCACTCCTGAAGAAGGGAGTGTGACCTGCAGTCACCAGGAGATGAGGGTTAGGTGTGCCCAGCCCTCCAGACCCGGCCTTTCTGGTTAACCCCTGCATGCCAAGCTGCCTGCTGCCCCAGGTCCTCACCTCAGGCCTTTGAAGGGGCAGCTTCTGGAAGTTGTTTTCTCCTCTGCTTGGAGAGTTTGCCCTTGTCTGTCTTGGAAAGTGTGGGCAGCCACAGATGCCCCCAAATCAGAGCTCACAGTGAGTGAGCCCCTAAGCTTCAGTCTGCAATAAAGAATGCATTGGTTTCATCTGCAAAAAAAAAAA'
#length 2554, http://www.ncbi.nlm.nih.gov/nuccore/84105548
trypsindomaincontaining1 = 'GGCGCTAGGCAGCTTCAGCCGGACCGGGTAGGGGTCCTCGCTCGCTAGCTTGCTGTTTCTCGGAGAAGCTCCCGAGTGTCCGGCCTAGAGGCCATGAGAAGGCAGTGGGGGTCTGCCATGAGGGCGGCCGAGCAGGCGGGCTGCATGGTGAGCGCCTCCCGGGCCGGACAGCCCGAGGCGGGCCCGTGGAGCTGCAGCGGGGTAATCCTGAGCCGTAGCCCGGGCCTGGTGCTTTGCCACGGGGGCATCTTCGTCCCCTTCCTGCGAGCTGGCAGCGAAGTCCTGGCCGCGGCCGGCGCCGTCTTCCTGCCTGGCGACAGTTGCAGGGACGACCTGCGCCTGCACGTGCAGTGGGCCCCAACGGCCGCGGGTCCCGGGGGCGGCGCGGAGCGGGGCCGCCCAGGGCTGTGCACGCCCCAGTGCGCGAGCCTCGAGCCCGGCCCACCTGCCCCGTCCCGCGGGCGTCCCCTGCAGCCCCGGCTTCCTGCTGAGCTGCTGCTGCTGCTGAGCTGCCCGGCCTTCTGGGCCCACTTCGCGCGCCTCTTCGGGGACGAGGCAGCGGAACAGTGGCGCTTCTCGAGCGCGGCGCGGGATGACGAAGTGTCGGAGGACGAGGAGGCGGATCAACTGAGAGCGCTGGGCTGGTTTGCGCTGCTGGGCGTGCGGCTAGGCCAGGAGGAGGTGGAGGAGGAGCGCGGGCCAGCCATGGCGGTGTCGCCTCTCGGGGCCGTGCCCAAGGGTGCGCCATTGCTGGTCTGCGGCTCCCCTTTCGGCGCCTTCTGCCCCGACATCTTTCTCAACACGCTGAGCTGCGGGGTGCTCAGCAACGTGGCCGGCCCACTGCTGCTTACCGACGCACGCTGCCTTCCCGGCACCGAGGGCGGCGGCGTGTTCACCGCGCGGCCCGCGGGGGCGCTGGTGGCGCTGGTGGTGGCGCCGCTCTGTTGGAAGGCCGGCGAATGGGTGGGCTTCACGCTGCTCTGCGCCGCCGCCCCCCTTTTCCGCGCCGCCCGCGACGCGCTTCACCGCCTGCCGCACAGCACCGCTGCCCTGGCCGCCCTTCTGCCGCCAGAGGTGGGCGTCCCGTGGGGTCTGCCCCTCCGAGACTCCGGGCCCCTGTGGGCAGCCGCGGCAGTGTTGGTGGAGTGCGGCACCGTATGGGGCTCCGGAGTGGCTGTGGCACCCCGCCTTGTAGTGACCTGTCGGCACGTGTCCCCTCGGGAAGCAGCCAGGGTCCTGGTGCGCTCCACCACCCCCAAGAGTGTGGCCATCTGGGGCCGTGTGGTATTTGCCACTCAGGAGACATGTCCCTATGACATAGCAGTGGTGAGCCTGGAGGAGGACCTGGATGATGTCCCCATCCCTGTGCCCGCTGAGCACTTCCATGAAGGCGAGGCTGTGAGTGTGGTGGGCTTTGGCGTCTTTGGCCAGTCTTGCGGGCCCTCGGTGACCTCAGGCATCCTTTCGGCTGTGGTGCAGGTGAATGGCACGCCCGTAATGCTGCAGACCACGTGTGCTGTGCACAGCGGCTCCAGTGGGGGACCCCTCTTCTCCAACCACTCAGGAAACCTCCTTGGCATAATCACCAGCAACACCCGGGACAATAATACGGGGGCCACCTACCCCCACCTGAACTTCAGCATTCCCATCACGGTGCTCCAGCCGGCCCTGCAGCAGTACAGCCAGACCCAAGACCTAGGTGGCCTCCGTGAGCTGGACCGCGCTGCTGAGCCAGTCAGGGTGGTGTGGCGGTTGCAGCGGCCCCTGGCAGAGGCCCCGCGGAGCAAGCTCTGAGGCTGTGTTACCACCTTTGGAAAGAAGAGTGACCTTTTTCTGCTGTAGGAAGTGATGTTGAGGTGACGGTGGCCTCAGGATTCAGGGCCCAGCCCCTGCAGGGGCCCAGGCTGCCTCTCATCTCCACCCACTGACTGCAGACTGGGCTTTGGGCTCTGGGGCAAACTTCTCTTCAGCCCCATGGATCCTTAACCTGGCAGCCCGTTTTGGGGTGCTTTCTTGAGCCCCCAGTTCTCTGTCCCCTAGCACTAGACTCAGCTGTATTGTTTTTCCTTCTGGGGAGCCCACTCCAACTGCACAGAAGTTCTGGGCCTGACAGGTAGATTCCAGCTGGAAGGCAGGCCCGTGCCTGGTTTTGCGTCTGTTCCCCTGAGGGCCATCGTCATCCTGGAGCTTCAATGGGGCCTTGGCTCCTGTCTGCCTCTCAGTCAGAGTCAGGGCTGACAAAGGACTCAGCTTCCTTAGCATCTCAGCAGAAACCTTGCTCTGAAGACCAGAGACAGAAGGGACAGAAACAGGAGTGCCTCCTGCTGTGCCAGGCCCATGGGCAGTGCAGGCAGATCCCTGAAGGTCAGCACTCCTGGGTCTTCATATGCCAACAGGGGCGCTCTTGACACTGTGCCTTCATTTTCCAGCCCACAGCCTGGGTCTCAGGGATCTTGAGGGGTAGAACATGTCTGGTTGGGGCTTGGGAATAAACATGATCTATTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA'
#length 2538, http://www.ncbi.nlm.nih.gov/nuccore/1004170698
mitogenactivatedproteinkinase = 'AGCGCCAGGGCAGTGCTGGTCTGGGCAGTGCGCGGGGAAGCGGGGACCCGCTGTCACTGCGCCTCCCGCTGCCGACGCCGCCTGGACGGCCGCACTCTCCCTGCCCGAGACCCGACTCTCCAGAAAGAGCAACAGTAATGGAGTACATGAGCACTGGAAGTGACAATAAAGAAGAGATTGATTTATTAATTAAACATTTAAATGTGTCTGATGTAATAGACATTATGGAAAATCTTTATGCAAGTGAAGAGCCAGCAGTTTATGAACCCAGTCTAATGACCATGTGTCAAGACAGTAATCAAAACGATGAGCGTTCTAAGTCTCTGCTGCTTAGTGGCCAAGAGGTACCATGGTTGTCATCAGTCAGATATGGAACTGTGGAGGATTTGCTTGCTTTTGCAAACCATATATCCAACACTGCAAAGCATTTTTATGGACAACGACCACAGGAATCTGGAATTTTATTAAACATGGTCATCACTCCCCAAAATGGACGTTACCAAATAGATTCCGATGTTCTCCTGATCCCCTGGAAGCTGACTTACAGGAATATTGGTTCTGATTTTATTCCTCGGGGCGCCTTTGGAAAGGTATACTTGGCACAAGATATAAAGACGAAGAAAAGAATGGCGTGTAAACTGATCCCAGTAGATCAATTTAAGCCATCTGATGTGGAAATCCAGGCTTGCTTCCGGCACGAGAACATCGCAGAGCTGTATGGCGCAGTCCTGTGGGGTGAAACTGTCCATCTCTTTATGGAAGCAGGCGAGGGAGGGTCTGTTCTGGAGAAACTGGAGAGCTGTGGACCAATGAGAGAATTTGAAATTATTTGGGTGACAAAGCATGTTCTCAAGGGACTTGATTTTCTACACTCAAAGAAAGTGATCCATCATGATATTAAACCTAGCAACATTGTTTTCATGTCCACAAAAGCTGTTTTGGTGGATTTTGGCCTAAGTGTTCAAATGACCGAAGATGTCTATTTTCCTAAGGACCTCCGAGGAACAGAGATTTACATGAGCCCAGAGGTCATCCTGTGCAGGGGCCATTCAACCAAAGCAGACATCTACAGCCTGGGGGCCACGCTCATCCACATGCAGACGGGCACCCCACCCTGGGTGAAGCGCTACCCTCGCTCAGCCTATCCCTCCTACCTGTACATAATCCACAAGCAAGCACCTCCACTGGAAGACATTGCAGATGACTGCAGTCCAGGGATGAGAGAGCTGATAGAAGCTTCCCTGGAGAGAAACCCCAATCACCGCCCAAGAGCCGCAGACCTACTAAAACATGAGGCCCTGAACCCGCCCAGAGAGGATCAGCCACGCTGTCAGAGTCTGGACTCTGCCCTCTTGGAGCGCAAGAGGCTGCTGAGTAGGAAGGAGCTGGAACTTCCTGAGAACATTGCTGATTCTTCGTGCACAGGAAGCACCGAGGAATCTGAGATGCTCAAGAGGCAACGCTCTCTCTACATCGACCTCGGCGCTCTGGCTGGCTACTTCAATCTTGTTCGGGGACCACCAACGCTTGAATATGGCTGAAGGATGCCATGTTTGCTCTAAATTAAGACAGCATTGATCTCCTGGAGGCTGGTTCTGCTGCCTCTACACAGGGGCCCTGTACAGTGAATGGTGCCATTTTCGAAGGAGCAGTGTGACCTCCTGTGACCCATGAATGTGCCTCCAAGCGGCCCTGTGTGTTTGACATGTGAAGCTATTTGATATGCACCAGGTCTCAAGGTTCTCATTTCTCAGGTGACGTGATTCTAAGGCAGGAATTTGAGAGTTCACAGAAGGATCGTGTCTGCTGACTGTTTCATTCACTGTGCACTTTGCTCAAAATTTTAAAAATACCAATCACAAGGATAATAGAGTAGCCTAAAATTACTATTCTTGGTTCTTATTTAAGTATGGAATATTCATTTTACTCAGAATAGCTGTTTTGTGTATATTGGTGTATATTATATAACTCTTTGAGCCTTTATTGGTAAATTCTGGTATACATTGAATTCATTATAATTTGGGTGACTAGAACAACTTGAAGATTGTAGCAATAAGCTGGACTAGTGTCCTAAAAATGGCTAACTGATGAATTAGAAGCCATCTGACAGCAGGCCACTAGTGACAGTTTCTTTTGTGTTCCTATGGAAACATTTTATACTGTACATGCTATGCTGAAGACATTCAAAACGTGATGTTTTGAATGTGGATAAAACTGTGTAAACCACATAATTTTTGTACATCCCAAAGGATGAGAATGTGACCTTTAAGAAAAATGAAAACTTTTGTAAATTATTGATGATTTTGTAATTCTTATGACTAAATTTTCTTTTAAGCATTTGTATATTAAAATAGCATACTGTGTATGTTTTATATCAAATGCCTTCATGAATCTTTCATACATATATATATTTGTAACATTGTAAAGTATGTGAGTAGTCTTATGTAAAGTATGTTTTTACATTATGCAAATAAAACCCAATACTTTTGTCCAATGTGGTTGGTCAAATCAACTGAATAAATTCAGTATTTTGCCTTA'
#length 4038, http://www.ncbi.nlm.nih.gov/nuccore/1002341822
fibroblastgrowthfactorreceptor = 'ACAGACTCTCCCGCAGAACTGACCCCAGCAAGAAGCCTTTGGGAGCAGTAGAGATGGAGTTTCACTATGTTGCCCAGGCTAGCCTTGAACTCCTGACCTCAGATGATCTGCCCGCGCAGGCCTCCCGAAGTGCTGGGATTACAGGCATGAGCCACCGCACCTGGCCTGCCAACTCTTGTTAAGATCTCGAAGGAAACATTTTCTTCCCCTGAAGGAAACCCAGCTATGCAGACACCAGCTGATAATCTTGCATTCCTGAAAGATGTTGCACCCCTATGGCAAGTGGCGGCTGCTGAGGCTCTGACGTGACTCCCAGGCATGAACGCTCTCAGCTGTGTTTACCTCAGCTCCTCGGGAGGGAGCCTGGGAGACTGACGCCTGAGTTTTACATCAGTGTCAAAACCCAAGCACAACCTAGGGAGGGACCTCCTGCCTAGTGTGTGTGGGTCAGGAGATAGAAAAGCTCTCACTGAGTAAACTGGACAAGGTCAATATACCTCGCTGATTGAGAAGACTTCACTCTCTCTGCAAAGAGACGTGTGTGTTTTAGAGGAAGTGGGAGCCCCAGCCGATTCTGCAAGACTTCCGAGAGTCAGATATCCAGACAGAAGATGCGGACACCTGGGTGACCAGACAGCGAAGAGGAAAGAACAAAACGAGCATGTGCCAAGCCTGTGAGGGAGAAAGGGCAACAAACCAGTGACCTTCCACAGAAATGTGTTTAAACAAAACAAAACAGCTCTTTGGCGTTGCTAAGAGACTGCCATTTTGGAGGAAAGAGCGATCGCCTCACCGGCCCATCCTCCAAGCCGGACTGCCGGCAAATGCCTCCACAGTGGTCGGAGGAGACGTAGAGTTTGTCTGCAAGGTTTACAGTGATGCCCAGCCCCACATCCAGTGGATCAAGCACGTGGAAAAGAACGGCAGTAAATACGGGCCCGACGGGCTGCCCTACCTCAAGGTTCTCAAGGCCGCCGGTGTTAACACCACGGACAAAGAGATTGAGGTTCTCTATATTCGGAATGTAACTTTTGAGGACGCTGGGGAATATACGTGCTTGGCGGGTAATTCTATTGGGATATCCTTTCACTCTGCATGGTTGACAGTTCTGCCAGCGCCTGGAAGAGAAAAGGAGATTACAGCTTCCCCAGACTACCTGGAGATAGCCATTTACTGCATAGGGGTCTTCTTAATCGCCTGTATGGTGGTAACAGTCATCCTGTGCCGAATGAAGAACACGACCAAGAAGCCAGACTTCAGCAGCCAGCCGGCTGTGCACAAGCTGACCAAACGTATCCCCCTGCGGAGACAGGTAACAGTTTCGGCTGAGTCCAGCTCCTCCATGAACTCCAACACCCCGCTGGTGAGGATAACAACACGCCTCTCTTCAACGGCAGACACCCCCATGCTGGCAGGGGTCTCCGAGTATGAACTTCCAGAGGACCCAAAATGGGAGTTTCCAAGAGATAAGCTGACACTGGGCAAGCCCCTGGGAGAAGGTTGCTTTGGGCAAGTGGTCATGGCGGAAGCAGTGGGAATTGACAAAGACAAGCCCAAGGAGGCGGTCACCGTGGCCGTGAAGATGTTGAAAGATGATGCCACAGAGAAAGACCTTTCTGATCTGGTGTCAGAGATGGAGATGATGAAGATGATTGGGAAACACAAGAATATCATAAATCTTCTTGGAGCCTGCACACAGGATGGGCCTCTCTATGTCATAGTTGAGTATGCCTCTAAAGGCAACCTCCGAGAATACCTCCGAGCCCGGAGGCCACCCGGGATGGAGTACTCCTATGACATTAACCGTGTTCCTGAGGAGCAGATGACCTTCAAGGACTTGGTGTCATGCACCTACCAGCTGGCCAGAGGCATGGAGTACTTGGCTTCCCAAAAATGTATTCATCGAGATTTAGCAGCCAGAAATGTTTTGGTAACAGAAAACAATGTGATGAAAATAGCAGACTTTGGACTCGCCAGAGATATCAACAATATAGACTATTACAAAAAGACCACCAATGGGCGGCTTCCAGTCAAGTGGATGGCTCCAGAAGCCCTGTTTGATAGAGTATACACTCATCAGAGTGATGTCTGGTCCTTCGGGGTGTTAATGTGGGAGATCTTCACTTTAGGGGGCTCGCCCTACCCAGGGATTCCCGTGGAGGAACTTTTTAAGCTGCTGAAGGAAGGACACAGAATGGATAAGCCAGCCAACTGCACCAACGAACTGTACATGATGATGAGGGACTGTTGGCATGCAGTGCCCTCCCAGAGACCAACGTTCAAGCAGTTGGTAGAAGACTTGGATCGAATTCTCACTCTCACAACCAATGAGGAATACTTGGACCTCAGCCAACCTCTCGAACAGTATTCACCTAGTTACCCTGACACAAGAAGTTCTTGTTCTTCAGGAGATGATTCTGTTTTTTCTCCAGACCCCATGCCTTACGAACCATGCCTTCCTCAGTATCCACACATAAACGGCAGTGTTAAAACATGAATGACTGTGTCTGCCTGTCCCCAAACAGGACAGCACTGGGAACCTAGCTACACTGAGCAGGGAGACCATGCCTCCCAGAGCTTGTTGTCTCCACTTGTATATATGGATCAGAGGAGTAAATAATTGGAAAAGTAATCAGCATATGTGTAAAGATTTATACAGTTGAAAACTTGTAATCTTCCCCAGGAGGAGAAGAAGGTTTCTGGAGCAGTGGACTGCCACAAGCCACCATGTAACCCCTCTCACCTGCCGTGCGTACTGGCTGTGGACCAGTAGGACTCAAGGTGGACGTGCGTTCTGCCTTCCTTGTTAATTTTGTAATAATTGGAGAAGATTTATGTCAGCACACACTTACAGAGCACAAATGCAGTATATAGGTGCTGGATGTATGTAAATATATTCAAATTATGTATAAATATATATTATATATTTACAAGGAGTTATTTTTTGTATTGATTTTAAATGGATGTCCCAATGCACCTAGAAAATTGGTCTCTCTTTTTTTAATAGCTATTTGCTAAATGCTGTTCTTACACATAATTTCTTAATTTTCACCGAGCAGAGGTGGAAAAATACTTTTGCTTTCAGGGAAAATGGTATAACGTTAATTTATTAATAAATTGGTAATATACAAAACAATTAATCATTTATAGTTTTTTTTGTAATTTAAGTGGCATTTCTATGCAGGCAGCACAGCAGACTAGTTAATCTATTGCTTGGACTTAACTAGTTATCAGATCCTTTGAAAAGAGAATATTTACAATATATGACTAATTTGGGGAAAATGAAGTTTTGATTTATTTGTGTTTAAATGCTGCTGTCAGACGATTGTTCTTAGACCTCCTAAATGCCCCATATTAAAAGAACTCATTCATAGGAAGGTGTTTCATTTTGGTGTGCAACCCTGTCATTACGTCAACGCAACGTCTAACTGGACTTCCCAAGATAAATGGTACCAGCGTCCTCTTAAAAGATGCCTTAATCCATTCCTTGAGGACAGACCTTAGTTGAAATGATAGCAGAATGTGCTTCTCTCTGGCAGCTGGCCTTCTGCTTCTGAGTTGCACATTAATCAGATTAGCCTGTATTCTCTTCAGTGAATTTTGATAATGGCTTCCAGACTCTTTGGCGTTGGAGACGCCTGTTAGGATCTTCAAGTCCCATCATAGAAAATTGAAACACAGAGTTGTTCTGCTGATAGTTTTGGGGATACGTCCATCTTTTTAAGGGATTGCTTTCATCTAATTCTGGCAGGACCTCACCAAAAGATCCAGCCTCATACCTACATCAGACAAAATATCGCCGTTGTTCCTTCTGTACTAAAGTATTGTGTTTTGCTTTGGAAACACCCACTCACTTTGCAATAGCCGTGCAAGATGAATGCAGATTACACTGATCTTATGTGTTACAAAATTGGAGAAAGTATTTAATAAAACCTGTTAATTTTTATACTGACAATAAAAATGTTTCTACAGATATTAATGTTAACAAGACAAAATAAATGTCACGCAACTTATTTTTTTAATAAAAAAAAAAAAAAA'
#length 1642, http://www.ncbi.nlm.nih.gov/nuccore/158255079
cytochromeP450 = 'CTCCCGGGCTGGCAGCAGGGCCCCAGCGGCACCATGTCTGCCCTCGGAGTCACCGTGGCCCTGCTGGTGTGGGCGGCCTTCCTCCTGCTGGTGTCCATGTGGAGGCAGGTGCACAGCAGCTGGAATCTGCCCCCAGGCCCTTTCCCGCTTCCCATCATCGGGAACCTCTTCCAGTTGGAATTGAAGAATATTCCCAAGTCCTTCACCCGGTTGGCCCAGCGCTTCGGGCCGGTGTTCACGCTGTACGTGGGCTCGCAGCGCATGGTGGTGATGCACGGCTACAAGGCGGTGAAGGAAGCGCTGCTGGACTACAAGGACGAGTTCTCGGGCAGAGGCGACCTCCCCGCGTTCCATGCGCACAGGGACAGGGGAATCATTTTTAATAATGGACCTACCTGGAAGGACATCCGGCGGTTTTCCCTGACCACCCTCCGGAACTATGGGATGGGGAAACAGGGCAATGAGAGCCGGATCCAGAGGGAGGCCCACTTCCTGCTGGAAGCACTCAGGAAGACCCAAGGCCTGCCTTTCGACCCCACCTTCCTCATCGGCTGCGCGCCCTGCAACGTCATAGCCGACATCCTCTTCCGCAAGCATTTTGACTACAATGATGAGAAGTTTCTAAGGCTGATGTATTTGTTTAATGAGAACTTCCACCTACTCAGCACTCCCTGGCTCCAGCTTTACAATAATTTTCCCAGCTTTCTACACTACTTGCCTGGAAGCCACAGAAAAGTCATAAAAAATGTGGCTGAAGTAAAAGAGTATGTGTCTGAAAGGGTGAAGGAGCACCATCAATCTCTGGACCCCAACTGTCCCCGGGACCTCACCGACTGCCTGCTCGTGGAAATGGAGAAGGAAAAGCACAGTGCAGAGCGCTTGTACACAATGGACGGTATCACCGTGACTGTGGCCGACCTGTTCTTTGCGGGGACAGAGACCACCAGCACAACTCTGAGATATGGGCTCCTGATTCTCATGAAATACCCTGAGATCGAAGAGAAGCTCCATGAAGAAATTGACAGGGTGATTGGGCCAAGCCGAATCCCTGCCATCAAGGATAGGCAAGAGATGCCCTACATGGATGCTGTGGTGCATGAGATTCAGCGGTTCATCACCCTCGTGCCCTCCAACCTGCCCCATGAAGCAACCCGAGACACCATTTTCAGAGGATACCTCATCCCCAAGGGCACAGTCGTAGTGCCAACTCTGGACTCTGTTTTGTATGACAACCAAGAATTTCCTGATCCAGAAAAGTTTAAGCCAGAACACTTCCTGAATGAAAATGGAAAGTTCAAGTACAGTGACTATTTCAAGCCATTTTCCACAGGAAAACGAGTGTGTGCTGGAGAAGGCCTGGCTCGCATGGAGTTGTTTCTTTTGTTGTGTGCCATTTTGCAGCATTTTAATTTGAAGCCTCTCGTTGACCCAAAGGATATCGACCTCAGCCCTATACATATTGGGTTTGGCTGTATCCCACCACGTTACAAACTCTGTGTCATTCCCCGCTCATGAGTGTGTGGAGGACACCCTGAACCCCCCGCTTTCAAACAAGATTTCGAATTGTTTGAGGTCAGGATTTCTCAAACTGATTCCTTTCTTTGCATATGAGTATTTGAAAATAAATATTTTCCCAGAATAT'
'''GENES THAT RESULT IN GENETIC DISORDERS/DISEASES'''
#length 1998, http://www.ncbi.nlm.nih.gov/nuccore/224831254
opsin1colorblindness ='CCCACTGGCCGGTATAAAGCACCGTGACCCTCAGGTGACGCACCAGGGCCGGCTGCCGTCGGGGACAGGGCTTTCCATAGCCATGGCCCAGCAGTGGAGCCTCCAAAGGCTCGCAGGCCGCCATCCGCAGGACAGCTATGAGGACAGCACCCAGTCCAGCATCTTCACCTACACCAACAGCAACTCCACCAGAGGCCCCTTCGAAGGCCCGAATTACCACATCGCTCCCAGATGGGTGTACCACCTCACCAGTGTCTGGATGATCTTTGTGGTCATTGCATCCGTCTTCACAAATGGGCTTGTGCTGGCGGCCACCATGAAGTTCAAGAAGCTGCGCCACCCGCTGAACTGGATCCTGGTGAACCTGGCGGTCGCTGACCTGGCAGAGACCGTCATCGCCAGCACTATCAGCGTTGTGAACCAGGTCTATGGCTACTTCGTGCTGGGCCACCCTATGTGTGTCCTGGAGGGCTACACCGTCTCCCTGTGTGGGATCACAGGTCTCTGGTCTCTGGCCATCATTTCCTGGGAGAGATGGATGGTGGTCTGCAAGCCCTTTGGCAATGTGAGATTTGATGCCAAGCTGGCCATCGTGGGCATTGCCTTCTCCTGGATCTGGGCTGCTGTGTGGACAGCCCCGCCCATCTTTGGTTGGAGCAGGTACTGGCCCCACGGCCTGAAGACTTCATGCGGCCCAGACGTGTTCAGCGGCAGCTCGTACCCCGGGGTGCAGTCTTACATGATTGTCCTCATGGTCACCTGCTGCATCACCCCACTCAGCATCATCGTGCTCTGCTACCTCCAAGTGTGGCTGGCCATCCGAGCGGTGGCAAAGCAGCAGAAAGAGTCTGAATCCACCCAGAAGGCAGAGAAGGAAGTGACGCGCATGGTGGTGGTGATGGTCCTGGCATTCTGCTTCTGCTGGGGACCATACGCCTTCTTCGCATGCTTTGCTGCTGCCAACCCTGGCTACCCCTTCCACCCTTTGATGGCTGCCCTGCCGGCCTTCTTTGCCAAAAGTGCCACTATCTACAACCCCGTTATCTATGTCTTTATGAACCGGCAGTTTCGAAACTGCATCTTGCAGCTTTTCGGGAAGAAGGTTGACGATGGCTCTGAACTCTCCAGCGCCTCCAAAACGGAGGTCTCATCTGTGTCCTCGGTATCGCCTGCATGAGGTCTGCCTCCTACCCATCCCGCCCACCGGGGCTTTGGCCACCTCTCCTTTCCCCCTCCTTCTCCATCCCTGTAAAATAAATGTAATTTATCTTTGCCAAAACCAACAAAGTCACAGAGGCTTTCACTGCAGTGTGGGACCACCTGAGCCTCTGCGTGTGCAGGCACTGGGTCTCGAGAGGGTGCAAGGGGGATAAAGAGGAGAGAGCGCTTCATAGACTTTAAGTTTTCCCGAGCCTCATGTCTACCGATGGCGTGAAAGGATCCTGGCAAAACAGAAGTGTGAGGCAGGTGGGCGTCTATATCCATTTCACCAGGCTGGTGGTTACATAATCGGCAAGCAAGAGCTGTGGAGGGGCTTGCTGGATGCCCTCAGCACCCAGGAGGAGGGAGGGAGCTAGCAAGCTAAGGCAGGTGGCCCTCCTGGCCCCTTAAGGTCCATCTGCTGGAGGCCCAGAGTCCTTGGAGTACAGTCTACACCTGGAGGGGACCCATTCCTGCCAGTCTGTGGCAGGGATGGCGCGCCACCTCTGCCAGGCCAGGACCCCAAGCCCGATCAGCATCAGCATGGTGCAGGTGCACAGGCGTGAGCTGATCAGTGACGAGGGGCAGGCACACAAGGTGGAGACAAAGACCAAGAGGACGGTTGCCAGTGAGAGGCGCGGACTCAGGAACTTGAACAACATCTGCGGGGGACGGCTTTGGAGGTGCTCCGCTGCCTCCAGTTGGGTGACTTGCTGTAGCATCTCCAGCTTGGATATTCGGCTCTTGAAGGTCTCCGTGATCTCCTGCAGGAGACGAAAATGCACGCACCAGAAGTCA'
#length 723, http://www.ncbi.nlm.nih.gov/nuccore/XM_011532205.1
sncaparkinsons = 'GGAGGAGCTTGCTTCTCCATTCTGGTGTGATCCAGGAACAGCTGTCTTCCAGCTCTGAAAGAGTGTGGTGTAAAGGAATTCATTAGCCATGGATGTATTCATGAAAGGACTTTCAAAGGCCAAGGAGGGAGTTGTGGCTGCTGCTGAGAAAACCAAACAGGGTGTGGCAGAAGCAGCAGGAAAGACAAAAGAGGGTGTTCTCTATGTAGGCTCCAAAACCAAGGAGGGAGTGGTGCATGGTGTGGCAACAGTGGCTGAGAAGACCAAAGAGCAAGTGACAAATGTTGGAGGAGCAGTGGTGACGGGTGTGACAGCAGTAGCCCAGAAGACAGTGGAGGGAGCAGGGAGCATTGCAGCAGCCACTGGCTTTGTCAAAAAGGACCAGTTGGGCAAGAAGCATCCAAAATACAAACCATCTAAGAGGCAAGAAAATGTCGTGATGTTCCTAGTGCAAGTTAAAAAGATTTGCTTTCCTCAAGTCGGAAAGCCCTTCTCATTTTTGAGGTTTTTTTCTTCTTTTTTTTTTCAAGTGAAAGCATTTTGGAGGAGTCAATATCCATCTTTAAAGGTAGCCAGGTCACATGTATACATATGTAACTAACCTGCACAATGTGCACATGTACCCTAAAACTTAAAGTATAATTTAAAAAAAAAGAATTTAAATAAAAAAAGAAAATCAGAGAGAAAAAAAAAAAGATGCATGTGCACCCTGATACTACCATC'
#length 9048, http://www.ncbi.nlm.nih.gov/nuccore/NM_000132.3
f8hemophila = 'GCTTAGTGCTGAGCACATCCAGTGGGTAAAGTTCCTTAAAATGCTCTGCAAAGAAATTGGGACTTTTCATTAAATCAGAAATTTTACTTTTTTCCCCTCCTGGGAGCTAAAGATATTTTAGAGAAGAATTAACCTTTTGCTTCTCCAGTTGAACATTTGTAGCAATAAGTCATGCAAATAGAGCTCTCCACCTGCTTCTTTCTGTGCCTTTTGCGATTCTGCTTTAGTGCCACCAGAAGATACTACCTGGGTGCAGTGGAACTGTCATGGGACTATATGCAAAGTGATCTCGGTGAGCTGCCTGTGGACGCAAGATTTCCTCCTAGAGTGCCAAAATCTTTTCCATTCAACACCTCAGTCGTGTACAAAAAGACTCTGTTTGTAGAATTCACGGATCACCTTTTCAACATCGCTAAGCCAAGGCCACCCTGGATGGGTCTGCTAGGTCCTACCATCCAGGCTGAGGTTTATGATACAGTGGTCATTACACTTAAGAACATGGCTTCCCATCCTGTCAGTCTTCATGCTGTTGGTGTATCCTACTGGAAAGCTTCTGAGGGAGCTGAATATGATGATCAGACCAGTCAAAGGGAGAAAGAAGATGATAAAGTCTTCCCTGGTGGAAGCCATACATATGTCTGGCAGGTCCTGAAAGAGAATGGTCCAATGGCCTCTGACCCACTGTGCCTTACCTACTCATATCTTTCTCATGTGGACCTGGTAAAAGACTTGAATTCAGGCCTCATTGGAGCCCTACTAGTATGTAGAGAAGGGAGTCTGGCCAAGGAAAAGACACAGACCTTGCACAAATTTATACTACTTTTTGCTGTATTTGATGAAGGGAAAAGTTGGCACTCAGAAACAAAGAACTCCTTGATGCAGGATAGGGATGCTGCATCTGCTCGGGCCTGGCCTAAAATGCACACAGTCAATGGTTATGTAAACAGGTCTCTGCCAGGTCTGATTGGATGCCACAGGAAATCAGTCTATTGGCATGTGATTGGAATGGGCACCACTCCTGAAGTGCACTCAATATTCCTCGAAGGTCACACATTTCTTGTGAGGAACCATCGCCAGGCGTCCTTGGAAATCTCGCCAATAACTTTCCTTACTGCTCAAACACTCTTGATGGACCTTGGACAGTTTCTACTGTTTTGTCATATCTCTTCCCACCAACATGATGGCATGGAAGCTTATGTCAAAGTAGACAGCTGTCCAGAGGAACCCCAACTACGAATGAAAAATAATGAAGAAGCGGAAGACTATGATGATGATCTTACTGATTCTGAAATGGATGTGGTCAGGTTTGATGATGACAACTCTCCTTCCTTTATCCAAATTCGCTCAGTTGCCAAGAAGCATCCTAAAACTTGGGTACATTACATTGCTGCTGAAGAGGAGGACTGGGACTATGCTCCCTTAGTCCTCGCCCCCGATGACAGAAGTTATAAAAGTCAATATTTGAACAATGGCCCTCAGCGGATTGGTAGGAAGTACAAAAAAGTCCGATTTATGGCATACACAGATGAAACCTTTAAGACTCGTGAAGCTATTCAGCATGAATCAGGAATCTTGGGACCTTTACTTTATGGGGAAGTTGGAGACACACTGTTGATTATATTTAAGAATCAAGCAAGCAGACCATATAACATCTACCCTCACGGAATCACTGATGTCCGTCCTTTGTATTCAAGGAGATTACCAAAAGGTGTAAAACATTTGAAGGATTTTCCAATTCTGCCAGGAGAAATATTCAAATATAAATGGACAGTGACTGTAGAAGATGGGCCAACTAAATCAGATCCTCGGTGCCTGACCCGCTATTACTCTAGTTTCGTTAATATGGAGAGAGATCTAGCTTCAGGACTCATTGGCCCTCTCCTCATCTGCTACAAAGAATCTGTAGATCAAAGAGGAAACCAGATAATGTCAGACAAGAGGAATGTCATCCTGTTTTCTGTATTTGATGAGAACCGAAGCTGGTACCTCACAGAGAATATACAACGCTTTCTCCCCAATCCAGCTGGAGTGCAGCTTGAGGATCCAGAGTTCCAAGCCTCCAACATCATGCACAGCATCAATGGCTATGTTTTTGATAGTTTGCAGTTGTCAGTTTGTTTGCATGAGGTGGCATACTGGTACATTCTAAGCATTGGAGCACAGACTGACTTCCTTTCTGTCTTCTTCTCTGGATATACCTTCAAACACAAAATGGTCTATGAAGACACACTCACCCTATTCCCATTCTCAGGAGAAACTGTCTTCATGTCGATGGAAAACCCAGGTCTATGGATTCTGGGGTGCCACAACTCAGACTTTCGGAACAGAGGCATGACCGCCTTACTGAAGGTTTCTAGTTGTGACAAGAACACTGGTGATTATTACGAGGACAGTTATGAAGATATTTCAGCATACTTGCTGAGTAAAAACAATGCCATTGAACCAAGAAGCTTCTCCCAGAATTCAAGACACCCTAGCACTAGGCAAAAGCAATTTAATGCCACCACAATTCCAGAAAATGACATAGAGAAGACTGACCCTTGGTTTGCACACAGAACACCTATGCCTAAAATACAAAATGTCTCCTCTAGTGATTTGTTGATGCTCTTGCGACAGAGTCCTACTCCACATGGGCTATCCTTATCTGATCTCCAAGAAGCCAAATATGAGACTTTTTCTGATGATCCATCACCTGGAGCAATAGACAGTAATAACAGCCTGTCTGAAATGACACACTTCAGGCCACAGCTCCATCACAGTGGGGACATGGTATTTACCCCTGAGTCAGGCCTCCAATTAAGATTAAATGAGAAACTGGGGACAACTGCAGCAACAGAGTTGAAGAAACTTGATTTCAAAGTTTCTAGTACATCAAATAATCTGATTTCAACAATTCCATCAGACAATTTGGCAGCAGGTACTGATAATACAAGTTCCTTAGGACCCCCAAGTATGCCAGTTCATTATGATAGTCAATTAGATACCACTCTATTTGGCAAAAAGTCATCTCCCCTTACTGAGTCTGGTGGACCTCTGAGCTTGAGTGAAGAAAATAATGATTCAAAGTTGTTAGAATCAGGTTTAATGAATAGCCAAGAAAGTTCATGGGGAAAAAATGTATCGTCAACAGAGAGTGGTAGGTTATTTAAAGGGAAAAGAGCTCATGGACCTGCTTTGTTGACTAAAGATAATGCCTTATTCAAAGTTAGCATCTCTTTGTTAAAGACAAACAAAACTTCCAATAATTCAGCAACTAATAGAAAGACTCACATTGATGGCCCATCATTATTAATTGAGAATAGTCCATCAGTCTGGCAAAATATATTAGAAAGTGACACTGAGTTTAAAAAAGTGACACCTTTGATTCATGACAGAATGCTTATGGACAAAAATGCTACAGCTTTGAGGCTAAATCATATGTCAAATAAAACTACTTCATCAAAAAACATGGAAATGGTCCAACAGAAAAAAGAGGGCCCCATTCCACCAGATGCACAAAATCCAGATATGTCGTTCTTTAAGATGCTATTCTTGCCAGAATCAGCAAGGTGGATACAAAGGACTCATGGAAAGAACTCTCTGAACTCTGGGCAAGGCCCCAGTCCAAAGCAATTAGTATCCTTAGGACCAGAAAAATCTGTGGAAGGTCAGAATTTCTTGTCTGAGAAAAACAAAGTGGTAGTAGGAAAGGGTGAATTTACAAAGGACGTAGGACTCAAAGAGATGGTTTTTCCAAGCAGCAGAAACCTATTTCTTACTAACTTGGATAATTTACATGAAATAATACACACAATCAAGAAAAAAAAATTCAGGAAGAAATAGAAAAGAAGGAAACATTAATCCAAGAGAATGTAGTTTTGCCTCAGATACATACAGTGACTGGCACTAAGAATTTCATGAAGAACCTTTTCTTACTGAGCACTAGGCAAAATGTAGAAGGTTCATATGACGGGGCATATGCTCCAGTACTTCAAGATTTTAGGTCATTAAATGATTCAACAAATAGAACAAAGAAACACACAGCTCATTTCTCAAAAAAAGGGGAGGAAGAAAACTTGGAAGGCTTGGGAAATCAAACCAAGCAAATTGTAGAGAAATATGCATGCACCACAAGGATATCTCCTAATACAAGCCAGCAGAATTTTGTCACGCAACGTAGTAAGAGAGCTTTGAAACAATTCAGACTCCCACTAGAAGAAACAGAACTTGAAAAAAGGATAATTGTGGATGACACCTCAACCCAGTGGTCCAAAAACATGAAACATTTGACCCCGAGCACCCTCACACAGATAGACTACAATGAGAAGGAGAAAGGGGCCATTACTCAGTCTCCCTTATCAGATTGCCTTACGAGGAGTCATAGCATCCCTCAAGCAAATAGATCTCCATTACCCATTGCAAAGGTATCATCATTTCCATCTATTAGACCTATATATCTGACCAGGGTCCTATTCCAAGACAACTCTTCTCATCTTCCAGCAGCATCTTATAGAAAGAAAGATTCTGGGGTCCAAGAAAGCAGTCATTTCTTACAAGGAGCCAAAAAAAATAACCTTTCTTTAGCCATTCTAACCTTGGAGATGACTGGTGATCAAAGAGAGGTTGGCTCCCTGGGGACAAGTGCCACAAATTCAGTCACATACAAGAAAGTTGAGAACACTGTTCTCCCGAAACCAGACTTGCCCAAAACATCTGGCAAAGTTGAATTGCTTCCAAAAGTTCACATTTATCAGAAGGACCTATTCCCTACGGAAACTAGCAATGGGTCTCCTGGCCATCTGGATCTCGTGGAAGGGAGCCTTCTTCAGGGAACAGAGGGAGCGATTAAGTGGAATGAAGCAAACAGACCTGGAAAAGTTCCCTTTCTGAGAGTAGCAACAGAAAGCTCTGCAAAGACTCCCTCCAAGCTATTGGATCCTCTTGCTTGGGATAACCACTATGGTACTCAGATACCAAAAGAAGAGTGGAAATCCCAAGAGAAGTCACCAGAAAAAACAGCTTTTAAGAAAAAGGATACCATTTTGTCCCTGAACGCTTGTGAAAGCAATCATGCAATAGCAGCAATAAATGAGGGACAAAATAAGCCCGAAATAGAAGTCACCTGGGCAAAGCAAGGTAGGACTGAAAGGCTGTGCTCTCAAAACCCACCAGTCTTGAAACGCCATCAACGGGAAATAACTCGTACTACTCTTCAGTCAGATCAAGAGGAAATTGACTATGATGATACCATATCAGTTGAAATGAAGAAGGAAGATTTTGACATTTATGATGAGGATGAAAATCAGAGCCCCCGCAGCTTTCAAAAGAAAACACGACACTATTTTATTGCTGCAGTGGAGAGGCTCTGGGATTATGGGATGAGTAGCTCCCCACATGTTCTAAGAAACAGGGCTCAGAGTGGCAGTGTCCCTCAGTTCAAGAAAGTTGTTTTCCAGGAATTTACTGATGGCTCCTTTACTCAGCCCTTATACCGTGGAGAACTAAATGAACATTTGGGACTCCTGGGGCCATATATAAGAGCAGAAGTTGAAGATAATATCATGGTAACTTTCAGAAATCAGGCCTCTCGTCCCTATTCCTTCTATTCTAGCCTTATTTCTTATGAGGAAGATCAGAGGCAAGGAGCAGAACCTAGAAAAAACTTTGTCAAGCCTAATGAAACCAAAACTTACTTTTGGAAAGTGCAACATCATATGGCACCCACTAAAGATGAGTTTGACTGCAAAGCCTGGGCTTATTTCTCTGATGTTGACCTGGAAAAAGATGTGCACTCAGGCCTGATTGGACCCCTTCTGGTCTGCCACACTAACACACTGAACCCTGCTCATGGGAGACAAGTGACAGTACAGGAATTTGCTCTGTTTTTCACCATCTTTGATGAGACCAAAAGCTGGTACTTCACTGAAAATATGGAAAGAAACTGCAGGGCTCCCTGCAATATCCAGATGGAAGATCCCACTTTTAAAGAGAATTATCGCTTCCATGCAATCAATGGCTACATAATGGATACACTACCTGGCTTAGTAATGGCTCAGGATCAAAGGATTCGATGGTATCTGCTCAGCATGGGCAGCAATGAAAACATCCATTCTATTCATTTCAGTGGACATGTGTTCACTGTACGAAAAAAAGAGGAGTATAAAATGGCACTGTACAATCTCTATCCAGGTGTTTTTGAGACAGTGGAAATGTTACCATCCAAAGCTGGAATTTGGCGGGTGGAATGCCTTATTGGCGAGCATCTACATGCTGGGATGAGCACACTTTTTCTGGTGTACAGCAATAAGTGTCAGACTCCCCTGGGAATGGCTTCTGGACACATTAGAGATTTTCAGATTACAGCTTCAGGACAATATGGACAGTGGGCCCCAAAGCTGGCCAGACTTCATTATTCCGGATCAATCAATGCCTGGAGCACCAAGGAGCCCTTTTCTTGGATCAAGGTGGATCTGTTGGCACCAATGATTATTCACGGCATCAAGACCCAGGGTGCCCGTCAGAAGTTCTCCAGCCTCTACATCTCTCAGTTTATCATCATGTATAGTCTTGATGGGAAGAAGTGGCAGACTTATCGAGGAAATTCCACTGGAACCTTAATGGTCTTCTTTGGCAATGTGGATTCATCTGGGATAAAACACAATATTTTTAACCCTCCAATTATTGCTCGATACATCCGTTTGCACCCAACTCATTATAGCATTCGCAGCACTCTTCGCATGGAGTTGATGGGCTGTGATTTAAATAGTTGCAGCATGCCATTGGGAATGGAGAGTAAAGCAATATCAGATGCACAGATTACTGCTTCATCCTACTTTACCAATATGTTTGCCACCTGGTCTCCTTCAAAAGCTCGACTTCACCTCCAAGGGAGGAGTAATGCCTGGAGACCTCAGGTGAATAATCCAAAAGAGTGGCTGCAAGTGGACTTCCAGAAGACAATGAAAGTCACAGGAGTAACTACTCAGGGAGTAAAATCTCTGCTTACCAGCATGTATGTGAAGGAGTTCCTCATCTCCAGCAGTCAAGATGGCCATCAGTGGACTCTCTTTTTTCAGAATGGCAAAGTAAAGGTTTTTCAGGGAAATCAAGACTCCTTCACACCTGTGGTGAACTCTCTAGACCCACCGTTACTGACTCGCTACCTTCGAATTCACCCCCAGAGTTGGGTGCACCAGATTGCCCTGAGGATGGAGGTTCTGGGCTGCGAGGCACAGGACCTCTACTGAGGGTGGCCACTGCAGCACCTGCCACTGCCGTCACCTCTCCCTCCTCAGCTCCAGGGCAGTGTCCCTCCCTGGCTTGCCTTCTACCTTTGTGCTAAATCCTAGCAGACACTGCCTTGAAGCCTCCTGAATTAACTATCATCAGTCCTGCATTTCTTTGGTGGGGGGCCAGGAGGGTGCATCCAATTTAACTTAACTCTTACCTATTTTCTGCAGCTGCTCCCAGATTACTCCTTCCTTCCAATATAACTAGGCAAAAAGAAGTGAGGAGAAACCTGCATGAAAGCATTCTTCCCTGAAAAGTTAGGCCTCTCAGAGTCACCACTTCCTCTGTTGTAGAAAAACTATGTGATGAAACTTTGAAAAAGATATTTATGATGTTAACATTTCAGGTTAAGCCTCATACGTTTAAAATAAAACTCTCAGTTGTTTATTATCCTGATCAAGCATGGAACAAAGCATGTTTCAGGATCAGATCAATACAATCTTGGAGTCAAAAGGCAAATCATTTGGACAATCTGCAAAATGGAGAGAATACAATAACTACTACAGTAAAGTCTGTTTCTGCTTCCTTACACATAGATATAATTATGTTATTTAGTCATTATGAGGGGCACATTCTTATCTCCAAAACTAGCATTCTTAAACTGAGAATTATAGATGGGGTTCAAGAATCCCTAAGTCCCCTGAAATTATATAAGGCATTCTGTATAAATGCAAATGTGCATTTTTCTGACGAGTGTCCATAGATATAAAGCCATTTGGTCTTAATTCTGACCAATAAAAAAATAAGTCAGGAGGATGCAATTGTTGAAAGCTTTGAAATAAAATAACAATGTCTTCTTGAAATTTGTGATGGCCAAGAAAGAAAATGATGATGACATTAGGCTTCTAAAGGACATACATTTAATATTTCTGTGGAAATATGAGGAAAATCCATGGTTATCTGAGATAGGAGATACAAACTTTGTAATTCTAATAATGCACTCAGTTTACTCTCTCCCTCTACTAATTTCCTGCTGAAAATAACACAACAAAATGTAACAGGGGAAATTATATACCGTGACTGAAAACTAGAGTCCTACTTACATAGTTGAAATATCAAGGAGGTCAGAAGAAAATTGGACTGGTGAAAACAGAAAAAACACTCCAGTCTGCCATATCACCACACAATAGGATCCCCCTTCTTGCCCTCCACCCCCATAAGATTGTGAAGGGTTTACTGCTCCTTCCATCTGCCTGACCCCTTCACTATGACTACACAGAATCTCCTGATAGTAAAGGGGGCTGGAGGCAAGGATAAGTTATAGAGCAGTTGGAGGAAGCATCCAAAGATTGCAACCCAGGGCAAATGGAAAACAGGAGATCCTAATATGAAAGAAAAATGGATCCCAATCTGAGAAAAGGCAAAAGAATGGCTACTTTTTTCTATGCTGGAGTATTTTCTAATAATCCTGCTTGACCCTTATCTGACCTCTTTGGAAACTATAACATAGCTGTCACAGTATAGTCACAATCCACAAATGATGCAGGTGCAAATGGTTTATAGCCCTGTGAAGTTCTTAAAGTTTAGAGGCTAACTTACAGAAATGAAAAGTTGTTTTGTTTTATAGCCCGGTAGAGGAGTTAACCCCAAAGGTGATATGGTTTTATTTCCTGTTATGTTTAACTTGATAATCTTATTTTGGCATTCTTTTCCCATTGACTATATACATCTCTATTTCTCAAATGTTCATGGAACTAGCTCTTTTATTTTCCTGCTGGTTTCTTCAGTAATGAGTTAAATAAAACATTGACACATACAAACAAAAAAAAAAAAAAA'
#length 6132, http://www.ncbi.nlm.nih.gov/nuccore/NM_000492.3
cftrcysticfibrosis = 'AATTGGAAGCAAATGACATCACAGCAGGTCAGAGAAAAAGGGTTGAGCGGCAGGCACCCAGAGTAGTAGGTCTTTGGCATTAGGAGCTTGAGCCCAGACGGCCCTAGCAGGGACCCCAGCGCCCGAGAGACCATGCAGAGGTCGCCTCTGGAAAAGGCCAGCGTTGTCTCCAAACTTTTTTTCAGCTGGACCAGACCAATTTTGAGGAAAGGATACAGACAGCGCCTGGAATTGTCAGACATATACCAAATCCCTTCTGTTGATTCTGCTGACAATCTATCTGAAAAATTGGAAAGAGAATGGGATAGAGAGCTGGCTTCAAAGAAAAATCCTAAACTCATTAATGCCCTTCGGCGATGTTTTTTCTGGAGATTTATGTTCTATGGAATCTTTTTATATTTAGGGGAAGTCACCAAAGCAGTACAGCCTCTCTTACTGGGAAGAATCATAGCTTCCTATGACCCGGATAACAAGGAGGAACGCTCTATCGCGATTTATCTAGGCATAGGCTTATGCCTTCTCTTTATTGTGAGGACACTGCTCCTACACCCAGCCATTTTTGGCCTTCATCACATTGGAATGCAGATGAGAATAGCTATGTTTAGTTTGATTTATAAGAAGACTTTAAAGCTGTCAAGCCGTGTTCTAGATAAAATAAGTATTGGACAACTTGTTAGTCTCCTTTCCAACAACCTGAACAAATTTGATGAAGGACTTGCATTGGCACATTTCGTGTGGATCGCTCCTTTGCAAGTGGCACTCCTCATGGGGCTAATCTGGGAGTTGTTACAGGCGTCTGCCTTCTGTGGACTTGGTTTCCTGATAGTCCTTGCCCTTTTTCAGGCTGGGCTAGGGAGAATGATGATGAAGTACAGAGATCAGAGAGCTGGGAAGATCAGTGAAAGACTTGTGATTACCTCAGAAATGATTGAAAATATCCAATCTGTTAAGGCATACTGCTGGGAAGAAGCAATGGAAAAAATGATTGAAAACTTAAGACAAACAGAACTGAAACTGACTCGGAAGGCAGCCTATGTGAGATACTTCAATAGCTCAGCCTTCTTCTTCTCAGGGTTCTTTGTGGTGTTTTTATCTGTGCTTCCCTATGCACTAATCAAAGGAATCATCCTCCGGAAAATATTCACCACCATCTCATTCTGCATTGTTCTGCGCATGGCGGTCACTCGGCAATTTCCCTGGGCTGTACAAACATGGTATGACTCTCTTGGAGCAATAAACAAAATACAGGATTTCTTACAAAAGCAAGAATATAAGACATTGGAATATAACTTAACGACTACAGAAGTAGTGATGGAGAATGTAACAGCCTTCTGGGAGGAGGGATTTGGGGAATTATTTGAGAAAGCAAAACAAAACAATAACAATAGAAAAACTTCTAATGGTGATGACAGCCTCTTCTTCAGTAATTTCTCACTTCTTGGTACTCCTGTCCTGAAAGATATTAATTTCAAGATAGAAAGAGGACAGTTGTTGGCGGTTGCTGGATCCACTGGAGCAGGCAAGACTTCACTTCTAATGGTGATTATGGGAGAACTGGAGCCTTCAGAGGGTAAAATTAAGCACAGTGGAAGAATTTCATTCTGTTCTCAGTTTTCCTGGATTATGCCTGGCACCATTAAAGAAAATATCATCTTTGGTGTTTCCTATGATGAATATAGATACAGAAGCGTCATCAAAGCATGCCAACTAGAAGAGGACATCTCCAAGTTTGCAGAGAAAGACAATATAGTTCTTGGAGAAGGTGGAATCACACTGAGTGGAGGTCAACGAGCAAGAATTTCTTTAGCAAGAGCAGTATACAAAGATGCTGATTTGTATTTATTAGACTCTCCTTTTGGATACCTAGATGTTTTAACAGAAAAAGAAATATTTGAAAGCTGTGTCTGTAAACTGATGGCTAACAAAACTAGGATTTTGGTCACTTCTAAAATGGAACATTTAAAGAAAGCTGACAAAATATTAATTTTGCATGAAGGTAGCAGCTATTTTTATGGGACATTTTCAGAACTCCAAAATCTACAGCCAGACTTTAGCTCAAAACTCATGGGATGTGATTCTTTCGACCAATTTAGTGCAGAAAGAAGAAATTCAATCCTAACTGAGACCTTACACCGTTTCTCATTAGAAGGAGATGCTCCTGTCTCCTGGACAGAAACAAAAAAACAATCTTTTAAACAGACTGGAGAGTTTGGGGAAAAAAGGAAGAATTCTATTCTCAATCCAATCAACTCTATACGAAAATTTTCCATTGTGCAAAAGACTCCCTTACAAATGAATGGCATCGAAGAGGATTCTGATGAGCCTTTAGAGAGAAGGCTGTCCTTAGTACCAGATTCTGAGCAGGGAGAGGCGATACTGCCTCGCATCAGCGTGATCAGCACTGGCCCCACGCTTCAGGCACGAAGGAGGCAGTCTGTCCTGAACCTGATGACACACTCAGTTAACCAAGGTCAGAACATTCACCGAAAGACAACAGCATCCACACGAAAAGTGTCACTGGCCCCTCAGGCAAACTTGACTGAACTGGATATATATTCAAGAAGGTTATCTCAAGAAACTGGCTTGGAAATAAGTGAAGAAATTAACGAAGAAGACTTAAAGGAGTGCTTTTTTGATGATATGGAGAGCATACCAGCAGTGACTACATGGAACACATACCTTCGATATATTACTGTCCACAAGAGCTTAATTTTTGTGCTAATTTGGTGCTTAGTAATTTTTCTGGCAGAGGTGGCTGCTTCTTTGGTTGTGCTGTGGCTCCTTGGAAACACTCCTCTTCAAGACAAAGGGAATAGTACTCATAGTAGAAATAACAGCTATGCAGTGATTATCACCAGCACCAGTTCGTATTATGTGTTTTACATTTACGTGGGAGTAGCCGACACTTTGCTTGCTATGGGATTCTTCAGAGGTCTACCACTGGTGCATACTCTAATCACAGTGTCGAAAATTTTACACCACAAAATGTTACATTCTGTTCTTCAAGCACCTATGTCAACCCTCAACACGTTGAAAGCAGGTGGGATTCTTAATAGATTCTCCAAAGATATAGCAATTTTGGATGACCTTCTGCCTCTTACCATATTTGACTTCATCCAGTTGTTATTAATTGTGATTGGAGCTATAGCAGTTGTCGCAGTTTTACAACCCTACATCTTTGTTGCAACAGTGCCAGTGATAGTGGCTTTTATTATGTTGAGAGCATATTTCCTCCAAACCTCACAGCAACTCAAACAACTGGAATCTGAAGGCAGGAGTCCAATTTTCACTCATCTTGTTACAAGCTTAAAAGGACTATGGACACTTCGTGCCTTCGGACGGCAGCCTTACTTTGAAACTCTGTTCCACAAAGCTCTGAATTTACATACTGCCAACTGGTTCTTGTACCTGTCAACACTGCGCTGGTTCCAAATGAGAATAGAAATGATTTTTGTCATCTTCTTCATTGCTGTTACCTTCATTTCCATTTTAACAACAGGAGAAGGAGAAGGAAGAGTTGGTATTATCCTGACTTTAGCCATGAATATCATGAGTACATTGCAGTGGGCTGTAAACTCCAGCATAGATGTGGATAGCTTGATGCGATCTGTGAGCCGAGTCTTTAAGTTCATTGACATGCCAACAGAAGGTAAACCTACCAAGTCAACCAAACCATACAAGAATGGCCAACTCTCGAAAGTTATGATTATTGAGAATTCACACGTGAAGAAAGATGACATCTGGCCCTCAGGGGGCCAAATGACTGTCAAAGATCTCACAGCAAAATACACAGAAGGTGGAAATGCCATATTAGAGAACATTTCCTTCTCAATAAGTCCTGGCCAGAGGGTGGGCCTCTTGGGAAGAACTGGATCAGGGAAGAGTACTTTGTTATCAGCTTTTTTGAGACTACTGAACACTGAAGGAGAAATCCAGATCGATGGTGTGTCTTGGGATTCAATAACTTTGCAACAGTGGAGGAAAGCCTTTGGAGTGATACCACAGAAAGTATTTATTTTTTCTGGAACATTTAGAAAAAACTTGGATCCCTATGAACAGTGGAGTGATCAAGAAATATGGAAAGTTGCAGATGAGGTTGGGCTCAGATCTGTGATAGAACAGTTTCCTGGGAAGCTTGACTTTGTCCTTGTGGATGGGGGCTGTGTCCTAAGCCATGGCCACAAGCAGTTGATGTGCTTGGCTAGATCTGTTCTCAGTAAGGCGAAGATCTTGCTGCTTGATGAACCCAGTGCTCATTTGGATCCAGTAACATACCAAATAATTAGAAGAACTCTAAAACAAGCATTTGCTGATTGCACAGTAATTCTCTGTGAACACAGGATAGAAGCAATGCTGGAATGCCAACAATTTTTGGTCATAGAAGAGAACAAAGTGCGGCAGTACGATTCCATCCAGAAACTGCTGAACGAGAGGAGCCTCTTCCGGCAAGCCATCAGCCCCTCCGACAGGGTGAAGCTCTTTCCCCACCGGAACTCAAGCAAGTGCAAGTCTAAGCCCCAGATTGCTGCTCTGAAAGAGGAGACAGAAGAAGAGGTGCAAGATACAAGGCTTTAGAGAGCAGCATAAATGTTGACATGGGACATTTGCTCATGGAATTGGAGCTCGTGGGACAGTCACCTCATGGAATTGGAGCTCGTGGAACAGTTACCTCTGCCTCAGAAAACAAGGATGAATTAAGTTTTTTTTTAAAAAAGAAACATTTGGTAAGGGGAATTGAGGACACTGATATGGGTCTTGATAAATGGCTTCCTGGCAATAGTCAAATTGTGTGAAAGGTACTTCAAATCCTTGAAGATTTACCACTTGTGTTTTGCAAGCCAGATTTTCCTGAAAACCCTTGCCATGTGCTAGTAATTGGAAAGGCAGCTCTAAATGTCAATCAGCCTAGTTGATCAGCTTATTGTCTAGTGAAACTCGTTAATTTGTAGTGTTGGAGAAGAACTGAAATCATACTTCTTAGGGTTATGATTAAGTAATGATAACTGGAAACTTCAGCGGTTTATATAAGCTTGTATTCCTTTTTCTCTCCTCTCCCCATGATGTTTAGAAACACAACTATATTGTTTGCTAAGCATTCCAACTATCTCATTTCCAAGCAAGTATTAGAATACCACAGGAACCACAAGACTGCACATCAAAATATGCCCCATTCAACATCTAGTGAGCAGTCAGGAAAGAGAACTTCCAGATCCTGGAAATCAGGGTTAGTATTGTCCAGGTCTACCAAAAATCTCAATATTTCAGATAATCACAATACATCCCTTACCTGGGAAAGGGCTGTTATAATCTTTCACAGGGGACAGGATGGTTCCCTTGATGAAGAAGTTGATATGCCTTTTCCCAACTCCAGAAAGTGACAAGCTCACAGACCTTTGAACTAGAGTTTAGCTGGAAAAGTATGTTAGTGCAAATTGTCACAGGACAGCCCTTCTTTCCACAGAAGCTCCAGGTAGAGGGTGTGTAAGTAGATAGGCCATGGGCACTGTGGGTAGACACACATGAAGTCCAAGCATTTAGATGTATAGGTTGATGGTGGTATGTTTTCAGGCTAGATGTATGTACTTCATGCTGTCTACACTAAGAGAGAATGAGAGACACACTGAAGAAGCACCAATCATGAATTAGTTTTATATGCTTCTGTTTTATAATTTTGTGAAGCAAAATTTTTTCTCTAGGAAATATTTATTTTAATAATGTTTCAAACATATATAACAATGCTGTATTTTAAAAGAATGATTATGAATTACATTTGTATAAAATAATTTTTATATTTGAAATATTGACTTTTTATGGCACTAGTATTTCTATGAAATATTATGTTAAAACTGGGACAGGGGAGAACCTAGGGTGATATTAACCAGGGGCCATGAATCACCTTTTGGTCTGGAGGGAAGCCTTGGGGCTGATGCAGTTGTTGCCCACAGCTGTATGATTCCCAGCCAGCACAGCCTCTTAGATGCAGTTCTGAAGAAGATGGTACCACCAGTCTGACTGTTTCCATCAAGGGTACACTGCCTTCTCAACTCCAAACTGACTCTTAAGAAGACTGCATTATATTTATTACTGTAAGAAAATATCACTTGTCAATAAAATCCATACATTTGTGTGAAA'
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment