Skip to content

Instantly share code, notes, and snippets.

@hyphaltip
Created January 17, 2017 22:13
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save hyphaltip/1fd6e38fcd1d51681ed5c7d6257cbf67 to your computer and use it in GitHub Desktop.
Save hyphaltip/1fd6e38fcd1d51681ed5c7d6257cbf67 to your computer and use it in GitHub Desktop.
Bifiguratus EF1
>ENDOGMAKER|AZ0501_00762-R0 transcript offset:0 AED:0.26 eAED:0.26 QI:0|0|0|0.81|1|1|11|0|947
ATGTTGAAGATTGGCGCACTTCCGACACTTGCTGCAGATGTCTTGGCTAGAGATCAGTAT
GACGAAGAAGACGAAGCGCCAGAGTTGACCGCAGAGGAGGAAACACTCCTGGATGAAGGG
ACAGAACACATACGAGCAATGCTGCCAGCAGACTCGGGAGTTTCTAATCAAGAGATACGC
GAGACTCTCTATTACTACTATTACGACCAAGAAGAGAGTTTAAACTATTTACTGGACAAG
ATTGCAAAGGAAGAAGCGCAAAAGCAAAAGACTGCGCAAAGAGTAAAAGTGGCATCGCAT
CCGGCAGAACGAGTGCAACCGTGGAGTCGGGGTAGGCGAAGATGTTCTGGCATGACACAC
GATACAGCAAAGTTGACAAGGTCAAGCAATACCGCCTCGTCTCCCGCTAATGCTGCTGCT
GCCGACATGTTGAAAGCACAGTCTGATGGGTCAAAACGACCGGCGTCCCTGGCATATCTG
GCCGCGGCGCGTGCTGGTGGACAAGCTGCGCGACGACCATTGACAAGTCTGCTCAGTGGC
AATGGAGACAGGAATACTTCCTCACCCCCCAAACCCAGCAATCTGACCAACACCATCACC
AGCTCAAAACCATCGCTCGCATCTCTTGCCAAGAAATCTGGCGCACCGTCTGGGTCGTTG
AGCTCTCTTGCTTATCGCAAATCACCGTCAACCAGCTCCAATCCGTCCGTCTCCCGTATC
CTTCCTAAATCGCAAACTCCGCAGACCTCCCTGCAAAACCTGGGCCATGCCGAGTTGAAA
ACAGCCACCGCGGTGCTTCCAAGGACGACCAAAGCGCCCGAGGCAGAGCAAACACAAAGT
GAACAGCCATCCTATGAACAGCGCACACCATCAGAAGCTTCCTTATCCGATATCAAGTCA
TCTTCATCTACGCCGTTGACGCCATTTTCCGCTCCTCCATCAGCAATAGCATGCTTTTTG
TTCCAATCACTTCCCTCCAGGATGCCCAATGCATCACCACAGGATGTCGAAAAATCCATT
GCCGTCCGAGATCTTGAATTCGCCGTTTCGGCGATATTTCCCAAGTCCCACCGGGGGAAT
ACATCCTCTTCAGTCACACCACCTCTTATCGAGGATCGTGAGGGGCGTGCCAAGGACAAG
GCGGACGATTCGGACGGCCCACTCTTACCCCGATTCACCTTCCACACGGCTAGTCCAGAT
GACATCGTCTTAGAGAAGCAAGGAGGCAGAGAAACACCAAAATCCATTACACCTACACCC
ACCAAATCAAAACCCGAGACCGAACCATCCAAGGCCAAGGTTGCCGATAAAGGTTTGGAG
GAAGGAGACCTGGAAGATCGAATCAAATTCGACATGGACGCGCTACATCTCAACATGGCA
CCCACAAAGAAGACCGAGTTGACGTCACATCCGTTAAGTCGTGAAAGTTCTTATGCGAAA
TCTGGAGCTTCTACACCGACAAGGAAACGGATTGATGTGCTGGAAGAGTACAATAAACGG
CAAGGCAAACCATCGCTCAATTTGGTCGTCATTGGTCATGTGGATGCGGGCAAATCAACC
TTGATGGGTCATTTGCTCTACGAGTTGGGTCAAGTGAGCGAAAGGACCTTGAAAAAGTAT
GAGCGTGAGGCTCAAAGAATTGGCAAATCATCTTTTGCGTATGCATGGATCTTGGATGAG
ACGGGTGAAGAGCGTAGTCGGTATGTCTTGAGCGGGTTTCATGTATTTTCCGTGTTGATC
AGCTTCTCTAGGGGAATCACCATGGACATTGCTGTCAACGCATTTGAAACCGAACACCGG
AAGTTGACCCTCCTCGATGCCCCCGGACATCGAGACTTTATACCAAACATGATTTCTGGT
GCTGCTCAGGCGGACGTAGCCATTCTGGTAGTAGACGCCATGACAGGCGAATTCGAAGCC
GGATTTGATGCCAATGGTCAAACCAAGGAACACGCTCTACTTGTTCGGAGTTTGGGAGTC
CAGCAGTTGATTGTGGCCATCAACAAGTTGGATCTCTTGAATTGGTCACAGACCCGCTTT
GACGAAATCGTAGCCCGTCTTGGACAGTTTCTCCAACAAGCCGGCTTCCGAAAACAAAAA
CTTTCATTCGTGCCTGTGAGCGGATTAACCGGCGAAAATTTGGTCAAGATGAATGCGGCC
CCTTTGAAGGCGTGGTATCATGGTCCCACGCTTGTCGATCTCATTGACGCTTTTGATCCA
CCCGTGCGAAATGTGGAAAAGCATTTCCGACTCGGTGTCTCAGATTTCTTCAAGGGAGGA
ATTGGCAGTGGAGGCGGTGTTTCTGTGGCTGGACGCATCGATGCTGGCACGGTTCAGATT
GGGGATCAAGTGATGTGCGTCCCAGGTGGAGAATTGGGGACCGTCAAAGCTGTGGAAGTC
AATGATCAATCGGTGAAATGGGCTGTGGCTGGAGATACGCTTCTCATGACCCTGTCCGGA
TTAGACATTTTACAACTCAGCCCTGGATGTGTCCTATGTGATCCGTTGGCGCCCGTTCCG
GTCGCTAAGCTATTCAAGGCGCAAATCGTCACCTTTGACATCAAGATACCCATTACTTTA
GGCTACCCAGTGGTGGTGCATCATCAGAGATTGGACGAACCGGCAGTCATTACAAAGCTG
GTCGCAATTTTAGATAAGGCAAGCGGAGAGGTGACGAAAAAGAACCCGAGAACTATTGGT
CGATCAGCCATCGCCACGGTCGAGATCACACTGACAAATCGCAAGATACCGTTAGAAGCA
TTCATGGACAGTAAAGAACTTGGTCGGGTCATGTTACGAAAAGGCGGAGAGACGGTTGCT
GCTGGCGTCGTGGTAGAGGTATGA
>ENDOGMAKER|AZ0501_02347-R0 transcript offset:0 AED:0.01 eAED:0.01 QI:0|-1|0|1|-1|1|1|0|465
ATGGCCACCAACAACAAGACTCACTTGTCCATCGTTATTTGCGGACACGTCGATTCCGGC
AAGTCCACCACCACTGGTCGTCTTCTTTTCGAACTCGGTGGTATCTCCGAGAGAGAAATG
GAGAAGTTGAAGCAAGAAGCTGAACGCCTCGGCAAGTCTTCTTTCGCTTTCGCTTTCTAC
ATGGATCGTCAAAAGGATGAACGTGAGCGTGGTGTCACTATTGCCTGCACCACCAAGGAG
TTCTTTACCGAGAAGTGGCACTACACCATCATTGATGCCCCTGGCCACAGAGATTTCATC
AAGAACATGATTTCCGGTGCCGCTCAAGCTGATGTCGCTTTGCTCATGGTTCCCGCTGAT
GGTAACTTCACCACTGCCATTCAAAAGGGTGATCACAAGGCTGGTGATATTCAAGGTCAA
ACTCGCCAACACGCTCGTCTCCTCAACCTTCTCGGTGTTAAGCAACTTGTTGTTGGTGTC
AACAAGATGGACTCTGATGTTGCTGGTTACAAGGAGTCTCGTTACAACGAAATCCGTGAT
GAGATGCGCAACATGTTGGTCCGTGTCGGCTGGAAGAAGGACTTCGTTGAGGGTTCCGTC
CCCGTCATCCCCATCTCTGGCTGGATGGGTGACAACTTGTTGAAGAAGTCCGACAACATG
GGCTGGTGGAAGGGTCAAGAGGTCACCAACTCTGAGGGCAAGAAGATGACTATCACCACC
CTCTTGGACGCTCTTAACGACTTCGCCACCCTCCCTCCCCGCAAGACTGATGCTGCCCTC
CGTCTTCCCGTCTCTGGTATCTACAAGATCAAGGGTGTCGGTGATGTCATTGCTGGACGT
GTCGAGCAAGGTACCGTCAAGCCCAAGGATGAAGTTGTCTTCCTTCCTACCCACACTGCT
GCCAACAAGTGCGCTGGTGTCATCTTCTCCATTGAAATGCACCACAAGCGTGTTGAGCAA
GCCGTCTCTGGCGACAACGTTGGTATGAACGTTAAGAACTTGGATAAGGCCAACATGCCC
CGTGCTGGTGATGTCATGATCCTCGCCAAGGACACCACCCTCACCGCCGTCAAGCGCTTC
ACTGCTCAAATTCAGACCCTCGATATCCCCGGAGAAGTCAAGCCCAACTACTCTCCCATT
GGTTTCGTCCGCTGTGGTCGAGCTGCCTGCAAGATTGTAGAGCTCAAGTGGAAGATTGGC
AAGGAGACTGGACGCTCCAAGATGCCTAACCCCGTTTCCTTGAAGGCCAACGAGGCTGCC
GAGGTTGTTTTCGAGCCCATCCAACCCCTCATCGTCGATACATTCCAAAATTGCGAGGGT
CTTTCCCGTATCGCTTTCTTGGATGGTAACACCGCTGTCATGTTGGGCAAGGTTACCGCC
GTCGAGCTCAAGGCTTAA
>ENDOGMAKER|AZ0501_05111-R0 transcript offset:0 AED:0.10 eAED:0.10 QI:0|0|0|1|1|1|8|0|1433
ATGTCAACGTCATCCTTTTCGAGTGACTCAAATTCGTCCGATCGTCCTTCTTGTTGCCAC
ATGTTCACTTTGCGAGTTGCGCGGTCTGCTCTACGGCCACGGCCAACGACTGCACTGCGA
CGTATACCTGTTCAACGCACGCCACGAAGGTACTTTGCTTCAGAGCAACCGAGAAAGGAT
ACAAACAAGGAGAAGCGAGATCGAGACAACAAGGAGAAGGATGATCAACCTTCCCTAATC
CCAAAGGGCTTTGAGAACTTCTTTGGAAAGAACAAGGGTGCTCAGAGTAAGTCAAAGAAT
TCAGCCGAAGGAGGTTCGGAAAACAACAAGTCGTCTTCCGGTATTCCGAAGCCTCCTCAC
GACCCAAAGAATGGACCGACCGAGATTCGAATGAACCTCAATGCGCAAACACTCTTGACG
GCTGCCTTTGCTTCATATCTGTTGTGGAAGATGGCTTCCCCGGCGGAAAACGCTAGAGAA
CTCACATGGCAAGATTTCCGCAACACGTTCCTGGACAAGGGATTGGTAGAGAAGCTCGTG
GTCGTCAATCGGAGCCGGGTACGGGTGCATTTACGCCCTGAAGCAGCCAATATGCCTGGA
GGAGGCGGTTATGTGACATATTACTTTAGCATCGGATCGGTGGATGCGTTTGAACGCAAG
ATTGACGAAGCCCAACGCGAACTAGGTATTCCGTCGAACGAGCGGATACCGGTCGCCTAT
CACGATGAGGTCTCGGTGATGAATACGCTCTTTCAATTCGCGCCCACACTTTTGCTGATG
GGCGCATTATTCTATATCACGCGGCGCGCTGGCGCCGGTGGAGGTTCTCAAGGGATCTTT
GGTGTTGGAAAATCGAAAGCAAAGATGTTTAATCAGGAAACAGATGTGAAGGTGAAATTC
AAGGATGTGGCTGGAGCGGATGAGGCCAAGGAGGAAATTATGGAATTTGTCAAGTTTCTA
AAGGATCCTGGTGCTTATGAGAAGCTCGGAGCCAAGATTCCCAAGGGTGCCATTCTTTCT
GGTCCTCCCGGAACGGGTAAAACCCTCTTGGCCAAAGCGACAGCTGGTGAAGCCGGTGTA
CCTTTCCTTAGCGTCAGCGGATCGGAATTTGTTGAAATGTTTGTCGGTGTTGGTCCCTCT
CGGGTTCGTGATTTGTTCGCAACAGCCAAGAAGCACGCACCTTGCATCATTTTTGTGGAT
GAGATCGATGCTATCGGAAAGGCTCGTGGCAAGGGTGGTCAATTTGGAGGAAATGACGAA
CGTGAATCGACGTTGAATCAACTTTTGGTGGAGATGGACGGATTCGGTACGACAGAACAC
GTCGTGGTGCTCGCTGGTACGAATCGACCGGATGTACTCGATCCCGCCTTGATGAGACCT
GGACGTTTTGACCGGCATATTGCGATTGATCGCCCCGATATCAAGGGACGGGCTCAAATC
TTCAAGGTGCATTTGAAGCCCATCAAGACAAACGTGAATCTCGAAACGCTCGCAAATAAG
CTAGCTGCGCTGACACCAGGTTTCTCTGGAGCTGACATTCATAATGTATGCAACGAAGCT
GCTCTAATTGCAGCTCGCCACCACAAGGATGAGGTGTTTGGGGAGCACTTTGAGATGGCC
ATTGAACGGGTGATTGCCGGTCTTGAAAAGAAGTCTCGGGTGCTTTCACCGGAGGAGAAG
AAGACGGTGGCCTATCATGAAGCTGGACACGCAGTCTGCGGATGGTTTTTGGAACACACC
GATCCTCTTTTGAAGGTCTCCATCATTCCACGCGGTATCGGTGCCCTCGGTTATGCGCAA
TATCTACCTAAAGATCAGTACTTGTATTCTACGCAGCAATTTCTGGACAGAATGTGCATG
ACACTGGGCGGACGAGTCTCTGAGCAAATCTTTTTCAACACCATCACCACCGGCGCACAA
GACGATCTCCAAAAGGTGACAAAGATGGCATATGCCCAAGTTTCCACCTATGGCATGAAC
GCCAATGTCGGTCCCCTCTCCTATCACAACCCTAATGATGAACCTCAATATCAAAAGCCC
TACTCGGAACAGACCGCTCAAATGATCGACCATGAAGCCAGAGACATAATATCCCAAGCC
TATAAGCGAACGCTGGCTCTATTGACGGAAAAGAAGGACGATGTGGAAAAGGTGGCTAAG
CTGTTACTGGACAAGGAGGTCCTCAATCGCGAAGACATGATACATTTGCTCGGTAATCGT
CCCTTTGTGGAGAAGACTGTCTACGATGAATATGTCAAGCCCAAAGAGACCATCACTCCA
CCGCCCTTCCCTGTCGATGAACCTCCCAACGAAGATCGTCCCATGCAAGCAATAGAAAGA
AAAGCATTGAGAGGGCTGTCCCGTTCCATCTCACTAGAACCTATCCGTCCAACGATAAGA
CGCTCTCTCTCGCTCTCTCTAGCACAGAAGTTGCCGTCCTTTGGCAAATTCTTTCGATTA
TTTTCTCCCTCTCTCTTTCTTTCTTCCATGGCCGACGACTGGGACCAGGATTACGATCAA
TTGCCTCAACAGACGAGTGGATTGAGCTTGAATCCCAACGCTTCAGAGTGGAAGCCCAAC
ACGGGAGCAAAAGAGTTTGTTCCAAGTTGGATGGGCAGCGGAGCCGGTGTTCCTAGACCT
GCTCCTCCCGCTGCGCCGTCATCCAATGGTAGGCGTCCTGCAAAGGTCCTTTCTATCGGT
GGTGGTAGCGCACCAGCCAAGGCCGTGAGCATTAGCATTGGCTCGACGCCAAAGCCGCAG
GAGAAACCTGCAGCCAATGGTGTGGCGGAGGTGAAGAGCGAGAGTCAAGCTGAAGCAGCT
CGTCCCGAATCGCCCAAGCCTGCAAAGACAGCCTCTCCGGCTCCATCGGCGTCTAAGGAG
CAAAAGAAGGCTGAAGCGAAAGCGGAGGCCAAGGCAGTGGCCGCTCAAGAGGCTGTTAAG
GAATCTGTTGATGTGCAAGCCGATATAGAAAAGCTCGTAGATGATGAAGTTGTTACTGAT
CTATTTGGAAAGGAGCATTTGAACGTTGTCTTTATGGGACATGTTGATGCTGGAAAGTCG
ACCATGGGTGGAAACATTCTCTTCCTCACTGGCATGGTAGACAAACGTACCATGGAAAAA
TACGAAAAGGATGCCAAGGAAGCCGGTCGTGAGTCTTGGTACCTTTCTTGGGCTCTTGAC
ACCAACACCGAAGAGCGTGCCAAGGGTAAAACGGTCGAATGTGGTCGGGCATCGTTCGAA
ACCGAGAAGCGTCGTTACACCATCCTAGATGCTCCAGGACACAAAAACTATGTCCCATCC
ATGATCACTGGTGCATCTCAAGCTGATATCGGTGTGCTTGTTATTTCTGCCCGTAAGGGT
GAGTTTGAGACCGGTTTTGAACGCGGTGGACAGACCCAGGAACACGCCGTGTTGGCCAAG
ACAAGTGGCGTCGGCAAACTCATTGTAGCCATCAACAAGATGGACGATGCCACGGTGAAC
TGGAACAAGGAACGATACGATGAGATTGTCTCCAAACTTACCCGCTTCCTCAAGGGTCTC
GGCTACAATCCCAAGACCGACCTTCAGTTCATGCCTGTGTCTGGTTTCACCGGTGCCAAC
ATCAAGGACCGGTACACGGAACTTGATTGGTACGATGGCCCAAGTTTGTTGGAGTATCTC
GATAACATGCAGGCGCTTGAACGCAAGATCAATGCTCCTTTGATGATGCCCATTACAGAG
AAATACAAGGACATGGGCACCATTGTTGTTGGTAAGCTTGAATCCGGAGCCGTCAAGAAG
GGACAAAACGTGATTATTATGCCAAACAAGAAGGTGTGCGAAGTGACGGCTGTATACGGT
GAATCAGAAGAGGAAATCCCGGCTGGTTTATGCGGTGACAACGTTAGAATGCGGTTGCGA
GGTATTGAAGAAGAAGAAGTATCGGTCGGCTTCGTATTGTGCTCTCCCAAGGCTCCTGTC
AAGACCACCACGGCATTCGAATGCCAATTGGCCATCCTTGAAGCCAAAAACATCATATGC
CCCGGCTACACGGCCATTCTGCATGTACACTCTGCCGTGGAGGAGATCACTATATCGGCA
TTCCTCCACTTGATTGACAAGAAGACGGGACGCCGAACCAAGCGGCCACCCCAATTCGTC
AAGCAAGGTCAAAAAGTCATTTGCCGCATCGAGACCGCCGGCCCCCTTTGCGTCGAACCT
TTTGATGACCATCCCCAACTCGGTCGCTTCACCCTTCGTGATGAAGGCAAAACCATTGCT
ATCGGCAAGATTACCAAGGTCTATGAAAATGAATCTGCGTAA
# fasta36 -E 1e-10 Sacch_TEF1.fa ../pep/Bifiguratus_adelaidae_AZ0501.all.maker.proteins.aa.fasta
FASTA searches a protein or DNA sequence data bank
version 36.3.7a Jan, 2015(preload9)
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: Sacch_TEF1.fa
1>>>TEF1 YPR080W SGDID:S000006284 - 459 aa
Library: ../pep/Bifiguratus_adelaidae_AZ0501.all.maker.proteins.aa.fasta
3229420 residues in 6120 sequences
Statistics: Altschul/Gish params: n0: 459 Lambda: 0.158 K: 0.019 H: 0.100
statistics sampled from 596 (607) to 596 sequences
Algorithm: FASTA (3.8 Nov 2011) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.381), E-opt: 0.2 (0.0992), width: 16
Scan time: 0.410
The best scores are: opt bits E(6120)
ENDOGMAKER|AZ0501_05111-R0 protein AED:0.10 eAED:0 (1433) 703 166.4 3.3e-41
ENDOGMAKER|AZ0501_00762-R0 protein AED:0.26 eAED:0 ( 947) 684 162.1 4.3e-40
ENDOGMAKER|AZ0501_02347-R0 protein AED:0.01 eAED:0 ( 465) 557 133.3 1e-31
ENDOGMAKER|AZ0501_04449-R0 protein AED:0.33 eAED:0 ( 444) 308 76.5 1.2e-14
>>ENDOGMAKER|AZ0501_05111-R0 protein AED:0.10 eAED:0.10 (1433 aa)
initn: 779 init1: 533 opt: 703 Z-score: 840.3 bits: 166.4 E(6120): 3.3e-41
Smith-Waterman score: 1078; 38.6% identity (68.5% similar) in 448 aa overlap (2-445:1003-1433)
10 20 30
TEF1 MGKEKSHINVVVIGHVDSGKSTTTGHLIYKC
::: :.::: .::::.:::: :....
ENDOGM VAAQEAVKESVDVQADIEKLVDDEVVTDLFGKE--HLNVVFMGHVDAGKSTMGGNILFLT
980 990 1000 1010 1020 1030
40 50 60 70 80 90
TEF1 GGIDKRTIEKFEKEAAELGKGSFKYAWVLDKLKAERERGITIDIALWKFETPKYQVTVID
: .::::.::.::.: : :. :. .:.:: :: .: :.. . .::: : . :..:
ENDOGM GMVDKRTMEKYEKDAKEAGRESWYLSWALDTNTEERAKGKTVECGRASFETEKRRYTILD
1040 1050 1060 1070 1080 1090
100 110 120 130 140 150
TEF1 APGHRDFIKNMITGTSQADCAILIIAGGVGEFEAGISKDGQTREHALLAFTLGVRQLIVA
::::.... .::::.:::: ..:.:.. ::::.:. . :::.:::.:: : :: .::::
ENDOGM APGHKNYVPSMITGASQADIGVLVISARKGEFETGFERGGQTQEHAVLAKTSGVGKLIVA
1100 1110 1120 1130 1140 1150
160 170 180 190 200
TEF1 VNKMD--SVKWDESRFQEIVKETSNFIKKVGYNPKT-VPFVPISGWNGDNMIEATTNAPW
.:::: .:.:.. :..:::.. . :.: .:::::: . :.:.::..: :. . :. :
ENDOGM INKMDDATVNWNKERYDEIVSKLTRFLKGLGYNPKTDLQFMPVSGFTGANIKDRYTELDW
1160 1170 1180 1190 1200 1210
210 220 230 240 250 260
TEF1 YKGWEKETKAGVVKGKTLLEAIDAIEQPSRPTDKPLRLPLQDVYKIGGIGTVPVGRVETG
: : .::: .: .. : . :: .:. . :: .::. ::..:.:
ENDOGM YDG------------PSLLEYLDNMQALERKINAPLMMPITEKYK--DMGTIVVGKLESG
1220 1230 1240 1250
270 280 290 300 310 320
TEF1 VIKPGMVVTFAPAGVTTEVKSVEMH-HEQLEQGVPGDNVGFNVKNVSVKEIRRGNVCGDA
..: :. : . : . :: .: . .:.. :. :::: . .... .:. : : .
ENDOGM AVKKGQNVIIMPNKKVCEVTAVYGESEEEIPAGLCGDNVRMRLRGIEEEEVSVGFVLCSP
1260 1270 1280 1290 1300 1310
330 340 350 360 370 380
TEF1 KNDPPKGCASFNATVIVLNHPGQISAGYSPVLDCHTAHIACRFDELLEKNDRRSGKKLED
: : : ..:. . .:. . : ::. .: :.: .. .:. :...:.. .
ENDOGM KA-PVKTTTAFECQLAILEAKNIICPGYTAILHVHSAVEEITISAFLHLIDKKTGRRTKR
1320 1330 1340 1350 1360 1370
390 400 410 420 430 440
TEF1 HPKFLKSGDAALVKFVPSKPMCVEAFSEYPPLGRFAVRDMRQTVAVGVIKSVDKTEKAAK
:.:.:.:. .. .. . :.::: :...: ::::..:: .:.:.: : .: ..:.:
ENDOGM PPQFVKQGQKVICRIETAGPLCVEPFDDHPQLGRFTLRDEGKTIAIGKITKVYENESA
1380 1390 1400 1410 1420 1430
450
TEF1 VTKAAQKAAKK*
>>ENDOGMAKER|AZ0501_00762-R0 protein AED:0.26 eAED:0.26 (947 aa)
initn: 720 init1: 507 opt: 684 Z-score: 820.3 bits: 162.1 E(6120): 4.3e-40
Smith-Waterman score: 988; 38.2% identity (67.0% similar) in 463 aa overlap (5-439:503-947)
10 20 30
TEF1 MGKEKSHINVVVIGHVDSGKSTTTGHLIYKCGGI
: .:.:::::::.:::: :::.:. : .
ENDOGM SRESSYAKSGASTPTRKRIDVLEEYNKRQGKPSLNLVVIGHVDAGKSTLMGHLLYELGQV
480 490 500 510 520 530
40 50 60 70
TEF1 DKRTIEKFEKEAAELGKGSFKYAWVLDKLKAERER-----------------GITIDIAL
..::..:.:.:: ..::.:: :::.::. :: : :::.:::.
ENDOGM SERTLKKYEREAQRIGKSSFAYAWILDETGEERSRYVLSGFHVFSVLISFSRGITMDIAV
540 550 560 570 580 590
80 90 100 110 120 130
TEF1 WKFETPKYQVTVIDAPGHRDFIKNMITGTSQADCAILIIAGGVGEFEAGISKDGQTREHA
::: . ..:..::::::::: :::.:..::: :::.. . .::::::.. .:::.:::
ENDOGM NAFETEHRKLTLLDAPGHRDFIPNMISGAAQADVAILVVDAMTGEFEAGFDANGQTKEHA
600 610 620 630 640 650
140 150 160 170 180 190
TEF1 LLAFTLGVRQLIVAVNKMDSVKWDESRFQEIVKETSNFIKKVGYNPKTVPFVPISGWNGD
::. .:::.:::::.::.: ..:...::.::: . ..:....:. . . :::.:: .:.
ENDOGM LLVRSLGVQQLIVAINKLDLLNWSQTRFDEIVARLGQFLQQAGFRKQKLSFVPVSGLTGE
660 670 680 690 700 710
200 210 220 230 240 250
TEF1 NMIEATTNAPWYKGWEKETKAGVVKGKTLLEAIDAIEQPSRPTDKPLRLPLQDVYKIGGI
:... :: :.: .: ::.. :::.. : : ..: .:: ..: .: :::
ENDOGM NLVK--MNAAPLKAW--------YHGPTLVDLIDAFDPPVRNVEKHFRLGVSDFFK-GGI
720 730 740 750 760
260 270 280 290 300 310
TEF1 GT---VPV-GRVETGVIKPGMVVTFAPAGVTTEVKSVEMHHEQLEQGVPGDNVGFNVKNV
:. : : ::...:... : : .:.: ::.::.. .... .: ::.. ......
ENDOGM GSGGGVSVAGRIDAGTVQIGDQVMCVPGGELGTVKAVEVNDQSVKWAVAGDTLLMTLSGL
770 780 790 800 810 820
320 330 340 350 360 370
TEF1 SVKEIRRGNVCGDAKNDPPKGCASFNATVIVLNHPGQISAGYSPVLDCHTAHIACRFDE-
.. .. : : : : . :.: ..... :. :: ::. : :.::
ENDOGM DILQLSPGCVLCDPLAPVPVA-KLFKAQIVTFDIKIPITLGY-PVVVHHQ-----RLDEP
830 840 850 860 870
380 390 400 410 420
TEF1 -----LLEKNDRRSGKKLEDHPKFLKSGDAALVKF-VPSKPMCVEAFSEYPPLGRFAVRD
:. :. ::. . .:. . . : :.. . .. . .::: . ::: .:
ENDOGM AVITKLVAILDKASGEVTKKNPRTIGRSAIATVEITLTNRKIPLEAFMDSKELGRVMLRK
880 890 900 910 920 930
430 440 450
TEF1 MRQTVAVGVIKSVDKTEKAAKVTKAAQKAAKK*
.:::.::. :
ENDOGM GGETVAAGVVVEV
940
>>ENDOGMAKER|AZ0501_02347-R0 protein AED:0.01 eAED:0.01 (465 aa)
initn: 1062 init1: 446 opt: 557 Z-score: 670.0 bits: 133.3 E(6120): 1e-31
Smith-Waterman score: 1147; 42.4% identity (70.1% similar) in 458 aa overlap (5-440:6-462)
10 20 30 40 50
TEF1 MGKEKSHINVVVIGHVDSGKSTTTGHLIYKCGGIDKRTIEKFEKEAAELGKGSFKYAWV
:.:...:. ::::::::::::.:... :::..: .::...:: .:::.:: .:.
ENDOGM MATNNKTHLSIVICGHVDSGKSTTTGRLLFELGGISEREMEKLKQEAERLGKSSFAFAFY
10 20 30 40 50 60
60 70 80 90 100 110
TEF1 LDKLKAERERGITIDIALWKFETPKYQVTVIDAPGHRDFIKNMITGTSQADCAILIIAGG
.:. : :::::.:: . .: : :.. :.::::::::::::::.:..::: :.:.. .
ENDOGM MDRQKDERERGVTIACTTKEFFTEKWHYTIIDAPGHRDFIKNMISGAAQADVALLMVPAD
70 80 90 100 110 120
120 130 140 150 160
TEF1 VGEFEAGISK--------DGQTREHALLAFTLGVRQLIVAVNKMDS--VKWDESRFQEIV
:.: ..:.: .::::.:: : :::.::.:.:::::: . . :::..::
ENDOGM -GNFTTAIQKGDHKAGDIQGQTRQHARLLNLLGVKQLVVGVNKMDSDVAGYKESRYNEIR
130 140 150 160 170
170 180 190 200 210 220
TEF1 KETSNFIKKVGYNPK----TVPFVPISGWNGDNMIEATTNAPWYKGWEKETKAGV-VKGK
: :.. .::.. .:: .::::: :::... . : :.:: : .. : .
ENDOGM DEMRNMLVRVGWKKDFVEGSVPVIPISGWMGDNLLKKSDNMGWWKGQEVTNSEGKKMTIT
180 190 200 210 220 230
230 240 250 260 270 280
TEF1 TLLEAI-DAIEQPSRPTDKPLRLPLQDVYKIGGIGTVPVGRVETGVIKPGMVVTFAPAGV
:::.:. : : : :: ::::.. .::: :.: : .:::: :..:: :.: :. .
ENDOGM TLLDALNDFATLPPRKTDAALRLPVSGIYKIKGVGDVIAGRVEQGTVKPKDEVVFLPTHT
240 250 260 270 280 290
290 300 310 320 330
TEF1 TTE-----VKSVEMHHEQLEQGVPGDNVGFNVKNVSVKEI-RRGNVCGDAKNDPPKGCAS
... . :.::::...::.: :::::.::::.. .. : :.: ::. .
ENDOGM AANKCAGVIFSIEMHHKRVEQAVSGDNVGMNVKNLDKANMPRAGDVMILAKDTTLTAVKR
300 310 320 330 340 350
340 350 360 370 380 390
TEF1 FNATVIVLNHPGQISAGYSPVLDCHTAHIACRFDELLEKNDRRSGKKLEDHPKFLKSGDA
:.: . .:. ::... .:::. . .. ::.. :: : ...:.. .: ::...:
ENDOGM FTAQIQTLDIPGEVKPNYSPIGFVRCGRAACKIVELKWKIGKETGRSKMPNPVSLKANEA
360 370 380 390 400 410
400 410 420 430 440 450
TEF1 ALVKFVPSKPMCVEAFSEYPPLGRFAVRDMRQTVAVGVIKSVDKTEKAAKVTKAAQKAAK
: : : : .:. :..:.. :.:.: : .: .: . .:.
ENDOGM AEVVFEPIQPLIVDTFQNCEGLSRIAFLDGNTAVMLGKVTAVELKA
420 430 440 450 460
TEF1 K*
# fasta36 Endogene_EF1.fa ../cds/Bifiguratus_adelaidae_AZ0501.all.maker.transcripts.fasta
FASTA searches a protein or DNA sequence data bank
version 36.3.7a Jan, 2015(preload9)
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: Endogene_EF1.fa
1>>>Endogone EF - 1263 nt
Library: ../cds/Bifiguratus_adelaidae_AZ0501.all.maker.transcripts.fasta
9706093 residues in 6120 sequences
Statistics: Altschul/Gish params: n0: 1263 Lambda: 0.192 K: 0.177 H: 0.360
statistics sampled from 417 (425) to 417 sequences
Algorithm: FASTA (3.8 Nov 2011) [optimized]
Parameters: +5/-4 matrix (5:-4), open/ext: -12/-4
ktup: 6, E-join: 0.25 (0.154), E-opt: 0.05 (0.0355), width: 16
Scan time: 0.990
The best scores are: opt bits E(6120)
ENDOGMAKER|AZ0501_02347-R0 transcript offset:0 (1398) [f] 414 117.3 6.2e-26
ENDOGMAKER|AZ0501_05111-R0 transcript offset:0 (4302) [f] 342 97.3 6.3e-20
ENDOGMAKER|AZ0501_00762-R0 transcript offset:0 (2844) [f] 324 92.3 2e-18
ENDOGMAKER|AZ0501_04449-R0 transcript offset:0 (1335) [f] 138 40.8 0.0064
ENDOGMAKER|AZ0501_04791-R0 transcript offset:0 ( 720) [f] 133 39.4 0.016
ENDOGMAKER|AZ0501_00232-R0 transcript offset:0 ( 984) [f] 123 36.7 0.11
ENDOGMAKER|AZ0501_04070-R0 transcript offset:0 (4140) [f] 118 35.2 0.3
ENDOGMAKER|AZ0501_01184-R0 transcript offset:0 ( 519) [f] 115 34.5 0.51
ENDOGMAKER|AZ0501_05950-R0 transcript offset:0 (1059) [f] 113 33.9 0.77
ENDOGMAKER|AZ0501_06048-R0 transcript offset:0 (1569) [f] 113 33.9 0.78
ENDOGMAKER|AZ0501_06082-R0 transcript offset:0 (1746) [r] 113 33.9 0.78
ENDOGMAKER|AZ0501_03617-R0 transcript offset:0 ( 375) [f] 112 33.7 0.89
ENDOGMAKER|AZ0501_04421-R0 transcript offset:0 (2736) [f] 112 33.6 0.95
ENDOGMAKER|AZ0501_03071-R0 transcript offset:0 (4449) [r] 112 33.6 0.96
ENDOGMAKER|AZ0501_04042-R0 transcript offset:0 (2385) [f] 111 33.3 1.2
>>ENDOGMAKER|AZ0501_02347-R0 transcript offset:0 AED:0.0 (1398 nt)
initn: 459 init1: 138 opt: 414 Z-score: 567.0 bits: 117.3 E(6120): 6.2e-26
banded Smith-Waterman score: 433; 56.1% identity (56.3% similar) in 506 nt overlap (178-661:361-865)
150 160 170 180 190 200
Endogo TATTTCGTCTCCGCTTTATAGGCTGACTGCGGTATTCTCATTATTGCCGCCGGTACTGGT
:::: ::: : :::: : :::
ENDOGM GATGTCGCTTTGCTCATGGTTCCCGCTGATGGTAACTTCACCACTGCCATTCAAAAGGGT
340 350 360 370 380 390
210 220 230 240 250 260
Endogo GAGTTCGAGGCTGGT-ATCTCCAAGGATGGTCAGACTCGTGAGCACGCTCTCCTCGCCTT
:: : :::::::: :: : ::::: : : :: :: ::::::: : :
ENDOGM GATCACAAGGCTGGTGATATTCAAGGTCAAACTCGCCAACACGCTCG-TCTCCTCAACCT
400 410 420 430 440
270 280 290 300 310 320
Endogo CACCCTTGGTGTGCGTCAGCTCATCGTTGCCATCAACA---AGATGGACACCACCAAGTG
: :: : :: : : : : :: :: ::: : : :: : :
ENDOGM TCTCGGTGTTAAGCAACTTGTTGTTGGTGTCAACAAGATGGACTCTGATGTTGCTGGTTA
450 460 470 480 490 500
330 340 350 360 370
Endogo GTCGCAGGATCGTTTCAACGAAAT-CGTGA----------AGGAGGTCTCTTCCT-TCAT
: :: ::::: ::::::::: ::::: : : :: : : : ::
ENDOGM CAAGGAGTCTCGTTACAACGAAATCCGTGATGAGATGCGCAACATGTTGGTCCGTGTCGG
510 520 530 540 550 560
380 390 400 410 420 430
Endogo CAAGAAGATTGGTTTCAACCCCGCAACTGTTCCGTTCGTCCCGATCTCCGGCTGGCACGG
: ::::: : ::: : : :: :: :: :::: ::::: :::::: ::
ENDOGM CTGGAAGAAGGACTTCGTTGAGGGTTCCGTCCCCGTCATCCCCATCTCTGGCTGGATGGG
570 580 590 600 610 620
440 450 460 470 480
Endogo CGACAACATGTTGGAGGAGTCCGTCAACATGACCTGGTTCAAGGG---ATGGACCAAGGA
:::::: ::::: :: :::::: ::::::: ::::: ::::: : : :: :
ENDOGM TGACAACTTGTTGAAGAAGTCCGACAACATGGGCTGGTGGAAGGGTCAAGAGGTCACCAA
630 640 650 660 670 680
490 500 510 520 530 540
Endogo GTCTAAGGCCGGYAACAAGTCTGGCAAGACACTCCTCGAGGCCATCGATGCCATTG--AC
::: ::: : :: : : :: :: :: ::: : :: :: : : : : : : ::
ENDOGM CTCTGAGGGCAAGAAGATGACTATCACCACCCTCTTGGACGCTCTTAACGACTTCGCCAC
690 700 710 720 730 740
550 560 570 580 590 600
Endogo CCT-CCGAGCCGTCCTACCGACAAGCCCCTACGTCTTCCCCTCCAGGATGTGTACAAGAT
::: :: ::: :: :: :::: ::::::::: :: : : : ::::::::
ENDOGM CCTCCCTCCCCGCAAGACTGATGCTGCCCTCCGTCTTCCCGTCTCTGGTATCTACAAGAT
750 760 770 780 790 800
610 620 630 640 650 660
Endogo CGGTGGTATTGGCACAGTTCCCGYCGGTCGTGTCGAGACTGGTATCATCAAGGCAAGTAA
: ::: : :: :: :. :: ::::::::: :::: : ::::: : :
ENDOGM CAAGGGTGTCGGTGATGTCATTGCTGGACGTGTCGAGCAAGGTACCGTCAAGCCCAAGGA
810 820 830 840 850 860
670 680 690 700 710 720
Endogo TTTCTGGGGGCTGTTACGGGGAGGGCTTTTGACCATGAAGTGGAATCGAAAAAGTTTATG
ENDOGM TGAAGTTGTCTTCCTTCCTACCCACACTGCTGCCAACAAGTGCGCTGGTGTCATCTTCTC
870 880 890 900 910 920
>--
initn: 273 init1: 132 opt: 203 Z-score: 251.2 bits: 58.8 E(6120): 2.4e-08
banded Smith-Waterman score: 203; 53.0% identity (53.0% similar) in 421 nt overlap (821-1233:928-1343)
800 810 820 830 840 850
Endogo GCTCCCGCTGGTGTCTCCACTGAAGTGAAGTCCGTCGAAATGCACCACGAACAGCTCACC
::: : :::::::::::: : : :
ENDOGM GCTGCCAACAAGTGCGCTGGTGTCATCTTCTCCATTGAAATGCACCACAAGCGTGTTGAG
900 910 920 930 940 950
860 870 880 890 900
Endogo GAGGGTGTCCCCGGCGATAATGTCGGCTTCAACGTCAAGAAC--GTATCA-GTCAATGAA
: : ::: : ::::: :: :: :: : ::::: :::::: : :: : : :::
ENDOGM CAAGCCGTCTCTGGCGACAACGTTGGTATGAACGTTAAGAACTTGGATAAGGCCAACATG
960 970 980 990 1000 1010
910 920 930 940 950 960
Endogo ATCCGACGTGGTTTCGTC-TGCTCCGACTCCAGGAACGACCCCGCCAAGGAATCCGCCTC
::: :::: ::: :: ::: : ::: : :: : :: :: : :
ENDOGM CCCCGTGCTGGTGATGTCATGATCC-TCGCCAAGGACACCACCCTCACCGCCGTCAAGCG
1020 1030 1040 1050 1060 1070
970 980 990 1000 1010 1020
Endogo CTTCCTCGCTCAGGTTATCGTCCTCAACCACCCCGGTCAGATCGGCGCTGGTTACGCACC
:::: ::::: :: :::: : :::::: : :: : ::: : ::
ENDOGM CTTCACTGCTCAAATTCAGACCCTCGATATCCCCGGAGAAGTCAAGCCCAACTACTCTCC
1080 1090 1100 1110 1120 1130
1030 1040 1050 1060 1070 1080
Endogo TGTGCTCGATT--GCCACACCGCCCATATTGCCTGCAAGTTCGCCGAGCTTGTCGAGAAG
: : : :: :: : : : :::::::::: : : ::::: ::::
ENDOGM CAT--TGGTTTCGTCCGCTGTGGTCGAGCTGCCTGCAAGATTGTAGAGCTCAAGTGGAAG
1140 1150 1160 1170 1180 1190
1090 1100 1110 1120 1130 1140
Endogo ATCGATCGTCGTTCCGGCAAGAAGCTTGAGGAC-AACCCCAAGTTCGTCAAATCCGGTGA
:: : : : : : : : : :: : : :::::: : : : :: :: ::
ENDOGM ATTG-GCAAGGAGACTGGACGCTCCAAGATGCCTAACCCCGTTTCCTTGAAGGCCAACGA
1200 1210 1220 1230 1240 1250
1150 1160 1170 1180 1190 1200
Endogo CTCTGCCATCGTCAAGAT-GATTCCGTCTAAGCCTATGTGCGTTGAATCCTACACCGAGT
::::: :: : :: :: :: :: :: : ::: :: : : : : :
ENDOGM GGCTGCCGAGGTTGTTTTCGAGCCCATCCAA-CCCCTCATCGTCGATACATTCCAAAATT
1260 1270 1280 1290 1300 1310
1210 1220 1230 1240 1250 1260
Endogo TCCCCCCGCTTGGTCGCTTCGCTGTCTGGGACATGAGGCAAACCGTCGCCGTTGGTGTCA
: ::: :: ::::: ::: :::
ENDOGM GCGAGGGTCTTTCCCGTATCGCTTTCTTGGATGGTAACACCGCTGTCATGTTGGGCAAGG
1320 1330 1340 1350 1360 1370
Endogo T
ENDOGM TTACCGCCGTCGAGCTCAAGGCTTAA
1380 1390
>TEF1 YPR080W SGDID:S000006284
MGKEKSHINVVVIGHVDSGKSTTTGHLIYKCGGIDKRTIEKFEKEAAELGKGSFKYAWVL
DKLKAERERGITIDIALWKFETPKYQVTVIDAPGHRDFIKNMITGTSQADCAILIIAGGV
GEFEAGISKDGQTREHALLAFTLGVRQLIVAVNKMDSVKWDESRFQEIVKETSNFIKKVG
YNPKTVPFVPISGWNGDNMIEATTNAPWYKGWEKETKAGVVKGKTLLEAIDAIEQPSRPT
DKPLRLPLQDVYKIGGIGTVPVGRVETGVIKPGMVVTFAPAGVTTEVKSVEMHHEQLEQG
VPGDNVGFNVKNVSVKEIRRGNVCGDAKNDPPKGCASFNATVIVLNHPGQISAGYSPVLD
CHTAHIACRFDELLEKNDRRSGKKLEDHPKFLKSGDAALVKFVPSKPMCVEAFSEYPPLG
RFAVRDMRQTVAVGVIKSVDKTEKAAKVTKAAQKAAKK*
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment