Text S1: Protein sequences and alignments of all proteins found in ...
Text S1: Protein sequences and alignments of all proteins found in ...
Text S1: Protein sequences and alignments of all proteins found in ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
AT1G07920<br />
MGKEKFHINIVVIGHVDSGKSTTTGHLIYKLGGIDKRVIERFEKEAAEMNKRSFKYAWVLDKLKAERERGITIDIALWKFETTKYYCTVIDAPGHRDFIK<br />
NMITGTSQADCAVLIIDSTTGGFEAGISKDGQTREHALLAFTLGVKQMICCCNKMDATTPKYSKARYDEIIKEVSSYLKKVGYNPDKIPFVPISGFEGDN<br />
MIERSTNLDWYKGPTLLEALDQINEPKRPSDKPLRLPLQDVYKIGGIGTVPVGRVETGMIKPGMVVTFAPTGLTTEVKSVEMHHESLLEALPGDNVGFNV<br />
KNVAVKDLKRGYVASNSKDDPAKGAANFTSQVIIMNHPGQIGNGYAPVLDCHTSHIAVKFSEILTKIDRRSGKEIEKEPKFLKNGDAGMVKMTPTKPMVV<br />
ETFSEYPPLGRFAVRDMRQTVAVGVIKSVDKKDPTGAKVTKAAVKKGAK<br />
GENE ID: 1915 EEF1A1 | eukaryotic translation elongation factor 1 alpha 1<br />
[Homo sapiens] (Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 716 bits (1848), Expect = 0.0, Method: Compositional matrix adjust.<br />
Identities = 347/457 (75%), Positives = 391/457 (85%), Gaps = 12/457 (2%)<br />
Query 1 MGKEKFHINIVVIGHVDSGKSTTTGHLIYKLGGIDKRVIERFEKEAAEMNKRSFKYAWVL 60<br />
MGKEK HINIVVIGHVDSGKSTTTGHLIYK GGIDKR IE+FEKEAAEM K SFKYAWVL<br />
Sbjct 1 MGKEKTHINIVVIGHVDSGKSTTTGHLIYKFGGIDKRTIEKFEKEAAEMGKGSFKYAWVL 60<br />
Query 61 DKLKAERERGITIDIALWKFETTKYYCTVIDAPGHRDFIKNMITGTSQADCAVLIIDSTT 120<br />
DKLKAERERGITIDI+LWKFET+KYY T+IDAPGHRDFIKNMITGTSQADCAVLI+ +<br />
Sbjct 61 DKLKAERERGITIDISLWKFETSKYYVTIIDAPGHRDFIKNMITGTSQADCAVLIVAAGV 120<br />
Query 121 GGFEAGISKDGQTREHALLAFTLGVKQMICCCNKMDATTPKYSKARYDEIIKEVSSYLKK 180<br />
G FEAGISK+GQTREHALLA+TLGVKQ+I NKMD+T P YS+ RY+EI+KEVS+Y+KK<br />
Sbjct 121 GEFEAGISKNGQTREHALLAYTLGVKQLIVGVNKMDSTEPPYSQKRYEEIVKEVSTYIKK 180<br />
Query 181 VGYNPDKIPFVPISGFEGDNMIERSTNLDWYKG------------PTLLEALDQINEPKR 228<br />
+GYNPD + FVPISG+ GDNM+E S N+ W+KG TLLEALD I P R<br />
Sbjct 181 IGYNPDTVAFVPISGWNGDNMLEPSANMPWFKGWKVTRKDGNASGTTLLEALDCILPPTR 240<br />
Query 229 PSDKPLRLPLQDVYKIGGIGTVPVGRVETGMIKPGMVVTFAPTGLTTEVKSVEMHHESLL 288<br />
P+DKPLRLPLQDVYKIGGIGTVPVGRVETG++KPGMVVTFAP +TTEVKSVEMHHE+L<br />
Sbjct 241 PTDKPLRLPLQDVYKIGGIGTVPVGRVETGVLKPGMVVTFAPVNVTTEVKSVEMHHEALS 300<br />
Query 289 EALPGDNVGFNVKNVAVKDLKRGYVASNSKDDPAKGAANFTSQVIIMNHPGQIGNGYAPV 348<br />
EALPGDNVGFNVKNV+VKD++RG VA +SK+DP AA FT+QVII+NHPGQI GYAPV<br />
Sbjct 301 EALPGDNVGFNVKNVSVKDVRRGNVAGDSKNDPPMEAAGFTAQVIILNHPGQISAGYAPV 360<br />
Query 349 LDCHTSHIAVKFSEILTKIDRRSGKEIEKEPKFLKNGDAGMVKMTPTKPMVVETFSEYPP 408<br />
LDCHT+HIA KF+E+ KIDRRSGK++E PKFLK+GDA +V M P KPM VE+FS+YPP<br />
Sbjct 361 LDCHTAHIACKFAELKEKIDRRSGKKLEDGPKFLKSGDAAIVDMVPGKPMCVESFSDYPP 420<br />
Query 409 LGRFAVRDMRQTVAVGVIKSVDKKDPTGAKVTKAAVK 445<br />
LGRFAVRDMRQTVAVGVIK+VDKK KVTK+A K<br />
Sbjct 421 LGRFAVRDMRQTVAVGVIKAVDKKAAGAGKVTKSAQK 457<br />
>AT1G09200<br />
MARTKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHRFRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSSAVAALQEAAEAY<br />
LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA<br />
GENE ID: 126961 HIST2H3C | histone cluster 2, H3c [Homo sapiens]<br />
(Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 267 bits (683), Expect = 2e-71, Method: Compositional matrix adjust.<br />
Identities = 132/136 (97%), Positives = 135/136 (99%), Gaps = 0/136 (0%)<br />
Query 1 MARTKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHRFRPGTVALREIRKYQKSTE 60<br />
MARTKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHR+RPGTVALREIR+YQKSTE<br />
Sbjct 1 MARTKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHRYRPGTVALREIRRYQKSTE 60<br />
Query 61 LLIRKLPFQRLVREIAQDFKTDLRFQSSAVAALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120<br />
LLIRKLPFQRLVREIAQDFKTDLRFQSSAV ALQEA+EAYLVGLFEDTNLCAIHAKRVTI<br />
Sbjct 61 LLIRKLPFQRLVREIAQDFKTDLRFQSSAVMALQEASEAYLVGLFEDTNLCAIHAKRVTI 120<br />
Query 121 MPKDIQLARRIRGERA 136<br />
MPKDIQLARRIRGERA<br />
Sbjct 121 MPKDIQLARRIRGERA 136<br />
>AT1G09300<br />
MQFLARNLVRRVSRTQVVSRNAYSTQTVRDIGQPTPASHPHLMAEGEVTPGIRIEEYIGRRKKLVELLPENSLAIISSAPVKMMTDVVPYTFRQDADYLY<br />
LTGCQQPGGVAVLSDERGLCMFMPESTPKDIAWEGEVAGVDAASEVFKADQAYPISKLPEILSDMIRHSSKVFHNVQSASQRYTNLDDFQNSASLGKVKT<br />
LSSLTHELRLIKSPAELKLMRESASIACQGLLKTMLHSKGFPDEGILSAQVEYECRVRGAQRMAFNPVVGGGSNASVIHYSRNDQRIKDGDLVLMDMGCE<br />
LHGYVSDLTRTWPPCGKFSSVQEELYDLILQTNKECIKQCKPGTTIRQLNTYSTELLCDGLMKMGILKSRRLYHQLNPTSIGHYLGMDVHDSSAVGYDRP<br />
LQPGFVITIEPGVYIPSSFDCPERFQGIGIRIEDDVLITETGYEVLTGSMPKEIKHIETLLNNHCHDNSARTSPVSLCKVKGLHTNRNPRRLF<br />
GENE ID: 63929 XPNPEP3 | X-prolyl am<strong>in</strong>opeptidase (am<strong>in</strong>opeptidase P) 3, putative<br />
[Homo sapiens] (10 or fewer PubMed l<strong>in</strong>ks)<br />
Score = 347 bits (891), Expect = 2e-95, Method: Compositional matrix adjust.<br />
Identities = 185/486 (38%), Positives = 280/486 (57%), Gaps = 34/486 (6%)<br />
Query 9 VRRVSRTQVVSRNAYSTQTV-------RDIGQPTPASHPHLMAEGEVTPGIRIEEYIGRR 61<br />
VR +S + S+ YS Q V R +GQP+P +HPHL+ GEVTPG+ EY RR<br />
Sbjct 17 VRGLSGCMLCSQRRYSLQPVPERRIPNRYLGQPSPFTHPHLLRPGEVTPGLSQVEYALRR 76<br />
Query 62 KKLVELLPE--------NSLAIISSAPVKMMTDVVPYTFRQDADYLYLTGCQQPGGVAVL 113<br />
KL+ L+ + + ++ S P M++ +PYTF QD ++LYL G Q+P + VL<br />
Sbjct 77 HKLMSLIQKEAQGQSGTDQTVVVLSNPTYYMSNDIPYTFHQDNNFLYLCGFQEPDSILVL 136<br />
Query 114 SDERG-------LCMFMPESTPKDIAWEGEVAGVDAASEVFKADQAYPISKLPEILSDMI 166<br />
G +F+P P W+G +G D A + D+AY + + +L M<br />
Sbjct 137 QSLPGKQLPSHKAILFVPRRDPSRELWDGPRSGTDGAIALTGVDEAYTLEEFQHLLPKMK 196<br />
Query 167 RHSSKVFHNVQSASQRYTNLDDFQ-----NSASLGKVKTLSSLTHELRLIKSPAELKLMR 221<br />
++ V+++ S + D Q + S KV+ + L LRLIKSPAE++ M+<br />
Sbjct 197 AETNMVWYDWMRPSHAQLHSDYMQPLTEAKAKSKNKVRGVQQLIQRLRLIKSPAEIERMQ 256<br />
Query 222 ESASIACQGLLKTMLHSKGFPDEGILSAQVEYECRVRGAQRMAFNPVVGGGSNASVIHYS 281<br />
+ + Q ++TM SK +E L A+ E+ECR RGA +A+ PVV GG+ ++ +HY<br />
Sbjct 257 IAGKLTSQAFIETMFTSKAPVEEAFLYAKFEFECRARGADILAYPPVVAGGNRSNTLHYV 316