22.02.2013 Views

Text S1: Protein sequences and alignments of all proteins found in ...

Text S1: Protein sequences and alignments of all proteins found in ...

Text S1: Protein sequences and alignments of all proteins found in ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

AT1G07920<br />

MGKEKFHINIVVIGHVDSGKSTTTGHLIYKLGGIDKRVIERFEKEAAEMNKRSFKYAWVLDKLKAERERGITIDIALWKFETTKYYCTVIDAPGHRDFIK<br />

NMITGTSQADCAVLIIDSTTGGFEAGISKDGQTREHALLAFTLGVKQMICCCNKMDATTPKYSKARYDEIIKEVSSYLKKVGYNPDKIPFVPISGFEGDN<br />

MIERSTNLDWYKGPTLLEALDQINEPKRPSDKPLRLPLQDVYKIGGIGTVPVGRVETGMIKPGMVVTFAPTGLTTEVKSVEMHHESLLEALPGDNVGFNV<br />

KNVAVKDLKRGYVASNSKDDPAKGAANFTSQVIIMNHPGQIGNGYAPVLDCHTSHIAVKFSEILTKIDRRSGKEIEKEPKFLKNGDAGMVKMTPTKPMVV<br />

ETFSEYPPLGRFAVRDMRQTVAVGVIKSVDKKDPTGAKVTKAAVKKGAK<br />

GENE ID: 1915 EEF1A1 | eukaryotic translation elongation factor 1 alpha 1<br />

[Homo sapiens] (Over 10 PubMed l<strong>in</strong>ks)<br />

Score = 716 bits (1848), Expect = 0.0, Method: Compositional matrix adjust.<br />

Identities = 347/457 (75%), Positives = 391/457 (85%), Gaps = 12/457 (2%)<br />

Query 1 MGKEKFHINIVVIGHVDSGKSTTTGHLIYKLGGIDKRVIERFEKEAAEMNKRSFKYAWVL 60<br />

MGKEK HINIVVIGHVDSGKSTTTGHLIYK GGIDKR IE+FEKEAAEM K SFKYAWVL<br />

Sbjct 1 MGKEKTHINIVVIGHVDSGKSTTTGHLIYKFGGIDKRTIEKFEKEAAEMGKGSFKYAWVL 60<br />

Query 61 DKLKAERERGITIDIALWKFETTKYYCTVIDAPGHRDFIKNMITGTSQADCAVLIIDSTT 120<br />

DKLKAERERGITIDI+LWKFET+KYY T+IDAPGHRDFIKNMITGTSQADCAVLI+ +<br />

Sbjct 61 DKLKAERERGITIDISLWKFETSKYYVTIIDAPGHRDFIKNMITGTSQADCAVLIVAAGV 120<br />

Query 121 GGFEAGISKDGQTREHALLAFTLGVKQMICCCNKMDATTPKYSKARYDEIIKEVSSYLKK 180<br />

G FEAGISK+GQTREHALLA+TLGVKQ+I NKMD+T P YS+ RY+EI+KEVS+Y+KK<br />

Sbjct 121 GEFEAGISKNGQTREHALLAYTLGVKQLIVGVNKMDSTEPPYSQKRYEEIVKEVSTYIKK 180<br />

Query 181 VGYNPDKIPFVPISGFEGDNMIERSTNLDWYKG------------PTLLEALDQINEPKR 228<br />

+GYNPD + FVPISG+ GDNM+E S N+ W+KG TLLEALD I P R<br />

Sbjct 181 IGYNPDTVAFVPISGWNGDNMLEPSANMPWFKGWKVTRKDGNASGTTLLEALDCILPPTR 240<br />

Query 229 PSDKPLRLPLQDVYKIGGIGTVPVGRVETGMIKPGMVVTFAPTGLTTEVKSVEMHHESLL 288<br />

P+DKPLRLPLQDVYKIGGIGTVPVGRVETG++KPGMVVTFAP +TTEVKSVEMHHE+L<br />

Sbjct 241 PTDKPLRLPLQDVYKIGGIGTVPVGRVETGVLKPGMVVTFAPVNVTTEVKSVEMHHEALS 300<br />

Query 289 EALPGDNVGFNVKNVAVKDLKRGYVASNSKDDPAKGAANFTSQVIIMNHPGQIGNGYAPV 348<br />

EALPGDNVGFNVKNV+VKD++RG VA +SK+DP AA FT+QVII+NHPGQI GYAPV<br />

Sbjct 301 EALPGDNVGFNVKNVSVKDVRRGNVAGDSKNDPPMEAAGFTAQVIILNHPGQISAGYAPV 360<br />

Query 349 LDCHTSHIAVKFSEILTKIDRRSGKEIEKEPKFLKNGDAGMVKMTPTKPMVVETFSEYPP 408<br />

LDCHT+HIA KF+E+ KIDRRSGK++E PKFLK+GDA +V M P KPM VE+FS+YPP<br />

Sbjct 361 LDCHTAHIACKFAELKEKIDRRSGKKLEDGPKFLKSGDAAIVDMVPGKPMCVESFSDYPP 420<br />

Query 409 LGRFAVRDMRQTVAVGVIKSVDKKDPTGAKVTKAAVK 445<br />

LGRFAVRDMRQTVAVGVIK+VDKK KVTK+A K<br />

Sbjct 421 LGRFAVRDMRQTVAVGVIKAVDKKAAGAGKVTKSAQK 457<br />

>AT1G09200<br />

MARTKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHRFRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSSAVAALQEAAEAY<br />

LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA<br />

GENE ID: 126961 HIST2H3C | histone cluster 2, H3c [Homo sapiens]<br />

(Over 10 PubMed l<strong>in</strong>ks)<br />

Score = 267 bits (683), Expect = 2e-71, Method: Compositional matrix adjust.<br />

Identities = 132/136 (97%), Positives = 135/136 (99%), Gaps = 0/136 (0%)<br />

Query 1 MARTKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHRFRPGTVALREIRKYQKSTE 60<br />

MARTKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHR+RPGTVALREIR+YQKSTE<br />

Sbjct 1 MARTKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHRYRPGTVALREIRRYQKSTE 60<br />

Query 61 LLIRKLPFQRLVREIAQDFKTDLRFQSSAVAALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120<br />

LLIRKLPFQRLVREIAQDFKTDLRFQSSAV ALQEA+EAYLVGLFEDTNLCAIHAKRVTI<br />

Sbjct 61 LLIRKLPFQRLVREIAQDFKTDLRFQSSAVMALQEASEAYLVGLFEDTNLCAIHAKRVTI 120<br />

Query 121 MPKDIQLARRIRGERA 136<br />

MPKDIQLARRIRGERA<br />

Sbjct 121 MPKDIQLARRIRGERA 136<br />

>AT1G09300<br />

MQFLARNLVRRVSRTQVVSRNAYSTQTVRDIGQPTPASHPHLMAEGEVTPGIRIEEYIGRRKKLVELLPENSLAIISSAPVKMMTDVVPYTFRQDADYLY<br />

LTGCQQPGGVAVLSDERGLCMFMPESTPKDIAWEGEVAGVDAASEVFKADQAYPISKLPEILSDMIRHSSKVFHNVQSASQRYTNLDDFQNSASLGKVKT<br />

LSSLTHELRLIKSPAELKLMRESASIACQGLLKTMLHSKGFPDEGILSAQVEYECRVRGAQRMAFNPVVGGGSNASVIHYSRNDQRIKDGDLVLMDMGCE<br />

LHGYVSDLTRTWPPCGKFSSVQEELYDLILQTNKECIKQCKPGTTIRQLNTYSTELLCDGLMKMGILKSRRLYHQLNPTSIGHYLGMDVHDSSAVGYDRP<br />

LQPGFVITIEPGVYIPSSFDCPERFQGIGIRIEDDVLITETGYEVLTGSMPKEIKHIETLLNNHCHDNSARTSPVSLCKVKGLHTNRNPRRLF<br />

GENE ID: 63929 XPNPEP3 | X-prolyl am<strong>in</strong>opeptidase (am<strong>in</strong>opeptidase P) 3, putative<br />

[Homo sapiens] (10 or fewer PubMed l<strong>in</strong>ks)<br />

Score = 347 bits (891), Expect = 2e-95, Method: Compositional matrix adjust.<br />

Identities = 185/486 (38%), Positives = 280/486 (57%), Gaps = 34/486 (6%)<br />

Query 9 VRRVSRTQVVSRNAYSTQTV-------RDIGQPTPASHPHLMAEGEVTPGIRIEEYIGRR 61<br />

VR +S + S+ YS Q V R +GQP+P +HPHL+ GEVTPG+ EY RR<br />

Sbjct 17 VRGLSGCMLCSQRRYSLQPVPERRIPNRYLGQPSPFTHPHLLRPGEVTPGLSQVEYALRR 76<br />

Query 62 KKLVELLPE--------NSLAIISSAPVKMMTDVVPYTFRQDADYLYLTGCQQPGGVAVL 113<br />

KL+ L+ + + ++ S P M++ +PYTF QD ++LYL G Q+P + VL<br />

Sbjct 77 HKLMSLIQKEAQGQSGTDQTVVVLSNPTYYMSNDIPYTFHQDNNFLYLCGFQEPDSILVL 136<br />

Query 114 SDERG-------LCMFMPESTPKDIAWEGEVAGVDAASEVFKADQAYPISKLPEILSDMI 166<br />

G +F+P P W+G +G D A + D+AY + + +L M<br />

Sbjct 137 QSLPGKQLPSHKAILFVPRRDPSRELWDGPRSGTDGAIALTGVDEAYTLEEFQHLLPKMK 196<br />

Query 167 RHSSKVFHNVQSASQRYTNLDDFQ-----NSASLGKVKTLSSLTHELRLIKSPAELKLMR 221<br />

++ V+++ S + D Q + S KV+ + L LRLIKSPAE++ M+<br />

Sbjct 197 AETNMVWYDWMRPSHAQLHSDYMQPLTEAKAKSKNKVRGVQQLIQRLRLIKSPAEIERMQ 256<br />

Query 222 ESASIACQGLLKTMLHSKGFPDEGILSAQVEYECRVRGAQRMAFNPVVGGGSNASVIHYS 281<br />

+ + Q ++TM SK +E L A+ E+ECR RGA +A+ PVV GG+ ++ +HY<br />

Sbjct 257 IAGKLTSQAFIETMFTSKAPVEEAFLYAKFEFECRARGADILAYPPVVAGGNRSNTLHYV 316

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!