Text S1: Protein sequences and alignments of all proteins found in ...
Text S1: Protein sequences and alignments of all proteins found in ...
Text S1: Protein sequences and alignments of all proteins found in ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
AT1G63660<br />
METPTMKPDTVLILDYGSQYTHLITRRIRSLNVFSLVISGTSSLKSITSYNPRVVILSGGPHSVHALDAPSFPEGFIEWAESNGVSVLGICYGLQLIVQK<br />
LGGVVVEGESKEYGKMEIEVKGKSEIFGSESGGEKQMVWMSHGDEAVKLPEGFEVVAQSAQGAVAALESRKKKIYGLQYHPEVTHSPKGMETLRHFLFDV<br />
CGVSADWKMEDLMEEEIKVINKTVASDEHVICALSGGVDSTVAATLVHKAIGDRLHCIFVDNGLLRYKEQERVMDTFERDLHLPVTCVDASERFLSELKG<br />
VVDPETKRKIIGREFINIFDQFAQELEKKHGKKPAFLVQGTLYPDVIESCPPPGTDRTHSHTIKSHHNVGGLPKDMKLKLIEPLKLLFKDEVRELGRILN<br />
VPVGFLKRHPFPGPGLAVRVLGDVTQGNALEVLRQVDEIFIQSIRDAGLYDSIWQAFAVFLPVRSVGVQGDKRTHSHVVALRAVTSQDGMTADWFNFEHK<br />
FLDDVSRKICNSVQGVNRVVLDITSKPPSTIEWE<br />
GENE ID: 8833 GMPS | guan<strong>in</strong>e monphosphate synthetase [Homo sapiens]<br />
(Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 263 bits (673), Expect = 4e-70, Method: Compositional matrix adjust.<br />
Identities = 187/570 (32%), Positives = 300/570 (52%), Gaps = 71/570 (12%)<br />
Query 11 VLILDYGSQYTHLITRRIRSLNVFSLVISGTSSLKSITSYNPRVVILSGGPHSVHALDAP 70<br />
V+ILD G+QY +I RR+R L V S + + +I R +I+SGGP+SV+A DAP<br />
Sbjct 28 VVILDAGAQYGKVIDRRVRELFVQSEIFPLETPAFAIKEQGFRAIIISGGPNSVYAEDAP 87<br />
Query 71 SFPEGFIEWAESNGVSVLGICYGLQLIVQKLGGVVVEGESKEYGKMEIEVKGKSEIFGSE 130<br />
F + G VLGICYG+Q++ + GG V + +E G I V +F<br />
Sbjct 88 WFDPAIF----TIGKPVLGICYGMQMMNKVFGGTVHKKSVREDGVFNISVDNTCSLFRGL 143<br />
Query 131 SGGEKQMVWMSHGDEAVKLPEGFEVVAQSAQGAVAALESRKKKIYGLQYHPEVTHSPKGM 190<br />
++++V ++HGD K+ +GF+VVA+S VA + + KK+YG Q+HPEV + G<br />
Sbjct 144 Q--KEEVVLLTHGDSVDKVADGFKVVARSGN-IVAGIANESKKLYGAQFHPEVGLTENGK 200<br />
Query 191 ETLRHFLFDVCGVSADWKMEDLMEEEIKVINKTVASDEHVICALSGGVDSTVAATLVHKA 250<br />
L++FL+D+ G S + +++ E I+ I + V + + V+ LSGGVDSTV L+++A<br />
Sbjct 201 VILKNFLYDIAGCSGTFTVQNRELECIREIKERVGTSK-VLVLLSGGVDSTVCTALLNRA 259<br />
Query 251 IG-DRLHCIFVDNGLLRYKEQERVMDTFERDLHLPVTCVDASERFLS------------- 296<br />
+ +++ + +DNG +R +E + V + ++ L + V ++A+ F +<br />
Sbjct 260 LNQEQVIAVHIDNGFMRKRESQSVEEALKK-LGIQVKVINAAHSFYNGTTTLPISDEDRT 318<br />
Query 297 -------ELKGVVDPETKRKIIGREFINIFDQFAQELEKKHGKKPAFLVQGTLYPDVIES 349<br />
L PE KRKIIG F+ I ++ E+ K + FL QGTL PD+IES<br />
Sbjct 319 PRKRISKTLNMTTSPEEKRKIIGDTFVKIANEVIGEMNLK--PEEVFLAQGTLRPDLIES 376<br />
Query 350 CPPPGTDRTHSHTIKSHHNVGGLPKDMKL--KLIEPLKLLFKDEVRELGRILNVPVGFLK 407<br />
+ + + IK+HHN L + ++ K+IEPLK KDEVR LGR L +P +<br />
Sbjct 377 ASLVASGK--AELIKTHHNDTELIRKLREEGKVIEPLKDFHKDEVRILGRELGLPEELVS 434<br />
Query 408 RHPFPGPGLAVRVL--------GDVTQGNALEVLRQVDEI---------FIQSIRDAGLY 450<br />
RHPFPGPGLA+RV+ D + N +L+ V + +Q ++<br />
Sbjct 435 RHPFPGPGLAIRVICAEEPYICKDFPETN--NILKIVADFSASVKKPHTLLQRVKACTTE 492<br />
Query 451 D---------SIWQAFAVFLPVRSVGVQGDKRTHSHVVALRAVTSQDGMTADWFNFEHKF 501<br />
+ S+ A LP+++VGVQGD R++S+V ++S+D DW + F<br />
Sbjct 493 EDQEKLMQITSLHSLNAFLLPIKTVGVQGDCRSYSYVC---GISSKD--EPDWESL--IF 545<br />
Query 502 LDDVSRKICNSVQGVNRVVLDITSKPPSTI 531<br />
L + ++C++V V + +PP+ +<br />
Sbjct 546 LARLIPRMCHNVNRVVYIFGPPVKEPPTDV 575<br />
Score = 40.0 bits (92), Expect = 0.010, Method: Compositional matrix adjust.<br />
Identities = 31/110 (28%), Positives = 50/110 (45%), Gaps = 6/110 (5%)<br />
Query 430 LEVLRQVDEIFIQSIRDAGLYDSIWQAFAVFLPVR--SVGVQGDKRTHSHVVALRAVTSQ 487<br />
L LRQ D +R++G I Q + P+ +Q VV +R +<br />
Sbjct 585 LSTLRQADFEAHNILRESGYAGKISQMPVILTPLHFDRDPLQKQPSCQRSVV-IRTFITS 643<br />
Query 488 DGMTADWFNFEHKFLDDVSRKICNSVQ---GVNRVVLDITSKPPSTIEWE 534<br />
D MT ++ +V K+ ++ G++R++ D+TSKPP T EWE<br />
Sbjct 644 DFMTGIPATPGNEIPVEVVLKMVTEIKKIPGISRIMYDLTSKPPGTTEWE 693<br />
>AT1G67090<br />
MASSMLSSATMVASPAQATMVAPFNGLKSSAAFPATRKANNDITSITSNGGRVNCMQVWPPIGKKKFETLSYLPDLTDSELAKEVDYLIRNKWIPCVEFE<br />
LEHGFVYREHGNSPGYYDGRYWTMWKLPLFGCTDSAQVLKEVEECKKEYPNAFIRIIGFDNTRQVQCISFIAYKPPSFTG<br />
GENE ID: 84284 C1orf57 | chromosome 1 open read<strong>in</strong>g frame 57 [Homo sapiens]<br />
(10 or fewer PubMed l<strong>in</strong>ks)<br />
Score = 30.0 bits (66), Expect = 8.0, Method: Compositional matrix adjust.<br />
Identities = 20/86 (23%), Positives = 38/86 (44%), Gaps = 9/86 (10%)<br />
Query 22 APFNGLKSSAAFPATRKANNDITSITSNGGRVNCMQVWPPIGKKKFETLSYLPDLTDSE- 80<br />
P +G + R+ D+ +++ G ++ + + PP GK++ Y+ DLT E<br />
Sbjct 31 VPVDGFYTEEVRQGGRRIGFDVVTLSGTRGPLSRVGLEPPPGKRECRVGQYVVDLTSFEQ 90<br />
Query 81 ----LAKEVDYLIRNKWIP----CVE 98<br />
+ + V RN +P CV+<br />
Sbjct 91 LALPVLRNVTKENRNHLLPDIVTCVQ 116<br />
>AT1g73430<br />
1 MATKAASSSS LPKSGAISKG YNFASTWEQS APLTEQQQAA IVSLSHAVAE<br />
51 RPFPANLVHE HVHRPENGLS VSVEDTHLGD SGAIEAVLVN TNQFYKWFTD<br />
101 LESAMKSETE EKYRHYVSTL TERIQTCDNI LHQVDETLDL FNELQLQHQG<br />
151 VTTKTKTLHD ACDRLLMEKQ KLMEFAEALR SKLNYFDELE NVSSNFYSPN<br />
201 MNVSNSNFLP LLKRLDECIS YIEDNPQYAE SSVYLLKFRQ LQSRALGMIR<br />
251 TYILAVLKTA ASQVQAAFRG TGGNKTSVSE GVEASVIYVR FKAAANELKP