22.02.2013 Views

Text S1: Protein sequences and alignments of all proteins found in ...

Text S1: Protein sequences and alignments of all proteins found in ...

Text S1: Protein sequences and alignments of all proteins found in ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

AT1G63660<br />

METPTMKPDTVLILDYGSQYTHLITRRIRSLNVFSLVISGTSSLKSITSYNPRVVILSGGPHSVHALDAPSFPEGFIEWAESNGVSVLGICYGLQLIVQK<br />

LGGVVVEGESKEYGKMEIEVKGKSEIFGSESGGEKQMVWMSHGDEAVKLPEGFEVVAQSAQGAVAALESRKKKIYGLQYHPEVTHSPKGMETLRHFLFDV<br />

CGVSADWKMEDLMEEEIKVINKTVASDEHVICALSGGVDSTVAATLVHKAIGDRLHCIFVDNGLLRYKEQERVMDTFERDLHLPVTCVDASERFLSELKG<br />

VVDPETKRKIIGREFINIFDQFAQELEKKHGKKPAFLVQGTLYPDVIESCPPPGTDRTHSHTIKSHHNVGGLPKDMKLKLIEPLKLLFKDEVRELGRILN<br />

VPVGFLKRHPFPGPGLAVRVLGDVTQGNALEVLRQVDEIFIQSIRDAGLYDSIWQAFAVFLPVRSVGVQGDKRTHSHVVALRAVTSQDGMTADWFNFEHK<br />

FLDDVSRKICNSVQGVNRVVLDITSKPPSTIEWE<br />

GENE ID: 8833 GMPS | guan<strong>in</strong>e monphosphate synthetase [Homo sapiens]<br />

(Over 10 PubMed l<strong>in</strong>ks)<br />

Score = 263 bits (673), Expect = 4e-70, Method: Compositional matrix adjust.<br />

Identities = 187/570 (32%), Positives = 300/570 (52%), Gaps = 71/570 (12%)<br />

Query 11 VLILDYGSQYTHLITRRIRSLNVFSLVISGTSSLKSITSYNPRVVILSGGPHSVHALDAP 70<br />

V+ILD G+QY +I RR+R L V S + + +I R +I+SGGP+SV+A DAP<br />

Sbjct 28 VVILDAGAQYGKVIDRRVRELFVQSEIFPLETPAFAIKEQGFRAIIISGGPNSVYAEDAP 87<br />

Query 71 SFPEGFIEWAESNGVSVLGICYGLQLIVQKLGGVVVEGESKEYGKMEIEVKGKSEIFGSE 130<br />

F + G VLGICYG+Q++ + GG V + +E G I V +F<br />

Sbjct 88 WFDPAIF----TIGKPVLGICYGMQMMNKVFGGTVHKKSVREDGVFNISVDNTCSLFRGL 143<br />

Query 131 SGGEKQMVWMSHGDEAVKLPEGFEVVAQSAQGAVAALESRKKKIYGLQYHPEVTHSPKGM 190<br />

++++V ++HGD K+ +GF+VVA+S VA + + KK+YG Q+HPEV + G<br />

Sbjct 144 Q--KEEVVLLTHGDSVDKVADGFKVVARSGN-IVAGIANESKKLYGAQFHPEVGLTENGK 200<br />

Query 191 ETLRHFLFDVCGVSADWKMEDLMEEEIKVINKTVASDEHVICALSGGVDSTVAATLVHKA 250<br />

L++FL+D+ G S + +++ E I+ I + V + + V+ LSGGVDSTV L+++A<br />

Sbjct 201 VILKNFLYDIAGCSGTFTVQNRELECIREIKERVGTSK-VLVLLSGGVDSTVCTALLNRA 259<br />

Query 251 IG-DRLHCIFVDNGLLRYKEQERVMDTFERDLHLPVTCVDASERFLS------------- 296<br />

+ +++ + +DNG +R +E + V + ++ L + V ++A+ F +<br />

Sbjct 260 LNQEQVIAVHIDNGFMRKRESQSVEEALKK-LGIQVKVINAAHSFYNGTTTLPISDEDRT 318<br />

Query 297 -------ELKGVVDPETKRKIIGREFINIFDQFAQELEKKHGKKPAFLVQGTLYPDVIES 349<br />

L PE KRKIIG F+ I ++ E+ K + FL QGTL PD+IES<br />

Sbjct 319 PRKRISKTLNMTTSPEEKRKIIGDTFVKIANEVIGEMNLK--PEEVFLAQGTLRPDLIES 376<br />

Query 350 CPPPGTDRTHSHTIKSHHNVGGLPKDMKL--KLIEPLKLLFKDEVRELGRILNVPVGFLK 407<br />

+ + + IK+HHN L + ++ K+IEPLK KDEVR LGR L +P +<br />

Sbjct 377 ASLVASGK--AELIKTHHNDTELIRKLREEGKVIEPLKDFHKDEVRILGRELGLPEELVS 434<br />

Query 408 RHPFPGPGLAVRVL--------GDVTQGNALEVLRQVDEI---------FIQSIRDAGLY 450<br />

RHPFPGPGLA+RV+ D + N +L+ V + +Q ++<br />

Sbjct 435 RHPFPGPGLAIRVICAEEPYICKDFPETN--NILKIVADFSASVKKPHTLLQRVKACTTE 492<br />

Query 451 D---------SIWQAFAVFLPVRSVGVQGDKRTHSHVVALRAVTSQDGMTADWFNFEHKF 501<br />

+ S+ A LP+++VGVQGD R++S+V ++S+D DW + F<br />

Sbjct 493 EDQEKLMQITSLHSLNAFLLPIKTVGVQGDCRSYSYVC---GISSKD--EPDWESL--IF 545<br />

Query 502 LDDVSRKICNSVQGVNRVVLDITSKPPSTI 531<br />

L + ++C++V V + +PP+ +<br />

Sbjct 546 LARLIPRMCHNVNRVVYIFGPPVKEPPTDV 575<br />

Score = 40.0 bits (92), Expect = 0.010, Method: Compositional matrix adjust.<br />

Identities = 31/110 (28%), Positives = 50/110 (45%), Gaps = 6/110 (5%)<br />

Query 430 LEVLRQVDEIFIQSIRDAGLYDSIWQAFAVFLPVR--SVGVQGDKRTHSHVVALRAVTSQ 487<br />

L LRQ D +R++G I Q + P+ +Q VV +R +<br />

Sbjct 585 LSTLRQADFEAHNILRESGYAGKISQMPVILTPLHFDRDPLQKQPSCQRSVV-IRTFITS 643<br />

Query 488 DGMTADWFNFEHKFLDDVSRKICNSVQ---GVNRVVLDITSKPPSTIEWE 534<br />

D MT ++ +V K+ ++ G++R++ D+TSKPP T EWE<br />

Sbjct 644 DFMTGIPATPGNEIPVEVVLKMVTEIKKIPGISRIMYDLTSKPPGTTEWE 693<br />

>AT1G67090<br />

MASSMLSSATMVASPAQATMVAPFNGLKSSAAFPATRKANNDITSITSNGGRVNCMQVWPPIGKKKFETLSYLPDLTDSELAKEVDYLIRNKWIPCVEFE<br />

LEHGFVYREHGNSPGYYDGRYWTMWKLPLFGCTDSAQVLKEVEECKKEYPNAFIRIIGFDNTRQVQCISFIAYKPPSFTG<br />

GENE ID: 84284 C1orf57 | chromosome 1 open read<strong>in</strong>g frame 57 [Homo sapiens]<br />

(10 or fewer PubMed l<strong>in</strong>ks)<br />

Score = 30.0 bits (66), Expect = 8.0, Method: Compositional matrix adjust.<br />

Identities = 20/86 (23%), Positives = 38/86 (44%), Gaps = 9/86 (10%)<br />

Query 22 APFNGLKSSAAFPATRKANNDITSITSNGGRVNCMQVWPPIGKKKFETLSYLPDLTDSE- 80<br />

P +G + R+ D+ +++ G ++ + + PP GK++ Y+ DLT E<br />

Sbjct 31 VPVDGFYTEEVRQGGRRIGFDVVTLSGTRGPLSRVGLEPPPGKRECRVGQYVVDLTSFEQ 90<br />

Query 81 ----LAKEVDYLIRNKWIP----CVE 98<br />

+ + V RN +P CV+<br />

Sbjct 91 LALPVLRNVTKENRNHLLPDIVTCVQ 116<br />

>AT1g73430<br />

1 MATKAASSSS LPKSGAISKG YNFASTWEQS APLTEQQQAA IVSLSHAVAE<br />

51 RPFPANLVHE HVHRPENGLS VSVEDTHLGD SGAIEAVLVN TNQFYKWFTD<br />

101 LESAMKSETE EKYRHYVSTL TERIQTCDNI LHQVDETLDL FNELQLQHQG<br />

151 VTTKTKTLHD ACDRLLMEKQ KLMEFAEALR SKLNYFDELE NVSSNFYSPN<br />

201 MNVSNSNFLP LLKRLDECIS YIEDNPQYAE SSVYLLKFRQ LQSRALGMIR<br />

251 TYILAVLKTA ASQVQAAFRG TGGNKTSVSE GVEASVIYVR FKAAANELKP

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!