Text S1: Protein sequences and alignments of all proteins found in ...
Text S1: Protein sequences and alignments of all proteins found in ...
Text S1: Protein sequences and alignments of all proteins found in ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
<strong>Text</strong> <strong>S1</strong>: <strong>Prote<strong>in</strong></strong> <strong>sequences</strong> <strong>and</strong> <strong>alignments</strong> <strong>of</strong> <strong>all</strong> <strong>prote<strong>in</strong>s</strong> <strong>found</strong> <strong>in</strong> this study.<br />
(A) Alignment <strong>of</strong> Rubisco <strong>sequences</strong> from Arabidopsis thaliana, Brassica oleacera,<br />
Chlamydomonas rhe<strong>in</strong>hardtii <strong>and</strong> Synechococcus elongatus (B) Alignment <strong>of</strong><br />
Arabidopsis GAPA-2 <strong>and</strong> GAPC-2 <strong>prote<strong>in</strong>s</strong>. Conserved lys<strong>in</strong>e residues are<br />
highlighted <strong>in</strong> black. Acetylated lys<strong>in</strong>e residues are red. (C) <strong>Prote<strong>in</strong></strong> <strong>sequences</strong> <strong>and</strong><br />
<strong>alignments</strong> <strong>of</strong> detected <strong>prote<strong>in</strong>s</strong>. Lys<strong>in</strong>e acetylated peptides are highlighted <strong>in</strong> yellow<br />
<strong>and</strong> acetylated lys<strong>in</strong>e residues are highlighted <strong>in</strong> red.<br />
(A)<br />
A.thal<strong>in</strong>a -MSPQTETKASVGFKAGVKEYKLTYYTPEYETKDTDILAAFRVTPQPGVPPEEAGAAVAA 59<br />
B.oleacera -MSPQTETKASVGFKAGVKEYKLNYYTPEYETKDTDILAAFRVTPQPGVPPEEAGAAVAA 59<br />
C.rhe<strong>in</strong>hardtii -MVPQTETKAGAGFKAGVKDYRLTYYTPDYVVRDTDILAAFRMTPQPGVPPEECGAAVAA 59<br />
Synechococcus MSYSQTQSKSGAGYDAGVQDYRLTYYAPDYTPRDTDILAAFRMTPQPGVPPEECAAAVAA 60<br />
.**::*:..*:.***::*:*.**:*:* :*********:**********..*****<br />
A.thal<strong>in</strong>a ESSTGTWTTVWTDGLTSLDRYKGRCYHIEPVPGEETQFIAYVAYPLDLFEEGSVTNMFTS 119<br />
B.oleacera ESSTGTWTTVWTDGLTSLDRYKGRCYHIEPVPGEETQFIAYVAYPLDLFEEGSVTNMFTS 119<br />
C.rhe<strong>in</strong>hardtii ESSTGTWTTVWTDGLTSLDRYKGRCYDIEPVPGEDNQYIAYVAYPIDLFEEGSVTNMFTS 119<br />
Synechococcus ESSTGTWTTVWTDLLTDMDRYRGRCYDIEPVPGEDNQYIAYVAYPLDLFEEGSVTNLLTS 120<br />
************* **.:***:****.*******:.*:*******:**********::**<br />
Catalytic residue<br />
A.thal<strong>in</strong>a IVGNVFGFKALAALRLEDLRIPPAYTKTFQGPPHGIQVERDKLNKYGRPLLGCTIKPKLG 179<br />
B.oleacera IVGNVFGFKALAALRLEDLRIPPAYTKTFQGPPHGIQVERDKLNKYGRPLLGCTIKPKLG 179<br />
C.rhe<strong>in</strong>hardtii IVGNVFGFKALRALRLEDLRIPPAYVKTFVGPPHGIQVERDKLNKYGRGLLGCTIKPKLG 179<br />
Synechococcus LVGNVFGFKALRALRLEDLRIPVAYVKTFQGPPHGIQVERDRINKYGRPLLGCTIKPKLG 180<br />
:********** ********** **.*** ***********::***** ***********<br />
(Carbamylation site)<br />
A.thal<strong>in</strong>a LSAKNYGRAVYECLRGGLDFTKDDENVNSQPFMRWRDRFLFCAEAIYKSQAETGEIKGHY 239<br />
B.oleacera LSAKNYGRAVYECLRGGLDFTKDDENVNSQPFMRWRDRFLFCAEAIYKSQAETGEIKGHY 239<br />
C.rhe<strong>in</strong>hardtii LSAKNYGRAVYECLRGGLDFTKDDENVNSQPFMRWRDRFLFVAEAIYKAQAETGEVKGHY 239<br />
Synechococcus LSAKNYGRAVYECLRGGLDFTKDDENINSQPFQRWRDRFLFVADAIHKSQAETGEIKGHY 240<br />
**************************:***** ******** *:**:*:******:****<br />
A.thal<strong>in</strong>a LNATAGTCEEMIKRAVFARELGVPIVMHDYLTGGFTANTSLSHYCRDNGLLLHIHRAMHA 299<br />
B.oleacera LNATAGTCEEMMKRAIFARELGVPIVMHDYLTGGFTANTSLAHYCRDNGLLLHIHRAMHA 299<br />
C.rhe<strong>in</strong>hardtii LNATAGTCEEMMKRAVCAKELGVPIIMHDYLTGGFTANTSLAIYCRDNGLLLHIHRAMHA 299<br />
Synechococcus LNVTAATCEEMMKRAAYAKELEMPIVMHDFLTGGFTANTTLAHWCRDNGILLHIHRAMHA 300<br />
**.**.*****:*** *:** :**:***:*********:*: :*****:**********<br />
Catalytic residue<br />
A.thal<strong>in</strong>a VIDRQKNHGMHFRVLAKALRLSGGDHIHAGTVVGKLEGDRESTLGFVDLLRDDYVEKDRS 359<br />
B.oleacera VIDRQKNHGMHFRVLAKALRLSGGDHVHAGTVVGKLEGDRESTLGFVDLLRDDYVEKDRS 359<br />
C.rhe<strong>in</strong>hardtii VIDRQRNHGIHFRVLAKALRMSGGDHLHSGTVVGKLEGEREVTLGFVDLMRDDYVEKDRS 359<br />
Synechococcus VIDRQKNHGIHFRVLAKCLRMSGGDHIHTGTVVGKLEGDRAGTLGFVDLLRENYIEQDKS 360<br />
*****:***:*******.**:*****:*:*********:* *******:*::*:*:*:*<br />
A.thal<strong>in</strong>a RGIFFTQDWVSLPGVLPVASGGIHVWHMPALTEIFGDDSVLQFGGGTLGHPWGNAPGAVA 419<br />
B.oleacera RGIFFTQDWVSLPGVLPVASGGIHVWHMPALTEIFGDDSVLQFGGGTLGHPWGNAPGAVA 419<br />
C.rhe<strong>in</strong>hardtii RGIYFTQDWCSMPGVMPVASGGIHVWHMPALVEIFGDDACLQFGGGTLGHPWGNAPGAAA 419<br />
Synechococcus RGVYFTQDWASMPGVMAVASGGIHVWHMPALVEIFGDDSVLQFGGGTLGHPWGNAPGATA 420<br />
**::***** *:***:.**************.******: ******************.*<br />
A.thal<strong>in</strong>a NRVALEACVQARNEGRDLAVEGNEIIREACKWSPELAAACEVWKEITFNFPTIDKLDGQE 479<br />
B.oleacera NRVALEACVQARNEGRDLAVEGNEIIREACKWSPELAAACEVWKEITFNFPTIDKLDGQD 479<br />
C.rhe<strong>in</strong>hardtii NRVALEACTQARNEGRDLAREGGDVIRSACKWSPELAAACEVWKEIKFEFDTIDKL---- 475<br />
Synechococcus NRVALEACVQARNEGRNLAREGGDIIREACKWSPELAAACELWKEIKFEFDTVDTI---- 476<br />
********.*******:** **.::**.*************:****.*:* *:*.:
(B)<br />
GAPC2 -----------------------------------------------------------M 1<br />
GAPA2 MASATFSVAKPSLQGFSEFSGLRNSSALPFAKRSSSDEFVSFVSFQTSAMRSNGGYRKGV 60<br />
:<br />
GAPC2 ADKKIRIGINGFGRIGRLVARVVLQRDD--VELVAVNDPFITTEYMTYMFKYDSVHGQWK 59<br />
GAPA2 TEAKIKVAINGFGRIGRNFLRCWHGRKDSPLDVVVINDTGG-VKQASHLLKYDSTLGIFD 119<br />
:: **::.********* . * *.* :::*.:**. .: ::::****. * :.<br />
GAPC2 HHELKVKDDKTLLFGEKPVTVFGIRNPEDIPWGEAGADFVVESTGVFTDKDKAAAHLKGG 119<br />
GAPA2 -ADVKPSGDSALSVDGKIIKIVSDRNPSNLPWGELGIDLVIEGTGVFVDRDGAGKHLQAG 178<br />
::* ..*.:* .. * :.:.. ***.::**** * *:*:*.****.*:* *. **:.*<br />
GAPC2 AKKVVISAPSK-DAPMFVVGVNEHEYKSDLDIVSNASCTTNCLAPLAKVINDRFGIVEGL 178<br />
GAPA2 AKKVLITAPGKGDIPTYVVGVNAELYSHEDTIISNASCTTNCLAPFVKVLDQKFGIIKGT 238<br />
****:*:**.* * * :***** . *. : *:************:.**::::***::*<br />
GAPC2 MTTVHSITATQKTVDGPSMKDWRGGRAASFNIIPSSTGAAKAVGKVLPSLNGKLTGMSFR 238<br />
GAPA2 MTTTHSYTGDQRLLD-ASHRDLRRARAAALNIVPTSTGAAKAVALVLPNLKGKLNGIALR 297<br />
***.** *. *: :* .* :* * .***::**:*:********. ***.*:***.*:::*<br />
GAPC2 VPTVDVSVVDLTVRLEKAATYDEIKKAIKEESEGKMKGILGYTEDDVVSTDFVGDNRSSI 298<br />
GAPA2 VPTPNVSVVDLVVQVSKKTFAEEVNAAFRDAAEKELKGILDVCDEPLVSVDFRCSDVSST 357<br />
*** :******.*::.* : :*:: *::: :* ::****. :: :**.** .: **<br />
GAPC2 FDAKAGIALSDKFVKLVSWYDNEWGYSSRVVDLIVHMSKA-- 338<br />
GAPA2 IDSSLTMVMGDDMVKVIAWYDNEWGYSQRVVDLADIVANNWK 399<br />
:*:. :.:.*.:**:::*********.***** :::<br />
(C)<br />
>AT1G03860<br />
MSFNKVPNIPGAPALSALLKVSVIGGLGVYALTNSLYNVDGGHRAVMFNRLTGIKEKVYPEGTHFMVPWFERPIIYDVRARPYLVESTTGSHDLQMVKIG<br />
LRVLTRPMGDRLPQIYRTLGENYSERVLPSIIHETLKAVVAQYNASQLITQREAVSREIRKILTERASNFDIALDDVSITTLTFGKEFTAAIEAKQVAAQ<br />
EAERAKFIVEKAEQDRRSAVIRAQGEAKSAQLIGQAIANNQAFITLRKIEAAREIAQTIAQSANKVYLSSNDLLLNLQEMNLEPKK<br />
GENE ID: 11331 PHB2 | prohibit<strong>in</strong> 2 [Homo sapiens] (Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 310 bits (794), Expect = 4e-84, Method: Compositional matrix adjust.<br />
Identities = 145/273 (53%), Positives = 210/273 (76%), Gaps = 2/273 (0%)<br />
Query 9 IPGAP-ALSALLKVSVIGGLGVYALTNSLYNVDGGHRAVMFNRLTGIKEK-VYPEGTHFM 66<br />
+P P + LK+ + G Y + S++ V+GGHRA+ FNR+ G+++ + EG HF<br />
Sbjct 12 LPAGPRGMGTALKLLLGAGAVAYGVRESVFTVEGGHRAIFFNRIGGVQQDTILAEGLHFR 71<br />
Query 67 VPWFERPIIYDVRARPYLVESTTGSHDLQMVKIGLRVLTRPMGDRLPQIYRTLGENYSER 126<br />
+PWF+ PIIYD+RARP + S TGS DLQMV I LRVL+RP LP +Y+ LG +Y ER<br />
Sbjct 72 IPWFQYPIIYDIRARPRKISSPTGSKDLQMVNISLRVLSRPNAQELPSMYQRLGLDYEER 131<br />
Query 127 VLPSIIHETLKAVVAQYNASQLITQREAVSREIRKILTERASNFDIALDDVSITTLTFGK 186<br />
VLPSI++E LK+VVA++NASQLITQR VS IR+ LTERA +F + LDDV+IT L+F +<br />
Sbjct 132 VLPSIVNEVLKSVVAKFNASQLITQRAQVSLLIRRELTERAKDFSLILDDVAITELSFSR 191<br />
Query 187 EFTAAIEAKQVAAQEAERAKFIVEKAEQDRRSAVIRAQGEAKSAQLIGQAIANNQAFITL 246<br />
E+TAA+EAKQVA QEA+RA+F+VEKA+Q++R +++A+GEA++A+++G+A++ N +I L<br />
Sbjct 192 EYTAAVEAKQVAQQEAQRAQFLVEKAKQEQRQKIVQAEGEAEAAKMLGEALSKNPGYIKL 251<br />
Query 247 RKIEAAREIAQTIAQSANKVYLSSNDLLLNLQE 279<br />
RKI AA+ I++TIA S N++YL++++L+LNLQ+<br />
Sbjct 252 RKIRAAQNISKTIATSQNRIYLTADNLVLNLQD 284<br />
>AT1g03910<br />
MGSHGKGKRDRSGRQKKRRDESESGSESESYTSDSDGSDDLSPPRSSRRKKGSSSRRTRRRSSSDDSSDSDGGRKSKKRSSSKDYSEEKVTEYMSKKAQK<br />
KALRAAKKLKTQSVSGYSNDSNPFGDSNLTETFVWRKKIEKDVHRGVPLEEFSVKAEKRRHRERMTEVEKVKKRREERAVEKARHEEEMALLARERARAE<br />
FHDWEKKEEEFHFDQSKVRSEIRLREGRLKPIDVLCKHLDGSDDLDIELSEPYMVFKKKKVRIGIWLNFQLSITNVYVEAEYKNDSACLLLRSRVDILLN<br />
KGLTVKDMEELRDDIKMYLDLDRATPTRVQYWEALIVVCDWELAEARKRDALDRARVRGEEPPAELLAQERGLHAGVEADVRKLLDGKTHAELVELQLDI<br />
ESQLRSGSAKVVEYWEAVLKRLEIYKAKACLKEIHAEMLRRHLHRLEQLSEGEDDVEVNPGLTRVVEENEEEINDTNLSDAEEAFSPEPVAEEEEADEAA<br />
EAAGSFSPELMHGDDREEAIDPEEDKKLLQMKRMIVLEKQKKRLKEAMDSKPAPVEDNLELKAMKAMGAMEEGDAIFGSNAEVNLDSEVYWWHDKYRPRK<br />
PKYFNRVHTGYEWNKYNQTHYDHDNPPPKIVQGYKFNIFYPDLVDKIKAPIYTIEKDGTSAETCMIRFHAGPPYEDIAFRIVNKEWEYSHKKGFKCTFER<br />
GILHLYFNFKRHRYRR<br />
GENE ID: 58509 C19orf29 | chromosome 19 open read<strong>in</strong>g frame 29 [Homo sapiens]<br />
(Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 394 bits (1012), Expect = 6e-109, Method: Compositional matrix adjust.<br />
Identities = 242/644 (38%), Positives = 354/644 (55%), Gaps = 121/644 (18%)<br />
Query 116 GYSNDSNPFGDSNLTETFVWRKKIEKDVHRGVP-LEEFSVKAEKRRHRE-RMTEVEKVKK 173<br />
GY+N NPFGD+NL TF+W K +EK +G+ LEE +K +R +E E++KVK+<br />
Sbjct 193 GYTNTDNPFGDNNLLGTFIWNKALEK---KGISHLEEKELKERNKRIQEDNRLELQKVKQ 249<br />
Query 174 RREERAVEKARHEEEMALLARERARAEFHDWEKKEEEFHFDQSKVRSEIRLREGRLKPID 233<br />
R ER EKA E+E+ +L RE+ F WE++E+ FH Q+K+RS+IR+R+GR KPID<br />
Sbjct 250 LRLEREREKAMREQELEMLQREKEAEHFKTWEEQEDNFHLQQAKLRSKIRIRDGRAKPID 309<br />
Query 234 VLCKHLDG-SDDLDIELSEPYMVFKKKKVRIGIWLNFQLSITNVYVEAEYKNDSACLLLR 292<br />
+L K++ DDL +E+ EPY<br />
Sbjct 310 LLAKYISAEDDDLAVEMHEPY--------------------------------------- 330
Query 293<br />
Sbjct 331<br />
Query 353<br />
Sbjct 384<br />
Query 412<br />
Sbjct 434<br />
Query 472<br />
Sbjct 490<br />
Query 503<br />
Sbjct 549<br />
Query 553<br />
Sbjct 602<br />
Query 613<br />
Sbjct 655<br />
Query 673<br />
Sbjct 715<br />
>AT1G04410<br />
MAKEPVRVLVTTGAAGQIGYALVPMIAARGIMLGADQPVILHM<br />
MLDIPPAAEALNGVKM KMELIDAAFPLLKGVV VATTDAVEGCTGVNVA AVMVGGFPRKEGMERKK<br />
DVMSKNVSIYKKSQAAALEKHAAPNCKKVLVVANPANTNALIL<br />
LKEFAPSIPEKNISCL CLTRLDHNRALGQISE ERLSVPVSDVKNVIIW WGNHSSSQYPDVNHAKK<br />
VQTSSGEKPVRRELVKDDAWLDGEFISSTVQQRGAAIIKARKL<br />
LSSALSAASSACDHIRRDWVLGTPEGTFVSM<br />
MGVYSDGSYSVPSGLIYSFPVTCRNGDWSIVV<br />
QGLPIDEVSRKKKMDLTAEELKEEKDLLAYSCLS<br />
GENE ID: 41190<br />
MDH1 | malatte<br />
dehydrogenase e 1, NAD (solublle)<br />
[Homo sapiens]<br />
(Over 10 PuubMed<br />
l<strong>in</strong>ks)<br />
Score = 4418<br />
bits (1075), , Expect = 1e-1 116, Method: Commpositional<br />
matrix<br />
adjust.<br />
Identitiess<br />
= 204/330 (61% %), Positives = 249/330 (75%), Gaps = 1/330 ( 0%)<br />
Query 4<br />
Sbjct 3<br />
Query 64<br />
Sbjct 63<br />
Query 124<br />
Sbjct 123<br />
Query 184<br />
Sbjct 183<br />
Query 244<br />
Sbjct 243<br />
Query 303<br />
Sbjct 303<br />
>AT1G07660<br />
MSGRGKGGKGLLGKGGAKRHRKVLRDNNIQGITKPAIRRLARR<br />
RGGVKRISGLIYEETR TRGVLKIFLENVIRDA AVTYTEHARRKTVTAM MDVVYALKRQGRTLYGG<br />
FGG<br />
> gb|EEAW55528.1|<br />
Length=129<br />
GENE ID: 88364<br />
HIST1H4C | histone cluster r 1, H4c [Homo ssapiens]<br />
(Over 10 PuubMed<br />
l<strong>in</strong>ks)<br />
Score = 2200<br />
bits (508), Expect = 5e-51 1, Method: Compoositional<br />
matrix<br />
adjust.<br />
Identitiess<br />
= 101/103 (98% %), Positives = 103/103 (100%), , Gaps = 0/103 (0%)<br />
Query 1<br />
Sbjct 27<br />
Query 61<br />
Sbjct 87<br />
SRVDILLNKGLTVKKDMEELRDDIKMYLDL<br />
LDRATPTRVQYWEALIIVVCDWELAEARKRD<br />
DAL 352<br />
LN GLTV DME+L +DI++Y++L L++ +W + + + E+++ RK + A<br />
----TFLN-GLTVAADMEDLLEDIQVYMEL<br />
LEQGK--NADFWRDMT MTTITEDEISKLRKLEAS<br />
383<br />
DRARVRGEEPPAELLLAQERGLHAGVEADV<br />
VRKLLDGKTHAELVELLQLDIESQLRSGSAK<br />
KV- 411<br />
+ P E + G++A V +DV V+ + GKT+ +L + IE ++R+G +<br />
GKG-------PGE----RREGVNASVSSDV<br />
VQSVFKGKTYNQLQVI VIFQGIEGKIRAGGPN NLD 433<br />
VEYWEAVLKRLEIYYKAKACLKEIHAEMLR<br />
RRHLHRLEQLSEGEDD DDVEVNPGLTRVVEEN NEE 471<br />
+ YWE++L++L + A+A L+E H ++LR R+ L++L+Q E VE P + +E +<br />
MGYWESLLQQLRAHHMARARLRERHQDVLR<br />
RQKLYKLKQ----EQG QGVESEPLFPILKQEP PQS 489<br />
EINDTNLSDAEEAFFSPEPVAE----EEEA<br />
ADEAAEA--------------------------<br />
502<br />
DA P P +E E E D A<br />
PSRSLEPEDAAPT--PPGPSSEGGPAEAEV<br />
VDGATPTEGDGDGDGE GEGEGEGEAVLMEEDL LIQ 548<br />
-------AGSFSPEELMHGDDRE---EAID<br />
DPEEDKKLLQMKRMIVVLEKQKKRLKEAMDSKP<br />
552<br />
AG +SP L+ + ++ +P+ED + LQ+ R +Q + +A +S<br />
QSLDDYDAGRYSPRRLLTAHELPLDAHVLE<br />
EPDEDLQRLQLSR------QQLQVTGDASES--<br />
601<br />
APVEDNLELKAMKAAMGAMEEGDAIFGSNA<br />
AEVNLDSEVYWWHDKY KYRPRKPKYFNRVHTG GYE 612<br />
ED +A + MG + +A F + E+ L + Y W DKY KYRPRKP++FNRVHTG G+E<br />
--AEDIFFRRAKEGGMG---QDEAQF--SV<br />
VEMPLTGKAYLWADKY KYRPRKPRFFNRVHTG GFE 654<br />
WNKYNQTHYDHDNPPPPKIVQGYKFNIFYP<br />
PDLVDKIKAPIYTIEKKDGTSAETCMIRFHA<br />
AGP 672<br />
WNKYNQTHYD DNPPPPKIVQGYKFNIFYP<br />
PDL+DK P Y +E + + ++RFHA AGP<br />
WNKYNQTHYDFDNPPPPKIVQGYKFNIFYP<br />
PDLIDKRSTPEYFLEAACADNKDFAILRFHA<br />
AGP 714<br />
PYEDIAFRIVNKEWWEYSHKKGFKCTFERG<br />
GILHLYFNFKRHRYRR RR 716<br />
PYEDIAF+IVN+EWWEYSH+<br />
GF+C F GI G L+F+FKR+RYRR RR<br />
PYEDIAFKIVNREWWEYSHRHGFRCQFANG<br />
GIFQLWFHFKRYRYRR RR 758<br />
EPVRVLVTGAAGQIIGYALVPMIARGIMLG<br />
GADQPVILHMLDIPPAAAEALNGVKMELIDA<br />
AAF 63<br />
EP+RVLVTGAAGQII<br />
Y+L+ I G + G DQP+IL +LDI P L+GV MEL D A<br />
EPIRVLVTGAAGQIIAYSLLYSIGNGSVFG<br />
GKDQPIILVLLDITPMMMGVLDGVLMELQDC<br />
CAL 62<br />
PLLKGVVATTDAVEEGCTGVNVAVMVGGFP<br />
PRKEGMERKDVMSKNV NVSIYKSQAAALEKHA AAP 123<br />
PLLK V+AT<br />
++VA++VG PR+EGMERKD++ P<br />
NV I+KSQ AAL+K+A A<br />
PLLKDVIATDKEDVVAFKDLDVAILVGSMP<br />
PRREGMERKDLLKANV NVKIFKSQGAALDKYA AKK 122<br />
NCKVLVVANPANTNNALILKEFAPSIPEKN<br />
NISCLTRLDHNRALGQ GQISERLSVPVSDVKN NVI 183<br />
+ KV+VV NPANTNN<br />
L + APSIP++N N SCLTRLDHNRA QQI+<br />
+L V +DVKN NVI<br />
SVKVIVVGNPANTNNCLTASKSAPSIPKEN<br />
NFSCLTRLDHNRAKAQ AQIALKLGVTANDVKN NVI 182<br />
IWGNHSSSQYPDVNNHAKVQTSSGEKPVRE<br />
ELVKDDAWLDGEFISTTVQQRGAAIIKARKL<br />
LSS 243<br />
IWGNHSS+QYPDVNNHAKV+<br />
E V E +KDD+WL GEF++TTVQQRGAA+IKARKL<br />
LSS<br />
IWGNHSSTQYPDVNNHAKVKLQGKEVGVYE<br />
EALKDDSWLKGEFVTTTVQQRGAAVIKARKL<br />
LSS 242<br />
ALSAASSACDHIRDDWVLGTPEGTFVSMGV<br />
VYSDG-SYSVPSGLIYYSFPVTCRNGDWSIV<br />
VQG 302<br />
A+SAA + CDH+RDD<br />
GTPEG FVSMGV V SDG SY VP L+YYSFPV<br />
+N W V+G V<br />
AMSAAKAICDHVRDDIWFGTPEGEFVSMGV<br />
VISDGNSYGVPDDLLYYSFPVVIKNKTWKFV<br />
VEG 302<br />
LPIDEVSRKKMDLTTAEELKEEKDLAYSCL<br />
LS 332<br />
LPI++ SR+KMDLTTA+EL<br />
EEK+ A+ LS L<br />
LPINDFSREKMDLTTAKELTEEKESAFEFL<br />
LS 332<br />
histone 1, H4c [Homo sapiens]<br />
MSGRGKGGKGLGKGGGAKRHRKVLRDNIQG<br />
GITKPAIRRLARRGGV GVKRISGLIYEETRGV VLK 60<br />
MSGRGKGGKGLGKGGGAKRHRKVLRDNIQG<br />
GITKPAIRRLARRGGV GVKRISGLIYEETRGV VLK<br />
MSGRGKGGKGLGKGGGAKRHRKVLRDNIQG<br />
GITKPAIRRLARRGGV GVKRISGLIYEETRGV VLK 86<br />
IFLENVIRDAVTYTTEHARRKTVTAMDVVY<br />
YALKRQGRTLYGFGG G 103<br />
+FLENVIRDAVTYTTEHA+RKTVTAMDVVY<br />
YALKRQGRTLYGFGGG<br />
VFLENVIRDAVTYTTEHAKRKTVTAMDVVY<br />
YALKRQGRTLYGFGG G 129
AT1G07920<br />
MGKEKFHINIVVIGHVDSGKSTTTGHLIYKLGGIDKRVIERFEKEAAEMNKRSFKYAWVLDKLKAERERGITIDIALWKFETTKYYCTVIDAPGHRDFIK<br />
NMITGTSQADCAVLIIDSTTGGFEAGISKDGQTREHALLAFTLGVKQMICCCNKMDATTPKYSKARYDEIIKEVSSYLKKVGYNPDKIPFVPISGFEGDN<br />
MIERSTNLDWYKGPTLLEALDQINEPKRPSDKPLRLPLQDVYKIGGIGTVPVGRVETGMIKPGMVVTFAPTGLTTEVKSVEMHHESLLEALPGDNVGFNV<br />
KNVAVKDLKRGYVASNSKDDPAKGAANFTSQVIIMNHPGQIGNGYAPVLDCHTSHIAVKFSEILTKIDRRSGKEIEKEPKFLKNGDAGMVKMTPTKPMVV<br />
ETFSEYPPLGRFAVRDMRQTVAVGVIKSVDKKDPTGAKVTKAAVKKGAK<br />
GENE ID: 1915 EEF1A1 | eukaryotic translation elongation factor 1 alpha 1<br />
[Homo sapiens] (Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 716 bits (1848), Expect = 0.0, Method: Compositional matrix adjust.<br />
Identities = 347/457 (75%), Positives = 391/457 (85%), Gaps = 12/457 (2%)<br />
Query 1 MGKEKFHINIVVIGHVDSGKSTTTGHLIYKLGGIDKRVIERFEKEAAEMNKRSFKYAWVL 60<br />
MGKEK HINIVVIGHVDSGKSTTTGHLIYK GGIDKR IE+FEKEAAEM K SFKYAWVL<br />
Sbjct 1 MGKEKTHINIVVIGHVDSGKSTTTGHLIYKFGGIDKRTIEKFEKEAAEMGKGSFKYAWVL 60<br />
Query 61 DKLKAERERGITIDIALWKFETTKYYCTVIDAPGHRDFIKNMITGTSQADCAVLIIDSTT 120<br />
DKLKAERERGITIDI+LWKFET+KYY T+IDAPGHRDFIKNMITGTSQADCAVLI+ +<br />
Sbjct 61 DKLKAERERGITIDISLWKFETSKYYVTIIDAPGHRDFIKNMITGTSQADCAVLIVAAGV 120<br />
Query 121 GGFEAGISKDGQTREHALLAFTLGVKQMICCCNKMDATTPKYSKARYDEIIKEVSSYLKK 180<br />
G FEAGISK+GQTREHALLA+TLGVKQ+I NKMD+T P YS+ RY+EI+KEVS+Y+KK<br />
Sbjct 121 GEFEAGISKNGQTREHALLAYTLGVKQLIVGVNKMDSTEPPYSQKRYEEIVKEVSTYIKK 180<br />
Query 181 VGYNPDKIPFVPISGFEGDNMIERSTNLDWYKG------------PTLLEALDQINEPKR 228<br />
+GYNPD + FVPISG+ GDNM+E S N+ W+KG TLLEALD I P R<br />
Sbjct 181 IGYNPDTVAFVPISGWNGDNMLEPSANMPWFKGWKVTRKDGNASGTTLLEALDCILPPTR 240<br />
Query 229 PSDKPLRLPLQDVYKIGGIGTVPVGRVETGMIKPGMVVTFAPTGLTTEVKSVEMHHESLL 288<br />
P+DKPLRLPLQDVYKIGGIGTVPVGRVETG++KPGMVVTFAP +TTEVKSVEMHHE+L<br />
Sbjct 241 PTDKPLRLPLQDVYKIGGIGTVPVGRVETGVLKPGMVVTFAPVNVTTEVKSVEMHHEALS 300<br />
Query 289 EALPGDNVGFNVKNVAVKDLKRGYVASNSKDDPAKGAANFTSQVIIMNHPGQIGNGYAPV 348<br />
EALPGDNVGFNVKNV+VKD++RG VA +SK+DP AA FT+QVII+NHPGQI GYAPV<br />
Sbjct 301 EALPGDNVGFNVKNVSVKDVRRGNVAGDSKNDPPMEAAGFTAQVIILNHPGQISAGYAPV 360<br />
Query 349 LDCHTSHIAVKFSEILTKIDRRSGKEIEKEPKFLKNGDAGMVKMTPTKPMVVETFSEYPP 408<br />
LDCHT+HIA KF+E+ KIDRRSGK++E PKFLK+GDA +V M P KPM VE+FS+YPP<br />
Sbjct 361 LDCHTAHIACKFAELKEKIDRRSGKKLEDGPKFLKSGDAAIVDMVPGKPMCVESFSDYPP 420<br />
Query 409 LGRFAVRDMRQTVAVGVIKSVDKKDPTGAKVTKAAVK 445<br />
LGRFAVRDMRQTVAVGVIK+VDKK KVTK+A K<br />
Sbjct 421 LGRFAVRDMRQTVAVGVIKAVDKKAAGAGKVTKSAQK 457<br />
>AT1G09200<br />
MARTKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHRFRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSSAVAALQEAAEAY<br />
LVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA<br />
GENE ID: 126961 HIST2H3C | histone cluster 2, H3c [Homo sapiens]<br />
(Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 267 bits (683), Expect = 2e-71, Method: Compositional matrix adjust.<br />
Identities = 132/136 (97%), Positives = 135/136 (99%), Gaps = 0/136 (0%)<br />
Query 1 MARTKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHRFRPGTVALREIRKYQKSTE 60<br />
MARTKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHR+RPGTVALREIR+YQKSTE<br />
Sbjct 1 MARTKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHRYRPGTVALREIRRYQKSTE 60<br />
Query 61 LLIRKLPFQRLVREIAQDFKTDLRFQSSAVAALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120<br />
LLIRKLPFQRLVREIAQDFKTDLRFQSSAV ALQEA+EAYLVGLFEDTNLCAIHAKRVTI<br />
Sbjct 61 LLIRKLPFQRLVREIAQDFKTDLRFQSSAVMALQEASEAYLVGLFEDTNLCAIHAKRVTI 120<br />
Query 121 MPKDIQLARRIRGERA 136<br />
MPKDIQLARRIRGERA<br />
Sbjct 121 MPKDIQLARRIRGERA 136<br />
>AT1G09300<br />
MQFLARNLVRRVSRTQVVSRNAYSTQTVRDIGQPTPASHPHLMAEGEVTPGIRIEEYIGRRKKLVELLPENSLAIISSAPVKMMTDVVPYTFRQDADYLY<br />
LTGCQQPGGVAVLSDERGLCMFMPESTPKDIAWEGEVAGVDAASEVFKADQAYPISKLPEILSDMIRHSSKVFHNVQSASQRYTNLDDFQNSASLGKVKT<br />
LSSLTHELRLIKSPAELKLMRESASIACQGLLKTMLHSKGFPDEGILSAQVEYECRVRGAQRMAFNPVVGGGSNASVIHYSRNDQRIKDGDLVLMDMGCE<br />
LHGYVSDLTRTWPPCGKFSSVQEELYDLILQTNKECIKQCKPGTTIRQLNTYSTELLCDGLMKMGILKSRRLYHQLNPTSIGHYLGMDVHDSSAVGYDRP<br />
LQPGFVITIEPGVYIPSSFDCPERFQGIGIRIEDDVLITETGYEVLTGSMPKEIKHIETLLNNHCHDNSARTSPVSLCKVKGLHTNRNPRRLF<br />
GENE ID: 63929 XPNPEP3 | X-prolyl am<strong>in</strong>opeptidase (am<strong>in</strong>opeptidase P) 3, putative<br />
[Homo sapiens] (10 or fewer PubMed l<strong>in</strong>ks)<br />
Score = 347 bits (891), Expect = 2e-95, Method: Compositional matrix adjust.<br />
Identities = 185/486 (38%), Positives = 280/486 (57%), Gaps = 34/486 (6%)<br />
Query 9 VRRVSRTQVVSRNAYSTQTV-------RDIGQPTPASHPHLMAEGEVTPGIRIEEYIGRR 61<br />
VR +S + S+ YS Q V R +GQP+P +HPHL+ GEVTPG+ EY RR<br />
Sbjct 17 VRGLSGCMLCSQRRYSLQPVPERRIPNRYLGQPSPFTHPHLLRPGEVTPGLSQVEYALRR 76<br />
Query 62 KKLVELLPE--------NSLAIISSAPVKMMTDVVPYTFRQDADYLYLTGCQQPGGVAVL 113<br />
KL+ L+ + + ++ S P M++ +PYTF QD ++LYL G Q+P + VL<br />
Sbjct 77 HKLMSLIQKEAQGQSGTDQTVVVLSNPTYYMSNDIPYTFHQDNNFLYLCGFQEPDSILVL 136<br />
Query 114 SDERG-------LCMFMPESTPKDIAWEGEVAGVDAASEVFKADQAYPISKLPEILSDMI 166<br />
G +F+P P W+G +G D A + D+AY + + +L M<br />
Sbjct 137 QSLPGKQLPSHKAILFVPRRDPSRELWDGPRSGTDGAIALTGVDEAYTLEEFQHLLPKMK 196<br />
Query 167 RHSSKVFHNVQSASQRYTNLDDFQ-----NSASLGKVKTLSSLTHELRLIKSPAELKLMR 221<br />
++ V+++ S + D Q + S KV+ + L LRLIKSPAE++ M+<br />
Sbjct 197 AETNMVWYDWMRPSHAQLHSDYMQPLTEAKAKSKNKVRGVQQLIQRLRLIKSPAEIERMQ 256<br />
Query 222 ESASIACQGLLKTMLHSKGFPDEGILSAQVEYECRVRGAQRMAFNPVVGGGSNASVIHYS 281<br />
+ + Q ++TM SK +E L A+ E+ECR RGA +A+ PVV GG+ ++ +HY<br />
Sbjct 257 IAGKLTSQAFIETMFTSKAPVEEAFLYAKFEFECRARGADILAYPPVVAGGNRSNTLHYV 316
Query 282<br />
Sbjct 317<br />
Query 342<br />
Sbjct 377<br />
Query 396<br />
Sbjct 437<br />
Query 455<br />
Sbjct 497<br />
>AT1G12900 (GAPA2)<br />
MASATFSVAKPPSLQGFSEFSGLRNSSSALPFAKRSSSDEFVS<br />
SFVSFQTSAMRSNGGY GYRKGVTEAKIKVAIN N<br />
GFGRIGRNFLRRCWHGRKDSPLDVVVIINDTGGVKQASHLLKY<br />
YDSTLGIFDADVKPSGGDSALSVDGKIIKIV<br />
V<br />
SDRNPSNLPWGGELGIDLVIEGTGVFVVDRDGAGKHLQAGAKK<br />
KVLITAPGKGDIPTYV YVVGVNAELYSHEDTI<br />
ISNASCTTNCLLAPFVKVLDQKFGIIKKGTMTTTHSYTGDQRL<br />
LLDASHRDLRRARAAA AALNIVPTSTGAAKAV V<br />
ALVLPNLKGKLLNGIALRVPTPNVSVVVDLVVQVSKKTFAEEV<br />
VNAAFRDAAEKELKGI GILDVCDEPLVSVDFR R<br />
CSDVSSTIDSSSLTMVMGDDMVKVIAWWYDNEWGYSQRVVDLA<br />
ADIVANNWK<br />
> pdb| |1ZNQ|O Chai<strong>in</strong><br />
O, Crsytal St tructure Of Huma man Liver Gapdh<br />
Length=338<br />
Score = 2289<br />
bits (740), Expect = 1e-77 7, Method: Compoositional<br />
matrix<br />
adjust.<br />
Identitiess<br />
= 156/334 (46% %), Positives = 204/334 (61%), Gaps = 6/334 ( 1%)<br />
Query 59<br />
Sbjct 1<br />
Query 118<br />
Sbjct 59<br />
Query 178<br />
Sbjct 118<br />
Query 238<br />
Sbjct 177<br />
Query 297<br />
Sbjct 237<br />
Query 357<br />
Sbjct 297<br />
>AT1G22840<br />
MASFDEAPPGNNAKAGEKIFRTKCAQCCHTVEAGAGHKQGPNL<br />
LNGLFGRQSGTTAGYS YSYSAANKNKAVEWEE EKALYDYLLNPKKYIP PGTKMVFPGLKKPQDRR<br />
ADLIAYLKESTTAPK<br />
GENE ID: 544205<br />
CYCS | cytoochrome<br />
c, somat tic [Homo sapienns]<br />
(Over 10 PuubMed<br />
l<strong>in</strong>ks)<br />
Score = 1145<br />
bits (367), Expect = 1e-34 4, Method: Compoositional<br />
matrix<br />
adjust.<br />
Identitiess<br />
= 66/102 (64%) ), Positives = 82/102 8 (80%), Ga Gaps = 0/102 (0% )<br />
Query 10<br />
Sbjct 2<br />
Query 70 EKALYDYLLNPKKYYIPGTKMVFPGLKKPQ<br />
QDRADLIAYLKEST 111<br />
E L +YL NPKKYYIPGTKM+F<br />
G+KK ++RADLIAYLK++T<br />
+<br />
Sbjct 62 EDTLMEYLENPKKYYIPGTKMIFVGIKKKE<br />
EERADLIAYLKKAT 103<br />
>AT1G29910 chlorophyll a/bb-b<strong>in</strong>d<strong>in</strong>g<br />
protei <strong>in</strong><br />
MAASTMALSSPPAFAGKAVNLSPAASEEVLGSGRVTMRKTVAK<br />
KPKGPSGSPWYGSDRV RVKYLGPFSGESPSYL LTGEFPGDYGWDTAGL LSADPETFARNRELEVV<br />
IHSRWAMLGALLGCVFPELLARNGVKFFGEAVWFKAGSQIFSD<br />
DGGLDYLGNPSLVHAQ AQSILAIWATQVILMG GAVEGYRVAGNGPLGEAEDLLYPGGSFDPLGG<br />
LATDPEAFAELLKVKELKNGRLAMFSMMFGFFVQAIVTGKGPI<br />
IENLADHLADPVNNNA NAWAFATNFVPGK<br />
GENE ID: 557589<br />
KIAA1432 | KIAA1432 [Homo o sapiens]<br />
Score = 333.1<br />
bits (74), Expect = 0.98, Method: Composiition-based<br />
sta ats.<br />
Identitiess<br />
= 14/38 (36%), , Positives = 25 5/38 (65%), Gapss<br />
= 1/38 (2%)<br />
Query 13<br />
Sbjct 895<br />
>AT1G35190<br />
MENHTTMKVSSSLNCIDLANDDLNHSVVVSLKQACLDCGFFYV<br />
VINHGISEEFMDDVFE FEQSKKLFALPLEEKM MKVLRNEKHRGYTPVL LDELLDPKNQINGDHKK<br />
EGYYIGIEVPKKDDPHWDKPFYGPNPWWPDADVLPGWRETMEK<br />
KYHQEALRVSMAIARL RLLALALDLDVGYFDR RTEMLGKPIATMRLLR RYQGISDPSKGIYACGG<br />
AHSDFGMMTLLLATDGVMGLQICKDKNNAMPQKWEYVPPIKGA<br />
AFIVNLGDMLERWSNG NGFFKSTLHRVLGNGQ QERYSIPFFVEPNHDCLVECLPTCKSESELPP<br />
KYPPIKCSTYLLTQRYEETHANLSIYHHQQT<br />
No significcant<br />
homologies<br />
>AT1G41880<br />
RNDQRIKDGDLVLMMDMGCELHGYVSDLTR<br />
RTWPPCGKFSSVQEELLYDLILQTNKECIKQ<br />
QCK 341<br />
+N+Q IKDG++VL+ +D GCE YVSD+TR RTWP G+F++ Q ELLY+<br />
+L+ ++C+ C<br />
KNNQLIKDGEMVLLLDGGCESSCYVSDITR<br />
RTWPVNGRFTAPQAELLYEAVLEIQRDCLAL<br />
LCF 376<br />
PGTTIRQLNTYSTEELLCDGLMKMGILKSR<br />
RRLYHQLN------PTTSIGHYLGMDVHDSSAV<br />
395<br />
PGT++ + + L+ L +GI+K+ + + P +GHYLGMDVHD+ +<br />
PGTSLENIYSMMLTTLIGQKLKDLGIMKNI<br />
IKENNAFKAARKYCPHHHVGHYLGMDVHDTP<br />
PDM 436<br />
GYDRPLQPGFVITIIEPGVYIPS-SFDCPE<br />
ERFQGIGIRIEDDVLIITETGYEVLTGSMPK<br />
KEI 454<br />
PLQPG VITIIEPG+YIP<br />
D PE E+F+G+G+RIEDDV++ +T+ +L+ PK KE+<br />
PRSLPLQPGMVITIIEPGIYIPEDDKDAPE<br />
EKFRGLGVRIEDDVVV VVTQDSPLILSADCPK KEM 496<br />
KHIETL 460<br />
IE +<br />
NDIEQI 502<br />
GVTEAKIKVAINGFFGRIGRNFLRCWHGRK<br />
KDSPLDVVVINDTG-GGVKQASHLLKYDSTL<br />
LGI 117<br />
G K+KV +NGFFGRIGR<br />
R<br />
+D+V IND + ++ +YDST G<br />
GSHMGKVKVGVNGFFGRIGRLVTRA--AFN<br />
NSGKVDIVAINDPFIDDLNYMVYMFQYDSTH<br />
HGK 58<br />
FDADVKPSGDSALSSVDGKIIKIVSDRNPS<br />
SNLPWGELGIDLVIEGGTGVFVDRDGAGKHL<br />
LQA 177<br />
F VK + L ++G I I +R+PS S + WG+ G + V+E TGVF + AG HL LQ<br />
FHGTVKAE-NGKLVVINGNPITIFQERDPS<br />
SKIKWGDAGAEYVVESSTGVFTTMEKAGAHL<br />
LQG 117<br />
GAKKVLITAPGKGDDIPTYVVGVNAELYSH<br />
HEDTIISNASCTTNCLLAPFVKVLDQKFGIIKG<br />
237<br />
GAK+V+I+AP D P +V+GVN E Y + IISNASCTTNCLLAP<br />
KV+ FGI+ +G<br />
GAKRVIISAP-SADDAPMFVMGVNHEKYDN<br />
NSLKIISNASCTTNCLLAPLAKVIHDNFGIV<br />
VEG 176<br />
TMTTTHSYTGDQRLLLDASHRDL-RRARAA<br />
AALNIVPTSTGAAKAV AVALVLPNLKGKLNGIAL<br />
296<br />
MTT H+ T Q+ +D L R R A NI+P STGAAKAV AV V+P L GKL G+ A<br />
LMTTVHAITATQKTTVDGPSGKLWRDGRGA<br />
ALQNIIPASTGAAKAV AVGKVIPELNGKLTGM MAF 236<br />
RVPTPNVSVVDLVVVQVSKKTFAEEVNAAF<br />
FRDAAEKELKGILDVC VCDEPLVSVDFRCSDV VSS 356<br />
RVPT NVSVVDL ++ K +++ + A+E LKGIL + +VS DF SS<br />
RVPTANVSVVDLTCCRLEKPAKYDDIKKVV<br />
VKQASEGPLKGILGYT YTEHQVVSSDFNSDTH HSS 296<br />
TIDSSLTMVMGDDMMVKVIAWYDNEWGYSQ<br />
QRVVDL 390<br />
T D+ + + D VK+I+WYDNE+GYS RVVDL<br />
TFDAGAGIALNDHFFVKLISWYDNEFGYSN<br />
NRVVDL 330<br />
GNAKAGEKIFRTKCCAQCHTVEAGAGHKQG<br />
GPNLNGLFGRQSGTTAAGYSYSAANKNKAVEWE<br />
69<br />
G+ + G+KIF KCC+QCHTVE<br />
G HK GPNL+GLFGR++G<br />
G<br />
GYSY+AANKNK + W<br />
GDVEKGKKIFIMKCCSQCHTVEKGGKHKTG<br />
GPNLHGLFGRKTGQAP APGYSYTAANKNKGIIWG<br />
61<br />
FAGKAVNLSPAASEEVLGSGRVTMRKTVAK<br />
KPKGPSGSPW 50<br />
F ++++LS +A V S + +++KT++ P GPSG W<br />
FRNRSISLSQSAENNVPAS-KFSLQKTLSM<br />
MPSGPSGKRW 931
MKGRQGERVRLLYVRGTVLGYKRSKSNNQYPNTSLIQIEGVNT<br />
TQEEVNWYKGKRLAYI YIYKAKTKKNGSHYRC CIWGKVTRPHGNSGVV VRSKFTSNLPPKSMGAA<br />
RVRVFMYPSNII<br />
GENE ID: 61165<br />
RPL35A | ribbosomal<br />
prote<strong>in</strong> L35a [Homo sapiiens]<br />
(Over 10 PuubMed<br />
l<strong>in</strong>ks)<br />
Score = 1108<br />
bits (270), Expect = 2e-23 3, Method: Compoositional<br />
matrix<br />
adjust.<br />
Identitiess<br />
= 57/109 (52%) ), Positives = 74/109 7 (67%), Ga Gaps = 9/109 (8% )<br />
Query 10<br />
Sbjct 4<br />
Query 63<br />
Sbjct 64<br />
>AT1G44120<br />
MTSEMDDPEKAAAVTITRLIEQLHAKKKSSAQEKELSTARLLG<br />
GLAKGKKECRKIISQN QNVNAMPAFISLLRSG GTLLAKLNSASVLTVL LCKDKNVRSKILIGGCC<br />
IPPLLSLLKSDDSVDAKRVVAEAIYEVVSLCGMDGDNVGTKIF<br />
FVTEGVVPSLWDQLKT KTGKKQDKTVEGHLVG GALRNLCGDKDGFWAL LTLEDGGVDIILKLLQQ<br />
SSNPVSQSNAAASLLARLIRIFTSSISSKVEESGAVQVLVQLL<br />
LGEENSVFVRASVVNA NALEAITSKSEEAITV VARDLDGIHLLISAVV VASSKESVEEETERVLL<br />
QSYGTQALANLLCGGMSGLIVYLGGLSSLSPRLTEPIADILGA<br />
ALAYALRKFQLSCGDT DTREAFDPTLTEGILV VKLLKPRDTQLIHERILEAMESLFGNVDLSKK<br />
LLNNVDAKRVLLVCLTILATDGPRERMMITCLSNLCKHGDVWD<br />
DAIGKREGIQILIPYL YLGLSSEQHQELSVEF FLAILTDNVEESRWAV VTSAGGIPPLLQILETT<br />
GVSQKAKDDAVVRVILNLCCHSEEIRLLCVEKAGAIPALLGLL<br />
LKNGGPKSQESSANTL TLLKLIKTADPSVIEQ QVQALFLGDAPKSKTH HLIRVLGHVLASASLEE<br />
EFVTKGSAANNNGLRSLVQRLASSNEKKMKENAASVLADLFSS<br />
SRKDLCGGLGFDEDDN DNPCTKLLSGNTHAVA ATQLAHALGSLSNPTK KKKTATKKLSGPEVEVV<br />
IKPLIKSAKTNNPIESTENPMSTLANLLLSDPNVAAEALNDDV<br />
VVSALTRVLREGTLQG QGKRNASHALHQLLKH HFQVSDVFKGNEQCRF FAVSELIDLLNATDLNN<br />
NSAFIDVLEVLLSLLAKAKYGANLSHNNPFSAFGEVPSNLDSL<br />
LVRGLAEGHPLVQDKA KAIEILSRFCKTQFIL LLGRLLVTQSKSISSL LANRTINSSSPEIKVGG<br />
GAILLVCAAKNNDITLWAEAVEQSGYLLKTLVNTLLDMSKQNS<br />
SKSASYGIEIQRPRSFFITSNLCLRMDDSEM<br />
MVDPVTILGSTASMWL LLSIICSSHPSNRLVVV<br />
MEGNGLEIIAEENLQRNKSNTQENSSDDSEEKWIAMSFLAVMS<br />
SQEPKVVSSPATENILLQTLAPFMQSEQMID<br />
DGYFTAQVLAALVRHK KNDKTISEIMNSDIVEE<br />
TTINLVGCEESSDTRSLCALAEELSLVVQNPYEATLEVLFENE<br />
ERVRSGSFTKKCIPLL LLVNLLKPYADKVGGIPVAIRLLRRIADNDDLSKLLIAEAGALDALL<br />
AKYLSLSPQDSSTEITVSELLESLFRSSPEITRHKTAISSMKQ<br />
QLIGILHLASRSTRYN YNAARVLCELFSSEHIRDSELAWKALSPLIEMLNTTLESERVAALTT<br />
ALVKLTMGINPPRPDILTSLEGNPLDNNIYKILSLDSSSLESK<br />
KTSAARICRFLFTNEG EGLRTSTSAACCIVSL LISLIRTGKSTAIEAG GMFALDRLLDIKRFVEE<br />
VAEEHDCVNLFFYGYVASENYLISEAAAISCLTKMAKDNTPRK<br />
KMDLIKMGIIEKCISQQLSKSPPSSLCSVIA<br />
ADLFRVLTNVGVIARSQDAIKMVQPLLLILLL<br />
RQDLDFQGQLGGGLQAIANILEKPMVLLESLKIASSTIIMPLI<br />
IPLLESESIAVKNATT TTILLTSLLEMQRFQE EEITTKNLIAPLVKLV VGIRVRNLQEIALMGLL<br />
ERSSVTWPKEVVADTGGIQELSKVIIDDEDPQLPVYLWESAAF<br />
FILCNILRINPEHYYF YFTVTIPVLSKMLFST TAESTVILAIDALIIR RENQDSSSVQEMAESSS<br />
ALDALLDLLRSSHHCEELSARLLELILLRNPKVRETKICQFVL<br />
LTPLSEYILDPDTISEESAKILIAMALGDIS<br />
SQHEGLAKATDSPVACRALISLLEDEPSEEMM<br />
QMVVMRALENFFAMHSRTSRKAMAEAGGGVYWVQEMLRSSNPQ<br />
QVSTQAALIIKSLFSNNHTLQEYVSGEIIKS<br />
SLTNAMEREFWTTTAINVEIVRTLNTILTTFF<br />
PKLRSSEAATAACIPHLIGALKSGEQEEARDSAMDTIYTLRQS<br />
SWTTMPTETARSQAVL VLAADAIPVLQLMMKS SKLKSPAPSSFHERGN NSLLNCLPGSLTVAIKK<br />
RGDNLKRSNAFFCRLIIDNCPTKKTKVVVKRSSSPVWKESFTW<br />
WDFAAPPRGQFLEIVC VCKSNNIFRNKNLGKV VRIPIDKVLSEGSYSG GIFKLNDESKKDNSSDD<br />
RSLEIEIVWSNNQSF<br />
GENE ID: 82291<br />
DYSF | dysfeerl<strong>in</strong>,<br />
limb gird dle muscular dys ystrophy 2B (autosomal<br />
recessive) [Homo sapiens] (Over 10 PubMed d l<strong>in</strong>ks)<br />
Score = 588.2<br />
bits (139), Expect = 3e-08 8, Method: Compoositional<br />
matrix<br />
adjust.<br />
Identitiess<br />
= 34/106 (32%) ), Positives = 56/106 5 (52%), Ga Gaps = 4/106 (3% )<br />
Query 20088<br />
SNAFCRLIIDNCPPTKKTKVVKRSSSPVW<br />
WKESFTWDFAAPP--RRGQFLEIVCKSNNIF<br />
F-RN 2064<br />
S+A+C + K+TKV+K S +PVW W E F WD P + +G L +V K + RN<br />
Sbjct 20 SDAYCSAVFAGV--KKRTKVIKNSVNPVW<br />
WNEGFEWDLKGIPLDQ DQGSELHVVVKDHETM MGRN 78<br />
Query 20655<br />
KNLGKVRIPIDKVVLSEGSYSGIFKLNDE<br />
ESKKDNSSDRSLEIEIIVWS<br />
2110<br />
+ LG+ ++P+ +VVL+<br />
S S F K + SL +++ + ++<br />
Sbjct 79 RFLGEAKVPLREVVLATPSLSASFNAPLL<br />
LDTKKQPTGASLVLQV QVSYT 124<br />
>AT1G52230<br />
MASFATIAAVQQPSAAVKGLGGSSLAGGAKLFIKPSRQSFKTK<br />
KSTRAGAVVAKYGDKS KSVYFDLEDLGNTTGQ QWDVYGSDAPSPYNPL LQSKFFETFAAPFTKRR<br />
GLLLKFLILGGGGSLLTYVSANSTGDVVLPIKRGPQEPPKLGP<br />
PRGKL<br />
No significcant<br />
homologies<br />
>AT1G53720<br />
MSVLIVTSLGDDIVIDLHSDKCPLTCKKNFLKLCKIKYYNGCL<br />
LFHTVQKDFTAQTGDP DPTGTGAGGDSIYKFL LYGEQARFYKDEIHLDLKHSKTGTVAMASGGG<br />
ENLNASQFYFTTLRDDLDYLDGKHTVFFGQIAEGFDTLTRINE<br />
EAYVDPKNRPYKNIRI RIKHTHILDDPFDDPP PQLAEMMPDASPEGKP PKEEVKDDVRLEDDWVV<br />
PMDEELGAQELLEEVIREKAAHSSAVVVLESIGDIPEAEVKPP<br />
PDNVLFVCKLNPVTED EDEDLHTIFSRFGTVV VSADVIRDFKTGDSLCYAFIEFENKESCEQAA<br />
YFKMDNALIDDDRRIHVDFSQSVSKLWWSQFRQKDSQKGKGNG<br />
GCFKCGSTDHIAKDCV CVGGPSSKFIVKDQNR RQHGGGEGYEMVFEGDVHETPKHNSHERERSS<br />
EKIQRRSPHGNNGEGKRQHRDERDDGRRRQHDREDARELERKH<br />
HRERKERESREDEDRR RRRRRRREESRDKESR RRERDEDDHRSHRDYK KERRRERDDRHGREARR<br />
HERRDR<br />
> emb| |CAD97776.1|<br />
Length=492<br />
GENE ID: 885313<br />
PPIL4 | peeptidylprolyl<br />
is somerase (cyclopphil<strong>in</strong>)-like<br />
4<br />
[Homo sapieens]<br />
(10 or feweer<br />
PubMed l<strong>in</strong>ks) )<br />
Score = 3369<br />
bits (946), Expect = 7e-10 02, Method: Comp mpositional matr rix adjust.<br />
Identitiess<br />
= 187/329 (56% %), Positives = 239/329 (72%), Gaps = 15/329 (4%)<br />
Query 1<br />
Sbjct 1<br />
Query 61<br />
Sbjct 61<br />
Query 121<br />
Sbjct 121<br />
Query 181<br />
Sbjct 179<br />
Query 235<br />
Sbjct 232<br />
Query 295<br />
RLYVRGTVLGYKRSSKSNQYPNTSLIQIEG<br />
GVNTQEEVNWYKGKRL RLAYIYKAKT-------K<br />
62<br />
RL+ + GYKR NQ +T+L++IEG GV ++E +Y GKR R AY+YKAK K<br />
RLWSKAIFAGYKRGGLRNQREHTALLKIEG<br />
GVYARDETEFYLGKRC RCAYVYKAKNNTVTPG GGK 63<br />
KNGSHYRCIWGKVTTRPHGNSGVVRSKFTS<br />
SNLPPKSMGARVRVFM FMYPSNI 111<br />
N + R IWGKVTTR<br />
HGNSG+VR+KF SNLP S K++G R+RV + +YPS I<br />
PNKT--RVIWGKVTTRAHGNSGMVRAKFRS<br />
SNLPAKAIGHRIRVML MLYPSRI 110<br />
hypothetical prote<strong>in</strong> p [Homo saapiens]<br />
MSVLIVTSLGDIVIIDLHSDKCPLTCKNFL<br />
LKLCKIKYYNGCLFHT HTVQKDFTAQTGDPTG GTG 60<br />
M+VL+ T+LGD+VIIDL++++<br />
P C NFL LKLCKIKYYN CL H VQ+DF QTGDPTG GTG<br />
MAVLLETTLGDVVIIDLYTEERPRACLNFL<br />
LKLCKIKYYNYCLIHN HNVQRDFIIQTGDPTG GTG 60<br />
AGGDSIYKFLYGEQQARFYKDEIHLDLKHS<br />
SKTGTVAMASGGENLNNASQFYFTLRDDLDY<br />
YLD 120<br />
GG+SI+ LYG+QQA<br />
F++ E +KH K GTV+M + G + + SQF T ++LDY YLD<br />
RGGESIFGQLYGDQQASFFEAEKVPRIKHK<br />
KKKGTVSMVNNGSDQH QHGSQFLITTGENLDY YLD 120<br />
GKHTVFGQIAEGFDDTLTRINEAYVDPKNR<br />
RPYKNIRIKHTHILDD DDPFDDPPQLAEMMPD DAS 180<br />
G HTVFG++ EG D + +INE +VD PY++IRI HT ILDD DD ++PD D S<br />
GVHTVFGEVTEGMDDIIKKINETFVDKDFV<br />
VPYQDIRINHTVILDD DD--PFDDPPDLLIPD DRS 178<br />
PEGKPKEEVKDDVRRLEDDWVPMDEEL---<br />
----GAQELEEVIREKKAAHSSAVVLESIGD<br />
DIP 234<br />
PE P E D R + DEE+ A+E+EE+ EKK<br />
A + A++LE +GD D+P<br />
PE--PTREQLDSGRR-----IGADEEIDDF<br />
FKGRSAEEVEEIKAEKKEAKTQAILLEMVGD<br />
DLP 231<br />
EAEVKPPDNVLFVCCKLNPVTEDEDLHTIF<br />
FSRFGTVVSADVIRDF DFKTGDSLCYAFIEFENK<br />
294<br />
+A++KPP+NVLFVCCKLNPVT<br />
DEDL IF FSRFG + S +VIRD+ D+KTG+SLCYAFIEFE +<br />
DADIKPPENVLFVCCKLNPVTTDEDLEIIF<br />
FSRFGPIRSCEVIRDW DWKTGESLCYAFIEFEKE<br />
291<br />
ESCEQAYFKMDNALLIDDRRIHVDFSQSVS<br />
S 323
E CE+A+FKMDN LIDDRRIHVDFSQSV+<br />
Sbjct 292 EDCEKAFFKMDNVLIDDRRIHVDFSQSVA 320<br />
>AT1G55130<br />
MAIRIRISGTLLLSFLFFSTLHAFYLPGVAPRDFQKGDPLYVKVNKLSSTKTQLPYDFYYLNYCKPPKILNTGENLGEVLRGDRIENSVYTFEMLEDQPC<br />
RVGCRVRVDAESAKNFREKIDYEYRANMILDNLPVAVLRQRKDGIQSTTYEHGYRVGFKGSYEGSKEKKYFIHNHLSFRVMYHRDQESESSRIVGFEVTP<br />
NSVLHEYKEWDENNPQLTTCNKDTKNLIQSNTVPQEVEEGKEIVFTYDVAFKESVIKWASRWDTYLLMNDDQIHWFSIINSLMIVLFLSGMVAMIMMRTL<br />
YKDISNYNQLETQDEAQEETGWKLVHGDVFRTPMNSGLLCVYVGTGVQIFGMTLVTMIFALLGFLSPSNRGGLTTAMVLLWVFMGIFAGYSSSRLHKMFK<br />
GNEWKRITLKTAFMFPGILFAIFFVLNTLIWGERSSGAIPFSTMFALVCLWFGISVPLVFIGSYLGHKKPAIEDPVKTNKIPRQVPEQPWYMKPGFSILI<br />
GGILPFGAVFIELFFILTSIWLNQFYYIFGFLFIVFLILIVTCAEITIVLCYFQLCSEDYNWCWRAYLTSGSSSLYLFLYSVFYFFTKLEISKLVSGVLY<br />
FGYMIIISYSFFVLTGSIGFYACLWFVRKIYSSVKID<br />
GENE ID: 9777 TM9SF4 | transmembrane 9 superfamily prote<strong>in</strong> member 4<br />
[Homo sapiens] (10 or fewer PubMed l<strong>in</strong>ks)<br />
Score = 647 bits (1668), Expect = 0.0, Method: Compositional matrix adjust.<br />
Identities = 318/645 (49%), Positives = 443/645 (68%), Gaps = 33/645 (5%)<br />
Query 12 LLSFLFFSTLHAFYLPGVAPRDFQKGDPLYVKVNKLSSTKTQLPYDFYYLNYCKPPKILN 71<br />
LL F AFY+PGVAP +F + DP+ +K KL+S++TQLPY++Y L +C+P KI<br />
Sbjct 12 LLLFSLMCETSAFYVPGVAPINFHQNDPVEIKAVKLTSSRTQLPYEYYSLPFCQPSKITY 71<br />
Query 72 TGENLGEVLRGDRIENSVYTFEMLEDQPCRVGCR-----VRVDAESAKNFREKIDYEYRA 126<br />
ENLGEVLRGDRI N+ + M ++ C V C V + E ++ E+I +Y<br />
Sbjct 72 KAENLGEVLRGDRIVNTPFQVLMNSEKKCEVLCSQSNKPVTLTVEQSRLVAERITEDYYV 131<br />
Query 127 NMILDNLPVAVLRQRKDG--------IQSTTYEHGYRVGFKGSYEGSKEKKYFIHNHLSF 178<br />
++I DNLPVA + + +EHGYR+GF + K ++HNHLSF<br />
Sbjct 132 HLIADNLPVATRLELYSNRDSDDKKKEKDVQFEHGYRLGF------TDVNKIYLHNHLSF 185<br />
Query 179 RVMYHRDQESESS----RIVGFEVTPNSVLHEYKEWDENNPQLTTCNKDTKNLIQSNTVP 234<br />
+ YHR+ E R+V FEV P S+ E + DE ++C +N+ P<br />
Sbjct 186 ILYYHREDMEEDQEHTYRVVRFEVIPQSIRLEDLKADEK----SSCTLPEG----TNSSP 237<br />
Query 235 QEVEEGKE--IVFTYDVAFKESVIKWASRWDTYLLMNDDQIHWFSIINSLMIVLFLSGMV 292<br />
QE++ KE + FTY V ++ES IKWASRWDTYL M+D QIHWFSIINS+++V FLSG++<br />
Sbjct 238 QEIDPTKENQLYFTYSVHWEESDIKWASRWDTYLTMSDVQIHWFSIINSVVVVFFLSGIL 297<br />
Query 293 AMIMMRTLYKDISNYNQLETQDEAQEETGWKLVHGDVFRTPMNSGLLCVYVGTGVQIFGM 352<br />
+MI++RTL KDI+NYN+ + ++ EE+GWKLVHGDVFR P +L +G+G+Q+F M<br />
Sbjct 298 SMIIIRTLRKDIANYNKEDDIEDTMEESGWKLVHGDVFRPPQYPMILSSLLGSGIQLFCM 357<br />
Query 353 TLVTMIFALLGFLSPSNRGGLTTAMVLLWVFMGIFAGYSSSRLHKMFKGNEWKRITLKTA 412<br />
L+ + A+LG LSPS+RG L T L++FMG+F G+S+ RL++ KG+ WK+ TA<br />
Sbjct 358 ILIVIFVAMLGMLSPSSRGALMTTACFLFMFMGVFGGFSAGRLYRTLKGHRWKKGAFCTA 417<br />
Query 413 FMFPGILFAIFFVLNTLIWGERSSGAIPFSTMFALVCLWFGISVPLVFIGSYLGHKKPAI 472<br />
++PG++F I FVLN IWG+ SSGA+PF TM AL+C+WFGIS+PLV++G Y G +K<br />
Sbjct 418 TLYPGVVFGICFVLNCFIWGKHSSGAVPFPTMVALLCMWFGISLPLVYLGYYFGFRKQPY 477<br />
Query 473 EDPVKTNKIPRQVPEQPWYMKPGFSILIGGILPFGAVFIELFFILTSIWLNQFYYIFGFL 532<br />
++PV+TN+IPRQ+PEQ WYM IL+ GILPFGA+FIELFFI ++IW NQFYY+FGFL<br />
Sbjct 478 DNPVRTNQIPRQIPEQRWYMNRFVGILMAGILPFGAMFIELFFIFSAIWENQFYYLFGFL 537<br />
Query 533 FIVFLILIVTCAEITIVLCYFQLCSEDYNWCWRAYLTSGSSSLYLFLYSVFYFFTKLEIS 592<br />
F+VF+IL+V+C++I+IV+ YFQLC+EDY W WR +L SG S+ Y+ +Y++FYF KL+I<br />
Sbjct 538 FLVFIILVVSCSQISIVMVYFQLCAEDYRWWWRNFLVSGGSAFYVLVYAIFYFVNKLDIV 597<br />
Query 593 KLVSGVLYFGYMIIISYSFFVLTGSIGFYACLWFVRKIYSSVKID 637<br />
+ + +LYFGY ++ SF++LTG+IGFYA FVRKIY++VKID<br />
Sbjct 598 EFIPSLLYFGYTALMVLSFWLLTGTIGFYAAYMFVRKIYAAVKID 642<br />
>AT1G56190<br />
MASTAATAALSIIKSTGGAAVTRSSRASFGHIPSTSVSARRLGFSAVVDSRFSVHVASKVHSVRGKGARGVITMAKKSVGDLNSVDLKGKKVFVRADLNV<br />
PLDDNQNITDDTRIRAAIPTIKFLIENGAKVILSTHLGRPKGVTPKFSLAPLVPRLSELLGIEVVKADDCIGPEVETLVASLPEGGVLLLENVRFYKEEE<br />
KNEPDFAKKLASLADLYVNDAFGTAHRAHASTEGVTKFLKPSVAGFLLQKELDYLVGAVSNPKRPFAAIVGGSKVSSKIGVIESLLEKCDILLLGGGMIF<br />
TFYKAQGLSVGSSLVEEDKLELATTLLAKAKARGVSLLLPTDVVIADKFAPDANSKIVPASAIPDGWMGLDIGPDSVKTFNEALDTTQTVIWNGPMGVFE<br />
FEKFAKGTEAVANKLAELSKKGVTTIIGGGDSVAAVEKVGVAGVMSHISTGGGASLELLEGKVLPGVVALDEATPVTV<br />
GENE ID: 5230 PGK1 | phosphoglycerate k<strong>in</strong>ase 1 [Homo sapiens]<br />
(Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 381 bits (978), Expect = 2e-105, Method: Compositional matrix adjust.<br />
Identities = 199/409 (48%), Positives = 278/409 (67%), Gaps = 23/409 (5%)<br />
Query 82 LNSVDLKGKKVFVRADLNVPLDDNQNITDDTRIRAAIPTIKFLIENGAK-VILSTHLGRP 140<br />
L+ +D+KGK+V +R D NVP+ +NQ IT++ RI+AA+P+IKF ++NGAK V+L +HLGRP<br />
Sbjct 9 LDKLDVKGKRVVMRVDFNVPMKNNQ-ITNNQRIKAAVPSIKFCLDNGAKSVVLMSHLGRP 67<br />
Query 141 KGVT--PKFSLAPLVPRLSELLGIEVVKADDCIGPEVETLVASLPEGGVLLLENVRFYKE 198<br />
GV K+SL P+ L LLG +V+ DC+GPEVE A+ G V+LLEN+RF+ E<br />
Sbjct 68 DGVPMPDKYSLEPVAVELKSLLGKDVLFLKDCVGPEVEKACANPAAGSVILLENLRFHVE 127<br />
Query 199 EE-----------KNEPD----FAKKLASLADLYVNDAFGTAHRAHASTEGVTKFLKPSV 243<br />
EE K EP F L+ L D+YVNDAFGTAHRAH+S GV L<br />
Sbjct 128 EEGKGKDASGNKVKAEPAKIEAFRASLSKLGDVYVNDAFGTAHRAHSSMVGVN--LPQKA 185<br />
Query 244 AGFLLQKELDYLVGAVSNPKRPFAAIVGGSKVSSKIGVIESLLEKCDILLLGGGMIFTFY 303<br />
GFL++KEL+Y A+ +P+RPF AI+GG+KV+ KI +I ++L+K + +++GGGM FTF<br />
Sbjct 186 GGFLMKKELNYFAKALESPERPFLAILGGAKVADKIQLINNMLDKVNEMIIGGGMAFTFL 245<br />
Query 304 KA-QGLSVGSSLVEEDKLELATTLLAKAKARGVSLLLPTDVVIADKFAPDANS-KIVPAS 361<br />
K + +G+SL +E+ ++ L++KA+ GV + LP D V ADKF +A + + AS<br />
Sbjct 246 KVLNNMEIGTSLFDEEGAKIVKDLMSKAEKNGVKITLPVDFVTADKFDENAKTGQATVAS 305<br />
Query 362 AIPDGWMGLDIGPDSVKTFNEALDTTQTVIWNGPMGVFEFEKFAKGTEAVANKLAELSKK 421<br />
IP GWMGLD GP+S K + EA+ + ++WNGP+GVFE+E FA+GT+A+ +++ + + +<br />
Sbjct 306 GIPAGWMGLDCGPESSKKYAEAVTRAKQIVWNGPVGVFEWEAFARGTKALMDEVVKATSR 365<br />
Query 422 GVTTIIGGGDSVAAVEKVGVAGVMSHISTGGGASLELLEGKVLPGVVAL 470<br />
G TIIGGGD+ K +SH+STGGGASLELLEGKVLPGV AL<br />
Sbjct 366 GCITIIGGGDTATCCAKWNTEDKVSHVSTGGGASLELLEGKVLPGVDAL 414
AT1G63660<br />
METPTMKPDTVLILDYGSQYTHLITRRIRSLNVFSLVISGTSSLKSITSYNPRVVILSGGPHSVHALDAPSFPEGFIEWAESNGVSVLGICYGLQLIVQK<br />
LGGVVVEGESKEYGKMEIEVKGKSEIFGSESGGEKQMVWMSHGDEAVKLPEGFEVVAQSAQGAVAALESRKKKIYGLQYHPEVTHSPKGMETLRHFLFDV<br />
CGVSADWKMEDLMEEEIKVINKTVASDEHVICALSGGVDSTVAATLVHKAIGDRLHCIFVDNGLLRYKEQERVMDTFERDLHLPVTCVDASERFLSELKG<br />
VVDPETKRKIIGREFINIFDQFAQELEKKHGKKPAFLVQGTLYPDVIESCPPPGTDRTHSHTIKSHHNVGGLPKDMKLKLIEPLKLLFKDEVRELGRILN<br />
VPVGFLKRHPFPGPGLAVRVLGDVTQGNALEVLRQVDEIFIQSIRDAGLYDSIWQAFAVFLPVRSVGVQGDKRTHSHVVALRAVTSQDGMTADWFNFEHK<br />
FLDDVSRKICNSVQGVNRVVLDITSKPPSTIEWE<br />
GENE ID: 8833 GMPS | guan<strong>in</strong>e monphosphate synthetase [Homo sapiens]<br />
(Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 263 bits (673), Expect = 4e-70, Method: Compositional matrix adjust.<br />
Identities = 187/570 (32%), Positives = 300/570 (52%), Gaps = 71/570 (12%)<br />
Query 11 VLILDYGSQYTHLITRRIRSLNVFSLVISGTSSLKSITSYNPRVVILSGGPHSVHALDAP 70<br />
V+ILD G+QY +I RR+R L V S + + +I R +I+SGGP+SV+A DAP<br />
Sbjct 28 VVILDAGAQYGKVIDRRVRELFVQSEIFPLETPAFAIKEQGFRAIIISGGPNSVYAEDAP 87<br />
Query 71 SFPEGFIEWAESNGVSVLGICYGLQLIVQKLGGVVVEGESKEYGKMEIEVKGKSEIFGSE 130<br />
F + G VLGICYG+Q++ + GG V + +E G I V +F<br />
Sbjct 88 WFDPAIF----TIGKPVLGICYGMQMMNKVFGGTVHKKSVREDGVFNISVDNTCSLFRGL 143<br />
Query 131 SGGEKQMVWMSHGDEAVKLPEGFEVVAQSAQGAVAALESRKKKIYGLQYHPEVTHSPKGM 190<br />
++++V ++HGD K+ +GF+VVA+S VA + + KK+YG Q+HPEV + G<br />
Sbjct 144 Q--KEEVVLLTHGDSVDKVADGFKVVARSGN-IVAGIANESKKLYGAQFHPEVGLTENGK 200<br />
Query 191 ETLRHFLFDVCGVSADWKMEDLMEEEIKVINKTVASDEHVICALSGGVDSTVAATLVHKA 250<br />
L++FL+D+ G S + +++ E I+ I + V + + V+ LSGGVDSTV L+++A<br />
Sbjct 201 VILKNFLYDIAGCSGTFTVQNRELECIREIKERVGTSK-VLVLLSGGVDSTVCTALLNRA 259<br />
Query 251 IG-DRLHCIFVDNGLLRYKEQERVMDTFERDLHLPVTCVDASERFLS------------- 296<br />
+ +++ + +DNG +R +E + V + ++ L + V ++A+ F +<br />
Sbjct 260 LNQEQVIAVHIDNGFMRKRESQSVEEALKK-LGIQVKVINAAHSFYNGTTTLPISDEDRT 318<br />
Query 297 -------ELKGVVDPETKRKIIGREFINIFDQFAQELEKKHGKKPAFLVQGTLYPDVIES 349<br />
L PE KRKIIG F+ I ++ E+ K + FL QGTL PD+IES<br />
Sbjct 319 PRKRISKTLNMTTSPEEKRKIIGDTFVKIANEVIGEMNLK--PEEVFLAQGTLRPDLIES 376<br />
Query 350 CPPPGTDRTHSHTIKSHHNVGGLPKDMKL--KLIEPLKLLFKDEVRELGRILNVPVGFLK 407<br />
+ + + IK+HHN L + ++ K+IEPLK KDEVR LGR L +P +<br />
Sbjct 377 ASLVASGK--AELIKTHHNDTELIRKLREEGKVIEPLKDFHKDEVRILGRELGLPEELVS 434<br />
Query 408 RHPFPGPGLAVRVL--------GDVTQGNALEVLRQVDEI---------FIQSIRDAGLY 450<br />
RHPFPGPGLA+RV+ D + N +L+ V + +Q ++<br />
Sbjct 435 RHPFPGPGLAIRVICAEEPYICKDFPETN--NILKIVADFSASVKKPHTLLQRVKACTTE 492<br />
Query 451 D---------SIWQAFAVFLPVRSVGVQGDKRTHSHVVALRAVTSQDGMTADWFNFEHKF 501<br />
+ S+ A LP+++VGVQGD R++S+V ++S+D DW + F<br />
Sbjct 493 EDQEKLMQITSLHSLNAFLLPIKTVGVQGDCRSYSYVC---GISSKD--EPDWESL--IF 545<br />
Query 502 LDDVSRKICNSVQGVNRVVLDITSKPPSTI 531<br />
L + ++C++V V + +PP+ +<br />
Sbjct 546 LARLIPRMCHNVNRVVYIFGPPVKEPPTDV 575<br />
Score = 40.0 bits (92), Expect = 0.010, Method: Compositional matrix adjust.<br />
Identities = 31/110 (28%), Positives = 50/110 (45%), Gaps = 6/110 (5%)<br />
Query 430 LEVLRQVDEIFIQSIRDAGLYDSIWQAFAVFLPVR--SVGVQGDKRTHSHVVALRAVTSQ 487<br />
L LRQ D +R++G I Q + P+ +Q VV +R +<br />
Sbjct 585 LSTLRQADFEAHNILRESGYAGKISQMPVILTPLHFDRDPLQKQPSCQRSVV-IRTFITS 643<br />
Query 488 DGMTADWFNFEHKFLDDVSRKICNSVQ---GVNRVVLDITSKPPSTIEWE 534<br />
D MT ++ +V K+ ++ G++R++ D+TSKPP T EWE<br />
Sbjct 644 DFMTGIPATPGNEIPVEVVLKMVTEIKKIPGISRIMYDLTSKPPGTTEWE 693<br />
>AT1G67090<br />
MASSMLSSATMVASPAQATMVAPFNGLKSSAAFPATRKANNDITSITSNGGRVNCMQVWPPIGKKKFETLSYLPDLTDSELAKEVDYLIRNKWIPCVEFE<br />
LEHGFVYREHGNSPGYYDGRYWTMWKLPLFGCTDSAQVLKEVEECKKEYPNAFIRIIGFDNTRQVQCISFIAYKPPSFTG<br />
GENE ID: 84284 C1orf57 | chromosome 1 open read<strong>in</strong>g frame 57 [Homo sapiens]<br />
(10 or fewer PubMed l<strong>in</strong>ks)<br />
Score = 30.0 bits (66), Expect = 8.0, Method: Compositional matrix adjust.<br />
Identities = 20/86 (23%), Positives = 38/86 (44%), Gaps = 9/86 (10%)<br />
Query 22 APFNGLKSSAAFPATRKANNDITSITSNGGRVNCMQVWPPIGKKKFETLSYLPDLTDSE- 80<br />
P +G + R+ D+ +++ G ++ + + PP GK++ Y+ DLT E<br />
Sbjct 31 VPVDGFYTEEVRQGGRRIGFDVVTLSGTRGPLSRVGLEPPPGKRECRVGQYVVDLTSFEQ 90<br />
Query 81 ----LAKEVDYLIRNKWIP----CVE 98<br />
+ + V RN +P CV+<br />
Sbjct 91 LALPVLRNVTKENRNHLLPDIVTCVQ 116<br />
>AT1g73430<br />
1 MATKAASSSS LPKSGAISKG YNFASTWEQS APLTEQQQAA IVSLSHAVAE<br />
51 RPFPANLVHE HVHRPENGLS VSVEDTHLGD SGAIEAVLVN TNQFYKWFTD<br />
101 LESAMKSETE EKYRHYVSTL TERIQTCDNI LHQVDETLDL FNELQLQHQG<br />
151 VTTKTKTLHD ACDRLLMEKQ KLMEFAEALR SKLNYFDELE NVSSNFYSPN<br />
201 MNVSNSNFLP LLKRLDECIS YIEDNPQYAE SSVYLLKFRQ LQSRALGMIR<br />
251 TYILAVLKTA ASQVQAAFRG TGGNKTSVSE GVEASVIYVR FKAAANELKP
301 VLEEIESSRSA<br />
RKEYVQILAE CHRLYCEQRL SLVK KGIVHQR VSDFAKKE KEAL<br />
351 PSLTRSGGCAY<br />
LMQVCHMEHQ LFTHFFPASS EEVS SSLAPLV DPLSTYLYYDI<br />
401 LRPKLIHHEAN<br />
IDLLCELVHI LKVEVLGDQS ARQS SEPLAGL RPTLQRILLAD<br />
451 VNERLTFFRAR<br />
TYIRDEIANY TPSDEDLDYP AKLE EGSPNTT SETDLRDD DDEN<br />
501 ADVFKTWWYPP<br />
LEKTLSCLSK LYRCLEQAVF TGLA AQEAVEV CSLSIQKA KASK<br />
551 LIIKRSTTTMD<br />
GQLFLIKHLL ILREQIAPFD IEFS SVTHKEL DFSHLLEHHLR<br />
601 RILRGQAASLF<br />
DWSRSTSLAR TLSPRVLESQ IDAK KKELEKC LKTTCEEFFIM<br />
651 SVTKLVVVDPM<br />
LSFVTKVTAI KVALSSGTQN HKVD DSVMAKP LKEQAFAT ATPD<br />
701 KVVELVQQKVY<br />
AAIQQELLPI LAKMKLYLQN PSTR RTILFKP IKTNIVEAAHT<br />
751 QVESLLKKAEY<br />
SAEEQANINM ISIQDLQTQL DNFL L<br />
> ref| |NP_113619.1|<br />
Score = 4462<br />
bits (1189), , Expect = 8e-1 128, Method: Commpositional<br />
matrix<br />
adjust.<br />
Identitiess<br />
= 286/813 (35% %), Positives = 439/813 (53%), Gaps = 80/813 (9%)<br />
Query 27<br />
Sbjct 25<br />
Query 77<br />
Sbjct 85<br />
Query 131<br />
Sbjct 145<br />
Query 191<br />
Sbjct 205<br />
Query 251<br />
Sbjct 265<br />
Query 310<br />
Sbjct 319<br />
Query 367<br />
Sbjct 379<br />
Query 427<br />
Sbjct 436<br />
Query 484<br />
Sbjct 496<br />
Query 509<br />
Sbjct 556<br />
Query 569<br />
Sbjct 616<br />
Query 624<br />
Sbjct 676<br />
Query 684<br />
Sbjct 735<br />
Query 744<br />
Sbjct 788<br />
>AT1G73390<br />
MGCFASRPNDTTGGNRRKPTSIGDVSVVYVPGLRIPKPVEFSQ<br />
QSLGDQLPKTLVERLT LTALRTRIVVMANQEG GPTITRTRRKTQHGGSTLADLHHALEDYIPVV<br />
LLGLTKDGSHLLQFKVQFNWVNQEDEEEEETAMSNVWYEILSV<br />
VLHLMAMLQMSQANLL LLLLPRGSSDGYHPKISEENRRASIDIFLKA<br />
AAGYLDCAVKHVLPHFF<br />
STEQRRSLPIDDLAEGALRALCLQALGGQGVDIQLGMAIDSAK<br />
KATLAVKRRLSCEMVK VKYWQQAQDNLMNLPL LANGWGEKHMLFVKWK KYVEAKAAAYYYHGLII<br />
LDEGNTEKSHGGMAVAALQAADECLKEESKKASEAFNTSSPTS<br />
SRTPSLFGTMKYLSEK EKIPKETSSKVRINRD DLYSYEKIMETAPTLP PDFALALKPDEYQLPSS<br />
VDASWSEASLRRTKNTSNHI<br />
> gb|AAAF24980.1|AF1500882_1<br />
volta age-gated sodiumm<br />
channel alpha subunit, alternate<br />
splice<br />
variant SCNN12A-s<br />
[Homo sappiens]<br />
Length=14444<br />
Score = 311.2<br />
bits (69), Expect = 3.6, Method: M Composittional<br />
matrix adjust. a<br />
Identitiess<br />
= 30/147 (20%) ), Positives = 63/147 6 (42%), Ga Gaps = 19/147 (12%)<br />
Query 65<br />
Sbjct 735<br />
Query 125<br />
Sbjct 780<br />
Query 185<br />
Sbjct 838<br />
WEQ----SAPLTEQQQQAAIVSLSHAVAER<br />
RPFPANLVHEHV-------HRPENGLSVSVEDT<br />
76<br />
W++ +APLT++ +Q +++ L A P PA L E + P SV E T<br />
WDRRPDTTAPLTDRRQTDSVLELKAAAENL<br />
LPVPAELPIEDLCSLTTSQSLPIELTSVVPEST<br />
84<br />
H------LGDSGAIIEAVLVNTNQFYKWFT<br />
TDLESAMKSETEEKYR YRHYVSTLTERIQTCD DNI 130<br />
G E + QF+ WF L++ M + KYR YR L+ + CD D I<br />
EDILLKGFTSLGMEEEERIETAQQFFSWFA<br />
AKLQTQMDQDEGTKYR YRQMRDYLSGFQEQCD DAI 144<br />
LHQVDETLDLFNELLQLQHQGVTTKTKTLH<br />
HDACDRLLMEKQKLME MEFAEALRSKLNYFDELE<br />
190<br />
L+ V+ L LLQ<br />
Q+ V+ KT TLH H+AC++LL E+ +L++ + AE ++ KL+YF+ELE<br />
LNDVNSALQHLESLLQKQYLFVSNKTGTLH<br />
HEACEQLLKEQSELVD VDLAENIQQKLSYFNELE<br />
204<br />
NVSSNFYSPNMNVSSNSNFLPLLKRLDECI<br />
ISYIEDNPQYAESSVY VYLLKFRQLQSRALGM MIR 250<br />
+++ SP ++V+ ++ F+P+L +LD+CI I+YI +P + + +YYLLKF+Q<br />
S+AL + ++<br />
TINTKLNSPTLSVNNSDGFIPMLAKLDDCI<br />
ITYISSHPNFKDYPIYYLLKFKQCLSKALHL<br />
LMK 264<br />
TYILAVLKTAASQVVQAAFRGTGGNKTSVS<br />
SEGVEA-SVIYVRFKA KAAANELKPVLEEIESRS<br />
309<br />
TY + L+T SQ+ + + +SV A ++ YV+F+AAAA<br />
+++ ++E+IE RS<br />
TYTVNTLQTLTSQLL------LKRDPSSVP<br />
PNADNAFTLFYVKFRA RAAAPKVRTLIEQIEL LRS 318<br />
AR-KEYVQILAECHHRLYCEQRLSLVKGIV<br />
VHQRVSDFAKKEALP---SLTRSGCAYLMQV<br />
VCH 366<br />
+ EY Q+L + HH+<br />
Y +QR L+ + V++ + +L RSGCA+++ VC V<br />
EKIPEYQQLLNDIHHQCYLDQRELLLGPSI<br />
IACTVAELTSQNNRDH DHCALVRSGCAFMVHV VCQ 378<br />
MEHQLFTHFFPASSSEEVSSLAPLVDPLST<br />
TYLYDILRPKLIHEAN ANIDLLCELVHILKVEVL<br />
426<br />
EHQL+ FF + ++ S L L++ L LYD+ RP +IH + +++ L EL ILK EVL<br />
DEHQLYNEFF---TTKPTSKLDELLEKLCV<br />
VSLYDVFRPLIIHVIHHLETLSELCGILKNEVL<br />
435<br />
GDQSARQSEPLAGLLRPTLQRILADVNERL<br />
LTFRARTYIRDEIANY NYTPSDEDLDYPAKL---<br />
483<br />
D +E L ++++L DV ERL L +R YI+ +I Y P+ DL YP KL<br />
EDHVQNNAEQLGAFFAAGVKQMLEDVQERL<br />
LVYRTHIYIQTDITGY GYKPAPGDLAYPDKLV VMM 495<br />
---------------------------EGS<br />
SPNTTSETDLRDDEN- N---------ADVFKTWY<br />
508<br />
EG N+ +++ + N AD+ WY<br />
EQIAQSLKDEQKKVVPSEASFSDVHLEEGE<br />
ESNSLTKSGSTESLNP NPRPQTTISPADLHGM MWY 555<br />
PPLEKTLSCLSKLYYRCLEQAVFTGLAQEA<br />
AVEVCSLSIQKASKLIIIKRSTTMDGQLFLIKH<br />
568<br />
P + +TL CLSKLYYRC+++AVF<br />
GL+QEA A+ C S+ AS+ I K T +DGQLFLIKH<br />
PTVRRTLVCLSKLYYRCIDRAVFQGLSQEA<br />
ALSACIQSLLGASESIISKNKTQIDGQLFLIKH<br />
615<br />
LLILREQIAPFDIEEFSVTHKELDFSHLLE<br />
EHLRRILRGQA--SLFFDWSRSTSLARTL---S<br />
623<br />
LLILREQIAPF EEF++<br />
LD + +IL F + + +L L +<br />
LLILREQIAPFHTEEFTIKEISLDLKKTRD<br />
DAAFKILNPMTVPRFF FFRLNSNNALIEFLLEGT<br />
675<br />
PRVLESQIDAKKELLEKCLKTTCEEFIMSV<br />
VTKLVVDPMLSFVTKV KVTAIKVALSSGTQNH HKV 683<br />
P + E +D+KK++ +++ LK+ CE+FI TKL V+ + F+TKV KV+A+K S G +<br />
PEIREHYLDSKKDVVDRHLKSACEQFIQQQ<br />
QTKLFVEQLEEFMTKV KVSALKTMASQGGPKY YT- 734<br />
DSVMAKPLKEQAFAATPDKVVELVQKVYAA<br />
AIQQELLPILAKMKLYYLQNPSTRTILFKPIKT<br />
743<br />
L +Q +AA<br />
P KV +L Y I+ +L L M LYYL<br />
N T ILFKP+ +<br />
-------LSQQPWAAQPAKVSDLAATAYKT<br />
TIKTKLPVTLRSMSLYYLSNKDTEFILFKPV<br />
VRN 787<br />
NIVEAHTQVESLLKKAEYSAEEQANINMIS<br />
SIQDL 776<br />
NI + + +LLKK<br />
E+S E+ I S++ S L<br />
NIQQVFQKFHALLKKEEFSPEDIQIIACPS<br />
SMEQL 820<br />
MANQEGPTITRTRRRKTQHGGSTLADLHHA<br />
ALEDYIPVLLGLTKDG DGSHLQFKVQFNWVNQ QED 124<br />
+ N GPT++ R H G D H+ + +L G +<br />
W ++<br />
LCNPTGPTVSCLRHH--WHMG----DFWHS<br />
SFLVVFRILCGEWIENNM---------WECM<br />
MQE 779<br />
EEEETAMSNVWYEIILSVLHLMAMLQMSQA<br />
ANLLLLPRGSSDGYHP HPKISEENRRASIDIF FLK 184<br />
+++ + + + +++V+ + +L + A LLL S++ + + E R+ + + L<br />
ANASSSLCVIVFILLITVIGKLVVLNLFIA<br />
A--LLLNSFSNEERNG NGNLEGEARKTKVQLA ALD 837<br />
AAGYLDCAVKHVLPPHFSTE--QRRSLP<br />
C V+H L HF + ++++LP<br />
RFRRAFCFVRHTLEEHFCHKWCRKQNLP<br />
conserved d oligomeric Gollgi<br />
complex subu unit 3 [Homo sap piens]<br />
209<br />
864<br />
>AT1G73430<br />
MATKAASSSSLLPKSGAISKGYNFASTTWEQSAPLTEQQQAAI<br />
IVSLSHAVAERPFPAN ANLVHEHVHRPENGLS SVSVEDTHLGDSGAIEAVLVNTNQFYKWFTDD
LESAMKSETEEKYRHYVSTLTERIQTCDNILHQVDETLDLFNELQLQHQGVTTKTKTLHDACDRLLMEKQKLMEFAEALRSKLNYFDELENVSSNFYSPN<br />
MNVSNSNFLPLLKRLDECISYIEDNPQYAESSVYLLKFRQLQSRALGMIRTYILAVLKTAASQVQAAFRGTGGNKTSVSEGVEASVIYVRFKAAANELKP<br />
VLEEIESRSARKEYVQILAECHRLYCEQRLSLVKGIVHQRVSDFAKKEALPSLTRSGCAYLMQVCHMEHQLFTHFFPASSEEVSSLAPLVDPLSTYLYDI<br />
LRPKLIHEANIDLLCELVHILKVEVLGDQSARQSEPLAGLRPTLQRILADVNERLTFRARTYIRDEIANYTPSDEDLDYPAKLEGSPNTTSETDLRDDEN<br />
ADVFKTWYPPLEKTLSCLSKLYRCLEQAVFTGLAQEAVEVCSLSIQKASKLIIKRSTTMDGQLFLIKHLLILREQIAPFDIEFSVTHKELDFSHLLEHLR<br />
RILRGQASLFDWSRSTSLARTLSPRVLESQIDAKKELEKCLKTTCEEFIMSVTKLVVDPMLSFVTKVTAIKVALSSGTQNHKVDSVMAKPLKEQAFATPD<br />
KVVELVQKVYAAIQQELLPILAKMKLYLQNPSTRTILFKPIKTNIVEAHTQVESLLKAEYSAEEQANINMISIQDLQTQLDNFL<br />
GENE ID: 83548 COG3 | component <strong>of</strong> oligomeric golgi complex 3 [Homo sapiens]<br />
(Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 462 bits (1189), Expect = 6e-130, Method: Compositional matrix adjust.<br />
Identities = 286/813 (35%), Positives = 439/813 (53%), Gaps = 80/813 (9%)<br />
Query 27 WEQ----SAPLTEQQQAAIVSLSHAVAERPFPANLVHEHV------HRPENGLSVSVEDT 76<br />
W++ +APLT++Q +++ L A P PA L E + P SV E T<br />
Sbjct 25 WDRRPDTTAPLTDRQTDSVLELKAAAENLPVPAELPIEDLCSLTSQSLPIELTSVVPEST 84<br />
Query 77 H------LGDSGAIEAVLVNTNQFYKWFTDLESAMKSETEEKYRHYVSTLTERIQTCDNI 130<br />
G E + QF+ WF L++ M + KYR L+ + CD I<br />
Sbjct 85 EDILLKGFTSLGMEEERIETAQQFFSWFAKLQTQMDQDEGTKYRQMRDYLSGFQEQCDAI 144<br />
Query 131 LHQVDETLDLFNELQLQHQGVTTKTKTLHDACDRLLMEKQKLMEFAEALRSKLNYFDELE 190<br />
L+ V+ L LQ Q+ V+ KT TLH+AC++LL E+ +L++ AE ++ KL+YF+ELE<br />
Sbjct 145 LNDVNSALQHLESLQKQYLFVSNKTGTLHEACEQLLKEQSELVDLAENIQQKLSYFNELE 204<br />
Query 191 NVSSNFYSPNMNVSNSNFLPLLKRLDECISYIEDNPQYAESSVYLLKFRQLQSRALGMIR 250<br />
+++ SP ++V++ F+P+L +LD+CI+YI +P + + +YLLKF+Q S+AL +++<br />
Sbjct 205 TINTKLNSPTLSVNSDGFIPMLAKLDDCITYISSHPNFKDYPIYLLKFKQCLSKALHLMK 264<br />
Query 251 TYILAVLKTAASQVQAAFRGTGGNKTSVSEGVEA-SVIYVRFKAAANELKPVLEEIESRS 309<br />
TY + L+T SQ+ + +SV A ++ YV+F+AAA +++ ++E+IE RS<br />
Sbjct 265 TYTVNTLQTLTSQL------LKRDPSSVPNADNAFTLFYVKFRAAAPKVRTLIEQIELRS 318<br />
Query 310 AR-KEYVQILAECHRLYCEQRLSLVKGIVHQRVSDFAKKEALP--SLTRSGCAYLMQVCH 366<br />
+ EY Q+L + H+ Y +QR L+ + V++ + +L RSGCA+++ VC<br />
Sbjct 319 EKIPEYQQLLNDIHQCYLDQRELLLGPSIACTVAELTSQNNRDHCALVRSGCAFMVHVCQ 378<br />
Query 367 MEHQLFTHFFPASSEEVSSLAPLVDPLSTYLYDILRPKLIHEANIDLLCELVHILKVEVL 426<br />
EHQL+ FF ++ S L L++ L LYD+ RP +IH +++ L EL ILK EVL<br />
Sbjct 379 DEHQLYNEFF---TKPTSKLDELLEKLCVSLYDVFRPLIIHVIHLETLSELCGILKNEVL 435<br />
Query 427 GDQSARQSEPLAGLRPTLQRILADVNERLTFRARTYIRDEIANYTPSDEDLDYPAKL--- 483<br />
D +E L ++++L DV ERL +R YI+ +I Y P+ DL YP KL<br />
Sbjct 436 EDHVQNNAEQLGAFAAGVKQMLEDVQERLVYRTHIYIQTDITGYKPAPGDLAYPDKLVMM 495<br />
Query 484 --------------------------EGSPNTTSETDLRDDEN---------ADVFKTWY 508<br />
EG N+ +++ + N AD+ WY<br />
Sbjct 496 EQIAQSLKDEQKKVPSEASFSDVHLEEGESNSLTKSGSTESLNPRPQTTISPADLHGMWY 555<br />
Query 509 PPLEKTLSCLSKLYRCLEQAVFTGLAQEAVEVCSLSIQKASKLIIKRSTTMDGQLFLIKH 568<br />
P + +TL CLSKLYRC+++AVF GL+QEA+ C S+ AS+ I K T +DGQLFLIKH<br />
Sbjct 556 PTVRRTLVCLSKLYRCIDRAVFQGLSQEALSACIQSLLGASESISKNKTQIDGQLFLIKH 615<br />
Query 569 LLILREQIAPFDIEFSVTHKELDFSHLLEHLRRILRGQA--SLFDWSRSTSLARTL---S 623<br />
LLILREQIAPF EF++ LD + +IL F + + +L L +<br />
Sbjct 616 LLILREQIAPFHTEFTIKEISLDLKKTRDAAFKILNPMTVPRFFRLNSNNALIEFLLEGT 675<br />
Query 624 PRVLESQIDAKKELEKCLKTTCEEFIMSVTKLVVDPMLSFVTKVTAIKVALSSGTQNHKV 683<br />
P + E +D+KK++++ LK+ CE+FI TKL V+ + F+TKV+A+K S G +<br />
Sbjct 676 PEIREHYLDSKKDVDRHLKSACEQFIQQQTKLFVEQLEEFMTKVSALKTMASQGGPKYT- 734<br />
Query 684 DSVMAKPLKEQAFATPDKVVELVQKVYAAIQQELLPILAKMKLYLQNPSTRTILFKPIKT 743<br />
L +Q +A P KV +L Y I+ +L L M LYL N T ILFKP++<br />
Sbjct 735 -------LSQQPWAQPAKVSDLAATAYKTIKTKLPVTLRSMSLYLSNKDTEFILFKPVRN 787<br />
Query 744 NIVEAHTQVESLLKAEYSAEEQANINMISIQDL 776<br />
NI + + +LLK E+S E+ I S++ L<br />
Sbjct 788 NIQQVFQKFHALLKEEFSPEDIQIIACPSMEQL 820<br />
>AT1G76180<br />
MAEEIKNVPEQEVPKVATEESSAEVTDRGLFDFLGKKKDETKPEETPIASEFEQKVHISEPEPEVKHESLLEKLHRSDSSSSSSSEEEGSDGEKRKKKKE<br />
KKKPTTEVEVKEEEKKGFMEKLKEKLPGHKKPEDGSAVAAAPVVVPPPVEEAHPVEKKGILEKIKEKLPGYHPKTTVEEEKKDKE<br />
No significant homologies<br />
>AT1G78060<br />
MAKQLLLLLLLFIVHGVESAPPPHSCDPSNPTTKLYQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGIPRLGVPAYEWWSEALHGVAYAGPGIRFN<br />
GTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEARGVYNAGQANGMTFWAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDSFDGRKTLSN<br />
HLQASACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQPPFKKCIEEGRASGIMCAYNRVNGIPSCADPNLLTRTARGQWAFRGYITSDCDAVSIIY<br />
DAQGYAKSPEDAVADVLKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGDPTKLPYGNISPNEVCSPAHQALALDAARNGIVL<br />
LKNNLKLLPFSKRSVSSLAVIGPNAHVVKTLLGNYAGPPCKTVTPLDALRSYVKNAVYHQGCDSVACSNAAIDQAVAIAKNADHVVLIMGLDQTQEKEDF<br />
DRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFAANNNKIGSIIWAGYPGEAGGIAISEIIFGDHNPGGRLPVTWYPQSFVNIQMTDMRMRSA<br />
TGYPGRTYKFYKGPKVYEFGHGLSYSAYSYRFKTLAETNLYLNQSKAQTNSDSVRYTLVSEMGKEGCDVAKTKVTVEVENQGEMAGKHPVLMFARHERGG<br />
EDGKRAEKQLVGFKSIVLSNGEKAEMEFEIGLCEHLSRANEFGVMVLEEGKYFLTVGDSELPLIVNV<br />
GENE ID: 84503 ZNF527 | z<strong>in</strong>c f<strong>in</strong>ger prote<strong>in</strong> 527 [Homo sapiens]<br />
(10 or fewer PubMed l<strong>in</strong>ks)<br />
Score = 38.1 bits (87), Expect = 0.035, Method: Compositional matrix adjust.<br />
Identities = 35/155 (22%), Positives = 62/155 (40%), Gaps = 22/155 (14%)<br />
Query 231 VSLADLAETYQPPFKKCIEEGRASGIMCAYNRVNGIPSCADP-------------NLLTR 277<br />
++L T + PFK C E G+ G N+ I + P + L R<br />
Sbjct 377 LTLHQRIHTGEKPFK-CSECGKTFGYRSHLNQHQRIHTGEKPYECIKCGKFFRTDSQLNR 435<br />
Query 278 TARGQWAFRGYITSDC-----DAVSIIYDAQGYAKSPEDAVADVLKAGMDVNCGSYLQKH 332<br />
R R + S C DA+ +I+ + +A + + K G +CGSYL +H<br />
Sbjct 436 HHRIHTGERPFECSKCGKAFSDALVLIHHKRSHAG---EKPYECNKCGKAFSCGSYLNQH 492<br />
Query 333 TKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGD 367<br />
+ +K ++ +A + S+R+ + G+<br />
Sbjct 493 QRIHTGEKPYECSECGKAFHQILSLRLHQRIHAGE 527
AT2G21330<br />
MASSTATMLKASPVKSDWVKGQSLLLRQPSSVSAIRSHVAPSALTVRAASAYADELVKTAKTIASPGHGIMAMDESNATCGKRLASIGLENTEANRQAYR<br />
TLLVSAPGLGQYISGAILFEETLYQSTTDGKKMVDVLVEQNIVPGIKVDKGLVPLVGSYDESWCQGLDGLASRTAAYYQQGARFAKWRTVVSIPNGPSAL<br />
AVKEAAWGLARYAAISQDSGLVPIVEPEIMLDGEHGIDRTYDVAEKVWAEVFFYLAQNNVMFEGILLKPSMVTPGAEATDRATPEQVASYTLKLLRNRIP<br />
PAVPGIMFLSGGQSELEATLNLNAMNQAPNPWHVSFSYARALQNTCLKTWGGKEENVKAAQDILLARAKANSLAQLGKYTGEGESEEAKEGMFVKGYT<br />
GENE ID: 226 ALDOA | aldolase A, fructose-bisphosphate [Homo sapiens]<br />
(Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 343 bits (880), Expect = 4e-94, Method: Compositional matrix adjust.<br />
Identities = 183/348 (52%), Positives = 234/348 (67%), Gaps = 5/348 (1%)<br />
Query 55 ELVKTAKTIASPGHGIMAMDESNATCGKRLASIGLENTEANRQAYRTLLVSAPG-LGQYI 113<br />
EL A I +PG GI+A DES + KRL SIG ENTE NR+ YR LL++A + I<br />
Sbjct 69 ELSDIAHRIVAPGKGILAADESTGSIAKRLQSIGTENTEENRRFYRQLLLTADDRVNPCI 128<br />
Query 114 SGAILFEETLYQSTTDGKKMVDVLVEQNIVPGIKVDKGLVPLVGSYDESWCQGLDGLASR 173<br />
G ILF ETLYQ DG+ V+ + V GIKVDKG+VPL G+ E+ QGLDGL+ R<br />
Sbjct 129 GGVILFHETLYQKADDGRPFPQVIKSKGGVVGIKVDKGVVPLAGTNGETTTQGLDGLSER 188<br />
Query 174 TAAYYQQGARFAKWRTVVSI-PNGPSALAVKEAAWGLARYAAISQDSGLVPIVEPEIMLD 232<br />
A Y + GA FAKWR V+ I + PSALA+ E A LARYA+I Q +G+VPIVEPEI+ D<br />
Sbjct 189 CAQYKKDGADFAKWRCVLKIGEHTPSALAIMENANVLARYASICQQNGIVPIVEPEILPD 248<br />
Query 233 GEHGIDRTYDVAEKVWAEVFFYLAQNNVMFEGILLKPSMVTPGAEATDRATPEQVASYTL 292<br />
G+H + R V EKV A V+ L+ +++ EG LLKP+MVTPG T + + E++A T+<br />
Sbjct 249 GDHDLKRCQYVTEKVLAAVYKALSDHHIYLEGTLLKPNMVTPGHACTQKFSHEEIAMATV 308<br />
Query 293 KLLRNRIPPAVPGIMFLSGGQSELEATLNLNAMNQAP--NPWHVSFSYARALQNTCLKTW 350<br />
LR +PPAV GI FLSGGQSE EA++NLNA+N+ P PW ++FSY RALQ + LK W<br />
Sbjct 309 TALRRTVPPAVTGITFLSGGQSEEEASINLNAINKCPLLKPWALTFSYGRALQASALKAW 368<br />
Query 351 GGKEENVKAAQDILLARAKANSLAQLGKYTGEGES-EEAKEGMFVKGY 397<br />
GGK+EN+KAAQ+ + RA ANSLA GKYT G++ A E +FV +<br />
Sbjct 369 GGKKENLKAAQEEYVKRALANSLACQGKYTPSGQAGAAASESLFVSNH 416<br />
>AT2G24500<br />
MSGLACNSCNKDFEDDAEQKFHYKSEWHRYNLKRKIAGVPGVTEALFEARQAAIAQEKVKAVEAPMLYSCGICNKGYRSSKAHEQHLKSKSHVLKASTST<br />
GEEDKAIIKQLPPRRVEKNNTAQLKGSIEEEESEDEWIEVDSDEDLDAEMNEDGEEEDMDEDGIEFELDPACCLMCDKKHKTIEKCMVHMHKFHGFFIPD<br />
IEYLKDPKGFLTYLGLKVKRDFVCLYCNELCHPFSSLEAVRKHMDAKGHCKVHYGDGGDEEDAELEEFYDYSSSYVNGDENQMVVSGESVNTVELFGGSE<br />
LVITKRTDNKVTSRTLGSREFMRYYKQKPAPSSQKHIVNSLTSRYKMMGLATVQSKEAIVRMKVMREMNKRGAKSSVRLGMKSNVIRNLPNNVTY<br />
GENE ID: 90441 ZNF622 | z<strong>in</strong>c f<strong>in</strong>ger prote<strong>in</strong> 622 [Homo sapiens]<br />
(10 or fewer PubMed l<strong>in</strong>ks)<br />
Score = 124 bits (312), Expect = 3e-28, Method: Compositional matrix adjust.<br />
Identities = 95/269 (35%), Positives = 137/269 (50%), Gaps = 31/269 (11%)<br />
Query 140 VDSDEDL---DAEMNEDGEEEDMDED----GIEFELDPAC-CLMCDKKHKTIEKCMVHMH 191<br />
+DSDE+L D E +D E+D +E+ G P CL C ++ K + HM<br />
Sbjct 213 IDSDEELECEDTEAMDDVVEQDAEEEEAEEGPPLGAIPITDCLFCSHHSSSLMKNVAHMT 272<br />
Query 192 KFHGFFIPDIEYLKDPKGFLTYLGLKVKRDFVCLYCNELCHPFSSLEAVRKHMDAKGHCK 251<br />
K H FFIPDIEYL D KG + YLG KV +CL+CNE F S EAV+ HM+ K HCK<br />
Sbjct 273 KDHSFFIPDIEYLSDIKGLIKYLGEKVGVGKICLWCNEKGKSFYSTEAVQAHMNDKSHCK 332<br />
Query 252 VHYGDGGDEEDAELE--EFYDYSSSYVNGDENQMVVSGESVNTV-ELFGGSELVITKRTD 308<br />
+ + DG DA LE +FYD+ SSY + E GE N EL L T<br />
Sbjct 333 L-FTDG----DAALEFADFYDFRSSYPDHKE------GEDPNKAEELPSEKNLEYDDETM 381<br />
Query 309 NKV--TSRTLGSREFMRYYKQK------PAPSSQKHIVNSLTSRYKMMGLATVQSKEAIV 360<br />
+ + +G R MRYYKQ+ A + + V + +Y+ +G T + A++<br />
Sbjct 382 ELILPSGARVGHRSLMRYYKQRFGLSRAVAVAKNRKAVGRVLQQYRALGW-TGSTGAALM 440<br />
Query 361 RMKVMREMNKRGAKSSVRLGMKSNVIRNL 389<br />
R + M+ + + +K ++ GMK+N + +<br />
Sbjct 441 RERDMQYVQRMKSKWMLKTGMKNNATKQM 469<br />
Score = 76.3 bits (186), Expect = 1e-13, Method: Compositional matrix adjust.<br />
Identities = 39/96 (40%), Positives = 56/96 (58%), Gaps = 7/96 (7%)<br />
Query 1 MSGLACNSCNKDFEDDAEQKFHYKSEWHRYNLKRKIAGVPGVTEALFEAR---QAAIAQE 57<br />
M+ C +C F D Q+ HYK++WHRYNL+RK+A + VT F+ R Q A+A+E<br />
Sbjct 1 MATYTCITCRVAFRDADMQRAHYKTDWHRYNLRRKVASMAPVTAEGFQERVRAQRAVAEE 60<br />
Query 58 KVKAVEAPMLYSCGICNKGYRSSKAHEQHLKSKSHV 93<br />
+ K C +C+K + S A+E HLKS+ HV<br />
Sbjct 61 ESKGSAT----YCTVCSKKFASFNAYENHLKSRRHV 92<br />
>AT2G27280<br />
MEEARLSTLPFSASFNPSNPLGFLENVLDFIGKESNFLRKDTAEKEITDAVTTAKERLRETEKKTESMDVEKVRPSTLPFNASFDPSDPLGFLEKVFEFV<br />
GKKSNFLVKDKAVNAIITAVTDAKERLKEEEKESVKQATVKIKKYGLQIRAPSQKKQSSSRPLLRTASIFGEDDEENDVEKEISRQASKTKSLKKIEKQH<br />
KKAIEEDPSAFAYDEVYDDIKHEAALPRMQDREEHKSRYIQHIMKQAERREKEHEIVYERKLAKERAKDEHLYSDKEKFVTGPFKRKLEEQKKWLEEERL<br />
RELREERDDVTKKNDLSEFYINIGKNVAFGARDIEAREAGRLKELRKVDRLEELRKEETRKEKKRKSPEKEVSPDSGDFGLSSKKSVKPQDASIKEEAKE<br />
TQKATREDAIATAKERFLSRKKAKIEK<br />
GENE ID: 84081 CCDC55 | coiled-coil doma<strong>in</strong> conta<strong>in</strong><strong>in</strong>g 55 [Homo sapiens]<br />
(Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 89.4 bits (220), Expect = 1e-17, Method: Compositional matrix adjust.<br />
Identities = 68/197 (34%), Positives = 119/197 (60%), Gaps = 10/197 (5%)<br />
Query 143 KKYGLQIRAPSQKKQSSSRPLLRTASIFG---EDDEENDVEKEISRQASKTKSLKKIEKQ 199<br />
++YGL + P KK P+L+ S+FG +DD+E V + + R+A+K +++K+ + +<br />
Sbjct 6 RQYGLIL--P--KKTQQLHPVLQKPSVFGNDSDDDDETSVSESLQREAAKKQAMKQTKLE 61<br />
Query 200 HKKAIEEDPSAFAYDEVYDDI--KHEAALPRMQDREEHKSRYIQHIMKQAERREKEHEIV 257<br />
+KA+ ED + + YD +YD++ K E P++ ++ K +YI +++K E R+KE E
Sbjct 62 IQKALAEDATVYEYDSIYDEMQKKKEENNPKLLLGKDRKPKYIHNLLKAVEIRKKEQEKR 121<br />
Query 258 YERKLAKERAKDEHLYSDKEKFVTGPFKRKLEEQKKWLEEERLRELREERDDVTKKNDLS 317<br />
E+K+ +ER ++ + DKE FVT +K+KL+E+ + E E+ E DVTK+ DLS<br />
Sbjct 122 MEKKIQREREMEKGEFDDKEAFVTSAYKKKLQERAEEEEREKRAAALEACLDVTKQKDLS 181<br />
Query 318 EFYINIGKNVAFGARDI 334<br />
FY ++ N A G ++<br />
Sbjct 182 GFYRHLL-NQAVGEEEV 197<br />
>AT2G28470<br />
MEIAAKMVKVRKMEMILLLILVIVVAATAANVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHEPEKNKYNFEGRY<br />
DLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTDNEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAA<br />
KSYIKWSASMALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGDPSPYRPVEDLAFAVARFYQRGGTFQNYYMYHG<br />
GTNFDRTSGGPLISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKLCEDALIATDPTITSLGSNLEAAVYKTESGSCAAFLANVDTKSDATVTFNGKSYN<br />
LPAWSVSILPDCKNVAFNTAKINSATESTAFARQSLKPDGGSSAELGSQWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLD<br />
EGSKAVLHIESLGQVVYAFINGKLAGSGHGKQKISLDIPINLVTGTNTIDLLSVTVGLANYGAFFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGL<br />
KGEDTGLATVDSSEWVSKSPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGCTESCDYRGSYRANKCLKNCGKPSQ<br />
TLYHVPRSWLKPSGNILVLFEEMGGDPTQISFATKQTGSNLCLTVSQSHPPPVDTWTSDSKISNRNRTRPVLSLKCPISTQVIFSIKFASFGTPKGTCGS<br />
FTQGHCNSSRSLSLVQKACIGLRSCNVEVSTRVFGEPCRGVVKSLAVEASCS<br />
> gb|EAW67839.1| hCG1729998, is<strong>of</strong>orm CRA_d [Homo sapiens]<br />
Length=653<br />
Score = 164 bits (416), Expect = 2e-40, Method: Compositional matrix adjust.<br />
Identities = 104/300 (34%), Positives = 153/300 (51%), Gaps = 14/300 (4%)<br />
Query 39 LVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHEPEKNKYNFEG 98<br />
++G + ++ GSIHY R E W + + K K G + + TYV W+ HEPE+ K++F G<br />
Sbjct 80 FTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSG 139<br />
Query 99 RYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTDNEPFKEEMQR 158<br />
DL FV +AA+ GL+V LR GPY+C+E + GG P WL P + RT N+ F E +++<br />
Sbjct 140 NLDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPRLLLRTTNKSFIEAVEK 199<br />
Query 159 FTTKIVDLMKQEKLYASQGGPIILSQIENEYG--NIDSAYGA-AAKSYIKWSASMALSLD 215<br />
+ ++ + L Q GP+I Q+ENEYG N D Y K+ ++ L<br />
Sbjct 200 YFDHLIP--RVIPLQYRQAGPVIAVQVENEYGSFNKDKTYMPYLHKALLRRGIVELLLTS 257<br />
Query 216 TGVPWNMCQQTDAPDPMIN--TCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGDPSPYR 273<br />
G + T IN + +Q +KP + E W GWF +GD +<br />
Sbjct 258 DGEKHVLSGHTKGVLAAINLQKLHQDTFNQLHKVQRDKPLLIMEYWVGWFDRWGDKHHVK 317<br />
Query 274 PVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPL------ISTSYDYDAPIDEYG 327<br />
+++ AV+ F + +F N YM+HGGTNF +G I TSYDYDA + E G<br />
Sbjct 318 DAKEVEHAVSEFIKYEISF-NVYMFHGGTNFGFMNGATYFGKHSGIVTSYDYDAVLTEAG 376<br />
Score = 38.9 bits (89), Expect = 0.018, Method: Compositional matrix adjust.<br />
Identities = 51/216 (23%), Positives = 81/216 (37%), Gaps = 55/216 (25%)<br />
Query 522 GKLAGSGHGKQKISLDIPINLVTGTNTIDL----------LSVTV---GLANYGAFFDLV 568<br />
G+L H ++ LD + + N DL L + V G N+<br />
Sbjct 465 GRLRAHAHDMAQVFLDETMIGILNENNKDLHIPELRDCRYLRILVENQGRVNFSWQIQNE 524<br />
Query 569 GAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLATVDSSEWVSKSPLP-TKQPL 627<br />
GITG V SI+ +S + L+ + + + S+ W P+P + Q<br />
Sbjct 525 QKGITGSV---------SINNSSLEGFTIYSLEMKMSFFERLRSATW---KPVPDSHQGP 572<br />
Query 628 IWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGCTESCDYRGSYR 687<br />
+Y T A + + G ++NG+++GRYW<br />
Sbjct 573 AFYCGTLKAGPSPKDTFLSLLNWNYGFVFINGRNLGRYW--------------------- 611<br />
Query 688 ANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEM 723<br />
N G P +TLY +P WL P N ++LFE+M<br />
Sbjct 612 ------NIG-PQKTLY-LPGVWLHPEDNEVILFEKM 639<br />
>AT2G28620<br />
MDSNNSKKGSSVKSPCQTPRSTEKSNRDFRVDSNSNSNPVSKNEKEKGVNIQVIVRCRPFNSEETRLQTPAVLTCNDRKKEVAVAQNIAGKQIDKTFLFD<br />
KVFGPTSQQKDLYHQAVSPIVFEVLDGYNCTIFAYGQTGTGKTYTMEGGARKKNGEIPSDAGVIPRAVKQIFDILEAQSAAEYSLKVSFLELYNEELTDL<br />
LAPEETKFADDKSKKPLALMEDGKGGVFVRGLEEEIVSTADEIYKVLEKGSAKRRTAETLLNKQSSRSHSIFSVTIHIKECTPEGEEIVKSGKLNLVDLA<br />
GSENISRSGAREGRAREAGEINKSLLTLGRVINALVEHSGHIPYRESKLTRLLRDSLGGKTKTCVIATVSPSVHCLEETLSTLDYAHRAKHIKNKPEVNQ<br />
KMMKSAIMKDLYSEIERLKQEVYAAREKNGIYIPKERYTQEEAEKKAMADKIEQMEVEGEAKDKQIIDLQELYNSEQLVTAGLREKLDKTEKKLYETEQA<br />
LLDLEEKHRQAVATIKEKEYLISNLLKSEKTLVDRAVELQAELANAASDVSNLFAKIGRKDKIEDSNRSLIQDFQSQLLRQLELLNNSVAGSVSQQEKQL<br />
QDMENVMVSFVSAKTKATETLRGSLAQLKEKYNTGIKSLDDIAGNLDKDSQSTLNDLNSEVTKHSCALEDMFKGFTSEAYTLLEGLQGSLHNQEEKLSAF<br />
TQQQRDLHSRSMDSAKSVSTVMLDFFKTLDTHANKLTKLAEDAQNVNEQKLSAFTKKFEESIANEEKQMLEKVAELLASSNARKKELVQIAVQDIRQGSS<br />
SQTGALQQEMSAMQDSASSIKVQWNSHIVQAESHHLDNISAVEVAKEDMQKMHLKCLENSKTGTQQWKTAQESLVDLEKRNVATADSIIRGAIENNEKLR<br />
TQFSSAVSTTLSDVDSSNREIISSIDNSLQLDKDASTDVNSTIVPCSENLKELRTHHDDNVVEIKQNTGKCLGHEYKVTRFDPFLYNHHIYMIELDKIVN<br />
RKLNSLKTSTQVDEATSSTPRKREYNIPTVGSIEELKTPSFEELLKAFHDCKSPKQMQNGEAKHVSNGRPPLTAIN<br />
GENE ID: 3832 KIF11 | k<strong>in</strong>es<strong>in</strong> family member 11 [Homo sapiens]<br />
(Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 487 bits (1254), Expect = 1e-137, Method: Compositional matrix adjust.<br />
Identities = 293/627 (46%), Positives = 399/627 (63%), Gaps = 48/627 (7%)<br />
Query 35 SNSNPVSKNEKEKGVNIQVIVRCRPFNSEETRLQTPAVLTCNDRKKEVAV-AQNIAGKQI 93<br />
S N +K ++EKG NIQV+VRCRPFN E + +++ C+ +KEV+V +A K<br />
Sbjct 3 SQPNSSAKKKEEKGKNIQVVVRCRPFNLAERKASAHSIVECDPVRKEVSVRTGGLADKSS 62<br />
Query 94 DKTFLFDKVFGPTSQQKDLYHQAVSPIVFEVLDGYNCTIFAYGQTGTGKTYTMEGGARKK 153<br />
KT+ FD VFG +++Q D+Y V PI+ EV+ GYNCTIFAYGQTGTGKT+TMEG R<br />
Sbjct 63 RKTYTFDMVFGASTKQIDVYRSVVCPILDEVIMGYNCTIFAYGQTGTGKTFTMEG-ERSP 121<br />
Query 154 NGEIPSD----AGVIPRAVKQIFDILEAQSAAEYSLKVSFLELYNEELTDLLAPEETKFA 209<br />
N E + AG+IPR + QIF+ L + E+S+KVS LE+YNEEL DLL P +<br />
Sbjct 122 NEEYTWEEDPLAGIIPRTLHQIFEKL-TDNGTEFSVKVSLLEIYNEELFDLLNP-----S 175
Query 210 DDKSKKPLALMED--GKGGVFVRGLEEEIVSTADEIYKVLEKGSAKRRTAETLLNKQSSR 267<br />
D S++ L + +D K GV ++GLEE V DE+Y++LEKG+AKR TA TL+N SSR<br />
Sbjct 176 SDVSER-LQMFDDPRNKRGVIIKGLEEITVHNKDEVYQILEKGAAKRTTAATLMNAYSSR 234<br />
Query 268 SHSIFSVTIHIKECTPEGEEIVKSGKLNLVDLAGSENISRSGAREGRAREAGEINKSLLT 327<br />
SHS+FSVTIH+KE T +GEE+VK GKLNLVDLAGSENI RSGA + RAREAG IN+SLLT<br />
Sbjct 235 SHSVFSVTIHMKETTIDGEELVKIGKLNLVDLAGSENIGRSGAVDKRAREAGNINQSLLT 294<br />
Query 328 LGRVINALVEHSGHIPYRESKLTRLLRDSLGGKTKTCVIATVSPSVHCLEETLSTLDYAH 387<br />
LGRVI ALVE + H+PYRESKLTR+L+DSLGG+T+T +IAT+SP+ LEETLSTL+YAH<br />
Sbjct 295 LGRVITALVERTPHVPYRESKLTRILQDSLGGRTRTSIIATISPASLNLEETLSTLEYAH 354<br />
Query 388 RAKHIKNKPEVNQKMMKSAIMKDLYSEIERLKQEVYAAREKNGIYIPKERYTQEEAEKKA 447<br />
RAK+I NKPEVNQK+ K A++K+ EIERLK+++ AAREKNG+YI +E + +<br />
Sbjct 355 RAKNILNKPEVNQKLTKKALIKEYTEEIERLKRDLAAAREKNGVYISEENF-------RV 407<br />
Query 448 MADKIEQMEVEGEAKDKQIIDL--------QELYNSEQLVTAGLREKLDKTEKKLYETEQ 499<br />
M+ K+ +++QI++L +EL +L + +LD+ + L Q<br />
Sbjct 408 MSGKL-------TVQEEQIVELIEKIGAVEEELNRVTELFMDN-KNELDQCKSDLQNKTQ 459<br />
Query 500 ALLDLEEKHRQ--AVATIKEKEYLISNLLKSEKTLVDRAVELQAELANAASDVSNLFAKI 557<br />
L+ +KH Q + +KE EY+ S L +E+ L D A +L + DVS L +K+<br />
Sbjct 460 E-LETTQKHLQETKLQLVKE-EYITSALESTEEKLHDAASKLLNTVEETTKDVSGLHSKL 517<br />
Query 558 GRKDKIEDSNRSLIQDFQSQLLRQL-----ELLNNSVAGSVSQQEKQLQDMENVMVSFVS 612<br />
RK + D + + QD + L L EL+ + + + E N++ S VS<br />
Sbjct 518 DRKKAV-DQHNAEAQDIFGKNLNSLFNNMEELIKDGSSKQKAMLEVHKTLFGNLLSSSVS 576<br />
Query 613 AKTKATETLRGSLAQLKEKYNTGIKSL 639<br />
A T GSL + E +T + +<br />
Sbjct 577 ALDTITTVALGSLTSIPENVSTHVSQI 603<br />
>AT2G31320<br />
MASPHKPWRAEYAKSSRSSCKTCKSVINKENFRLGKLVQSTHFDGIMPMWNHASCILKKTKQIKSVDDVEGIESLRWEDQQKIRKYVESGAGSNTSTSTG<br />
TSTSSTANNAKLEYGIEVSQTSRAGCRKCSEKILKGEVRIFSKPEGPGNKGLMWHHAKCFLEMSSSTELESLSGWRSIPDSDQEALLPLVKKALPAAKTE<br />
TAEARQTNSRAGTKRKNDSVDNEKSKLAKSSFDMSTSGALQPCSKEKEMEAQTKELWDLKDDLKKYVTSAELREMLEVNEQSTRGSELDLRDKCADGMMF<br />
GPLALCPMCSGHLSFSGGLYRCHGYISEWSKCSHSTLDPDRIKGKWKIPDETENQFLLKWNKSQKSVKPKRILRPVLSGETSQGQGSKDATDSSRSERLA<br />
DLKVSIAGNTKERQPWKKRIEEAGAEFHANVKKGTSCLVVCGLTDIRDAEMRKARRMKVAIVREDYLVDCFKKQRKLPFDKYKIEDTSESLVTVKVKGRS<br />
AVHEASGLQEHCHILEDGNSIYNTTLSMSDLSTGINSYYILQIIQEDKGSDCYVFRKWGRVGNEKIGGNKVEEMSKSDAVHEFKRLFLEKTGNTWESWEQ<br />
KTNFQKQPGKFLPLDIDYGVNKQVAKKEPFQTSSNLAPSLIELMKMLFDVETYRSAMMEFEINMSEMPLGKLSKHNIQKGFEALTEIQRLLTESDPQPTM<br />
KESLLVDASNRFFTMIPSIHPHIIRDEDDFKSKVKMLEALQDIEIASRIVGFDVDSTESLDDKYKKLHCDISPLPHDSEDYRLIEKYLNTTHAPTHTEWS<br />
LELEEVFALEREGEFDKYAPHREKLGNKMLLWHGSRLTNFVGILNQGLRIAPPEAPATGYMFGKGIYFADLVSKSAQYCYTCKKNPVGLMLLSEVALGEI<br />
HELTKAKYMDKPPRGKHSTKGLGKKVPQDSEFAKWRGDVTVPCGKPVSSKVKASELMYNEYIVYDTAQVKLQFLLKVRFKHKR<br />
GENE ID: 142 PARP1 | poly (ADP-ribose) polymerase 1 [Homo sapiens]<br />
(Over 100 PubMed l<strong>in</strong>ks)<br />
Score = 659 bits (1700), Expect = 0.0, Method: Compositional matrix adjust.<br />
Identities = 398/1045 (38%), Positives = 581/1045 (55%), Gaps = 103/1045 (9%)<br />
Query 3 SPHKPWRAEYAKSSRSSCKTCKSVINKENFRLGKLVQSTHFDGIMPMWNHASCILKKTKQ 62<br />
S K +R EYAKS R+SCK C I K++ R+ +VQS FDG +P W H SC K<br />
Sbjct 4 SSDKLYRVEYAKSGRASCKKCSESIPKDSLRMAIMVQSPMFDGKVPHWYHFSCFWKVGHS 63<br />
Query 63 IKSVD-DVEGIESLRWEDQQKIRKYVESGAGSNTSTSTGTSTSSTANNAKLEYGIEVSQT 121<br />
I+ D +V+G LRW+DQQK++K E+G + S A ++ E +++<br />
Sbjct 64 IRHPDVEVDGFSELRWDDQQKVKKTAEAGGVTGKGQD---GIGSKAEKTLGDFAAEYAKS 120<br />
Query 122 SRAGCRKCSEKILKGEVRIFSK---PEGPGNKGLM--WHHAKCFL----EMSSSTELES- 171<br />
+R+ C+ C EKI KG+VR+ K PE P G++ W+H CF+ E+ E +<br />
Sbjct 121 NRSTCKGCMEKIEKGQVRLSKKMVDPEKP-QLGMIDRWYHPGCFVKNREELGFRPEYSAS 179<br />
Query 172 -LSGWRSIPDSDQEALLPLVKKALPAAKTETAEARQTNSRAGTKRKNDSVDNEKSKLAKS 230<br />
L G+ + D+EAL KK LP K+E KRK D VD + +<br />
Sbjct 180 QLKGFSLLATEDKEAL----KKQLPGVKSEG------------KRKGDEVDG----VDEV 219<br />
Query 231 SFDMSTSGALQPCSKEKEMEAQTKELWDLKDDLKKYVTSAELREMLEVNEQSTRGSELDL 290<br />
+ S + EK ++AQ +W++KD+LKK ++ +L+E+L N+Q E +<br />
Sbjct 220 AKKKSKKEKDKDSKLEKALKAQNDLIWNIKDELKKVCSTNDLKELLIFNKQQVPSGESAI 279<br />
Query 291 RDKCADGMMFGPLALCPMCSGHLSFSGGLYRCHGYISEWSKCSHSTLDPDRIKGKWKIPD 350<br />
D+ ADGM+FG L C CSG L F Y C G ++ W+KC T P+R +W P<br />
Sbjct 280 LDRVADGMVFGALLPCEECSGQLVFKSDAYYCTGDVTAWTKCMVKTQTPNR--KEWVTPK 337<br />
Query 351 E-TENQFLLKWN-KSQKSVKPKRILRPVLSGETSQGQGSKDATDSSRS--ERLADLKVSI 406<br />
E E +L K K Q + P V + + A +SS S + L+++K+<br />
Sbjct 338 EFREISYLKKLKVKKQDRIFPPETSASVAATPPPSTASAPAAVNSSASADKPLSNMKILT 397<br />
Query 407 AGN-TKERQPWKKRIEEAGAEFHANVKKGTSCLVVCGLTDIRDAEMRKARRMKVAIVRED 465<br />
G ++ + K IE+ G + K + C+ + + +M + + + +V ED<br />
Sbjct 398 LGKLSRNKDEVKAMIEKLGGKLTGTANKASLCISTKKEVEKMNKKMEEVKEANIRVVSED 457<br />
Query 466 YLVDCFKKQRKL-------------------PFD--------------------KYKIED 486<br />
+L D + L P + K + +<br />
Sbjct 458 FLQDVSASTKSLQELFLAHILSPWGAEVKAEPVEVVAPRGKSGAALSKKSKGQVKEEGIN 517<br />
Query 487 TSESLVTVKVKGRSAVHEASGLQEHCHILEDGNSIYNTTLSMSDLSTGINSYYILQIIQE 546<br />
SE + + +KG +AV SGL+ H+LE G +++ TL + D+ G NSYY LQ++++<br />
Sbjct 518 KSEKRMKLTLKGGAAVDPDSGLEHSAHVLEKGGKVFSATLGLVDIVKGTNSYYKLQLLED 577<br />
Query 547 DKGSDCYVFRKWGRVGNEKIGGNKVEEM-SKSDAVHEFKRLFLEKTGNTWESWEQKTNFQ 605<br />
DK + ++FR WGRVG IG NK+E+M SK DA+ F +L+ EKTGN W S NF<br />
Sbjct 578 DKENRYWIFRSWGRVGTV-IGSNKLEQMPSKEDAIEHFMKLYEEKTGNAWHS----KNFT 632<br />
Query 606 KQPGKFLPLDIDYGVNKQVAKKEPFQ--TSSNLAPSLIELMKMLFDVETYRSAMMEFEIN 663<br />
K P KF PL+IDYG +++ KK T S L + +L+KM+FDVE+ + AM+E+EI+<br />
Sbjct 633 KYPKKFYPLEIDYGQDEEAVKKLTVNPGTKSKLPKPVQDLIKMIFDVESMKKAMVEYEID 692<br />
Query 664 MSEMPLGKLSKHNIQKGFEALTEIQRLLTESDPQPTMKESLLVDASNRFFTMIPSIH--- 720<br />
+ +MPLGKLSK IQ + L+E+Q+ +++ +S ++D SNRF+T+IP<br />
Sbjct 693 LQKMPLGKLSKRQIQAAYSILSEVQQAVSQGS-----SDSQILDLSNRFYTLIPHDFGMK 747<br />
Query 721 -PHIIRDEDDFKSKVKMLEALQDIEIASRIV--GFDVDSTESLDDKYKKLHCDISPLPHD 777
P ++ + D ++KV+ML+ L DIE+A ++ G D S + +D Y+KL DI + D<br />
Sbjct 748 KPPLLNNADSVQAKVEMLDNLLDIEVAYSLLRGGSDDSSKDPIDVNYEKLKTDIKVVDRD 807<br />
Query 778 SEDYRLIEKYLNTTHAPTHTEWSLELEEVFALEREGEFDKYAPHREKLGNKMLLWHGSRL 837<br />
SE+ +I KY+ THA TH + LE+ ++F +EREGE +Y P ++ L N+ LLWHGSR<br />
Sbjct 808 SEEAEIIRKYVKNTHATTHNAYDLEVIDIFKIEREGECQRYKPFKQ-LHNRRLLWHGSRT 866<br />
Query 838 TNFVGILNQGLRIAPPEAPATGYMFGKGIYFADLVSKSAQYCYTCKKNPVGLMLLSEVAL 897<br />
TNF GIL+QGLRIAPPEAP TGYMFGKGIYFAD+VSKSA YC+T + +P+GL+LL EVAL<br />
Sbjct 867 TNFAGILSQGLRIAPPEAPVTGYMFGKGIYFADMVSKSANYCHTSQGDPIGLILLGEVAL 926<br />
Query 898 GEIHELTKAKYMDKPPRGKHSTKGLGKKVPQDSEFAKWRGDVTVPCGKPVSSKVKASELM 957<br />
G ++EL A ++ K P+GKHS KGLGK P S G V VP G +SS V + L+<br />
Sbjct 927 GNMYELKHASHISKLPKGKHSVKGLGKTTPDPSANISLDG-VDVPLGTGISSGVNDTSLL 985<br />
Query 958 YNEYIVYDTAQVKLQFLLKVRFKHK 982<br />
YNEYIVYD AQV L++LLK++F K<br />
Sbjct 986 YNEYIVYDIAQVNLKYLLKLKFNFK 1010<br />
>AT2G35630<br />
MSTEDEKLLKEAKKLPWEDRLGHKNWKVRNEANVDLASVFDSITDPKDPRLRDFGHLFRKTVADSNAPVQEKALDALIAFLRAADSDAGRYAKEVCDAIA<br />
LKCLTGRKNTVDKAQAAFLLWVELEAVDVFLDTMEKAIKNKVAKAVVPAVDVMFQALSEFGSKVIPPKRILKMLPELFDHQDQNVRASAKGVTLELCRWI<br />
GKDPVKSILFEKMRDTMKKELEAELANVTAGAKPTRKIRSEQDKEPEAEASSDVVGDGPSEEAVADAPQEIDEYDLMDPVDILTPLEKSGFWDGVKATKW<br />
SERKEAVAELTKLASTKKIAPGDFSEICRTLKKLITDVNLAVAVEAIQAIGNLACGLRTHFSASSRFMLPVLLEKLKEKKQSVTDPLTQTLQTMYKAGCL<br />
NLVDVIEDVKTAVKNKVPLVRSSTLTWLTFCLETSNKALILKAHKEYVPLCMECLNDGTPDVRDAAFSALAAIAKSVGMRPLERSLEKLDDVRKKKLSEM<br />
IAGSGGGDQAGTSSVTVQSSVGSTATGNSDASFVRKSAASMLSGKRPAPSAQASKKVGTGKPGGGKKDGSVRNEGSKSVEPPEDVEPAEMGLEEIENRLG<br />
SLVKPETVSQLKSSVWKERLEATLALKEEIEGLQELDKSVEILVRLLCAVPGWNEKNVQVQQQVIEIITYISSTAAKFPKKCVVLCITGTSERVADIKTR<br />
ASAMKCLTAFCEAVGPGFVFERLFKIMKEHKNPKVLSEGLLWMVSAVDDFGVSLLKLKDLIDFCKDVGLQSSTAATRNATIKLLGALHKFVGPDIKGFLN<br />
DVKPALLSALDTEYEKNPFEGTAAPKRVVKTSVSTSTSSGGLDSLPREDISTKITPNLLKGFESPDWKMRLESIEAVNKILEEANKRIQPTGTGELFGGL<br />
RGRLLDSNKNLVMQTLTTIGGVAAAMGPAVEKASKGILSDVLKCLGDNKKHMRECTLAALDLWLGAVHLDKMIPYIIIALTDGKMGAEGRKDLFDWLTKQ<br />
LTGLSDFVDAIHLLKPASTAMTDKSADVRKAAEGCISEILRVSGQEMIEKNLKDIQGPALALVLEKVRPGFVQEPFESSKAMAGPVSKGVTKISKSTSNG<br />
TLKQGNRSRAVPTKGSSQITSVHDIAIQSQALLNTKDSNKEDRERVVVRRIKFEELRPEQIQDLENDMMKFFREDLQKRLLSPDFKKQVDGLEILQKALP<br />
SVSKEIIEVLDVLLRWFVLQFCKSNTTCLLKVLEFLPELFNTLRDEEYCMTEAEAAIFLPCLAEKLGHNIEKVREKMRELMKQIIQAYSVGKTYPYILEG<br />
LRSKNNRTRIECTDLIGYLLETCGTEIGGLLKYLNIVASLTAERDGELRKAALNTMATGYQILGADIWKYVGKLTDAQKSMIDDRFKWKAKDMEKRREGK<br />
PGEARAALRRSVRDSGPEVAEQSGDISQTVPGPLFPRQSYGISEQMLERTPVPRTIAGVNGPTDWNEALDIIMFGSPEQSVEGMKVVCHELAQASNDPEE<br />
SAIDELVKDADGLVSCLANKVAKTFDVSLMGASSRSCKYVLNTLMQTFQNKKLAHAVKEGTLESLITELLLWLLDERVPRMEDGSQLLKALNVLMLKILD<br />
NADRTSSFVVLISLLRPLDPSRWPSPATAEVYAVRNQKFSDLVVKCLIKLTKLLQSTIYEVDLDRLLQSIHVYLQDLGMEEIRRRAGADDKPLRMVKTVL<br />
HELVKLRGAAIKGHLSLVPIDMRPQPIILAYIDLNLETLAAARMLTATGPVGQTHWTDSTANNPSPPANSADVQLKQELGAIFKKIGDKQTSTIGLYDLY<br />
HITKSYPKVDIFSQLQNASEAFRTYIRDGLAQVEKNAAAGRTPSSLPLSTPPPSSLALPSPDIPSLSSLDVKPLMNPRSDLYTDDIRASNMNPGVMTGTL<br />
DAIRERMKNMQLASSEPVSKPLMPTNDNLSMNQQSVPPSQMGQETVHTHPVVLPMDEKALSGLQARMERLKGGSLEHM<br />
GENE ID: 9793 CKAP5 | cytoskeleton associated prote<strong>in</strong> 5 [Homo sapiens]<br />
(Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 731 bits (1886), Expect = 0.0, Method: Compositional matrix adjust.<br />
Identities = 577/1985 (29%), Positives = 960/1985 (48%), Gaps = 156/1985 (7%)<br />
Query 14 KLPWEDRLGHKNWKVRNEANVDLASVFDSITDPKDPRLRDFGHLFRKTVADSNAPVQEKA 73<br />
KLP + + HK WK R + +F I D K P F L +K V DSNA VQ K<br />
Sbjct 9 KLPVDQKCEHKLWKARLSGYEEALKIFQKIKDEKSPEWSKFLGLIKKFVTDSNAVVQLKG 68<br />
Query 74 LDALIAFLRAADSDAGRYAKEVCDAIALKCLTGRKNTVDKAQAAF-LLWVELEAVDVFLD 132<br />
L+A + ++ A AG+ EV + K K + L+++E+E + +<br />
Sbjct 69 LEAALVYVENAHV-AGKTTGEVVSGVVSKVFNQPKAKAKELGIEICLMYIEIEKGEAVQE 127<br />
Query 133 TMEKAIKNKVAKAVVPAVDVMFQALSEFGSKVIPPKRILKMLPELFDHQDQNVRASAKGV 192<br />
+ K + NK K +V ++ + +ALSEFGSK+I K I+K+LP+LF+ +++ VR AK +<br />
Sbjct 128 ELLKGLDNKNPKIIVACIETLRKALSEFGSKIILLKPIIKVLPKLFESREKAVRDEAKLI 187<br />
Query 193 TLELCRWIGKDPVKSILFEKMRDTMKKELEAELANV-TAGAKPTRKIRSEQDKEPEAEAS 251<br />
+E+ RWI +D ++ L + + KELE E + T+ +PTR +RS+Q+ E + E<br />
Sbjct 188 AVEIYRWI-RDALRPPL-QNINSVQLKELEEEWVKLPTSAPRPTRFLRSQQELEAKLEQQ 245<br />
Query 252 SDVVGDGPSEEAVADAPQEIDEYDLMDPVDILTPLEKSGFWDGVKATKWSERKEAVAELT 311<br />
GD D +ID Y+L++ V+IL+ L K F+D ++A KW ERKEA+ +<br />
Sbjct 246 QSAGGDAEGGGDDGDEVPQIDAYELLEAVEILSKLPKD-FYDKIEAKKWQERKEALESVE 304<br />
Query 312 KLASTKKIAPGDFSEICRTLKKLI-TDVNLAVAVEAIQAIGNLACGLRTHFSASSRFMLP 370<br />
L K+ GD++++ + LKK++ D N+ + A + + LA GLR F + ++P<br />
Sbjct 305 VLIKNPKLEAGDYADLVKALKKVVGKDTNVMLVALAAKCLTGLAVGLRKKFGQYAGHVVP 364<br />
Query 371 VLLEKLKEKKQSVTDPLTQTLQTMYKAGCLNLVDVIEDVKTAVKNKVPLVRSSTLTWLTF 430<br />
+LEK KEKK V L + + ++ L ++ EDV + NK P ++ T ++<br />
Sbjct 365 TILEKFKEKKPQVVQALQEAIDAIFLTTTLQ--NISEDVLAVMDNKNPTIKQQTSLFIAR 422<br />
Query 431 CLETSNKALILKAH-KEYVPLCMECLNDGTPDVRDAAFSALAAIAKSVGMRPLERSLEKL 489<br />
+ + K+ K + ++ +ND P+VRDAAF AL K VG + ++ L +<br />
Sbjct 423 SFRHCTASTLPKSLLKPFCAALLKHINDSAPEVRDAAFEALGTALKVVGEKAVKPFLADV 482<br />
Query 490 DDVRKKKLSE------MIAGSGGGDQAGTSSVTV--QSSVGSTATGNSDASFVRKSAASM 541<br />
D ++ K+ E +I G G A + S A G+ D +<br />
Sbjct 483 DKLKLDKIKECSEKVELIHGKKAGLAADKKEFKPLPGRTAASGAAGDKDTKDISAPKPGP 542<br />
Query 542 LSGKRPAPSAQASKKVGTGKPGGGKKDGSVRNEGSKSVEPPEDVEPAEMGLEEIENRLGS 601<br />
L + AP+A+A GKP G+ + K +E E VEP E+ +E E + +<br />
Sbjct 543 L---KKAPAAKAGGPPKKGKPAAPGGAGNTGTKNKKGLETKEIVEP-ELSIEVCEEKASA 598<br />
Query 602 LVKPETVSQLKSSVWKERLEATLALKEEIEGLQELDKSVEILVRLLCAVPGWNEKNVQVQ 661<br />
++ P + L SS WKERL ++ +E + + + LVR+L PGW E N QV<br />
Sbjct 599 VLPPTCIQLLDSSNWKERLACMEEFQKAVELMDRTEMPCQALVRMLAKKPGWKETNFQVM 658<br />
Query 662 QQVIEIITYISSTAAKFPKKCVVLCITGTSERVADIKTRASAMKCLTAFCEAVGPGFVFE 721<br />
Q + I+ I+ F K + + G +++ D+K +A + +TA EA + E<br />
Sbjct 659 QMKLHIVALIAQKG-NFSKTSAQVVLDGLVDKIGDVKCGNNAKEAMTAIAEACMLPWTAE 717<br />
Query 722 RLFKIMKEHKNPKVLSEGLLWMVSAVDDFGVSLLKLKDLIDFCKDVGLQSSTAATRNATI 781<br />
++ + KNPK SE L W+ +A+ +FG S L +K I K L ++ A R A I<br />
Sbjct 718 QVVSMAFSQKNPKNQSETLNWLSNAIKEFGFSGLNVKAFISNVK-TALAATNPAVRTAAI 776<br />
Query 782 KLLGALHKFVGPDIKGFLNDVKPALLSALDTEYEKNPFEGTAAPKRVVKTSVSTSTSSGG 841<br />
LLG ++ +VGP ++ F D KPALLS +D E+EK + AP R + ++ T G<br />
Sbjct 777 TLLGVMYLYVGPSLRMFFEDEKPALLSQIDAEFEKMQGQSPPAPTRGISKHSTSGTDEGE 836
Query 842 ------------LDSLPREDISTKITPNLLKGFESPDWKMRLESIEAVNKILEEANKRIQ 889<br />
+D LPR +IS KIT L+ +WK+R E ++ V I+ +A K IQ<br />
Sbjct 837 DGDEPDDGSNDVVDLLPRTEISDKITSELVSKIGDKNWKIRKEGLDEVAGIINDA-KFIQ 895<br />
Query 890 PTGTGELFGGLRGRLLDSNKNLVMQTLTTIGGVAAAMGPAVEKASKGILSDVLKCLGDNK 949<br />
P GEL L+GRL DSNK LV QTL + +A AMGP +++ K + ++ LGD+K<br />
Sbjct 896 PN-IGELPTALKGRLNDSNKILVQQTLNILQQLAVAMGPNIKQHVKNLGIPIITVLGDSK 954<br />
Query 950 KHMRECTLAALDLWLGAVHLDKMIPYIIIALTDGKMGAEGRKDLFDWLTKQLTGL-SDFV 1008<br />
++R LA ++ W + + + ++ K R++L WL ++L L S<br />
Sbjct 955 NNVRAAALATVNAWAEQTGMKEWLEGEDLSEELKKENPFLRQELLGWLAEKLPTLRSTPT 1014<br />
Query 1009 DAIHLLKPASTAMTDKSADVRKAAEGCISEILRVSGQEMIEK---NLKDIQGPALALVLE 1065<br />
D I + + + D++ DVRK A+ + + G E + K LK + +LE<br />
Sbjct 1015 DLILCVPHLYSCLEDRNGDVRKKAQDALPFFMMHLGYEKMAKATGKLKPTSKDQVLAMLE 1074<br />
Query 1066 KVRPGFVQEPFESSKAMAGPVSKGVT---KISKSTSNGTLKQGNRSRAVPTKG-----SS 1117<br />
K + +P +KA + P+ + + + + + + P K SS<br />
Sbjct 1075 KAKVNMPAKPAPPTKATSKPMGGSAPAKFQPASAPAEDCISSSTEPKPDPKKAKAPGLSS 1134<br />
Query 1118 QITSVHDIAIQSQALL---------------NTKDSNKEDRERVVVRRIKFEELRPEQIQ 1162<br />
+ S + S+ L N K+ +D + + V + F R E I+<br />
Sbjct 1135 KAKSAQGKKMPSKTSLKEDEDKSGPIFIVVPNGKEQRMKDEKGLKVLKWNFTTPRDEYIE 1194<br />
Query 1163 DLENDMMKFFREDLQKRLLSPDFKKQVDGLEILQKALPSVSKEIIEVLDVLLRWFVLQFC 1222<br />
L+ M + LQ + DF+ L ++ L S + +I LD++L+W L+F<br />
Sbjct 1195 QLKTQMSSCVAKWLQDEMFHSDFQHHNKALAVMVDHLESEKEGVIGCLDLILKWLTLRFF 1254<br />
Query 1223 KSNTTCLLKVLEFLPELFNTLRDEEYCMTEAEAAIFLPCLAEKLGHNIEKVREKMRELMK 1282<br />
+NT+ L+K LE+L LF L +EEY +TE EA+ F+P L K+G + +R+ +R ++<br />
Sbjct 1255 DTNTSVLMKALEYLKLLFTLLSEEEYHLTENEASSFIPYLVVKVGEPKDVIRKDVRAILN 1314<br />
Query 1283 QIIQAYSVGKTYPYILEGLRSKNNRTRIECTDLIGYLLETCGTEIGGLL--KYLNIVASL 1340<br />
++ Y K +P+I+EG +SKN++ R EC + +G L+E+ G + K L +A<br />
Sbjct 1315 RMCLVYPASKMFPFIMEGTKSKNSKQRAECLEELGCLVESYGMNVCQPTPGKALKEIAVH 1374<br />
Query 1341 TAERDGELRKAALNTMATGYQILGADIWKYVGKLTDAQKSMIDDRFKWKAKDME----KR 1396<br />
+RD +R AALNT+ T Y + G ++K +G L++ SM+++R K AK K+<br />
Sbjct 1375 IGDRDNAVRNAALNTIVTVYNVHGDQVFKLIGNLSEKDMSMLEERIKRSAKRPSAAPIKQ 1434<br />
Query 1397 REGKPGEAR-AALRRSVRDSGPEVAEQSGDISQTVPGPLFPRQSYGISE--QMLERTPVP 1453<br />
E KP A+ + ++ GP + S ++Q R G E QM+ R<br />
Sbjct 1435 VEEKPQRAQNISSNANMLRKGP-AEDMSSKLNQA-------RSMSGHPEAAQMVRR---- 1482<br />
Query 1454 RTIAGVNGPTDWNEALDIIMFGSPEQSVEGMKVVCHELAQASN-----DPEESAIDELVK 1508<br />
++ LD I + E ++V H+L +P+ A+<br />
Sbjct 1483 ----------EFQLDLDEIENDNGTVRCEMPELVQHKLDDIFEPVLIPEPKIRAVSPHFD 1532<br />
Query 1509 DADGLVSCLANKVAKTFDVSLMGASSRSCKYVLNTLMQTFQNKKLAHAVKEGTLESLITE 1568<br />
D + + A T + + +S + L Q FQ + LA G L+ L+<br />
Sbjct 1533 D-------MHSNTASTINFIISQVASGDINTSIQALTQLFQIESLAREASTGVLKDLMHG 1585<br />
Query 1569 LLLWLLDERVPRMEDGSQLLKALNVLMLKILDNADRTSSFVVLISLLRPLDPSRWPSPAT 1628<br />
L+ +LD R+ +E+G Q+++++N+L++K+L+ +D+T+ L+ LL+ + SP<br />
Sbjct 1586 LITLMLDSRIEDLEEGQQVIRSVNLLVVKVLEKSDQTNILSALLVLLQDSLLATASSP-- 1643<br />
Query 1629 AEVYAVRNQKFSDLVVKCLIKLTKLLQSTIYEVDLDRLLQSIHVYLQDLGMEEIRRRAGA 1688<br />
KFS+LV+KCL ++ +LL TI ++LDR+L IH++++ E++++<br />
Sbjct 1644 ---------KFSELVMKCLWRMVRLLPDTINSINLDRILLDIHIFMKVFPKEKLKQ--CK 1692<br />
Query 1689 DDKPLRMVKTVLHELVKLRGAAIKGHLSLVPIDMRPQPIILAYIDLNLETLAAARMLTAT 1748<br />
+ P+R +KT+LH L KL+G I HL++ ID + + + A++ RM+ +<br />
Sbjct 1693 SEFPIRTLKTLLHTLCKLKGPKILDHLTM--IDNKNESELEAHL---------CRMMKHS 1741<br />
Query 1749 GPVGQTHWTDSTANNPSP-PANSADVQLKQELGAIFKKIGDKQTSTIGLYDLYHITKSYP 1807<br />
+ TA S A S+ ++ L IFKKIG K+ + GL +LY K Y<br />
Sbjct 1742 MDQTGSKSDKETAKGASRIDAKSSKAKVNDFLAEIFKKIGSKENTKEGLAELYEYKKKYS 1801<br />
Query 1808 KVDIFSQLQNASEAFRTYIRDGLAQVE-KNAAAGRTPSSLPLSTPPPSSLALPSPDIPSL 1866<br />
DI L+N+S+ F++Y+ GL +E + GR +S +S P +P+P ++<br />
Sbjct 1802 DADIEPFLKNSSQFFQSYVERGLRVIEMEREGKGRISTSTGIS-PQMEVTCVPTP-TSTV 1859<br />
Query 1867 SSLDVKPLMNPRSDLYTDDIRASNMNPGVMTGTLDAIRER--MKNMQLASSEP----VSK 1920<br />
SS+ + + P V L +R+R + N + P +SK<br />
Sbjct 1860 SSI--------------GNTNGEEVGPSVYLERLKILRQRCGLDNTKQDDRPPLTSLLSK 1905<br />
Query 1921 PLMPT 1925<br />
P +PT<br />
Sbjct 1906 PAVPT 1910<br />
>AT2G36090<br />
MANSSSFSPSTTVTDLISTVHDDIIESHILTRLDGATLASVSCASSHLHHLASNEILWSKICRSTWPSCSGGSRSFFSDAYSMVETAGTVSDLDRPFPEL<br />
ISAVDLHYRGKLIFSRVVKTETTTAWFKSSPLRIDLVDTKDTVATPIKRRQRTEDTCRDLEKDLTLSWIVIDPIGKRAANISSHRPVSVQRNWISGEVEA<br />
QFATVVGAVECVITVVTCGEEEMHVREVSLKVEKMEGTHLNGRDSLVILRSVMEGKRVNGSRREVESKKRHEEFMEKKREMKEKKMRVESVFDILTVAFG<br />
ILGFVLLVVFCLWRTSI<br />
GENE ID: 26269 FBXO8 | F-box prote<strong>in</strong> 8 [Homo sapiens]<br />
(10 or fewer PubMed l<strong>in</strong>ks)<br />
Score = 37.7 bits (86), Expect = 0.049, Method: Compositional matrix adjust.<br />
Identities = 17/42 (40%), Positives = 24/42 (57%), Gaps = 3/42 (7%)<br />
Query 29 ILTRLDGATLASVSCASSHLHHLASNEILWSKICRSTWPSCS 70<br />
IL+ L+ L SC LA++E+LW +C+STW CS<br />
Sbjct 82 ILSYLNATDLCLASCV---WQDLANDELLWQGLCKSTWGHCS 120<br />
>AT2G37660<br />
MAMMTTTTTTFFHPLLPANTYKSGAVASSFVSVPRSSSLQFRSLVSDSTSICGPSKFTGKNRRVSVTVSAAATTEPLTVLVTGAGGRTGQIVYKKLKERS<br />
EQFVARGLVRTKESKEKINGEDEVFIGDIRDTASIAPAVEGIDALVILTSAVPQMKPGFDPSKGGRPEFFFDDGAYPEQVDWIGQKNQIDAAKAAGVKQI<br />
VLVGSMGGTNINHPLNSIGNANILVWKRKAEQYLADSGIPYTIIRAGGLQDKDGGIRELLVGKDDELLETETRTIARADVAEVCVQALQLEEAKFKALDL<br />
ASKPEGTGTPTKDFKALFTQVTTKF<br />
GENE ID: 50814 NSDHL | NAD(P) dependent steroid dehydrogenase-like
[Homo sapieens]<br />
(Over 10 PuubMed<br />
l<strong>in</strong>ks)<br />
Score = 433.5<br />
bits (101), Expect = 8e-04 4, Method: Compoositional<br />
matrix<br />
adjust.<br />
Identitiess<br />
= 37/153 (24%) ), Positives = 64/153 6 (41%), Ga Gaps = 25/153 (16%)<br />
Query 55<br />
Sbjct 16<br />
Query 114<br />
Sbjct 69<br />
Query 173<br />
Sbjct 119<br />
>AT2G39730 Rubisco Activasse<br />
MAAAVSTVGAIINRAPLSLNGSGSGAVVSAPASTFLGKKVVTV<br />
VSRFAQSNKKSNGSFK FKVLAVKEDKQTDGDR RWRGLAYDTSDDQQDITRGKGMVDSVFQAPMM<br />
GTGTHHAVLSSSYEYVSQGLRQYNLDNNMMDGFYIAPAFMDKL<br />
LVVHITKNFLTLPNIKKVPLILGIWGGKGQG<br />
GKSFQCELVMAKMGIN NPIMMSAGELESGNAGG<br />
EPAKLIRQRYRREAADLIKKGKMCCLFFINDLDAGAGRMGGTT<br />
TQYTVNNQMVNATLMN MNIADNPTNVQLPGMY YNKEENARVPIICTGN NDFSTLYAPLIRDGRMM<br />
EKFYWAPTREDDRIGVCKGIFRTDKIKKDEDIVTLVDQFPGQS<br />
SIDFFGALRARVYDDE DEVRKFVESLGVEKIG GKRLVNSREGPPVFEQ QPEMTYEKLMEYGNMLL<br />
VMEQENVKRVQQLAETYLSQAALGDANNADAIGRGTFYGKGAQ<br />
QQVNLPVPEGCTDPVA VAENFDPTARSDDGTC CVYNF<br />
> GENE ID: 5706 PSMC6 | prroteasome<br />
(proso ome, macropa<strong>in</strong>) 26S subunit, ATPase, A 6<br />
[Homo sapieens]<br />
(Over 10 PuubMed<br />
l<strong>in</strong>ks)<br />
Score = 511.6<br />
bits (122), Expect = 6e-06 6, Method: Compoositional<br />
matrix<br />
adjust.<br />
Identitiess<br />
= 48/186 (25%) ), Positives = 87/186 8 (46%), Ga Gaps = 20/186 (10%)<br />
Query 163 IWGGKGQGKSFQCEELVMAKMGINPIMMSA<br />
AGELESGNAGEPAKLIIRQRYREAADLIKKG<br />
GKM 222<br />
++G G GK+ V +++ N + + + + GE A+LIIR+<br />
+ A D +<br />
Sbjct 172 LYGPPGTGKTLLARRAVASQLDCNFLKVVS<br />
SSSIVDKYIGESARLIIREMFNYARD----H<br />
HQP 227<br />
Query 223<br />
Sbjct 228<br />
Query 283<br />
Sbjct 278<br />
Query 337<br />
Sbjct 338<br />
>AT2G45990<br />
MGDLYALDFDGGVLCDSCGESSLSAVKKAAKVRWPDLFEGVDS<br />
SALEEWIVDQMHIVRP RPVVETGYENLLLVRL LLLETKIPSIRKSSVA AEGLTVDGILESWAKFF<br />
KPVIMEAWDEDDRDALVDLFGKVRDDWWINKDLTTWIGANRFY<br />
YPGVSDALKFASSKIYYIVTTKQGRFAEALL<br />
LREIAGVIIPSERIYG GLGSGPKVEVLKLLQDD<br />
KPEHQGLTLHFFVEDRLATLKNVIKEPPELDKWSLYLGTWGYN<br />
NTEKERAEAAGIPRIQQVIELSTFSNKLK<br />
GENE ID: 855459<br />
KIAA1731 | KIAA1731 [Homo sapiens] (10 orr<br />
fewer PubMed l<strong>in</strong>ks)<br />
Score = 333.9<br />
bits (76), Expect = 0.56, Method: Composiitional<br />
matrix adjust.<br />
Identitiess<br />
= 21/72 (29%), , Positives = 36 6/72 (50%), Gapss<br />
= 7/72 (9%)<br />
Query 44<br />
Sbjct 595<br />
Query 102 PV-IMEAWDEDR 112<br />
P I E WD+D+<br />
Sbjct 651 PTAISEHWDQDK 662<br />
>AT3G04290<br />
MNINCSPLGFLLISLFFIVTFLAPQVKKSRAFFVFGDSLVDNG<br />
GNNDYLVTTARADNYP YPYGIDYPTRRPTGRF FSNGLNIPDIISEAIG GMPSTLPYLSPHLTGEE<br />
NLLVGANFASAAGIGILNDTGIQFVNIIIRISKQMEYFEQYQL<br />
LRVSALIGPEATQQLV LVNQALVLITLGGNDF FVNNYYLIPFSARSRQ QYALPDYVVYLISEYGG<br />
KILRKLYELGAARRVLVTGTGAMGCAPPAELAQHSRNGECYGA<br />
ALQTAAALFNPQLVDL DLIASVNAEIGQDVFV VAANAYQMNMDYLSNP PEQFGFVTSKVACCGQQ<br />
GPYNGIGLCTPPVSNLCPNRDLYAFWDDAFHPTEKANRIIVNQ<br />
QILTGSSKYMHPMNLS LSTAMLLDSSKI<br />
GENE ID: 255981<br />
DNAH1 | dynne<strong>in</strong>,<br />
axonemal, heavy cha<strong>in</strong> 1 [ [Homo sapiens]<br />
(10 or feweer<br />
PubMed l<strong>in</strong>ks) )<br />
Score = 322.3<br />
bits (72), Expect = 1.7, Method: M Composittional<br />
matrix adjust. a<br />
Identitiess<br />
= 35/151 (23%) ), Positives = 64/151 6 (42%), Ga Gaps = 18/151 (11%)<br />
Query 222<br />
Sbjct 395<br />
Query 278<br />
Sbjct 453<br />
Query 336<br />
Sbjct 503<br />
SKFTGKNRRVSVTVVSAAATTEPLTVLVTG<br />
GAGGRTGQIVYKKLKE KERSEQFVARGL-VRTKE<br />
113<br />
+ T +V+ + + V G G GQ<br />
EQ +ARG V +<br />
THLTEDTPKVNADIIEKVNQNQAKRCTVIG<br />
GGSGFLGQ-------HHMVEQLLARGYAVNV<br />
VFD 68<br />
SKEKI-NGEDEVFIIGDIRDTASIAPAVEG<br />
GIDALVILTSAVPQMK MKPGFDPSKGGRPEFF FFD 172<br />
++ N + F+ +GD+ + PA++G G++ + A P P E F+ F<br />
IQQGFDNPQVRFFLLGDLCSRQDLYPALKG<br />
GVNT--VFHCASP--------PPSSNNKELF<br />
FY- 118<br />
DGAYPEQVDWIGQKKNQIDAAKAAGVKQIV<br />
VLVGS 205<br />
+V++IG KKN<br />
I+ K AGV++++ +L S<br />
------RVNYIGTKKNVIETCKEAGVQKLI<br />
ILTSS 145<br />
CCLFINDLDAGAGRRMGGTTQYTVNNQMVN<br />
NATLMNIADNPTNVQL QLPGMYNKEENARVPIIC<br />
282<br />
C +F++++DA GRR<br />
++ T ++ + TLM + + Q+ G + RV + I<br />
CIIFMDEIDAIGGRRR--FSEGTSADREIQ<br />
QRTLMELLN-----QM QMDGF---DTLHRVKM MIM 277<br />
TGNDFSTLYAPLIRRDGRMEKFYWA--PTR<br />
REDRIGVCK----GIFFRTDKIKDEDIVTLV<br />
VDQ 336<br />
N TL L+RR<br />
GR+++ P + R+ + K I + +I E IV L D<br />
ATNRPDTLDPALLRRPGRLDRKIHIDLPNE<br />
EQARLDILKIHAGPITTKHGEIDYEAIVKLSDG<br />
337<br />
FPGQSI 342<br />
F G +<br />
FNGADL 343<br />
EEWIVDQMHIVRPVVVETGYENLLLVRLLL<br />
LETKIPSIRKSSVAEGGLTVDGILE--SWAK<br />
KFK 101<br />
+ ++ Q + R VET + LL + +L L+ + PS+ A L D ++ SW +<br />
QHQLLQQNRLHRQSSVETARKQLLEYQTML<br />
LKGRCPSV----SAPSSLITDSVISVPSWKSER<br />
650<br />
MGCAPAE----LAQQHSRNGECYGALQTAA<br />
AALFNPQLVDLIASVN VNAEIGQDVFVAANAY YQM 277<br />
+ C P++ +++ + S + AL T P +++ ++S+ E+ D + N ++<br />
VDCMPSDGQHVISEEQSLSKIKQWALSTPR<br />
RMRKGPSVLEHLSSLAAREVSLDYERSMN--KI<br />
452<br />
NMDYL--SNPEQFGGFVTSKVACCGQGPYN<br />
NGIGLCTPVSNLCPNR NRDLYAFWDAFHPTEK KAN 335<br />
N D++ S PE F +VT Q P G+ + P<br />
Y FW+ +<br />
NFDHVVSSKPETFSSYVTLPKKEEEQVPER<br />
RGL-VSVPK----------YHFWEQKEDFTF<br />
FVS 502<br />
RIIVNQILTGSSKYYMHPMNLSTAMLLDSS<br />
SKI 366<br />
+ +++T SK N TAM L S +<br />
LLTRPEVITALSKVVRAECNKVTAMSLFHS<br />
SSL 533<br />
>AT3G06340<br />
MSINRDEALRAAKDLAEGLMKKTDFTAAARKLAMKAQKMDSSL<br />
LENISRMIMVCDVHCA CAATEKLFGTEMDWYG GILQVEQIANDVIIKK KQYKRLALLLHPDKNKK<br />
LPGAESAFKLIIGEAQRILLDREKRTLLHDNKRKTWRKPAAPP<br />
PYKAQQMPNYHTQPHF HFRASVNTRNIFTELR RPEIRHPFQKAQAQPA AAFTHLKTFGTSCVFCC<br />
RVRYEYDRAHVVNKEVTCETCKKRFTAAFEEPLQSAPQAKGPS<br />
SQTTYCFPQQSKFPDQ DQRACSEPHKRPENPP PTVSSSKASFPMPGSTAKHNGKRKRKNVAECC<br />
SESSDSESSSEESEDDVNNDTTAAQDSSGSNGGEQPRRSVRSK<br />
KQKVSYNENLSDDDVD VDLVNDNGEGSGKNID DTEREKETEEEKQTNENHSSTESIDMNGKIEE<br />
VDQVETPSGASSDSEEDLSSGSAEKPNNLINYDDPDFNDFDKL<br />
LREKSCFQAGQIWAVY VYDEEEGMPRFYALIK KKVTTPDFMLRYVWFEVDQDQENETPNLPVSS<br />
VGKFVVGNIEEETNLCSIFSHFVYSTTTKIRTRKFTVFPKKGE<br />
EIWALFKNWDINCSAD ADSVSPMKYEYEFVEILSDHAEGATVSVGFL<br />
LSKVQGFNCVFCPMPKK<br />
DESNTCEIPPHHEFCRFSHSIPSFRLTTGTEGRGITKGWYELD<br />
DPAALPASVSQNLSGE GEEAAQDRDRQSPPSG GSAS<br />
> pdb| |2CTP|A Chai<strong>in</strong><br />
A, Solution Structure S<br />
Of J-DDoma<strong>in</strong><br />
From Hum man Dnaj Subfamily<br />
B Menber 122<br />
Length=78<br />
Score = 699.3<br />
bits (168),<br />
Expect = 1e-11 1, Method: Compoositional<br />
matrix<br />
adjust.
Identities = 34/66 (51%), Positives = 43/66 (65%), Gaps = 0/66 (0%)<br />
Query 63 GTEMDWYGILQVEQIANDVIIKKQYKRLALLLHPDKNKLPGAESAFKLIGEAQRILLDRE 122<br />
G+ D+Y IL V + A+D +KK Y+RLAL HPDKN PGA AFK IG A +L + E<br />
Sbjct 4 GSSGDYYEILGVSRGASDEDLKKAYRRLALKFHPDKNHAPGATEAFKAIGTAYAVLSNPE 63<br />
Query 123 KRTLHD 128<br />
KR +D<br />
Sbjct 64 KRKQYD 69<br />
>AT3G08580<br />
MVDQVQHPTIAQKAAGQFMRSSVSKDVQVGYQRPSMYQRHATYGNYSNAAFQFPPTSRMLATTASPVFVQTPGEKGFTNFALDFLMGGVSAAVSKTAAAP<br />
IERVKLLIQNQDEMIKAGRLSEPYKGIGDCFGRTIKDEGFGSLWRGNTANVIRYFPTQALNFAFKDYFKRLFNFKKDRDGYWKWFAGNLASGGAAGASSL<br />
LFVYSLDYARTRLANDAKAAKKGGGGRQFDGLVDVYRKTLKTDGIAGLYRGFNISCVGIIVYRGLYFGLYDSVKPVLLTGDLQDSFFASFALGWVITNGA<br />
GLASYPIDTVRRRMMMTSGEAVKYKSSLDAFKQILKNEGAKSLFKGAGANILRAVAGAGVLSGYDKLQLIVFGKKYGSGGA<br />
GENE ID: 291 SLC25A4 | solute carrier family 25 (mitochondrial carrier; aden<strong>in</strong>e<br />
nucleotide translocator), member 4 [Homo sapiens] (Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 296 bits (757), Expect = 8e-80, Method: Compositional matrix adjust.<br />
Identities = 163/294 (55%), Positives = 202/294 (68%), Gaps = 9/294 (3%)<br />
Query 79 NFALDFLMGGVSAAVSKTAAAPIERVKLLIQNQDEMIKAGRLSEPYKGIGDCFGRTIKDE 138<br />
+F DFL GGV+AAVSKTA APIERVKLL+Q Q K + YKGI DC R K++<br />
Sbjct 7 SFLKDFLAGGVAAAVSKTAVAPIERVKLLLQVQHAS-KQISAEKQYKGIIDCVVRIPKEQ 65<br />
Query 139 GFGSLWRGNTANVIRYFPTQALNFAFKDYFKRLFNFKKDRDG-YWKWFAGNLASGGAAGA 197<br />
GF S WRGN ANVIRYFPTQALNFAFKD +K+LF DR +W++FAGNLASGGAAGA<br />
Sbjct 66 GFLSFWRGNLANVIRYFPTQALNFAFKDKYKQLFLGGVDRHKQFWRYFAGNLASGGAAGA 125<br />
Query 198 SSLLFVYSLDYARTRLANDAKAAKKGGGGRQFDGLVDVYRKTLKTDGIAGLYRGFNISCV 257<br />
+SL FVY LD+ARTRLA D KG R+F GL D K K+DG+ GLY+GFN+S<br />
Sbjct 126 TSLCFVYPLDFARTRLAAD---VGKGAAQREFHGLGDCIIKIFKSDGLRGLYQGFNVSVQ 182<br />
Query 258 GIIVYRGLYFGLYDSVKPVLLTGDLQDSFFASFALGWVITNGAGLASYPIDTVRRRMMMT 317<br />
GII+YR YFG+YD+ K +L F S+ + +T AGL SYP DTVRRRMMM<br />
Sbjct 183 GIIIYRAAYFGVYDTAKG-MLPDPKNVHIFVSWMIAQSVTAVAGLVSYPFDTVRRRMMMQ 241<br />
Query 318 SGEA---VKYKSSLDAFKQILKNEGAKSLFKGAGANILRAVAGAGVLSGYDKLQ 368<br />
SG + Y ++D +++I K+EGAK+ FKGA +N+LR + GA VL YD+++<br />
Sbjct 242 SGRKGADIMYTGTVDCWRKIAKDEGAKAFFKGAWSNVLRGMGGAFVLVLYDEIK 295<br />
Transmembrane alpha helices (green) predicted by TmConsens prediction<br />
1 mvdqvqhpti aqkaagqfmr ssvskdvqvg yqrpsmyqrh atygnysnaa fqfpptsrml<br />
61 attaspvfvq tpgekgftnf ALDFLMGGVS AAVSKTAAAP Iervklliqn qdemikagrl<br />
121 sepykgigdc fgrtikdegf gslwrgntan viryfptqal nfafkdyfkr lfnfkkdrdg<br />
181 ywkwFAGNLA SGGAAGASSL LFVYSldyar trl<strong>and</strong>akaa kkggggrqfd glvdvyrktl<br />
241 ktdgiaGLYR GFNISCVGII VYRGLYFgly dsvkpvlltg dlqdSFFASF ALGWVITNGA<br />
301 GLASYpidtv rrrmmmtsge avkyksslda fkqilknega kslfkGAGAN ILRAVAGAGV<br />
361 LSGYDKlqli vfgkkygsgg a<br />
>AT3G11710<br />
MEGAADQTTKALSELAMDSSTTLNAAESSAGDGAGPRSKNALKKEQKMKQKEEEKRRKDEEKAEKAKQAPKASSQKAVAADDEEMDATQYYENRLKYLAA<br />
EKAKGENPYPHKFAVSMSIPKYIETYGSLNNGDHVENAEESLAGRIMSKRSSSSKLFFYDLHGDDFKVQVMADASKSGLDEAEFLKLHSNAKRGDIVGVI<br />
GFPGKTKRGELSIFPRSFILLSHCLHMMPRKADNVNAKKPEIWVPGQTRNPEAYVLKDQESRYRQRHLDMILNVEVRQIFRTRAKIISYVRRFLDNKNFL<br />
EVETPMMNMIAGGAAARPFVTHHNDLDMRLYMRIAPELYLKQLIVGGLERVYEIGKQFRNEGIDLTHNPEFTTCEFYMAFADYNDLMEMTEVMLSGMVKE<br />
LTGGYKIKYNANGYDKDPIEIDFTPPFRRIEMIGELEKVAKLNIPKDLASEEANKYLIDACARFDVKCPPPQTTARLLDKLVGEFLEPTCVNPTFIINQP<br />
EIMSPLAKWHRSKSGLTERFELFINKHELCNAYTELNDPVVQRQRFADQLKDRQSGDDEAMALDETFCNALEYGLAPTGGWGLGIDRLSMLLTDSLNIKE<br />
VLFFPAMRPPQEESAAAQAPLTEEKK<br />
GENE ID: 3735 KARS | lysyl-tRNA synthetase [Homo sapiens]<br />
(Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 680 bits (1755), Expect = 0.0, Method: Compositional matrix adjust.<br />
Identities = 337/603 (55%), Positives = 428/603 (70%), Gaps = 31/603 (5%)<br />
Query 23 LNAAESSAGDGAGPR-SKNALKKEQKMKQKEEEKRRKDEEKAEK-----AKQAPKASSQK 76<br />
+ AAE DG+ P+ SKN LK+ K ++K EK K +E +EK A ++<br />
Sbjct 4 VQAAEVKV-DGSEPKLSKNELKRRLKAEKKVAEKEAKQKELSEKQLSQATAAATNHTTDN 62<br />
Query 77 AVAADDEEMDATQYYENRLKYLAAEKAKGENPYPHKFAVSMSIPKYIETYGSLNNGDHVE 136<br />
V ++E +D QYY+ R + + K GE+PYPHKF V +S+ +I+ Y L GDH+<br />
Sbjct 63 GVGPEEESVDPNQYYKIRSQAIHQLKVNGEDPYPHKFHVDISLTDFIQKYSHLQPGDHLT 122<br />
Query 137 NAEESLAGRIMSKRSSSSKLFFYDLHGDDFKVQVMADASKSGLDEAEFLKLHSNAKRGDI 196<br />
+ +AGRI +KR+S KL FYDL G+ K+QVMA+ S++ E EF+ +++ +RGDI<br />
Sbjct 123 DITLKVAGRIHAKRASGGKLIFYDLRGEGVKLQVMAN-SRNYKSEEEFIHINNKLRRGDI 181<br />
Query 197 VGVIGFPGKTKRGELSIFPRSFILLSHCLHMMPRKADNVNAKKPEIWVPGQTRNPEAYVL 256<br />
+GV G PGKTK+GELSI P LLS CLHM+P + L<br />
Sbjct 182 IGVQGNPGKTKKGELSIIPYEITLLSPCLHMLPHLH---------------------FGL 220<br />
Query 257 KDQESRYRQRHLDMILNVEVRQIFRTRAKIISYVRRFLDNKNFLEVETPMMNMIAGGAAA 316<br />
KD+E+RYRQR+LD+ILN VRQ F R+KII+Y+R FLD FLE+ETPMMN+I GGA A<br />
Sbjct 221 KDKETRYRQRYLDLILNDFVRQKFIIRSKIITYIRSFLDELGFLEIETPMMNIIPGGAVA 280<br />
Query 317 RPFVTHHNDLDMRLYMRIAPELYLKQLIVGGLERVYEIGKQFRNEGIDLTHNPEFTTCEF 376<br />
+PF+T+HN+LDM LYMRIAPELY K L+VGG++RVYEIG+QFRNEGIDLTHNPEFTTCEF<br />
Sbjct 281 KPFITYHNELDMNLYMRIAPELYHKMLVVGGIDRVYEIGRQFRNEGIDLTHNPEFTTCEF 340<br />
Query 377 YMAFADYNDLMEMTEVMLSGMVKELTGGYKIKYNANGYDKDPIEIDFTPPFRRIEMIGEL 436<br />
YMA+ADY+DLME+TE M+SGMVK +TG YK+ Y+ +G + ++DFTPPFRRI M+ EL<br />
Sbjct 341 YMAYADYHDLMEITEKMVSGMVKHITGSYKVTYHPDGPEGQAYDVDFTPPFRRINMVEEL 400<br />
Query 437 EKVAKLNIPKD--LASEEANKYLIDACARFDVKCPPPQTTARLLDKLVGEFLEPTCVNPT 494<br />
EK + +P+ +EE K L D C V+CPPP+TTARLLDKLVGEFLE TC+NPT<br />
Sbjct 401 EKALGMKLPETNLFETEETRKILDDICVAKAVECPPPRTTARLLDKLVGEFLEVTCINPT 460<br />
Query 495 FIINQPEIMSPLAKWHRSKSGLTERFELFINKHELCNAYTELNDPVVQRQRFADQLKDRQ 554<br />
FI + P+IMSPLAKWHRSK GLTERFELF+ K E+CNAYTELNDP+ QRQ F +Q K +
Sbjct 461<br />
Query 555<br />
Sbjct 521<br />
Query 615<br />
Sbjct 581<br />
>AT3G12780<br />
MASAAASSAFSSLLKSTGAVASSAGTRRARASLLPIPSTSVSA<br />
ARPLGFSATLDSRRFS FSLHVASKVESVRGKG GSRGVVSMAKKSVGDL LTSADLKGKKVFVRADD<br />
LNVPLDDNQTIITDDTRIRAAIPTIKYYLIENGAKVILSTHLG<br />
GRPKGVTPKFSLAPLV LVPRLSELLGIEVTKA ADDCIGPEVESLVASL LPEGGVLLLENVRFYKK<br />
EEEKNDPEFAKKKLASLADLYVNDAFGGTAHRAHASTEGVTKF<br />
FLKPSVAGFLLQKELD LDYLVGAVSNPKRPFA AAIVGGSKVSSKIGVIESLLEKCDILLLGGGG<br />
MIFTFYKAQGLLSVGSSLVEEDKLELAATELLAKAKAKGVSLL<br />
LLPTDVVVADKFAPDA DANSKIVPASGIEDGW WMGLDIGPDSIKTFNEALDTTQTVIWNGPMGG<br />
VFEMEKFAAGTTEAIANKLAELSEKGVVTTIIGGGDSVAAVEK<br />
KVGVAGVMSHISTGGG GGASLELLEGKVLPGV VIALDEAIPVTV<br />
GENE ID: 52230<br />
PGK1 | phospphoglycerate<br />
k<strong>in</strong> nase 1 [Homo sappiens]<br />
(Over 10 PuubMed<br />
l<strong>in</strong>ks)<br />
Score = 3374<br />
bits (961), Expect = 2e-10 03, Method: Comp mpositional matr rix adjust.<br />
Identitiess<br />
= 197/409 (48% %), Positives = 274/409 (66%), Gaps = 23/409 (5%)<br />
Query 85<br />
Sbjct 9<br />
Query 144<br />
Sbjct 68<br />
Query 202<br />
Sbjct 128<br />
Query 247<br />
Sbjct 186<br />
Query 307<br />
Sbjct 246<br />
Query 365<br />
Sbjct 306<br />
Query 425<br />
Sbjct 366<br />
>AT3G16040<br />
MSSKQGGKLKPPLKQPKSGKKEYDEHDDMELMQKKKDEEKALK<br />
KELRAKASQKGSFGGS GSGLKKSGKK<br />
> gb|EEAW99900.1|<br />
hCGG1644435,<br />
is<strong>of</strong>or rm CRA_a [Homo ssapiens]<br />
Length=51<br />
Score = 322.3<br />
bits (72), Expect = 1.6, Method: M Composittional<br />
matrix adjust. a<br />
Identitiess<br />
= 25/64 (39%), , Positives = 33 3/64 (51%), Gapss<br />
= 13/64 (20%)<br />
Query 1<br />
Sbjct 1<br />
Query 61<br />
Sbjct 48<br />
>AT3G16640<br />
MLVYQDLLTGDDELLSDSFPYKEIENGGILWEVEGKWVTVGAV<br />
VDVNIGANPSAEEGGE GEDEGVDDSTQKVVDIVDTFRLQEQPTYDKK<br />
KGFIAYIKKYIKLLTPP<br />
KLSEEDQAVFKKKGIEGATKFLLPRLSSDFQFFVGEGMHDDST<br />
TLVFAYYKEGSTNPTF TFLYFAHGLKEVKC<br />
> gb|AAAQ01550.1|<br />
Length=172<br />
GENE ID: 77178<br />
TPT1 | tumoor<br />
prote<strong>in</strong>, tran nslation<strong>all</strong>y-conntrolled<br />
1 [Hom mo sapiens]<br />
(Over 10 PuubMed<br />
l<strong>in</strong>ks)<br />
Score = 1108<br />
bits (270), Expect = 2e-23 3, Method: Compoositional<br />
matrix<br />
adjust.<br />
Identitiess<br />
= 67/174 (38%) ), Positives = 98/174 9 (56%), Ga Gaps = 8/174 (4% )<br />
Query 1<br />
Sbjct 1<br />
Query 58<br />
Sbjct 60<br />
Query 118<br />
Sbjct 119<br />
>AT3G16890<br />
FICDHPQIMSPLAKKWHRSKEGLTERFELF<br />
FVMKKEICNAYTELND NDPMRQRQLFEEQAKA AKA 520<br />
SGDDEAMALDETFCCNALEYGLAPTGGWGL<br />
LGIDRLSMLLTDSLNI NIKEVLFFPAMRPPQEES<br />
614<br />
+GDDEAM +DE FCC<br />
ALEYGL PT GWG+ +GIDR++M LTDS NI NIKEVL FPAM+P + +<br />
AGDDEAMFIDENFCCTALEYGLPPTAGWGM<br />
MGIDRVAMFLTDSNNI NIKEVLLFPAMKPEDK KKE 580<br />
AAA 617<br />
A<br />
NVA 583<br />
LTSADLKGKKVFVRRADLNVPLDDNQTITD<br />
DDTRIRAAIPTIKYLIIENGAK-VILSTHLG<br />
GRP 143<br />
L D+KGK+V +RR<br />
D NVP+ +NQ IT+ ++ RI+AA+P+IK+ + ++NGAK V+L +HLG GRP<br />
LDKLDVKGKRVVMRRVDFNVPMKNNQ-ITN<br />
NNQRIKAAVPSIKFCLLDNGAKSVVLMSHLG<br />
GRP 67<br />
KGVT--PKFSLAPLLVPRLSELLGIEVTKA<br />
ADDCIGPEVESLVASLLPEGGVLLLENVRFY<br />
YKE 201<br />
GV K+SL P+ + L LLG +V DC+GPEVE A+ G V+LLEN+RF+ E<br />
DGVPMPDKYSLEPVVAVELKSLLGKDVLFL<br />
LKDCVGPEVEKACANP NPAAGSVILLENLRFH HVE 127<br />
EE-----------KKNDPE----FAKKLAS<br />
SLADLYVNDAFGTAHR HRAHASTEGVTKFLKP PSV 246<br />
EE K +P F L+ L D+YVNDAFGTAHR HRAH+S GV L<br />
EEGKGKDASGNKVKKAEPAKIEAFRASLSK<br />
KLGDVYVNDAFGTAHR HRAHSSMVGVN--LPQ QKA 185<br />
AGFLLQKELDYLVGGAVSNPKRPFAAIVGG<br />
GSKVSSKIGVIESLLEEKCDILLLGGGMIFTFY<br />
306<br />
GFL++KEL+Y A+ +P+RPF AI+GG G+KV+ KI +I ++L+ +K + +++GGGM FTF<br />
GGFLMKKELNYFAKKALESPERPFLAILGG<br />
GAKVADKIQLINNMLDDKVNEMIIGGGMAFTFL<br />
245<br />
KA-QGLSVGSSLVEEEDKLELATELLAKAK<br />
KAKGVSLLLPTDVVVA VADKFAPDANS-KIVP PAS 364<br />
K + +G+SL + +E+ ++ +L++KA+ + GV + LP D V AADKF<br />
+A + + AS<br />
KVLNNMEIGTSLFDDEEGAKIVKDLMSKAE<br />
EKNGVKITLPVDFVTAADKFDENAKTGQATV<br />
VAS 305<br />
GIEDGWMGLDIGPDDSIKTFNEALDTTQTV<br />
VIWNGPMGVFEMEKFA FAAGTEAIANKLAELSEK<br />
424<br />
GI GWMGLD GP+ +S K + EA+ + ++WNGP+GVFE +<br />
E FA GT+A+ +++ + + +<br />
GIPAGWMGLDCGPEESSKKYAEAVTRAKQI<br />
IVWNGPVGVFEWEAFA FARGTKALMDEVVKATSR<br />
365<br />
GVTTIIGGGDSVAAAVEKVGVAGVMSHIST<br />
TGGGASLELLEGKVLPPGVIAL<br />
473<br />
G TIIGGGD+ K +SH+ST TGGGASLELLEGKVLPPGV<br />
AL<br />
GCITIIGGGDTATCCCAKWNTEDKVSHVST<br />
TGGGASLELLEGKVLPPGVDAL<br />
414<br />
MSSKQGGKLKPLKQPPKSGKKEYDEHDMELM<br />
MQKKKDEEKALKELRA RAKASQKGSFGGSGLK KK 60<br />
MS +GGK +PLKQ K KE D+ D+ QK+ + E<br />
G+ G+K KK<br />
MSGHKGGKKQPLKQHHKEQAKEMDKEDVAFK<br />
KQKQTEAE--------------GALDTGGVK<br />
KK 47<br />
SGKK 64<br />
SGKK<br />
SGKK 51<br />
TCTP [Homo sapi iens]<br />
MLVYQDLLTGDELLLSDSFPYKEIENGILW<br />
WEVEGKWV--TVGAVD VDVN-IGANPSAEEGG GED 57<br />
M++Y+DL++ DE+ SD + +EI +G+ EVEGK V T G +DD<br />
+ IG N SAE G E<br />
MIIYRDLISHDEMFFSDIYKIREIADGLCL<br />
LEVEGKMVSRTEGNIDDDSLIGGNASAE-GP<br />
PEG 59<br />
EGVDDSTQKVVDIVVDTFRLQEQPTYDKKG<br />
GFIAYIKKYIKLLTPKKLSEEDQAVFKKGIEGA<br />
117<br />
EG + + VDIVV<br />
LQE ++ K+ + YIK Y+K + KKL<br />
E+ K + GA<br />
EGTESTVITGVDIVVMNHHLQET-SFTKEA<br />
AYKKYIKDYMKSIKGK GKLEEQRPERVKPFMTGA<br />
118<br />
T---KFLLPRLSDFFQFFVGEGMHDDSTLV<br />
VFAYYKEGSTNPTFLYYFAHGLKEVKC<br />
168<br />
K +L ++ +QFF+GE M+ D + Y+E P ++ +F GLK KC<br />
AEQIKHILANFKNYYQFFIGENMNPDGMVA<br />
ALLDYREDGVTPYMIFFFKDGLKMEKC<br />
172
MRGFASSASRIIATAAAASKSLNASTSSVNPKLSKTLNSSGKP<br />
PTNPLNQRYISQVIER ERKDWFLILNQEFTTH HRIGLNTRFVISVLQN NQDNPLHSLRFYLWVSS<br />
NFDPVYAKDQSSLKSVLGNALFRKGPLLLLSMELLKEIRDSGY<br />
YRISDELMCVLIGSWG WGRLGLAKYCNDVFAQ QISFLGMKPSTRLYNA AVIDALVKSNSLDLAYY<br />
LKFQQMRSDGCCKPDRFTYNILIHGVCCKKGVVDEAIRLVKQM<br />
MEQEGNRPNVFTYTILLIDGFLIAGRVDEAL<br />
LKQLEMMRVRKLNPNEATIRTFVHGIFRCLPP<br />
PCKAFEVLVGFFMEKDSNLQRVGYDAVVLYCLSNNSMAKETGQ<br />
QFLRKIGERGYIPDSSSTFNAAMSCLLKGHD<br />
DLVETCRIFDGFVSRG GVKPGFNGYLVLVQALL<br />
LNAQRFSEGDRRYLKQMGVDGLLSSVYYSYNAVIDCLCKARRI<br />
IENAAMFLTEMQDRGI GISPNLVTFNTFLSGY YSVRGDVKKVHGVLEK KLLVHGFKPDVITFSLL<br />
IINCLCRAKEIIKDAFDCFKEMLEWGIIEPNEITYNILIRSCC<br />
CSTGDTDRSVKLFAKM KMKENGLSPDLYAYNA ATIQSFCKMRKVKKAEELLKTMLRIGLKPDNN<br />
FTYSTLIKALSSESGRESEAREMFSSIIERHGCVPDSYTKRLV<br />
VEELDLRKSGLSRETV TVSAS<br />
> gb|AAAH26034.1|<br />
Length=531<br />
GENE ID: 110128<br />
LRPPRC | lleuc<strong>in</strong>e-rich<br />
PPR R-motif conta<strong>in</strong>i<strong>in</strong>g<br />
[Homo sapiens]<br />
(Over 10 PuubMed<br />
l<strong>in</strong>ks)<br />
Score = 600.8<br />
bits (146), Expect = 5e-09 9, Method: Compoositional<br />
matrix<br />
adjust.<br />
Identitiess<br />
= 37/150 (24%) ), Positives = 67/150 6 (44%), Ga Gaps = 0/150 (0% )<br />
Query 479<br />
Sbjct 146<br />
Query 539<br />
Sbjct 206<br />
Query 599<br />
Sbjct 266<br />
Score = 577.4<br />
bits (137), Expect = 5e-08 8, Method: Compoositional<br />
matrix<br />
adjust.<br />
Identitiess<br />
= 35/142 (24%) ), Positives = 65/142 6 (45%), Ga Gaps = 0/142 (0% )<br />
Query 420<br />
Sbjct 157<br />
Query 480<br />
Sbjct 217<br />
Query 540<br />
Sbjct 277<br />
Score = 499.7<br />
bits (117), Expect = 1e-05 5, Method: Compoositional<br />
matrix<br />
adjust.<br />
Identitiess<br />
= 37/154 (24%) ), Positives = 64/154 6 (41%), Ga Gaps = 4/154 (2% )<br />
Query 337<br />
Sbjct 140<br />
Query 393<br />
Sbjct 200<br />
Query 453<br />
Sbjct 260<br />
Score = 433.9<br />
bits (102), Expect = 7e-04 4, Method: Compoositional<br />
matrix<br />
adjust.<br />
Identitiess<br />
= 76/405 (18%) ), Positives = 158/405 1 (39%), GGaps<br />
= 47/405 ( 11%)<br />
Query 162<br />
Sbjct 144<br />
Query 221<br />
Sbjct 203<br />
Query 281<br />
Sbjct 263<br />
Query 341<br />
Sbjct 323<br />
Query 400<br />
Sbjct 358<br />
Query 460<br />
Sbjct 406<br />
Query 520<br />
Sbjct 464<br />
Score = 366.6<br />
bits (83), Expect = 0.11, Method: Composiitional<br />
matrix adjust.<br />
Identitiess<br />
= 24/106 (22%) ), Positives = 43/106 4 (40%), Ga Gaps = 2/106 (1% )<br />
Query 533<br />
Sbjct 165<br />
LRPPRC prote<strong>in</strong> [Homo sapiens]<br />
VHGVLEKLLVHGFKKPDVITFSLIINCLCR<br />
RAKEIKDAFDCFKEML MLEWGIEPNEITYNIL LIR 538<br />
H + + L G DV ++ ++ + + D +M M E I+PN +TY LI L<br />
AHRIWDTLQKLGAVVYDVSHYNALLKVYLQ<br />
QNEYKFSPTDFLAKME MEEANIQPNRVTYQRL LIA 205<br />
SCCSTGDTDRSVKLLFAKMKENGLSPDLYA<br />
AYNATIQSFCKMRKVK VKKAEELLKTMLRIGL LKP 598<br />
S C+ GD + + K+ + MK L ++A + + ++ + AE +L M G+ +P<br />
SYCNVGDIEGASKIILGFMKTKDLPVTEAV<br />
VFSALVTGHARAGDME MENAENILTVMRDAGIEP<br />
265<br />
DNFTYSTLIKALSEESGRESEAREMFSSIE<br />
ER 628<br />
TY L+ A +EE<br />
G ++ +E E+<br />
GPDTYLALLNAYAEEKGDIDHVKQTLEKVE<br />
EK 295<br />
GLLSSVYSYNAVIDDCLCKARRIENAAMFL<br />
LTEMQDRGISPNLVTFFNTFLSGYSVRGDVK<br />
KKV 479<br />
G + V YNA++ + + FL L +M++ I PN VT+ + ++ Y GD++<br />
GAVYDVSHYNALLKKVYLQNEYKFSPTDFL<br />
LAKMEEANIQPNRVTYYQRLIASYCNVGDIEGA<br />
216<br />
HGVLEKLLVHGFKPPDVITFSLIINCLCRA<br />
AKEIKDAFDCFKEMLEEWGIEPNEITYNILIRS<br />
539<br />
+L +<br />
FS ++ RA A ++++A + M + GIEP TY L+ +<br />
SKILGFMKTKDLPVVTEAVFSALVTGHARA<br />
AGDMENAENILTVMRD RDAGIEPGPDTYLALL LNA 276<br />
CCSTGDTDRSVKLFFAKMKENGL<br />
561<br />
GD D + K++++ L<br />
YAEKGDIDHVKQTLLEKVEKSEL<br />
298<br />
KETGQFLRKIGER-----GYIPDSSTFNAA<br />
AMSCLLKGHDLVETCRRIFDGFVSRGVKPGF<br />
FNG 392<br />
+E +F +I + G + D S +NA + L+<br />
++P<br />
EERTEFAHRIWDTLLQKLGAVYDVSHYNAL<br />
LLKVYLQNEYKFSPTDDFLAKMEEANIQPNR<br />
RVT 199<br />
YLVLVQALLNAQRFFSEGDRYLKQMGVDGL<br />
LLSSVYSYNAVIDCLCCKARRIENAAMFLTEMQ<br />
452<br />
Y L+ + N + L M L + ++A++ +A +ENA LT M+<br />
YQRLIASYCNVGDIIEGASKILGFMKTKDL<br />
LPVTEAVFSALVTGHA HARAGDMENAENILTV VMR 259<br />
DRGISPNLVTFNTFFLSGYSVRGDVKKVHG<br />
GVLEKL 486<br />
D GI P T+ L+ Y+ +GD+ V LEK+<br />
DAGIEPGPDTYLALLLNAYAEKGDIDHVKQ<br />
QTLEKV 293<br />
KYCNDVFAQISFLGGMKPSTRLYNAVIDAL<br />
LVKSNSLDLAYLKF-QQQMRSDGCKPDRFTY<br />
YNI 220<br />
++ + ++ + LGG<br />
YNA++ ++ N + F +M +P+R TY Y<br />
EFAHRIWDTLQKLGGAVYDVSHYNALLKVY<br />
YLQ-NEYKFSPTDFLAAKMEEANIQPNRVTY<br />
YQR 202<br />
LIHGVCKKGVVDEAAIRLVKQMEQEGNRPN<br />
NVFTYTILIDGFLIAG AGRVDEALKQLEMMRV VRK 280<br />
LI C G ++ A +++ M+ + ++ L+ G AG ++ A L +MR<br />
LIASYCNVGDIEGAASKILGFMKTKDLPVT<br />
TEAVFSALVTGHARAG AGDMENAENILTVMRD DAG 262<br />
LNPNEATIRTFVHGGIFRCLPPCKAFEVLV<br />
VGFMEKDSNLQRVGYD YDAVLYCLSNNSMAKETG<br />
340<br />
+ P T ++<br />
+ L + + +L +++ S +<br />
IEPGPDTYLALLNAAYAEKGDIDHVKQTLE<br />
EKVEKSELHLMDRDLLLQIIFSFSKAGYPQY<br />
YVS 322<br />
QFLRKIG-ERGYIPPDSSTFNAAMSCLLKG<br />
GHDLVETCRIFDGFVS VSRGVKPGFNGYLVLV VQA 399<br />
+ L K+ ER YIPPD<br />
AM+ +L L+ T ++ D<br />
V + Q<br />
EILEKVTCERRYIPPD------AMNLIL--<br />
---LLVTEKLED----------------VAL<br />
LQI 357<br />
LLNAQRFSEGDRYLLKQMGVDGLLSSVYSY<br />
YNAVIDCLCKARRIENNAAMFLTEMQDRGISPN<br />
459<br />
LL E<br />
DG SV+ + C+ +E + ++++ +<br />
LLACPVSKE-----------DG--PSVFGS<br />
SFFLQHCVTMNTPVEKKLTDYCKKLKEVQMH<br />
HSF 405<br />
LVTFNTFLSGYSVRRGDVKKVHGVLEKLLV<br />
VHGFKPDVITFSLIINNCLCRAKEIKDAFDC<br />
CFK 519<br />
+ F + + + D+ K +++ + GF F ++ + K ++ + K<br />
PLQFTLHCALLANKKTDLAK--ALMKAVKE<br />
EEGFPIRPHYFWPLLVVGRRKEKNVQGIIEILK<br />
463<br />
EMLEWGIEPNEITYYNILIRSCCSTGDTDR<br />
RSVKLFAKMKENGLSPPD<br />
564<br />
M E G+ P++ TYY<br />
+ C + ++ R++ R ++ENG D<br />
GMQELGVHPDQETYYTDYVIPCFDSVNSAR<br />
RAI-----LQENGCLSSD<br />
503<br />
YNILIRSCCSTGDTTDRSVKLFAKMKENGL<br />
LSPDLYAYNATIQSFC FCKMRKVKKAEELLKTML<br />
592<br />
YN L++<br />
AKM+E + P+ Y I S+CC<br />
+ ++ A ++L M<br />
YNALLKVYLQNEYKKFSPTDFLAKMEEANI<br />
IQPNRVTYQRLIASYC YCNVGDIEGASKILGF FMK 224
Query 593 RIGLKPDNFTYSTLIKALSESGRESEAREMFSSIERHGCV--PDSY 636<br />
L +S L+ + +G A + + + G PD+Y<br />
Sbjct 225 TKDLPVTEAVFSALVTGHARAGDMENAENILTVMRDAGIEPGPDTY 270<br />
Score = 33.1 bits (74), Expect = 1.1, Method: Compositional matrix adjust.<br />
Identities = 29/143 (20%), Positives = 58/143 (40%), Gaps = 7/143 (4%)<br />
Query 493 PDVITFSLIINCLCRAKEIKDAFDCFKEMLEWGIEPNEITYNILIRSCCSTGDTDRSVKL 552<br />
P V + +C+ ++ D K++ E ++ + + TD + L<br />
Sbjct 369 PSVFGSFFLQHCVTMNTPVEKLTDYCKKLKE--VQMHSFPLQFTLHCALLANKTDLAKAL 426<br />
Query 553 FAKMKENGLSPDLYAYNATIQSFCKMRKVKKAEELLKTMLRIGLKPDNFTYSTLIKALSE 612<br />
+KE G + + + K + V+ E+LK M +G+ PD TY+ + +<br />
Sbjct 427 MKAVKEEGFPIRPHYFWPLLVGRRKEKNVQGIIEILKGMQELGVHPDQETYTDYVIPCFD 486<br />
Query 613 SGRESEAREMFSSIERHGCVPDS 635<br />
S + A ++ +GC+ DS<br />
Sbjct 487 SVNSARA-----ILQENGCLSDS 504<br />
>AT3G17820<br />
MSLLSDLVNLNLTDATGKIIAEYIWIGGSGMDIRSKARTLPGPVTDPSKLPKWNYDGSSTGQAAGEDSEVILYPQAIFKDPFRKGNNILVMCDAYTPAGD<br />
PIPTNKRHNAAKIFSHPDVAKEEPWYGIEQEYTLMQKDVNWPIGWPVGGYPGPQGPYYCGVGADKAIGRDIVDAHYKACLYAGIGISGINGEVMPGQWEF<br />
QVGPVEGISSGDQVWVARYLLERITEISGVIVSFDPKPVPGDWNGAGAHCNYSTKTMRNDGGLEVIKKAIGKLQLKHKEHIAAYGEGNERRLTGKHETAD<br />
INTFSWGVANRGASVRVGRDTEKEGKGYFEDRRPASNMDPYVVTSMIAETTILG<br />
GENE ID: 2752 GLUL | glutamate-ammonia ligase [Homo sapiens]<br />
(Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 405 bits (1041), Expect = 8e-113, Method: Compositional matrix adjust.<br />
Identities = 191/340 (56%), Positives = 243/340 (71%), Gaps = 7/340 (2%)<br />
Query 18 KIIAEYIWIGGSGMDIRSKARTLPGPVTDPSKLPKWNYDGSSTGQAAGEDSEVILYPQAI 77<br />
K+ A YIWI G+G +R K RTL +LP+WN+DGSST Q+ G +S++ L P A+<br />
Sbjct 25 KVQAMYIWIDGTGEGLRCKTRTLDSEPKCVEELPEWNFDGSSTLQSEGSNSDMYLVPAAM 84<br />
Query 78 FKDPFRKGNNILVMCDAYTPAGDPIPTNKRHNAAKIFSHPDVAKEEPWYGIEQEYTLMQK 137<br />
F+DPFRK N LV+C+ + P TN RH +I V+ + PW+G+EQEYTLM<br />
Sbjct 85 FRDPFRKDPNKLVLCEVFKYNRRPAETNLRHTCKRIMDM--VSNQHPWFGMEQEYTLMGT 142<br />
Query 138 DVNWPIGWPVGGYPGPQGPYYCGVGADKAIGRDIVDAHYKACLYAGIGISGINGEVMPGQ 197<br />
D + P GWP G+PGPQGPYYCGVGAD+A GRDIV+AHY+ACLYAG+ I+G N EVMP Q<br />
Sbjct 143 DGH-PFGWPSNGFPGPQGPYYCGVGADRAYGRDIVEAHYRACLYAGVKIAGTNAEVMPAQ 201<br />
Query 198 WEFQVGPVEGISSGDQVWVARYLLERITEISGVIVSFDPKPVPGDWNGAGAHCNYSTKTM 257<br />
WEFQ+GP EGIS GD +WVAR++L R+ E GVI +FDPKP+PG+WNGAG H N+STK M<br />
Sbjct 202 WEFQIGPCEGISMGDHLWVARFILHRVCEDFGVIATFDPKPIPGNWNGAGCHTNFSTKAM 261<br />
Query 258 RNDGGLEVIKKAIGKLQLKHKEHIAAY----GEGNERRLTGKHETADINTFSWGVANRGA 313<br />
R + GL+ I++AI KL +H+ HI AY G N RRLTG HET++IN FS GVANR A<br />
Sbjct 262 REENGLKYIEEAIEKLSKRHQYHIRAYDPKGGLDNARRLTGFHETSNINDFSAGVANRSA 321<br />
Query 314 SVRVGRDTEKEGKGYFEDRRPASNMDPYVVTSMIAETTIL 353<br />
S+R+ R +E KGYFEDRRP++N DP+ VT + T +L<br />
Sbjct 322 SIRIPRTVGQEKKGYFEDRRPSANCDPFSVTEALIRTCLL 361<br />
>AT3G23150<br />
MVKEIASWLLILSMVVFVSPVLAINGGGYPRCNCEDEGNSFWSTENILETQRVSDFLIAVAYFSIPIELLYFVSCSNVPFKWVLFEFIAFIVLCGMTHLL<br />
HGWTYSAHPFRLMMAFTVFKMLTALVSCATAITLITLIPLLLKVKVREFMLKKKAHELGREVGLILIKKETGFHVRMLTQEIRKSLDRHTILYTTLVELS<br />
KTLGLQNCAVWMPNDGGTEMDLTHELRGRGGYGGCSVSMEDLDVVRIRESDEVNVLSVDSSIARASGGGGDVSEIGAVAAIRMPMLRVSDFNGELSYAIL<br />
VCVLPGGTPRDWTYQEIEIVKVVADQVTVALDHAAVLEESQLMREKLAEQNRALQMAKRDALRASQARNAFQKTMSEGMRRPMHSILGLLSMIQDEKLSD<br />
EQKMIVDTMVKTGNVMSNLVGDSMDVPDGRFGTEMKPFSLHRTIHEAACMARCLCLCNGIRFLVDAEKSLPDNVVGDERRVFQVILHIVGSLVKPRKRQE<br />
GSSLMFKVLKERGSLDRSDHRWAAWRSPASSADGDVYIRFEMNVENDDSSSQSFASVSSRDQEVGDVRFSGGYGLGQDLSFGVCKKVVQLIHGNISVVPG<br />
SDGSPETMSLLLRFRRRPSISVHGSSESPAPDHHAHPHSNSLLRGLQVLLVDTNDSNRAVTRKLLEKLGCDVTAVSSGFDCLTAIAPGSSSPSTSFQVVV<br />
LDLQMAEMDGYEVAMRIRSRSWPLIVATTVSLDEEMWDKCAQIGINGVVRKPVVLRAMESELRRVLLQADQLL<br />
GENE ID: 6197 RPS6KA3 | ribosomal prote<strong>in</strong> S6 k<strong>in</strong>ase, 90kDa, polypeptide 3<br />
[Homo sapiens] (Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 32.7 bits (73), Expect = 1.3, Method: Compositional matrix adjust.<br />
Identities = 22/84 (26%), Positives = 44/84 (52%), Gaps = 9/84 (10%)<br />
Query 546 NDDSSSQSFASVSSRDQEV--GDVRFSGGYGLGQDL---SFGVCKKVVQL---IHGNISV 597<br />
+D+S + V S Q++ ++F+ GY L +D+ S+ VCK+ + + + +<br />
Sbjct 392 DDESQAMQTVGVHSIVQQLHRNSIQFTDGYELKEDIGVGSYSVCKRCIHKATNMEFAVKI 451<br />
Query 598 VPGSDGSP-ETMSLLLRFRRRPSI 620<br />
+ S P E + +LLR+ + P+I<br />
Sbjct 452 IDKSKRDPTEEIEILLRYGQHPNI 475<br />
>AT3G25520<br />
MVFVKSTKSNAYFKRYQVKFRRRRDGKTDYRARIRLINQDKNKYNTPKYRFVVRFTNKDIVAQIVSASIAGDIVKASAYAHELPQYGLTVGLTNYAAAYC<br />
TGLLLARRVLKMLEMDDEYEGNVEATGEDFSVEPTDSRRPFRALLDVGLIRTTTGNRVFGALKGALDGGLDIPHSDKRFAGFHKENKQLDAEIHRNYIYG<br />
GHVSNYMKLLGEDEPEKLQTHFSAYIKKGVEAESIEELYKKVHAAIRADPNPKKTVKPAPKQHKRYNLKKLTYEERKNKLIERVKALNGAGGDDDDEDDE<br />
E<br />
GENE ID: 6125 RPL5 | ribosomal prote<strong>in</strong> L5 [Homo sapiens] (Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 308 bits (789), Expect = 1e-83, Method: Compositional matrix adjust.<br />
Identities = 159/284 (55%), Positives = 208/284 (73%), Gaps = 2/284 (0%)<br />
Query 1 MVFVKSTKSNAYFKRYQVKFRRRRDGKTDYRARIRLINQDKNKYNTPKYRFVVRFTNKDI 60<br />
M FVK K+ AYFKRYQVKFRRRR+GKTDY AR RL+ QDKNKYNTPKYR +VR TN+DI<br />
Sbjct 1 MGFVKVVKNKAYFKRYQVKFRRRREGKTDYYARKRLVIQDKNKYNTPKYRMIVRVTNRDI 60<br />
Query 61 VAQIVSASIAGDIVKASAYAHELPQYGLTVGLTNYAAAYCTGLLLARRVLKMLEMDDEYE 120
+ QI A I GD++ +AYAHELP+YG+ VGLTNYAAAYCTGLLLARR+L MD YE<br />
Sbjct 61 ICQIAYARIEGDMIVCAAYAHELPKYGVKVGLTNYAAAYCTGLLLARRLLNRFGMDKIYE 120<br />
Query 121 GNVEATGEDFSVEPTDSRR-PFRALLDVGLIRTTTGNRVFGALKGALDGGLDIPHSDKRF 179<br />
G VE TG++++VE D + F LD GL RTTTGN+VFGALKGA+DGGL IPHS KRF<br />
Sbjct 121 GQVEVTGDEYNVESIDGQPGAFTCYLDAGLARTTTGNKVFGALKGAVDGGLSIPHSTKRF 180<br />
Query 180 AGFHKENKQLDAEIHRNYIYGGHVSNYMKLLGEDEPEKLQTHFSAYIKKGVEAESIEELY 239<br />
G+ E+K+ +AE+HR +I G +V++YM+ L E++ + + FS YIK V + +EE+Y<br />
Sbjct 181 PGYDSESKEFNAEVHRKHIMGQNVADYMRCLMEEDEDAYKKQFSQYIKNSVTPDMMEEMY 240<br />
Query 240 KKVHAAIRADPNPKKTVKPAPKQHKRYNLKKLTYEERKNKLIER 283<br />
KK HAAIR +P + + KR+N K++ ++K+++ ++<br />
Sbjct 241 KKAHAAIRENP-VYEKKPKKEVKKKRWNRPKMSLAQKKDRVAQK 283<br />
>AT3G46000<br />
MANAASGMAVHDDCKLKFMELKAKRTFRTIVYKIEDKQVIVEKLGEPEQSYDDFAASLPADDCRYCIYDFDFVTAENCQKSKIFFIAWSPDTAKVRDKMI<br />
YASSKDRFKRELDGIQVELQATDPTEMGLDVFKSRTN<br />
GENE ID: 1073 CFL2 | c<strong>of</strong>il<strong>in</strong> 2 (muscle) [Homo sapiens] (Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 79.7 bits (195), Expect = 1e-14, Method: Compositional matrix adjust.<br />
Identities = 52/154 (33%), Positives = 81/154 (52%), Gaps = 28/154 (18%)<br />
Query 5 ASGMAVHDDCKLKFMELKAKRTF--------------------RTIVYKIEDKQVIVEKL 44<br />
ASG+ V+D+ F ++K +++ R I+ + E KQ++V +<br />
Sbjct 2 ASGVTVNDEVIKVFNDMKVRKSSTQEEIKKRKKAVLFCLSDDKRQIIVE-EAKQILVGDI 60<br />
Query 45 GEP-EQSYDDFAASLPADDCRYCIYDFDFVTAENCQKSKIFFIAWSPDTAKVRDKMIYAS 103<br />
G+ E Y F LP +DCRY +YD + T E+ +K + FI W+P++A ++ KMIYAS<br />
Sbjct 61 GDTVEDPYTSFVKLLPLNDCRYALYDATYETKES-KKEDLVFIFWAPESAPLKSKMIYAS 119<br />
Query 104 SKDRFKRELDGIQVELQATDPTEMGLDVFKSRTN 137<br />
SKD K++ GI+ E Q GLD K R+<br />
Sbjct 120 SKDAIKKKFTGIKHEWQVN-----GLDDIKDRST 148<br />
>AT3G46030<br />
MAPKAEKKPAEKKPVEEKSKAEKAPAEKKPKAGKKLPKEAGAGGDKKKKMKKKSVETYKIYIFKVLKQVHPDIGISSKAMGIMNSFINDIFEKLASESSK<br />
LARYNKKPTITSREIQTAVRLVLPGELAKHAVSEGTKAVTKFTSS<br />
GENE ID: 8340 HIST1H2BL | histone cluster 1, H2bl [Homo sapiens]<br />
(Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 170 bits (430), Expect = 5e-42, Method: Compositional matrix adjust.<br />
Identities = 87/127 (68%), Positives = 104/127 (81%), Gaps = 8/127 (6%)<br />
Query 21 AEKAPAEKKPKAGKK--LPKEAGAGGDKKKKMKKKSVETYKIYIFKVLKQVHPDIGISSK 78<br />
A+ APA PK G K + K G K+K+ +K E+Y +Y++KVLKQVHPD GISSK<br />
Sbjct 5 AKSAPA---PKKGSKKAVTKAQKKDGKKRKRSRK---ESYSVYVYKVLKQVHPDTGISSK 58<br />
Query 79 AMGIMNSFINDIFEKLASESSKLARYNKKPTITSREIQTAVRLVLPGELAKHAVSEGTKA 138<br />
AMGIMNSF+NDIFE++ASE+S+LA YNK+ TITSREIQTAVRL+LPGELAKHAVSEGTKA<br />
Sbjct 59 AMGIMNSFVNDIFERIASEASRLAHYNKRSTITSREIQTAVRLLLPGELAKHAVSEGTKA 118<br />
Query 139 VTKFTSS 145<br />
VTK+TSS<br />
Sbjct 119 VTKYTSS 125<br />
>AT3G46440<br />
MASSDKQTSPKPPPSPSPLRNSKFCQSNMRILISGGAGFIGSHLVDKLMENEKNEVIVADNYFTGSKDNLKKWIGHPRFELIRHDVTEPLLIEVDQIYHL<br />
ACPASPIFYKYNPVKTIKTNVIGTLNMLGLAKRVGARILLTSTSEVYGDPLIHPQPESYWGNVNPIGVRSCYDEGKRVAETLMFDYHRQHGIEIRIARIF<br />
NTYGPRMNIDDGRVVSNFIAQALRGEALTVQKPGTQTRSFCYVSDMVDGLMRLMEGDDTGPINIGNPGEFTMVELAETVKELINPSIEIKMVENTPDDPR<br />
QRKPDITKAKEVLGWEPKVKLREGLPLMEEDFRLRLGVHKN<br />
GENE ID: 80146 UX<strong>S1</strong> | UDP-glucuronate decarboxylase 1 [Homo sapiens]<br />
(Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 418 bits (1075), Expect = 1e-116, Method: Compositional matrix adjust.<br />
Identities = 199/312 (63%), Positives = 242/312 (77%), Gaps = 1/312 (0%)<br />
Query 30 RILISGGAGFIGSHLVDKLMENEKNEVIVADNYFTGSKDNLKKWIGHPRFELIRHDVTEP 89<br />
RILI+GGAGF+GSHL DKLM + +EV V DN+FTG K N++ WIGH FELI HDV EP<br />
Sbjct 95 RILITGGAGFVGSHLTDKLMM-DGHEVTVVDNFFTGRKRNVEHWIGHENFELINHDVVEP 153<br />
Query 90 LLIEVDQIYHLACPASPIFYKYNPVKTIKTNVIGTLNMLGLAKRVGARILLTSTSEVYGD 149<br />
L IEVDQIYHLA PASP Y YNP+KT+KTN IGTLNMLGLAKRVGAR+LL STSEVYGD<br />
Sbjct 154 LYIEVDQIYHLASPASPPNYMYNPIKTLKTNTIGTLNMLGLAKRVGARLLLASTSEVYGD 213<br />
Query 150 PLIHPQPESYWGNVNPIGVRSCYDEGKRVAETLMFDYHRQHGIEIRIARIFNTYGPRMNI 209<br />
P +HPQ E YWG+VNPIG R+CYDEGKRVAET+ + Y +Q G+E+R+ARIFNT+GPRM++<br />
Sbjct 214 PEVHPQSEDYWGHVNPIGPRACYDEGKRVAETMCYAYMKQEGVEVRVARIFNTFGPRMHM 273<br />
Query 210 DDGRVVSNFIAQALRGEALTVQKPGTQTRSFCYVSDMVDGLMRLMEGDDTGPINIGNPGE 269<br />
+DGRVVSNFI QAL+GE LTV G+QTR+F YVSD+V+GL+ LM + + P+N+GNP E<br />
Sbjct 274 NDGRVVSNFILQALQGEPLTVYGSGSQTRAFQYVSDLVNGLVALMNSNVSSPVNLGNPEE 333<br />
Query 270 FTMVELAETVKELINPSIEIKMVENTPDDPRQRKPDITKAKEVLGWEPKVKLREGLPLME 329<br />
T++E A+ +K L+ EI+ + DDP++RKPDI KAK +LGWEP V L EGL<br />
Sbjct 334 HTILEFAQLIKNLVGSGSEIQFLSEAQDDPQKRKPDIKKAKLMLGWEPVVPLEEGLNKAI 393<br />
Query 330 EDFRLRLGVHKN 341<br />
FR L N<br />
Sbjct 394 HYFRKELEYQAN 405<br />
>AT3G49890<br />
MAKRELSGGDSSSEDEDPKWRAAINSIATTTVYGASATKPAATQSHNYGDFRLKPKKLTHGQIKVKNLLNEMVEKTLDFVEDPVNIPEDKPENDCGVRLF<br />
KRCATGIVFDHVDEIRGPKKKPNLRPDKGVEGSSKEFKKRVKSIAVDGSDILTAAVEAAKKASARLDAKEVAAKDKAKKEEERIAELKKVRGEKWLPSIE<br />
RAMKKEMKRIKHTAWKSAMS
No significant homologies<br />
>AT3G49950<br />
MTKTRILNPTRFPSPKPLRGCGDANFMEQLLLHCATAIDSNDAALTHQILWVLNNIAPPDGDSTQRLTSAFLRALLSRAVSKTPTLSSTISFLPQADELH<br />
RFSVVELAAFVDLTPWHRFGFIAANAAILTAVEGYSTVHIVDLSLTHCMQIPTLIDAMASRLNKPPPLLKLTVVSSSDHFPPFINISYEELGSKLVNFAT<br />
TRNITMEFTIVPSTYSDGFSSLLQQLRIYPSSFNEALVVNCHMMLRYIPEEPLTSSSSSLRTVFLKQLRSLNPRIVTLIEEDVDLTSENLVNRLKSAFNY<br />
FWIPFDTTDTFMSEQRRWYEAEISWKIENVVAKEGAERVERTETKRRWIERMREAEFGGVRVKEDAVADVKAMLEEHAVGWGMKKEDDDESLVLTWKGHS<br />
VVFATVWVPI<br />
GENE ID: 5819 PVRL2 | poliovirus receptor-related 2 (herpesvirus entry mediator<br />
B) [Homo sapiens] (Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 32.0 bits (71), Expect = 2.4, Method: Compositional matrix adjust.<br />
Identities = 27/101 (26%), Positives = 45/101 (44%), Gaps = 13/101 (12%)<br />
Query 150 QIPTLIDAMASRLNKPPPLLKLTVVSSSDHFPPFINISYEELGSKLVNFATTRNITMEFT 209<br />
Q PT + S+ +PP +++ +SS D +S A T +T FT<br />
Sbjct 175 QDPTTVALCISKEGRPP--ARISWLSSLDWEAKETQVSG--------TLAGTVTVTSRFT 224<br />
Query 210 IVPSTYSDGFSSLLQQLRIYPSSFNEALVVNCHMMLRYIPE 250<br />
+VPS +DG + ++ SF E ++ + +RY PE<br />
Sbjct 225 LVPSGRADGVTVT---CKVEHESFEEPALIPVTLSVRYPPE 262<br />
>AT3G50960<br />
MDPDAVKSTLSNLAFGNVMAAAARNYQKEVLANEKAQGSNPVNEEVDLDELMDDPELERLHADRIAALKREVEKRESFKRQGHGEYREVSEGDFLGEVTR<br />
SEKVICHFYHKEFYRCKIMDKHLKTLAPRHVDTKFIKVDAENAPFFVTKLAIKTLPCVVLFSKGVAMDRLVGFQDLGTKDDFTTNKLENVLLKKGMLSKK<br />
KKEEDDEDAEYQESIRRSVRSSENLDSDSD<br />
GENE ID: 10190 TXNDC9 | thioredox<strong>in</strong> doma<strong>in</strong> conta<strong>in</strong><strong>in</strong>g 9 [Homo sapiens]<br />
(10 or fewer PubMed l<strong>in</strong>ks)<br />
Score = 158 bits (399), Expect = 2e-38, Method: Compositional matrix adjust.<br />
Identities = 81/194 (41%), Positives = 121/194 (62%), Gaps = 7/194 (3%)<br />
Query 1 MDPDAVKSTLSNLAFGNVMAAAARNYQKEVLANEKAQGSNPVNEEVD-----LDELMDDP 55<br />
M ++ L+++ FG + A A+ + +VL ++ Q + V E +D LD+ MD+<br />
Sbjct 1 MSQKSLAPRLNSVPFGRMEADASVDMFSKVLEHQLLQTTKLVEEHLDSEIQKLDQ-MDED 59<br />
Query 56 ELERLHADRIAALKREVEKRESFKRQGHGEYREV-SEGDFLGEVTRSEKVICHFYHKEFY 114<br />
ELERL R+ AL++ ++++ + +GHGEYRE+ SE DF EV SE V+CHFY +<br />
Sbjct 60 ELERLKEKRLQALRKAQQQKQEWLSKGHGEYREIPSERDFFQEVKESENVVCHFYRDSTF 119<br />
Query 115 RCKIMDKHLKTLAPRHVDTKFIKVDAENAPFFVTKLAIKTLPCVVLFSKGVAMDRLVGFQ 174<br />
RCKI+D+HL L+ +H++TKF+K++ E APF +L IK +P + L G D +VGF<br />
Sbjct 120 RCKILDRHLAILSKKHLETKFLKLNVEKAPFLCERLHIKVIPTLALLKDGKTQDYVVGFT 179<br />
Query 175 DLGTKDDFTTNKLE 188<br />
DLG DDFTT LE<br />
Sbjct 180 DLGNTDDFTTETLE 193<br />
>AT3G51310<br />
MIADDDEKWLAAAIAAVKQNAFYMQRAIDSNNLKDALKFSAQMLSELRTSKLSPHKYYELYMRVFNELGTLEIFFKEETGRGCSIAELYELVQHAGNILP<br />
RLYLLCTIGSVYIKSKDVTATDILKDLVEMCRAVQHPLRGLFLRSYLAQVTRDKLPSIGSDLEGDGDAHMNALEFVLQNFTEMNKLWVRMQHQGPSREKE<br />
KREKERNELRDLVGKNLHVLSQLEGVDLGIYRDTVLPRILEQVVNCKDELAQCYLMDCIIQVFPDDFHLQTLDVLLGACPQLQPSVDIKTVLSGLMERLS<br />
NYAASSVEALPNFLQVEAFSKLNYAIGKVVEAQADLPAAASVTLYLFLLKFTLHVYSDRLDYVDQVLGSCVTQLSATGKLCDDKAAKQIVAFLSAPLEKY<br />
NNVVTILKLTNYPLVMEYLDRETNKAMAIILVQSVFKNNTHIATADEVDALFELAKGLMKDFDGTIDDEIDEEDFQEEQNLVARLVNKLYIDDPEEMSKI<br />
IFTVRKHIVAGGPKRLPLTIPPLVFSALKLIRRLRGGDENPFGDDASATPKRILQLLSETVEVLSDVSAPDLALRLYLQCAQAANNCELETVAYEFFTKA<br />
YLLYEEEISDSKAQVTALRLIIGTLQRMRVFNVENRDTLTHKATGYSARLLRKPDQCRAVYECAHLFWADECENLKDGERVVLCLKRAQRIADAVQQMAN<br />
ASRGTSSTGSVSLYVELLNKYLYFLEKGNQQVTGDTIKSLAELIKSETKKVESGAEPFINSTLRYIEFQRQQEDGGMNEKYEKIKMEWFE<br />
GENE ID: 55737 VPS35 | vacuolar prote<strong>in</strong> sort<strong>in</strong>g 35 homolog (S. cerevisiae)<br />
[Homo sapiens] (Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 612 bits (1578), Expect = 4e-175, Method: Compositional matrix adjust.<br />
Identities = 352/788 (44%), Positives = 500/788 (63%), Gaps = 35/788 (4%)<br />
Query 4 DDDEKWLAAAIAAVKQNAFYMQRAIDSNNLKDALKFSAQMLSELRTSKLSPHKYYELYMR 63<br />
D+ EK L AI AVK +F M+R +D N L DALK ++ ML ELRTS LSP YYELYM<br />
Sbjct 10 DEQEKLLDEAIQAVKVQSFQMKRCLDKNKLMDALKHASNMLGELRTSMLSPKSYYELYMA 69<br />
Query 64 VFNELGTLEIFFKEETGRGCSIAELYELVQHAGNILPRLYLLCTIGSVYIKSKDVTATDI 123<br />
+ +EL LE++ +E +G +A+LYELVQ+AGNI+PRLYLL T+G VY+KS + DI<br />
Sbjct 70 ISDELHYLEVYLTDEFAKGRKVADLYELVQYAGNIIPRLYLLITVGVVYVKSFPQSRKDI 129<br />
Query 124 LKDLVEMCRAVQHPLRGLFLRSYLAQVTRDKLPSIG--SDLEGDGDAHMNALEFVLQNFT 181<br />
LKDLVEMCR VQHPLRGLFLR+YL Q TR+ LP G +D E GD ++++FVL NF<br />
Sbjct 130 LKDLVEMCRGVQHPLRGLFLRNYLLQCTRNILPDEGEPTDEETTGDIS-DSMDFVLLNFA 188<br />
Query 182 EMNKLWVRMQHQGPSREKEKREKERNELRDLVGKNLHVLSQLEGVDLGIYRDTVLPRILE 241<br />
EMNKLWVRMQHQG SR++EKRE+ER ELR LVG NL LSQLEGV++ Y+ VL ILE<br />
Sbjct 189 EMNKLWVRMQHQGHSRDREKRERERQELRILVGTNLVRLSQLEGVNVERYKQIVLTGILE 248<br />
Query 242 QVVNCKDELAQCYLMDCIIQVFPDDFHLQTLDVLLGACPQLQPSVDIKTVLSGLMERLSN 301<br />
QVVNC+D LAQ YLM+CIIQVFPD+FHLQTL+ L AC +L +V++K ++ L++RL+<br />
Sbjct 249 QVVNCRDALAQEYLMECIIQVFPDEFHLQTLNPFLRACAELHQNVNVKNIIIALIDRLAL 308<br />
Query 302 YAASSVEALPNF-LQVEAFSKLNYAIGKVVEAQADLPAAASVTLYLFLLKFTLHVYSDRL 360<br />
+A E P ++ F + + V++++ D+P+ V+L + L+ + Y DR+
Sbjct 309 FAHR--EDGPGIPADIKLFDIFSQQVATVIQSRQDMPSEDVVSLQVSLINLAMKCYPDRV 366<br />
Query 361 DYVDQVLGSCV---TQLSATGKLCDDKAAKQIVAFLSAPLEKYNNVVTILKLTNYPLVME 417<br />
DYVD+VL + V +L+ +K++ L P++ YNN++T+LKL ++ + E<br />
Sbjct 367 DYVDKVLETTVEIFNKLNLEHIATSSAVSKELTRLLKIPVDTYNNILTVLKLKHFHPLFE 426<br />
Query 418 YLDRETNKAMAIILVQSVFKNNTHIATADEVDALFELAKGLMKDFDGTIDDEIDEEDFQE 477<br />
Y D E+ K+M+ ++ +V NT I + D+VD++ L L++D ++ D EDF +<br />
Sbjct 427 YFDYESRKSMSCYVLSNVLDYNTEIVSQDQVDSIMNLVSTLIQDQPDQPVEDPDPEDFAD 486<br />
Query 478 EQNLVARLVNKLYIDDPEEMSKIIFTVRKHIVAGGPKRLPLTIPPLVFSALKLIRRLRGG 537<br />
EQ+LV R ++ L +DP++ I+ T RKH AGG +R+ T+PPLVF+A +L R +<br />
Sbjct 487 EQSLVGRFIHLLRSEDPDQQYLILNTARKHFGAGGNQRIGFTLPPLVFAAYQLAFRYK-- 544<br />
Query 538 DENPFGDDA-SATPKRILQLLSETVEVLSDVSAPDLALRLYLQCAQAANNCEL---ETVA 593<br />
EN DD ++I +T+ L +L LRL+LQ A AA ETVA<br />
Sbjct 545 -ENSKVDDKWEKKCQKIFSFAHQTISALIKAELAELPLRLFLQGALAAGEIGFENHETVA 603<br />
Query 594 YEFFTKAYLLYEEEISDSKAQVTALRLIIGTLQRMRVFNVENRDTLTHKATGYSARLLRK 653<br />
YEF ++A+ LYE+EISDSKAQ+ A+ LIIGT +RM+ F+ EN + L + +++LL+K<br />
Sbjct 604 YEFMSQAFSLYEDEISDSKAQLAAITLIIGTFERMKCFSEENHEPLRTQCALAASKLLKK 663<br />
Query 654 PDQCRAVYECAHLFWA-----DECENLKDGERVVLCLKRAQRIADAVQQMANASRGTSST 708<br />
PDQ RAV CAHLFW+ E L GERV+ CLK+A +IA+ + +<br />
Sbjct 664 PDQGRAVSTCAHLFWSGRNTDKNGEELHGGERVMECLKKALKIAN---------QCMDPS 714<br />
Query 709 GSVSLYVELLNKYLYFLEKGNQQVTGDTIKSLAELIKSETKKVESGAEP-----FINSTL 763<br />
V L++E+LN+Y+YF EK N VT + L + I+ + +ES E ++TL<br />
Sbjct 715 LQVQLFIEILNRYIYFYEKENDAVTIQVLNQLIQKIREDLPNLESSEETEQINKHFHNTL 774<br />
Query 764 RYIEFQRQ 771<br />
++ +R+<br />
Sbjct 775 EHLRLRRE 782<br />
>AT3G52930<br />
MSAFTSKFADELIANAAYIGTPGKGILAADESTGTIGKRLASINVENVETNRRNLRELLFTAPGALPCLSGVILFEETLYQKSSDGKLFVDILKEGGVLP<br />
GIKVDKGTVELAGTDGETTTQGLDGLGDRCKKYYEAGARFAKWRAVLKIGENEPSEHSIHENAYGLARYAVICQENGLVPIVEPEILVDGSHDIQKCAAV<br />
TERVLAACYKALSDHHVLLEGTLLKPNMVTPGSDSPKVSPEVIAEHTVRALQRTVPAAVPAIVFLSGGQSEEEATRNLNAMNQLKTKKPWSLSFSFGRAL<br />
QQSTLKTWAGKEENVKAAQEALYVRCKANSEATLGTYKGDAKLGDGAAESLHVKDYKY<br />
(Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 423 bits (1088), Expect = 3e-118, Method: Compositional matrix adjust.<br />
Identities = 222/358 (62%), Positives = 260/358 (72%), Gaps = 2/358 (0%)<br />
Query 3 AFTSKFADELIANAAYIGTPGKGILAADESTGTIGKRLASINVENVETNRRNLRELLFTA 62<br />
A T + EL A I PGKGILAADESTG+I KRL SI EN E NRR R+LL TA<br />
Sbjct 61 ALTPEQKKELSDIAHRIVAPGKGILAADESTGSIAKRLQSIGTENTEENRRFYRQLLLTA 120<br />
Query 63 PGAL-PCLSGVILFEETLYQKSSDGKLFVDILKEGGVLPGIKVDKGTVELAGTDGETTTQ 121<br />
+ PC+ GVILF ETLYQK+ DG+ F ++K G + GIKVDKG V LAGT+GETTTQ<br />
Sbjct 121 DDRVNPCIGGVILFHETLYQKADDGRPFPQVIKSKGGVVGIKVDKGVVPLAGTNGETTTQ 180<br />
Query 122 GLDGLGDRCKKYYEAGARFAKWRAVLKIGENEPSEHSIHENAYGLARYAVICQENGLVPI 181<br />
GLDGL +RC +Y + GA FAKWR VLKIGE+ PS +I ENA LARYA ICQ+NG+VPI<br />
Sbjct 181 GLDGLSERCAQYKKDGADFAKWRCVLKIGEHTPSALAIMENANVLARYASICQQNGIVPI 240<br />
Query 182 VEPEILVDGSHDIQKCAAVTERVLAACYKALSDHHVLLEGTLLKPNMVTPG-SDSPKVSP 240<br />
VEPEIL DG HD+++C VTE+VLAA YKALSDHH+ LEGTLLKPNMVTPG + + K S<br />
Sbjct 241 VEPEILPDGDHDLKRCQYVTEKVLAAVYKALSDHHIYLEGTLLKPNMVTPGHACTQKFSH 300<br />
Query 241 EVIAEHTVRALQRTVPAAVPAIVFLSGGQSEEEATRNLNAMNQLKTKKPWSLSFSFGRAL 300<br />
E IA TV AL+RTVP AV I FLSGGQSEEEA+ NLNA+N+ KPW+L+FS+GRAL<br />
Sbjct 301 EEIAMATVTALRRTVPPAVTGITFLSGGQSEEEASINLNAINKCPLLKPWALTFSYGRAL 360<br />
Query 301 QQSTLKTWAGKEENVKAAQEALYVRCKANSEATLGTYKGDAKLGDGAAESLHVKDYKY 358<br />
Q S LK W GK+EN+KAAQE R ANS A G Y + G A+ESL V ++ Y<br />
Sbjct 361 QASALKAWGGKKENLKAAQEEYVKRALANSLACQGKYTPSGQAGAAASESLFVSNHAY 418<br />
>AT3G54870<br />
MSSSNSSSAVRSSAKHAAERIQQHLPPNSNHAVSLSSSSLNLPARTSIVAPGIAHSSRLKDRPSASSSSSSSSVSASSPSTRRSGTPVRRSQSKDFDDDN<br />
DPGRVRVSVRVRPRNGEELISDADFADLVELQPEIKRLKLRKNNWNSESYKFDEVFTDTASQKRVYEGVAKPVVEGVLSGYNGTIMAYGQTGTGKTYTVG<br />
KIGKDDAAERGIMVRALEDILLNASSASISVEISYLQLYMETIQDLLAPEKNNISINEDAKTGEVSVPGATVVNIQDLDHFLQVLQVGETNRHAANTKMN<br />
TESSRSHAILTVYVRRAMNEKTEKAKPESLGDKAIPRVRKSKLLIVDLAGSERINKSGTDGHMIEEAKFINLSLTSLGKCINALAEGSSHIPTRDSKLTR<br />
LLRDSFGGSARTSLIITIGPSARYHAETTSTIMFGQRAMKIVNMVKLKEEFDYESLCRKLETQVDHLTAEVERQNKLRNSEKHELEKRLRECENSFAEAE<br />
KNAVTRSKFLEKENTRLELSMKELLKDLQLQKDQCDLMHDKAIQLEMKLKNTKQQQLENSAYEAKLADTSQVYEKKIAELVQRVEDEQARSTNAEHQLTE<br />
MKNILSKQQKSIHEQEKGNYQYQRELAETTHTYESKIAELQKKLEGENARSNAAEDQLRQMKRLISDRQVISQENEEANELKIKLEELSQMYESTVDELQ<br />
TVKLDYDDLLQQKEKLGEEVRDMKERLLLEEKQRKQMESELSKLKKNLRESENVVEEKRYMKEDLSKGSAESGAQTGSQRSQGLKKSLSGQRATMARLCE<br />
EVGIQKILQLIKSEDLEVQIQAVKVVANLAAEEANQVKIVEEGGVEALLMLVQSSQNSTILRVASGAIANLAMNEKSQDLIMNKGGAQLLAKMVTKTDDP<br />
QTLRMVAGALANLCGNGKHKIKNFASDDFQYSLYNLCVKIY<br />
GENE ID: 3799 KIF5B | k<strong>in</strong>es<strong>in</strong> family member 5B [Homo sapiens]<br />
(Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 245 bits (625), Expect = 1e-64, Method: Compositional matrix adjust.<br />
Identities = 140/352 (39%), Positives = 207/352 (58%), Gaps = 42/352 (11%)<br />
Query 105 VRVSVRVRPRN------GEELISDADFADLVELQPEIKRLKLRKNNWNSESYKFDEVFTD 158<br />
++V R RP N G++ I+ D V + S+ Y FD VF<br />
Sbjct 9 IKVMCRFRPLNESEVNRGDKYIAKFQGEDTVVIA--------------SKPYAFDRVFQS 54<br />
Query 159 TASQKRVYEGVAKPVVEGVLSGYNGTIMAYGQTGTGKTYTV-GKIGKDDAAERGIMVRAL 217<br />
+ SQ++VY AK +V+ VL GYNGTI AYGQT +GKT+T+ GK+ D GI+ R +<br />
Sbjct 55 STSQEQVYNDCAKKIVKDVLEGYNGTIFAYGQTSSGKTHTMEGKLH--DPEGMGIIPRIV 112<br />
Query 218 EDILLNASSAS----ISVEISYLQLYMETIQDLLAPEKNNISINEDAKTGEVSVPGATVV 273<br />
+DI S +++SY ++Y++ I+DLL K N+S++ED K V G T<br />
Sbjct 113 QDIFNYIYSMDENLEFHIKVSYFEIYLDKIRDLLDVSKTNLSVHED-KNRVPYVKGCTER 171<br />
Query 274 NIQDLDHFLQVLQVGETNRHAANTKMNTESSRSHAILTVYVRRAMNEKTEKAKPESLGDK 333<br />
+ D + + G++NRH A T MN SSRSH+I + V++ N +TE+<br />
Sbjct 172 FVCSPDEVMDTIDEGKSNRHVAVTNMNEHSSRSHSIFLINVKQE-NTQTEQKLS------ 224
Query 334 AIPRVRKSKLLIVDLAGSERINKSGTDGHMIEEAKFINLSLTSLGKCINALAEGSSHIPT 393<br />
KL +VDLAGSE+++K+G +G +++EAK IN SL++LG I+ALAEGS+++P<br />
Sbjct 225 -------GKLYLVDLAGSEKVSKTGAEGAVLDEAKNINKSLSALGNVISALAEGSTYVPY 277<br />
Query 394 RDSKLTRLLRDSFGGSARTSLIITIGPSARYHAETTSTIMFGQRAMKIVNMV 445<br />
RDSK+TR+L+DS GG+ RT+++I PS+ +ET ST++FGQRA I N V<br />
Sbjct 278 RDSKMTRILQDSLGGNCRTTIVICCSPSSYNESETKSTLLFGQRAKTIKNTV 329<br />
>AT3G59820<br />
MASRAIVRRKNIISDYLNVYARSIQSFQYIGNSSQTVHSHAYHSGINRPPVETKPVTEHKSFTRRDGLLLLSRNGYFNRSFHGFHSSGFGYGSSEVGPSL<br />
GMRYMSLSIRNATTVAAKKPEEEDKKVDELAKNRKEASPEECDQAVESLSSVKAKAKAKRLQESKKVARSIVQRAWAIVLKIGPAIKAVASMNRADWAKK<br />
LTHWKHEFVSTLKHYWLGTKLLWADTRISSRLLLKLAGGKSLSRRERQQLTRTTADIFRLVPFAVFILVPFMEFLLPVFLKLFPNMLPSTFQDKMKEEEA<br />
LKRKLLARIEYAKFLQETAREMAKEVKHSRTGEVKQTAEDLDEFLDKVRRGQIVHNDELLGFAKLFNDELTLDNISRPRLVSMCKYMGISPYGTDAYLRY<br />
MLRKRLRSIKEDDKLIRAEGVDSLSEAELREDCRERGMLGLVSVEEMRQQLRDWMDLSLNHSVPSSLLILSRAFTVAGRVKAEDAVRATLSSLPDEVVDT<br />
VGITSLPSEDPVSERRRKLEYLEMQEELIKEEEEKEEEELTRIKDVKGGDEDKALQEMTIPTASEAQEQARARVLEQQDDLCKLSRALGVLASASSVCRE<br />
REEFLRLVKKEVEFYNTMVEREDVDGEKAAMKAYKAARVDIDQADEVAEADEVSSALMEKVDGLIQNLEKEIDDVDIKIGKGWQLLDRDRDGKVTPDEVA<br />
AAAMYLKDTLANDGLQQLISSLSKDKGKNYGGRHCKVGEIGKQARRKCNGRRIKLKEIIL<br />
GENE ID: 3954 LETM1 | leuc<strong>in</strong>e zipper-EF-h<strong>and</strong> conta<strong>in</strong><strong>in</strong>g transmembrane prote<strong>in</strong> 1<br />
[Homo sapiens] (Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 271 bits (694), Expect = 1e-72, Method: Compositional matrix adjust.<br />
Identities = 157/329 (47%), Positives = 223/329 (67%), Gaps = 7/329 (2%)<br />
Query 212 LKHYWLGTKLLWADTRISSRLLLKLAGGKSLSRRERQQLTRTTADIFRLVPFAVFILVPF 271<br />
LKHY+ G +LLW DT+I++R+L ++ G SL+RRER+Q R AD+FRLVPF VF++VPF<br />
Sbjct 161 LKHYYHGFRLLWIDTKIAARMLWRILNGHSLTRRERRQFLRICADLFRLVPFLVFVVVPF 220<br />
Query 272 MEFLLPVFLKLFPNMLPSTFQDKMKEEEALKRKLLARIEYAKFLQETAREMAKEVKHSRT 331<br />
MEFLLPV +KLFPNMLPSTF+ + +EE LK++L ++E AKFLQ+T EMA + K ++<br />
Sbjct 221 MEFLLPVAVKLFPNMLPSTFETQSLKEERLKKELRVKLELAKFLQDTIEEMALKNKAAKG 280<br />
Query 332 GEVKQTAEDLDEFLDKVRR-GQIVHNDELLGFAKLFNDELTLDNISRPRLVSMCKYMGIS 390<br />
K D F K+R G+ N+E++ F+KLF DELTLDN++RP+LV++CK + +<br />
Sbjct 281 SATK----DFSVFFQKIRETGERPSNEEIMRFSKLFEDELTLDNLTRPQLVALCKLLELQ 336<br />
Query 391 PYGTDAYLRYMLRKRLRSIKEDDKLIRAEGVDSLSEAELREDCRERGMLGL-VSVEEMRQ 449<br />
GT+ +LR+ L RLRSIK DDKLI EGVDSL+ EL+ CR RGM L V+ + +R<br />
Sbjct 337 SIGTNNFLRFQLTMRLRSIKADDKLIAEEGVDSLNVKELQAACRARGMRALGVTEDRLRG 396<br />
Query 450 QLRDWMDLSLNHSVPSSLLILSRAFTVAGRVKAEDAVRATLSSLPDEVVDTVGITSLPSE 509<br />
QL+ W+DL L+ +P+SLLILSRA + + D +++TL +LP+ V + E<br />
Sbjct 397 QLKQWLDLHLHQEIPTSLLILSRAMYLPDTLSPADQLKSTLQTLPEIVAKEAQVKVAEVE 456<br />
Query 510 DPVSERRRKLEYLEMQEELIKEEEEKEEE 538<br />
+ + KLE +QEE ++E +E+E<br />
Sbjct 457 GEQVDNKAKLEA-TLQEEAAIQQEHREKE 484<br />
>AT3G63140<br />
MAALSSSSLFFSSKTTSPISNLLIPPSLHRFSLPSSSSSFSSLSSSSSSSSSLLTFSLRTSRRLSPQKFTVKASSVGEKKNVLIVNTNSGGHAVIGFYFA<br />
KELLSAGHAVTILTVGDESSEKMKKPPFNRFSEIVSGGGKTVWGNPANVANVVGGETFDVVLDNNGKDLDTVRPVVDWAKSSGVKQFLFISSAGIYKSTE<br />
QPPHVEGDAVKADAGHVVVEKYLAETFGNWASFRPQYMIGSGNNKDCEEWFFDRIVRDRAVPIPGSGLQLTNISHVRDLSSMLTSAVANPEAASGNIFNC<br />
VSDRAVTLDGMAKLCAAAAGKTVEIVHYDPKAIGVDAKKAFLFRNMHFYAEPRAAKDLLGWESKTNLPEDLKERFEEYVKIGRDKKEIKFELDDKILEAL<br />
KTPVAA<br />
GENE ID: 64375 IKZF4 | IKAROS family z<strong>in</strong>c f<strong>in</strong>ger 4 (Eos) [Homo sapiens]<br />
(10 or fewer PubMed l<strong>in</strong>ks)<br />
Score = 34.7 bits (78), Expect = 0.35, Method: Compositional matrix adjust.<br />
Identities = 33/102 (32%), Positives = 46/102 (45%), Gaps = 13/102 (12%)<br />
Query 41 SSLSSSSSSSSSL--LTFSLRTSRRLSPQKFTVKASSVGEKK-----NVLIVNTNSGGHA 93<br />
S L SSS + + L SL +R +PQKF VGEK+ + L + NSGG+<br />
Sbjct 199 SMLHSSSERPTFIDRLANSLTKRKRSTPQKF------VGEKQMRFSLSDLPYDVNSGGYE 252<br />
Query 94 VIGFYFAKELLSAGHAVTILTVGDESSEKMKKPPFNRFSEIV 135<br />
A L G ++ VG E ++ PP N SE+<br />
Sbjct 253 KDVELVAHHSLEPGFGSSLAFVGAEHLRPLRLPPTNCISELT 294<br />
>AT4G02380<br />
MLSSGKRGYAATAAQGSVSSGGRSGAVASAVMKKKGVEESTQKISWVPDPKTGYYRPETGSNEIDAAELRAALLNNKQ<br />
No significant homologies<br />
>AT4G23630<br />
MAEEHKHDESVIAPEPAVEVVERESLMDKISEKIHHGGDSSSSSSSSDDEDEKKKTKKPSSPSSSMKSKVYRLFGREQPVHKVLGGGKPADIFMWKNKKM<br />
SGGVLGGATAAWVVFELMEYHLLTLLCHVMIVVLAVLFLWSNATMFINKSPPKIPEVHIPEEPILQLASGLRIEINRGFSSLREIASGRDLKKFLIAIAG<br />
LWVLSILGGCFNFLTLAYIALVLLFTVPLAYDKYEDKVDPLGEKAMIELKKQYAVLDEKVLSKIPLGPLKNKKKD<br />
GENE ID: 57142 RTN4 | reticulon 4 [Homo sapiens] (Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 62.8 bits (151), Expect = 1e-09, Method: Compositional matrix adjust.<br />
Identities = 40/159 (25%), Positives = 79/159 (49%), Gaps = 12/159 (7%)<br />
Query 91 DIFMWKNKKMSGGVLGGATAAWVVFELMEYHLLTLLCHVMIVVLAVLF---LWSNATMFI 147<br />
D+ W++ K +G V G + +++ L + ++++ ++ + +L+V ++ I<br />
Sbjct 775 DLLYWRDIKKTGVVFGASL--FLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAI 832<br />
Query 148 NKSPPKIP-------EVHIPEEPILQLASGLRIEINRGFSSLREIASGRDLKKFLIAIAG 200<br />
KS P EV I EE + + ++ +N LR + DL L<br />
Sbjct 833 QKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVL 892<br />
Query 201 LWVLSILGGCFNFLTLAYIALVLLFTVPLAYDKYEDKVD 239
+WV + +G FN LTL +AL+ LF+VP+ Y++++ ++D<br />
Sbjct 893 MWVFTYVGALFNGLTLLILALISLFSVPVIYERHQAQID 931<br />
>AT4G26110<br />
MSNDKDSFNVSDLTAALKDEDRAGLVNALKNKLQNLAGQRSDVLENLTPNVRKRVDALRDIQSQHDELEAKFREERAILEAKYQTLYQPLYVKRYEIVNG<br />
TTEVELAPEDDTKVDQGEEKTAEEKGVPSFWLTALKNNDVISEEVTERDEGALKYLKDIKWCKIEEPKGFKLEFFFDTNPYFKNTVLTKSYHMIDEDEPL<br />
LEKAMGTEIDWYPGKCLTQKILKKKPKKGSKNTKPITKLEDCESFFNFFSPPEVPDEDEDIDEERAEDLQNLMEQDYDIGSTIREKIIPRAVSWFTGEAM<br />
EAEDFEIDDDEEDDIDEDEDEEDEEDEEDDDDEDEEESKTKKKPSIGNKKGGRSQIVGEGKQDERPPECKQQ<br />
GENE ID: 4673 NAP1L1 | nucleosome assembly prote<strong>in</strong> 1-like 1 [Homo sapiens]<br />
(Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 187 bits (474), Expect = 4e-47, Method: Compositional matrix adjust.<br />
Identities = 119/300 (39%), Positives = 175/300 (58%), Gaps = 33/300 (11%)<br />
Query 25 LVNALKNKLQNLAGQRSDVLENLTPNVRKRVDALRDIQSQHDELEAKFREERAILEAKYQ 84<br />
++ AL+ +L L + +E+L V++RV+AL+++Q + ++EAKF EE LE KY<br />
Sbjct 7 ILAALQERLDGLVETPTGYIESLPRVVKRRVNALKNLQVKCAQIEAKFYEEVHDLERKYA 66<br />
Query 85 TLYQPLYVKRYEIVNGTTE-----VELAPEDDTKVDQGEEKT-----------AEEKGVP 128<br />
LYQPL+ KR+EI+N E E P+++ ++ + +EK + KG+P<br />
Sbjct 67 VLYQPLFDKRFEIINAIYEPTEEECEWKPDEEDEISELKEKAKIEDEKKDEEKEDPKGIP 126<br />
Query 129 SFWLTALKNNDVISEEVTERDEGALKYLKDIK--WCKIEEPKGFKLEFFFDTNPYFKNTV 186<br />
FWLT KN D++S+ V E DE LK+LKDIK + +P F LEF F+ N YF N V<br />
Sbjct 127 EFWLTVFKNVDLLSDMVQEHDEPILKHLKDIKVKFSDAGQPMSFVLEFHFEPNEYFTNEV 186<br />
Query 187 LTKSYHMIDE---------DEPLLEKAMGTEIDWYPGKCLT-QKILKKKPKKGSKNTKPI 236<br />
LTK+Y M E D P + G +IDW GK +T + I KK+ KG + +<br />
Sbjct 187 LTKTYRMRSEPDDSDPFSFDGPEIMGCTGCQIDWKKGKNVTLKTIKKKQKHKGRGTVRTV 246<br />
Query 237 TKLEDCESFFNFFSPPEVPDEDEDIDEERAEDLQNLMEQDYDIGSTIREKIIPRAVSWFT 296<br />
TK +SFFNFF+PPEVP E D+D+ D + ++ D++IG +RE+IIPR+V +FT<br />
Sbjct 247 TKTVSNDSFFNFFAPPEVP-ESGDLDD----DAEAILAADFEIGHFLRERIIPRSVLYFT 301<br />
>AT4G34490<br />
MEEDLIKRLEAAVTRLEGISSNGGGVVSLSRGGDFSSAAGIDIASSDPSILAYEDLISQCVGRALTAAEKIGGPVLDVTKIVAEAFASQKELLVRIKQTQ<br />
KPDLAGLAGFLKPLNDVTMKANAMTEGKRSDFFNHLKAACDSLSALAWIAFTGKDCGMSMPIAHVEESWQMAEFYNNKVLVEYRNKDADHVEWAKALKEL<br />
YLPGLREYVKSHYPLGPVWNASGKPASAPAKGPPGAPAPPPAPLFSAESSKPSSSSNQKQGMSAVFQQLSSGAVTSGLRKVTDDMKTKNRADRSGAVSAV<br />
EKETRTSKPAFSKTGPPKMELQMGRKWAVENQIGKKDLVISECDSKQSVYIYGCKDSVLQIQGKVNNITIDKCTKVGVVFTDVVAAFEIVNCNNVEVQCQ<br />
GSAPTVSVDNTTGCQLYLNKDSLETAITTAKSSEINVMVPGATPDGDWVEHALPQQYNHVFTEGKFETTPVSHSGA<br />
GENE ID: 10486 CAP2 | CAP, adenylate cyclase-associated prote<strong>in</strong>, 2 (yeast)<br />
[Homo sapiens] (10 or fewer PubMed l<strong>in</strong>ks)<br />
Score = 275 bits (703), Expect = 1e-73, Method: Compositional matrix adjust.<br />
Identities = 179/487 (36%), Positives = 271/487 (55%), Gaps = 42/487 (8%)<br />
Query 5 LIKRLEAAVTRLEGISSNGGGVVSLSRGGDFSSAAGIDIASSDPSILAYEDLISQCVGRA 64<br />
L++RLE AV+RLE +S+ S G+ G+ IA PS+ A++ L+ V<br />
Sbjct 7 LVERLERAVSRLESLSAE-----SHRPPGNCGEVNGV-IAGVAPSVEAFDKLMDSMVAEF 60<br />
Query 65 LTAAEKIGGPVLDVTKIVAEAFASQKELLVRIKQTQKPDLAGLAGFLKPLNDVTMKANAM 124<br />
L + + G V ++V AF +Q+ L+ Q Q+P +A LKP+++ +<br />
Sbjct 61 LKNSRILAGDVETHAEMVHSAFQAQRAFLLMASQYQQPHENDVAALLKPISEKIQEIQTF 120<br />
Query 125 TEGKR-SDFFNHLKAACDSLSALAWIAFTGKDCGMSMPIAHVEESWQMAEFYNNKVLVEY 183<br />
E R S+ FNHL A +S+ AL WIA + K P +V+E A FY N+VL +Y<br />
Sbjct 121 RERNRGSNMFNHLSAVSESIPALGWIAVSPK------PGPYVKEMNDAATFYTNRVLKDY 174<br />
Query 184 RNKDADHVEWAKALKELYLPGLREYVKSHYPLGPVWNASGKPASAPA------------K 231<br />
++ D HV+W K+ ++ L+ Y+K H+ G W+ +G AS +<br />
Sbjct 175 KHSDLRHVDWVKSYLNIW-SELQAYIKEHHTTGLTWSKTGPVASTVSAFSVLSSGPGLPP 233<br />
Query 232 GPPGAPAPPPAPLFSAESSKPSSSSNQKQGMSAVFQQLSSG-AVTSGLRKVTDDMKT-KN 289<br />
PP P P P PLF E K SS ++ SA+F QL+ G A+T GLR VTDD KT KN<br />
Sbjct 234 PPPPLPPPGPPPLFENEGKKEESSPSR----SALFAQLNQGEAITKGLRHVTDDQKTYKN 289<br />
Query 290 RADRS-GAVSAVEKETRTSKPAFSKTGP-----PKMELQMGRKWAVENQIGKKDLVISEC 343<br />
+ R+ G + ++ T P K+ P P +EL+ G+KW VE Q + DLVISE<br />
Sbjct 290 PSLRAQGGQTQSPTKSHTPSPTSPKSYPSQKHAPVLELE-GKKWRVEYQEDRNDLVISET 348<br />
Query 344 DSKQSVYIYGCKDSVLQIQGKVNNITIDKCTKVGVVFTDVVAAFEIVNCNNVEVQCQGSA 403<br />
+ KQ YI+ C+ S +QI+GKVN+I ID C K+G+VF +VV E++N ++++Q G<br />
Sbjct 349 ELKQVAYIFKCEKSTIQIKGKVNSIIIDNCKKLGLVFDNVVGIVEVINSQDIQIQVMGRV 408<br />
Query 404 PTVSVDNTTGCQLYLNKDSLETAITTAKSSEINVMVPGATPDGDWVEHALPQQYNHVFTE 463<br />
PT+S++ T GC +YL++D+L+ I +AKSSE+N+++P DGD+ E +P+Q+ +<br />
Sbjct 409 PTISINKTEGCHIYLSEDALDCEIVSAKSSEMNILIPQ---DGDYREFPIPEQFKTAWDG 465<br />
Query 464 GKFETTP 470<br />
K T P<br />
Sbjct 466 SKLITEP 472
At4g32700<br />
MDSDSSKSRIDQFYVSKKRKHQSPNLKSGRNEKNVKVTGERSPGDKGTLDSYLKASLDDKSTTNSGLQARQEAFTRKLDLEVSASSVGQNIHPCLPKPVS<br />
FATFKECLGQNGSQDLHKEGVAAETHATDGLLCANQKDNSELRDFATSFLSLYCSGVQSVVGSPPHQKENELKRRSSSSSLAQDIQISHKRRCESENIPS<br />
LDDLTNPLGSKPESLARNGNNRDKPVSDPTKKMPSNESVEIPMGLRKCSKAPESSAHLTEFHTPGSAIKSCPVGTPKSGCGSSMFSPGEAFWNEAIQVAD<br />
GLTIPIENFGSVEAKVRDQHVTILSCSKKTDKCTEKLERSLDLDEIRVKDKDAIGFSKVVEKHGRDFNKEVYQLPVKNLELLFQDKNINGGIQERCASFD<br />
QNNITLGSSRISESAFVGNKGCENLDIANNAQADKGLIGKMYPEPEGKKVLLCEENRGVRSVSMISNMRKPVGSSESEESHTPSSSHRNYDGLSLSTWLP<br />
SEVCSVYNKKGISKLYPWQVECLQVDGVLQKRNLVYCASTSAGKSFVAEVLMLRRVIRTGKMALLVLPYVSICAEKAEHLEVLLEPLGKHVRSYYGNQGG<br />
GTLPKDTSVAVCTIEKANSLINRLLEEGRLSELGIIVIDELHMVGDQHRGYLLELMLTKLRYAAGEGSSESSSGESSGTSSGKADPAHGLQIVGMSATMP<br />
NVGAVADWLQAALYQTEFRPVPLEEYIKVGSTIYNKKMEVVRTIPKAADMGGKDPDHIVELCNEVVQEGNSVLIFCSSRKGCESTARHISKLIKNVPVNV<br />
DGENSEFMDIRSAIDALRRSPSGVDPVLEETLPSGVAYHHAGLTVEEREIVETCYRKGLVRVLTATSTLAAGVNLPARRVIFRQPMIGRDFIDGTRYKQM<br />
SGRAGRTGIDTKGDSVLICKPGELKRIMALLNETCPPLQSCLSEDKNGMTHAILEVVAGGIVQTAKDIHRYVRCTLLNSTKPFQDVVKSAQDSLRWLCHR<br />
KFLEWNEETKLYTTTPLGRGSFGSSLCPEESLIVLDDLLRAREGLVMASDLHLVYLVTPINVGVEPNWELYYERFMELSPLEQSVGNRVGVVEPFLMRMA<br />
HGATVRTLNRPQDVKKNLRGEYDSRHGSTSMKMLSDEQMLRVCKRFFVALILSKLVQEASVTEVCEAFKVARGMVQALQENAGRFSSMVSVFCERLGWHD<br />
LEGLVAKFQNRVSFGVRAEIVELTSIPYIKGSRARALYKAGLRTSQAIAEASIPEIVKALFESSAWAAEGTGQRRIHLGLAKKIKNGARKIVLEKAEEAR<br />
AAAFSAFKSLGLDVNELSKPLPLAPASSLNGQETTERDISRGSVGPDGLQQSIEGHMECENFDMDNHREKPSEVLGDATLGVSSEINLTSRLPNFRPIGT<br />
AVGTNGPSAVSILSSDTFPIPVYDNREIKPKDNVEQHLTRNDHIPLSSNKDGTGEKGPVTAGNISGGFDSFLELWGSAGEFFFDLHYNKLQDLNSRISYE<br />
IHGIAICWNCSPVYYVNLNKDLPNLECVEKQKLIEDAVIGKSEVLASHNMLDVIKSRWNKISKIMGNVNTRKFTWNLKVQIQVLKSPAISIQRCTRLNLP<br />
EGIRDELVDGSWLMMPPLHTSHTIDMSIVIWILWPDEERHSNPNIDKEVKKRLSPEAAEAANRSGRWRNQIRRVAHNGCCRRVAQTRALCSALWKILVSE<br />
ELLQALTTIEMPLVNVLADMELWGIGIDIEGCLRARNILRDKLRSLEKKAFELAGMTFSLHNPADIANVLFGQLKLPIPENQSKGKLHPSTDKHCLDLLR<br />
NEHPVVPIIKEHRTLAKLLNCTLGSICSLAKLRLSTQRYTLHGRWLQTSTATGRLSIEEPNLQSVEHEVEFKLDKNGRDVSSDADRYKINARDFFVPTQE<br />
NWLLLTADYSQIELRLMAHFSRDSSLISKLSQPEGDVFTMIAAKWTGKAEDSVSPHDRDQTKRLIYGILYGMGANRLAEQLECTSDEAKEKIRSFKSSFP<br />
AVTSWLNETISFCQEKGYIQTLKGRRRFLSKIKFGNAKEKSKAQRQAVNSMCQGSAADIIKIAMINIYSAIAEDVDTAASSSSSETRFHMLKGRCRILLQ<br />
VHDELVLEVDPSYVKLAAMLLQTSMENAVSLLVPLHVKLKVGKTWGSLEPFQTD<br />
GENE ID: 10721 POLQ | polymerase (DNA directed), theta [Homo sapiens]<br />
(Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 606 bits (1563), Expect = 2e-172, Method: Compositional matrix adjust.<br />
Identities = 328/808 (41%), Positives = 482/808 (60%), Gaps = 71/808 (8%)<br />
Query 483 PSSSHRNYDGLSLSTW-LPSEVCSVYNKKGISKLYPWQVECLQVDGVLQKRNLVYCASTS 541<br />
P+ D L L+ W LP V Y+ G+ K++ WQ ECL + VL+ +NLVY A TS<br />
Sbjct 59 PTVPDYEIDKLLLANWGLPKAVLEKYHSFGVKKMFEWQAECLLLGQVLEGKNLVYSAPTS 118<br />
Query 542 AGKSFVAEVLMLRRVIRTGKMALLVLPYVSICAEKAEHLEVLLEPLGKHVRSYYGNQGGG 601<br />
AGK+ VAE+L+L+RV+ K AL +LP+VS+ EK +L+ L + +G V Y G+<br />
Sbjct 119 AGKTLVAELLILKRVLEMRKKALFILPFVSVAKEKKYYLQSLFQEVGIKVDGYMGSTSPS 178<br />
Query 602 TLPKDTSVAVCTIEKANSLINRLLEEGRLSELGIIVIDELHMVGDQHRGYLLELMLTKLR 661<br />
+AVCTIE+AN LINRL+EE ++ LG++V+DELHM+GD HRGYLLEL+LTK+<br />
Sbjct 179 RHFSSLDIAVCTIERANGLINRLIEENKMDLLGMVVVDELHMLGDSHRGYLLELLLTKIC 238<br />
Query 662 YAAGEGSSESSSGESSGTSSGKADPAHGLQIVGMSATMPNVGAVADWLQAALYQTEFRPV 721<br />
Y + +S+S ++ SS ++ +QIVGMSAT+PN+ VA WL A LY T+FRPV<br />
Sbjct 239 YI----TRKSASCQADLASS----LSNAVQIVGMSATLPNLELVASWLNAELYHTDFRPV 290<br />
Query 722 PLEEYIKVGSTIYNKKMEVVRTIPKAADMGGKDPDHIVELCNEVVQEGNSVLIFCSSRKG 781<br />
PL E +KVG++IY+ M++VR + G D DH+V LC E + + +SVL+FC S+K<br />
Sbjct 291 PLLESVKVGNSIYDSSMKLVREFEPMLQVKG-DEDHVVSLCYETICDNHSVLLFCPSKKW 349<br />
Query 782 CESTARHISKLIKNVPVNVDG------------ENSEFMDIRSAIDALRRSPSGVDPVLE 829<br />
CE A I++ N+ +G E E +++ +D LRR PSG+D VL+<br />
Sbjct 350 CEKLADIIAREFYNLHHQAEGLVKPSECPPVILEQKELLEV---MDQLRRLPSGLDSVLQ 406<br />
Query 830 ETLPSGVAYHHAGLTVEEREIVETCYRKGLVRVLTATSTLAAGVNLPARRVIFRQPMIGR 889<br />
+T+P GVA+HHAGLT EER+I+E +R+GL+RVL ATSTL++GVNLPARRVI R P+ G<br />
Sbjct 407 KTVPWGVAFHHAGLTFEERDIIEGAFRQGLIRVLAATSTLSSGVNLPARRVIIRTPIFGG 466<br />
Query 890 DFIDGTRYKQMSGRAGRTGIDTKGDSVLICKPGELKRIMALLNETCPPLQSCLS-----E 944<br />
+D YKQM GRAGR G+DT G+S+LICK E + +ALL + P++SCL E<br />
Sbjct 467 RPLDILTYKQMVGRAGRKGVDTVGESILICKNSEKSKGIALLQGSLKPVRSCLQRREGEE 526<br />
Query 945 DKNGMTHAILEVVAGGIVQTAKDIHRYVRCTLL-NSTKPFQDVVKSAQDSLR-------- 995<br />
M AILE++ GG+ T++D+H Y CT L S K + ++ Q+S++<br />
Sbjct 527 VTGSMIRAILEIIVGGVASTSQDMHTYAACTFLAASMKEGKQGIQRNQESVQLGAIEACV 586<br />
Query 996 -WLCHRKFLEWNE-----ETKLYTTTPLGRGSFGSSLCPEESLIVLDDLLRAREGLVMAS 1049<br />
WL +F++ E E K+Y T LG + SSL P ++L + DL RA +G V+ +<br />
Sbjct 587 MWLLENEFIQSTEASDGTEGKVYHPTHLGSATLSSSLSPADTLDIFADLQRAMKGFVLEN 646<br />
Query 1050 DLHLVYLVTPI-NVGVEPNWELYYERFMELSPLEQSVGNRVGVVEPFLMRMAHGATVRTL 1108<br />
DLH++YLVTP+ +W ++ + +L + V VGV E FL R G V<br />
Sbjct 647 DLHILYLVTPMFEDWTTIDWYRFFCLWEKLPTSMKRVAELVGVEEGFLARCVKGKVVART 706<br />
Query 1109 NRPQDVKKNLRGEYDSRHGSTSMKMLSDEQMLRVCKRFFVALILSKLVQEASVTEVCEAF 1168<br />
R + + + KRFF +L+L L+ E + E+ + +<br />
Sbjct 707 ER-------------------------QHRQMAIHKRFFTSLVLLDLISEVPLREINQKY 741<br />
Query 1169 KVARGMVQALQENAGRFSSMVSVFCERLGWHDLEGLVAKFQNRVSFGVRAEIVELTSIPY 1228<br />
RG +Q+LQ++A ++ M++VF RLGWH++E L+++FQ R++FG++ E+ +L +<br />
Sbjct 742 GCNRGQIQSLQQSAAVYAGMITVFSNRLGWHNMELLLSQFQKRLTFGIQRELCDLVRVSL 801<br />
Query 1229 IKGSRARALYKAGLRTSQAIAEASIPEI 1256<br />
+ RAR LY +G T +A A+I E+<br />
Sbjct 802 LNAQRARVLYASGFHTVADLARANIVEV 829<br />
Score = 255 bits (651), Expect = 1e-66, Method: Compositional matrix adjust.<br />
Identities = 185/538 (35%), Positives = 264/538 (50%), Gaps = 94/538 (17%)<br />
Query 1696 ILVSEELLQALTTIEMPLVNVLADMELWGIGIDIEGCLRARNILRDKLRSLEKKAFELAG 1755<br />
+L E L +EMP LA +EL GIG C ++I++ KL ++E +A++LAG<br />
Sbjct 2063 LLQKENLQDVFRKVEMPSQYCLALLELNGIGFSTAECESQKHIMQAKLDAIETQAYQLAG 2122<br />
Query 1756 MTFSLHNPADIANVLFGQLKLP----IPENQSKGKL-----------------HPSTDKH 1794<br />
+FS + DIA VLF +LKLP + SK L ST K<br />
Sbjct 2123 HSFSFTSSDDIAEVLFLELKLPPNREMKNQGSKKTLGSTRRGIDNGRKLRLGRQFSTSKD 2182<br />
Query 1795 CLDLLRNEHPVVPIIKEHRTLAKLLNCTLGSICSLAKLRLSTQRYTLHGRWL-------- 1846<br />
L+ L+ HP+ +I E R + ++ K+ QR +L<br />
Sbjct 2183 VLNKLKALHPLPGLILEWRRITN----------AITKVVFPLQREKCLNPFLGMERIYPV 2232
Query 1847 -QTSTATGRLSIEEPNLQSVEHEVEFKLD------------------------KNGRDVS 1881<br />
Q+ TATGR++ EPN+Q+V + E K+ K G V+<br />
Sbjct 2233 SQSHTATGRITFTEPNIQNVPRDFEIKMPTLVGESPPSQAVGKGLLPMGRGKYKKGFSVN 2292<br />
Query 1882 SD---------ADR---YKINARDFFVPTQENWLLLTADYSQIELRLMAHFSRDSSLISK 1929<br />
ADR + I+ R FVP +L ADYSQ+ELR++AH S D LI<br />
Sbjct 2293 PRCQAQMEERAADRGMPFSISMRHAFVPF-PGGSILAADYSQLELRILAHLSHDRRLIQV 2351<br />
Query 1930 LSQPEGDVFTMIAAKWTGKAEDSVSPHDRDQTKRLIYGILYGMGANRLAEQLECTSDEAK 1989<br />
L+ DVF IAA+W +SV R Q K++ YGI+YGMGA L EQ+ ++A<br />
Sbjct 2352 LNTG-ADVFRSIAAEWKMIEPESVGDDLRQQAKQICYGIIYGMGAKSLGEQMGIKENDAA 2410<br />
Query 1990 EKIRSFKSSFPAVTSWLNETISFCQEKGYIQTLKGRRRFLSKIKFGNAKEKSKAQRQAVN 2049<br />
I SFKS + + ++ ET+ C+ G++QT+ GRRR+L IK N K+ A+RQA+N<br />
Sbjct 2411 CYIDSFKSRYTGINQFMTETVKNCKRDGFVQTILGRRRYLPGIKDNNPYRKAHAERQAIN 2470<br />
Query 2050 SMCQGSAADIIKIAMINIYSAIAEDVDTAASSSSSE----------TRFHMLKGR-CRI- 2097<br />
++ QGSAADI+KIA +NI + T S E +R L+G C I<br />
Sbjct 2471 TIVQGSAADIVKIATVNIQKQLETFHSTFKSHGHREGMLQSDRTGLSRKRKLQGMFCPIR 2530<br />
Query 2098 ----LLQVHDELVLEVDPSYVKLAAMLLQTSMENAVSLLVPLHVKLKVGKTWGSLEPF 2151<br />
+LQ+HDEL+ EV V A +++ ME+AV L V L VK+K+G +WG L+ F<br />
Sbjct 2531 GGFFILQLHDELLYEVAEEDVVQVAQIVKNEMESAVKLSVKLKVKVKIGASWGELKDF 2588<br />
>AT4G38970<br />
MASTSLLKASPVLDKSEWVKGQSVLFRQPSSASVVLRNRATSLTVRAASSYADELVKTAKTIASPGRGILAMDESNATCGKRLDSIGLENTEANRQAFRT<br />
LLVSAPGLGQYVSGAILFEETLYQSTTEGKKMVDVLVEQNIVPGIKVDKGLVPLVGSNNESWCQGLDGLSSRTAAYYQQGARFAKWRTVVSIPNGPSALA<br />
VKEAAWGLARYAAISQDSGLVPIVEPEILLDGEHDIDRTYDVAEKVWAEVFFYLAQNNVMFEGILLKPSMVTPGAESKDRATPEQVAAYTLKLLRNRVPP<br />
AVPGIMFLSGGQSEVEATLNLNAMNQAPNPWHVSFSYARALQNTCLKTWGGRPENVNAAQTTLLARAKANSLAQLGKYTGEGESEEAKEGMFVKGYTY<br />
GENE ID: 226 ALDOA | aldolase A, fructose-bisphosphate [Homo sapiens]<br />
(Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 347 bits (891), Expect = 2e-95, Method: Compositional matrix adjust.<br />
Identities = 184/350 (52%), Positives = 234/350 (66%), Gaps = 5/350 (1%)<br />
Query 54 ELVKTAKTIASPGRGILAMDESNATCGKRLDSIGLENTEANRQAFRTLLVSAPG-LGQYV 112<br />
EL A I +PG+GILA DES + KRL SIG ENTE NR+ +R LL++A + +<br />
Sbjct 69 ELSDIAHRIVAPGKGILAADESTGSIAKRLQSIGTENTEENRRFYRQLLLTADDRVNPCI 128<br />
Query 113 SGAILFEETLYQSTTEGKKMVDVLVEQNIVPGIKVDKGLVPLVGSNNESWCQGLDGLSSR 172<br />
G ILF ETLYQ +G+ V+ + V GIKVDKG+VPL G+N E+ QGLDGLS R<br />
Sbjct 129 GGVILFHETLYQKADDGRPFPQVIKSKGGVVGIKVDKGVVPLAGTNGETTTQGLDGLSER 188<br />
Query 173 TAAYYQQGARFAKWRTVVSI-PNGPSALAVKEAAWGLARYAAISQDSGLVPIVEPEILLD 231<br />
A Y + GA FAKWR V+ I + PSALA+ E A LARYA+I Q +G+VPIVEPEIL D<br />
Sbjct 189 CAQYKKDGADFAKWRCVLKIGEHTPSALAIMENANVLARYASICQQNGIVPIVEPEILPD 248<br />
Query 232 GEHDIDRTYDVAEKVWAEVFFYLAQNNVMFEGILLKPSMVTPGAESKDRATPEQVAAYTL 291<br />
G+HD+ R V EKV A V+ L+ +++ EG LLKP+MVTPG + + E++A T+<br />
Sbjct 249 GDHDLKRCQYVTEKVLAAVYKALSDHHIYLEGTLLKPNMVTPGHACTQKFSHEEIAMATV 308<br />
Query 292 KLLRNRVPPAVPGIMFLSGGQSEVEATLNLNAMNQAP--NPWHVSFSYARALQNTCLKTW 349<br />
LR VPPAV GI FLSGGQSE EA++NLNA+N+ P PW ++FSY RALQ + LK W<br />
Sbjct 309 TALRRTVPPAVTGITFLSGGQSEEEASINLNAINKCPLLKPWALTFSYGRALQASALKAW 368<br />
Query 350 GGRPENVNAAQTTLLARAKANSLAQLGKYTGEGES-EEAKEGMFVKGYTY 398<br />
GG+ EN+ AAQ + RA ANSLA GKYT G++ A E +FV + Y<br />
Sbjct 369 GGKKENLKAAQEEYVKRALANSLACQGKYTPSGQAGAAASESLFVSNHAY 418<br />
>AT5G03430<br />
MEIDKAIGESDDKRLKTKYNNAIFVIKRALALYSIEEVAFSFNGGKDSTVLLHLLRAGYFLHKKEQTCSNGGLSSFPVRTIYFESPSAFTEINAFTYDAA<br />
QTYNLQLDIIRQDFKSGLEALLKANPIRAIFLGVRIGDPTAVGQEQFSPSSPGWPPFMRVNPILDWSYRDVWAFLLTCKVKYCSLYDQGYTSIGSIHDTV<br />
PNSLLSVNDTSSKEKFKPAYLLSDGRLERAGRVKKIASLKKDVDTESQKHEVLLASVIAVGDEILSGTVEDQLGLSLCKKLTSVGWSVQQTTVLRNDIDS<br />
VSEEVDRQRSTSDMVFIYGGVGPLHSDVTLAGVAKAFGVRLAPDEEFEEYLRHLISDQCTGDRNEMAQLPEGITELLHHEKLSVPLIKCRNVIVLAATNT<br />
EELEKEWECLTELTKLGGGSLIEYSSRRLMTSLTDVEVAEPLSKLGLEFPDIYLGCYRKSRQGPIIICLTGKDNARMDSAAQALRKKFKKDVFVEIK<br />
GENE ID: 80308 FLAD1 | FAD1 flav<strong>in</strong> aden<strong>in</strong>e d<strong>in</strong>ucleotide synthetase homolog (S.<br />
cerevisiae) [Homo sapiens] (10 or fewer PubMed l<strong>in</strong>ks)<br />
Score = 161 bits (408), Expect = 2e-39, Method: Compositional matrix adjust.<br />
Identities = 88/220 (40%), Positives = 118/220 (53%), Gaps = 9/220 (4%)<br />
Query 15 LKTKYNNAIFVIKRALALYSIEEVAFSFNGGKDSTVLLHLLRAGYFLHKKEQTCSNGGLS 74<br />
L K A+ I+ +LA YS+ ++ FNGGKD T LLHL A + +K N<br />
Sbjct 279 LGKKVAGALQTIETSLAQYSLTQLCVGFNGGKDCTALLHLFHAA--VQRKLPDVPN---- 332<br />
Query 75 SFPVRTIYFESPSAFTEINAFTYDAAQTYNLQLDIIRQDFKSGLEALLKANP-IRAIFLG 133<br />
P++ +Y S S F E+ F D + YNLQ+ K L L +P + A+ +G<br />
Sbjct 333 --PLQILYIRSISPFPELEQFLQDTIKRYNLQMLEAEGSMKQALGELQARHPQLEAVLMG 390<br />
Query 134 VRIGDPTAVGQEQFSPSSPGWPPFMRVNPILDWSYRDVWAFLLTCKVKYCSLYDQGYTSI 193<br />
R DP + FSP+ PGWP FMR+NP+LDW+YRD+W FL V YC LYD+GYTS+
Sbjct 391 TRRTDPYSCSLCPFSPTDPGWPAFMRINPLLDWTYRDIWDFLRQLFVPYCILYDRGYTSL 450<br />
Query 194 GSIHDTVPNSLLSVNDTSSKEKFKPAYLLSDGRLERAGRV 233<br />
GS +TV N L ++PAYLL + ER R<br />
Sbjct 451 GSRENTVRNPALKCLSPGGHPTYRPAYLLENEEEERNSRT 490<br />
Score = 74.3 bits (181), Expect = 4e-13, Method: Compositional matrix adjust.<br />
Identities = 63/237 (26%), Positives = 107/237 (45%), Gaps = 32/237 (13%)<br />
Query 255 ASVIAVGDEILSGTVEDQLGLSLCKKLTSVGWSVQQTTVLRNDIDSVSEEVDRQRSTSDM 314<br />
A +I VGDEIL G +D LC+ L S+G V + +V+ +++ +++ EV +<br />
Sbjct 16 AGIIIVGDEILKGHTQDTNTFFLCRTLRSLGVQVCRVSVVPDEVATIAAEVTSFSNRFTH 75<br />
Query 315 VFIYGGVGPLHSDVTLAGVAKAFGVRLAPDEEFEEYLRHLISDQCTGDRNEMAQLPEGIT 374<br />
V GG+GP H DVT VA+AFG L P + E + L + +++ +P +<br />
Sbjct 76 VLTAGGIGPTHDDVTFEAVAQAFGDELKPHPKLEAATKALGGE----GWEKLSLVPS--S 129<br />
Query 375 ELLHH-------EKLSVPLIKCRNVIVLAATNTEELEKEWECLTELTKLGGGSLIEYSSR 427<br />
LH+ + PL+ RNV + E L + E + L + +++ S+<br />
Sbjct 130 ARLHYGTDPCTGQPFRFPLVSVRNVYLFPGI-PELLRRVLEGMKGLFQ---NPAVQFHSK 185<br />
Query 428 RLMTSLTDVEVAEPLS--------KLGL-EFPDIYLGCYR------KSRQGPIIICL 469<br />
L + + +A L+ +LGL +PD Y+ +GP+ CL<br />
Sbjct 186 ELYVAADEASIAPILAEAQAHFGRRLGLGSYPDWGSNYYQVKLTLDSEEEGPLEECL 242<br />
>AT5G07650<br />
MSLVEISGSDAMAAPMPGRVPPPPPRPPPMPRRLPPMFDAFDHTGAGMVWGFPRPAKKRASLKPLHWVKITSDLQGSLWDELQRRHGDSQTAIELDISEL<br />
ETLFFVEAKPEKIRLHDLRRASYRVFNVRSYYMRANNKVINLSMPLPDMMTAVLAMDESVVDVDQIEKLIKFCPTNEEMELLKTYTGDKAALGKYEQYLL<br />
ELMKVPRLEAKLRVFSFKTQFGTKITELKERLNVVTSACEEVRSSEKLKEIMKKIPCLGNTSNQGPDRGKSSVVDKNLSFSSGIQLKEIMKKIPCLGNTS<br />
KSNPRVGVKLDSSVSDTHTVKSMHYYCKVLASEASELLDVYKDLQSLESASKIQVKSLAQNIQAIIKRLEKLKQELTASETDGPASEVFCNTLKDFISIA<br />
ETEMATVLSLYSVVRKKADALPPYFGEDPNQCPFEQLTMTLFNFIKLFKKAHEENVKQADLEKKKAMKQIDLRRANDTEIMLTKVNIPLADMMAAVLGMD<br />
EYVLDVDQIENLIRFCPTKEEMELLKNYTGDKATLGKCEQLAKAKAPLKEHFRVINAFPSLTPQYFLEVMKVPGVESKLRAFSFKIQFGTQIAELNKGLN<br />
AVNSACEEVRTSEKLKEIMANILCMGNILNQGTAEGSAVGFKLKSLLILSDTCAPNSKMTLMHYLCKVLASKASDLLDFHKDLESLESASKIQLKSLAEE<br />
IQAITKGLEKLNKQLTASESDGPVSQVFRKVLKDFISMAETQVATVSSLYSSVGKNADALAHYFGEDPNHYPFEKVTTTLLSFIRLFKKAHEENVKQADL<br />
DKNKDAKEAEMEKTK<br />
GENE ID: 81624 DIAPH3 | diaphanous homolog 3 (Drosophila) [Homo sapiens]<br />
(Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 141 bits (355), Expect = 3e-33, Method: Compositional matrix adjust.<br />
Identities = 91/327 (27%), Positives = 163/327 (49%), Gaps = 26/327 (7%)<br />
Query 470 IDLRRANDTEIMLTKVNIPLADMMAAVLGMDEYVLDVDQIENLIRFCPTKEEMELLKNYT 529<br />
+D + A + I L+ +P ++ +L +DE L I+NLI+ P +E++ L +<br />
Sbjct 715 LDSKIAQNLSIFLSSFRVPYEEIRMMILEVDETRLAESMIQNLIKHLPDQEQLNSLSQFK 774<br />
Query 530 GDKATLGKCEQLAKAKAPLKEHFRVINAFPSLTPQYFLEVM-KVPGVESKLRAFSFKIQF 588<br />
+ + L CE P+ F+ VM V + +L A FK+QF<br />
Sbjct 775 SEYSNL--CE-----------------------PEQFVVVMSNVKRLRPRLSAILFKLQF 809<br />
Query 589 GTQIAELNKGLNAVNSACEEVRTSEKLKEIMANILCMGNILNQGTAEGSAVGFKLKSLLI 648<br />
Q+ + + AV++ACEE++ S+ +++ +L MGN +N G+ GF L SL<br />
Sbjct 810 EEQVNNIKPDIMAVSTACEEIKKSKSFSKLLELVLLMGNYMNAGSRNAQTFGFNLSSLCK 869<br />
Query 649 LSDTCAPNSKMTLMHYLCKVLASKASDLLDFHKDLESLESASKIQLKSLAEEIQAITKGL 708<br />
L DT + + K TL+H+L ++ K D+L+F DLE L+ ASK+ +++L + ++ + + L<br />
Sbjct 870 LKDTKSADQKTTLLHFLVEICEEKYPDILNFVDDLEPLDKASKVSVETLEKNLRQMGRQL 929<br />
Query 709 EKLNKQLTASESDGPVSQVFRKVLKDFISMAETQVATVSSLYSSVGKNADALAHYFGEDP 768<br />
++L K+L + F + F+ A+ Q T+S L+ ++ K ++ Y+ D<br />
Sbjct 930 QQLEKELETFPPPEDLHDKFVTKMSRFVISAKEQYETLSKLHENMEKLYQSIIGYYAIDV 989<br />
Query 769 NHYPFEKVTTTLLSFIRLFKKAHEENV 795<br />
E T L +F F +A +EN+<br />
Sbjct 990 KKVSVEDFLTDLNNFRTTFMQAIKENI 1016<br />
Score = 100 bits (248), Expect = 8e-21, Method: Compositional matrix adjust.<br />
Identities = 76/325 (23%), Positives = 154/325 (47%), Gaps = 31/325 (9%)<br />
Query 135 ANNKVINLS---MPLPDMMTAVLAMDESVVDVDQIEKLIKFCPTNEEMELLKTYTGDKAA 191<br />
A N I LS +P ++ +L +DE+ + I+ LIK P E++ L + + +<br />
Sbjct 720 AQNLSIFLSSFRVPYEEIRMMILEVDETRLAESMIQNLIKHLPDQEQLNSLSQFKSEYSN 779<br />
Query 192 LGKYEQYLLELMKVPRLEAKLRVFSFKTQFGTKITELKERLNVVTSACEEVRSSEKLKEI 251<br />
L + EQ+++ + V RL +L FK QF ++ +K + V++ACEE++ S+ ++<br />
Sbjct 780 LCEPEQFVVVMSNVKRLRPRLSAILFKLQFEEQVNNIKPDIMAVSTACEEIKKSKSFSKL 839<br />
Query 252 MKKIPCLGNTSNQGPDRGKSSVVDKNLSFSSGIQLKEIMKKIPCLGNTSKSNPRVGVKLD 311<br />
++ + +GN N G ++ + SS +LK D<br />
Sbjct 840 LELVLLMGNYMNAGSRNAQTF----GFNLSSLCKLK-----------------------D 872<br />
Query 312 SSVSDTHTVKSMHYYCKVLASEASELLDVYKDLQSLESASKIQVKSLAQNIQAIIKRLEK 371<br />
+ +D T +H+ ++ + ++L+ DL+ L+ ASK+ V++L +N++ + ++L++<br />
Sbjct 873 TKSADQKTT-LLHFLVEICEEKYPDILNFVDDLEPLDKASKVSVETLEKNLRQMGRQLQQ 931<br />
Query 372 LKQELTASETDGPASEVFCNTLKDFISIAETEMATVLSLYSVVRKKADALPPYFGEDPNQ 431<br />
L++EL + F + F+ A+ + T+ L+ + K ++ Y+ D +<br />
Sbjct 932 LEKELETFPPPEDLHDKFVTKMSRFVISAKEQYETLSKLHENMEKLYQSIIGYYAIDVKK 991
Query 432 CPFEQLTMTLFNFIKLFKKAHEENV 456<br />
E L NF F +A +EN+<br />
Sbjct 992 VSVEDFLTDLNNFRTTFMQAIKENI 1016<br />
>AT5G09350<br />
MQMAQFLSLVRGDSIESPREITSPSNLISESGSNGWLIRFFDSSFFCEWIAVSYLYKHQHSGVRDYLCNRMYTLPLSGIESYLFQICYLMVHKPSPSLDK<br />
FVIDICAKSLKIALKVHWFLLAELEDSDDNEGISRIQEKCQIAATLVGEWSPLMRPHNEPSTPGSKVLNKFLSSKQKLFSLTLSPPTQKSLLFSPTSGSN<br />
LQDDGSQLSADDNKIFKRLIPSPKVRDALLFRKSADKEDEECEKDGFFKRLLRDSRGEDDEQRSNSEGFFKRLLKDNKSEEEEISNNSEGFFKRLRSSKG<br />
DEEELTSSSDGFFKRLLRDNKGDEEELGANSEGFFKKLLRDSKNEDEEPNANTEGFFKKLFHESKNEDDKVSNAVDDEEKDGFLKKLFKEKFDEKRNGNE<br />
RNETDETVYTDETSGEDNGREGFFKKLFKEKFEDKPNIGKADDGNESEDDESSEFSLFRRLFRRHPEDVKTTLPSENCSNGGFVESSPGTENFFRKLFRD<br />
RDRSVEDSELFGSKKYKEKCPGSPKPQNNTPSKKPPLPNNTAAQFRKGSYHESLEFVHALCETSYDLVDIFPIEDRKTALRESIAEINSHLAQAETTGGI<br />
CFPMGRGVYRVVNIPEDEYVLLNSREKVPYMICVEVLKAETPCGAKTTSTSLKLSKGGIPLANGDAFLHKPPPWAYPLSTAQEVYRNSADRMSLSTVEAI<br />
DQAMTHKSEVKLVNACLSVETHSNSNTKSVSSGVTGVLRTGLESDLEWVRLVLTADPGLRMESITDPKTPRRKEHRRVSSIVAYEEVRAAAAKGEAPPGL<br />
PLKGAGQDSSDAQPMANGGMLKAGDALSGEFWEGKRLRIRKDSIYGNLPGWDLRSIIVKSGDDCRQEHLAVQLISHFFDIFQEAGLPLWLRPYEVLVTSS<br />
YTALIETIPDTASIHSIKSRYPNITSLRDFFDAKFKENSPSFKLAQRNFVESMAGYSLVCYLLQIKDRHNGNLLMDEEGHIIHIDFGFMLSNSPGGVNFE<br />
SAPFKLTRELLEVMDSDAEGLPSEFFDYFKVLCIQGFLTCRKHAERIILLVEMLQDSGFPCFKGGPRTIQNLRKRFHLSLTEEQCVSLVLSLISSSLDAW<br />
RTRQYDYYQRVLNGIR<br />
GENE ID: 5298 PI4KB | phosphatidyl<strong>in</strong>ositol 4-k<strong>in</strong>ase, catalytic, beta<br />
[Homo sapiens] (Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 280 bits (717), Expect = 3e-75, Method: Compositional matrix adjust.<br />
Identities = 135/287 (47%), Positives = 196/287 (68%), Gaps = 7/287 (2%)<br />
Query 830 EFWEGKRLRIRKDSIYGNLPGWDLRSIIVKSGDDCRQEHLAVQLISHFFDIFQEAGLPLW 889<br />
E W+ K RIR+ S YG+LP W L S+IVK GDD RQE LA Q++ I+++ +PLW<br />
Sbjct 535 EPWQEKVRRIREGSPYGHLPNWRLLSVIVKCGDDLRQELLAFQVLKQLQSIWEQERVPLW 594<br />
Query 890 LRPYEVLVTSSYTALIETIPDTASIHSIKSRYPNITSLRDFFDAKFKENSPSFKLAQRNF 949<br />
++PY++LV S+ + +IE + + SIH +K + ++ L F + +F AQRNF<br />
Sbjct 595 IKPYKILVISADSGMIEPVVNAVSIHQVK-KQSQLSLLDYFLQEHGSYTTEAFLSAQRNF 653<br />
Query 950 VESMAGYSLVCYLLQIKDRHNGNLLMDEEGHIIHIDFGFMLSNSPGGVNFESAPFKLTRE 1009<br />
V+S AGY LVCYLLQ+KDRHNGN+L+D EGHIIHIDFGF+LS+SP + FE++ FKLT E<br />
Sbjct 654 VQSCAGYCLVCYLLQVKDRHNGNILLDAEGHIIHIDFGFILSSSPRNLGFETSAFKLTTE 713<br />
Query 1010 LLEVMDSDAEGLPSEFFDYFKVLCIQGFLTCRKHAERIILLVEMLQD-SGFPCFKGGPRT 1068<br />
++VM GL + F+Y+K+L +QG + RKH ++++ +VE++Q S PCF G T<br />
Sbjct 714 FVDVMG----GLDGDMFNYYKMLMLQGLIAARKHMDKVVQIVEIMQQGSQLPCFHGS-ST 768<br />
Query 1069 IQNLRKRFHLSLTEEQCVSLVLSLISSSLDAWRTRQYDYYQRVLNGI 1115<br />
I+NL++RFH+S+TEEQ LV ++ S+ + T+ YD +Q + NGI<br />
Sbjct 769 IRNLKERFHMSMTEEQLQLLVEQMVDGSMRSITTKLYDGFQYLTNGI 815<br />
Score = 54.3 bits (129), Expect = 4e-07, Method: Compositional matrix adjust.<br />
Identities = 26/95 (27%), Positives = 49/95 (51%), Gaps = 3/95 (3%)<br />
Query 31 SGSNGWLIRFFDSSFFCEWIAVSYLYKHQHSGVRDYLCNRMYTLPLSGIESYLFQICYLM 90<br />
S WL+R F+S F +A+SYLY + GV+ Y+ NR++ ++ YL Q+ +<br />
Sbjct 124 SAKQSWLLRLFESKLFDISMAISYLYNSKEPGVQAYIGNRLFCFRNEDVDFYLPQLLNMY 183<br />
Query 91 VHK---PSPSLDKFVIDICAKSLKIALKVHWFLLA 122<br />
+H ++ +++ C +S+ +L+ L A<br />
Sbjct 184 IHMDEDVGDAIKPYIVHRCRQSINFSLQCALLLGA 218<br />
Score = 42.0 bits (97), Expect = 0.002, Method: Compositional matrix adjust.<br />
Identities = 27/86 (31%), Positives = 47/86 (54%), Gaps = 5/86 (5%)<br />
Query 555 EFVHALCETSYDLVDIFPIEDRKTALRESIAEINSHLAQAETTGGICFPMGRGVYRVVNI 614<br />
EF+ +L L + P +++KT + I+E++ L + + P + VV +<br />
Sbjct 327 EFIKSLMAIGKRLATL-PTKEQKT--QRLISELS--LLNHKLPARVWLPTAGFDHHVVRV 381<br />
Query 615 PEDEYVLLNSREKVPYMICVEVLKAE 640<br />
P + V+LNS++K PY+I VEVL+ E<br />
Sbjct 382 PHTQAVVLNSKDKAPYLIYVEVLECE 407<br />
>AT5G17380<br />
MADKSETTPPSIDGNVLVAKSLSHLGVTHMFGVVGIPVTSLASRAMALGIRFIAFHNEQSAGYAASAYGYLTGKPGILLTVSGPGCVHGLAGLSNAWVNT<br />
WPMVMISGSCDQRDVGRGDFQELDQIEAVKAFSKLSEKAKDVREIPDCVSRVLDRAVSGRPGGCYLDIPTDVLRQKISESEADKLVDEVERSRKEEPIRG<br />
SLRSEIESAVSLLRKAERPLIVFGKGAAYSRAEDELKKLVEITGIPFLPTPMGKGLLPDTHEFSATAARSLAIGKCDVALVVGARLNWLLHFGESPKWDK<br />
DVKFILVDVSEEEIELRKPHLGIVGDAKTVIGLLNREIKDDPFCLGKSNSWVESISKKAKENGEKMEIQLAKDVVPFNFLTPMRIIRDAILAVEGPSPVV<br />
VSEGANTMDVGRSVLVQKEPRTRLDAGTWGTMGVGLGYCIAAAVASPDRLVVAVEGDSGFGFSAMEVETLVRYNLAVVIIVFNNGGVYGGDRRGPEEISG<br />
PHKEDPAPTSFVPNAGYHKLIEAFGGKGYIVETPDELKSALAESFAARKPAVVNVIIDPFAGAESGRLQHKN<br />
GENE ID: 26061 HACL1 | 2-hydroxyacyl-CoA lyase 1 [Homo sapiens]<br />
(10 or fewer PubMed l<strong>in</strong>ks)<br />
Score = 466 bits (1198), Expect = 5e-131, Method: Compositional matrix adjust.<br />
Identities = 239/569 (42%), Positives = 359/569 (63%), Gaps = 18/569 (3%)<br />
Query 2 ADKSETTPPSIDGNVLVAKSLSHLGVTHMFGVVGIPVTSLASRAMALGIRFIAFHNEQSA 61<br />
++ +E + + G ++A++L V ++FG+VGIPVT +A A LGI++I NEQ+A<br />
Sbjct 4 SNFAERSEEQVSGAKVIAQALKTQDVEYIFGIVGIPVTEIAIAAQQLGIKYIGMRNEQAA 63<br />
Query 62 GYAASAYGYLTGKPGILLTVSGPGCVHGLAGLSNAWVNTWPMVMISGSCDQRDVGRGDFQ 121<br />
YAASA GYLT +PG+ L VSGPG +H L G++NA +N WP+++I GS ++ G FQ<br />
Sbjct 64 CYAASAIGYLTSRPGVCLVVSGPGLIHALGGMANANMNCWPLLVIGGSSERNQETMGAFQ 123<br />
Query 122 ELDQIEAVKAFSKLSEKAKDVREIPDCVSRVLDRAVSGRPGGCYLDIPTDVLRQKISESE 181<br />
E Q+EA + ++K S + + IP + + + ++ GRPG CY+DIP D + +++ +<br />
Sbjct 124 EFPQVEACRLYTKFSARXSSIEAIPFVIEKAVRSSIYGRPGACYVDIPADFVNLQVNVNS 183<br />
Query 182 ADKLVDEVERSRKEEPIRGSLRSEIESAVSLLRKAERPLIVFGKGAAYSRAEDELKKLVE 241<br />
+ +ER PI + S + +A S++R A++PL++ GKGAAY+ AE+ +KKLVE<br />
Sbjct 184 ----IKYMERCMS-PPISMAETSAVCTAASVIRNAKQPLLIIGKGAAYAHAEESIKKLVE 238<br />
Query 242 ITGIPFLPTPMGKGLLPDTHEFSATAARSLAIGKCDVALVVGARLNWLLHFGESPKWDKD 301<br />
+PFLPTPMGKG++PD H + AARS A+ DV ++ GARLNW+LHFG P++ D
Sbjct 239 QYKLPFLPTPMGKGVVPDNHPYCVGAARSRALQFADVIVLFGARLNWILHFGLPPRYQPD 298<br />
Query 302 VKFILVDVSEEEI-ELRKPHLGIVGDAKTVIGLLNREIKDDPFCLGKSNSWVESISKKAK 360<br />
VKFI VD+ EE+ KP + ++G+ V L E+ P+ + W +++ +K K<br />
Sbjct 299 VKFIQVDICAEELGNNVKPAVTLLGNIHAVTKQLLEELDKTPWQYPPESKWWKTLREKMK 358<br />
Query 361 ENGEKMEIQLAKDVVPFNFLTPMRIIRDAILAVEGPSPVVVSEGANTMDVGRSVLVQKEP 420<br />
N + +K +P N+ T +++ + VVSEGANTMD+GR+VL P<br />
Sbjct 359 SNEAASKELASKKSLPMNYYTVFYHVQEQLPR----DCFVVSEGANTMDIGRTVLQNYLP 414<br />
Query 421 RTRLDAGTWGTMGVGLGYCIAAAVASPDR----LVVAVEGDSGFGFSAMEVETLVRYNLA 476<br />
R RLDAGT+GTMGVGLG+ IAAAV + DR ++ VEGDS FGFS MEVET+ RYNL<br />
Sbjct 415 RHRLDAGTFGTMGVGLGFAIAAAVVAKDRSPGHWIICVEGDSAFGFSGMEVETICRYNLP 474<br />
Query 477 VVIIVFNNGGVYGG-DRRGPEEISGPHKEDPA--PTSFVPNAGYHKLIEAFGGKGYIVET 533<br />
++++V NN G+Y G D +E+ P +PN+ Y +++ AFGGKGY V+T<br />
Sbjct 475 IILLVVNNNGIYQGFDTDTWKEMLKFQDATAVVPPMCLLPNSHYEQVMTAFGGKGYFVQT 534<br />
Query 534 PDELKSALAESFA-ARKPAVVNVIIDPFA 561<br />
P+EL+ +L +S A KP+++N++I+P A<br />
Sbjct 535 PEELQKSLEQSLADTTKPSLINIMIEPQA 563<br />
>AT5G19440<br />
MANSGEGKVVCVTGASGYIASWLVKFLLSRGYTVKASVRDPSDPKKTQHLVSLEGAKERLHLFKADLLEQGSFDSAIDGCHGVFHTASPFFNDAKDPQAE<br />
LIDPAVKGTLNVLNSCAKASSVKRVVVTSSMAAVGYNGKPRTPDVTVDETWFSDPELCEASKMWYVLSKTLAEDAAWKLAKEKGLDIVTINPAMVIGPLL<br />
QPTLNTSAAAILNLINGAKTFPNLSFGWVNVKDVANAHIQAFEVPSANGRYCLVERVVHHSEIVNILRELYPNLPLPERCVDENPYVPTYQVSKDKTRSL<br />
GIDYIPLKVSIKETVESLKEKGFAQF<br />
GENE ID: 50814 NSDHL | NAD(P) dependent steroid dehydrogenase-like<br />
[Homo sapiens] (Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 53.1 bits (126), Expect = 1e-06, Method: Compositional matrix adjust.<br />
Identities = 63/250 (25%), Positives = 102/250 (40%), Gaps = 39/250 (15%)<br />
Query 3 NSGEGKVVCVTGASGYIASWLVKFLLSRGYTVKA-SVRDPSDPKKTQHLVSLEGAKERLH 61<br />
N + K V G SG++ +V+ LL+RGY V ++ D ++<br />
Sbjct 33 NQNQAKRCTVIGGSGFLGQHMVEQLLARGYAVNVFDIQQGFD-------------NPQVR 79<br />
Query 62 LFKADLLEQGSFDSAIDGCHGVFHTASPFFNDAKDPQAELIDPAVKGTLNVLNSCAKASS 121<br />
F DL + A+ G + VFH ASP + + + GT NV+ +C K +<br />
Sbjct 80 FFLGDLCSRQDLYPALKGVNTVFHCASP--PPSSNNKELFYRVNYIGTKNVIETC-KEAG 136<br />
Query 122 VKRVVVTSSMAAV--GYNGKPRTPDVTVDETWFSDPELCEASKMWYVLSKTLAEDAAWKL 179<br />
V+++++TSS + + G + K T D+ +Y +K L E A<br />
Sbjct 137 VQKLILTSSASVIFEGVDIKNGTEDLPYAMKPID----------YYTETKILQERAVLGA 186<br />
Query 180 AK-EKGLDIVTINPAMVIGPL---LQPTLNTSA--AAILNLINGAKTFPNLSFGWVNVKD 233<br />
EK I P + GP L P L +A + +I K + +F V++<br />
Sbjct 187 NDPEKNFLTTAIRPHGIFGPRDPQLVPILIEAARNGKMKFVIGNGKNLVDFTF----VEN 242<br />
Query 234 VANAHIQAFE 243<br />
V + HI A E<br />
Sbjct 243 VVHGHILAAE 252<br />
>AT5G20980<br />
MGQLALQRLQPLASLPRRPPSLPPPSSATPSLPCATASRRPRFYVARAMSSHIVGYPRIGPKRELKFALESFWDGKTNVDDLQNVAANLRKSIWKHMAHA<br />
GIKYIPSNTFSYYDQMLDTTAMLGAVPSRYGWESGEIGFDVYFSMARGNASAHAMEMTKWFDTNYHYIVPELGPDVNFSYASHKAVVEFKEAKALGIDTV<br />
PVLIGPMTYLLLSKPAKGVEKSFCLLSLIDKILPVYKEVLADLKSAGARWIQFDEPILVMDLDTSQLQAFSDAYSHMESSLAGLNVLIATYFADVPAEAY<br />
KTLMSLKCVTGFGFDLVRGLETLDLIKMNFPRGKLLFAGVVDGRNIWANDLSASLKTLQTLEDIVGKEKVVVSTSCSLLHTAVDLVNEMKLDKELKSWLA<br />
FAAQKVVEVNALAKSFSGAKDEALFSSNSMRQASRRSSPRVTNAAVQQDVDAVKKSDHHRSTEVSVRLQAQQKKLNLPALPTTTIGSFPQTTDLRRIRRE<br />
FKAKKISEVDYVQTIKEEYEKVIKLQEELGIDVLVHGEAERNDMVEFFGEQLSGFAFTSNGWVQSYGSRCVKPPIIYGDITRPKAMTVFWSSMAQKMTQR<br />
PMKGMLTGPVTILNWSFVRNDQPRHETCFQIALAIKDEVEDLEKAGVTVIQIDEAALREGLPLRKSEQKFYLDWAVHAFRITNSGVQDSTQIHTHMCYSN<br />
FNDIIHSIIDMDADVITIENSRSDEKLLSVFHEGVKYGAGIGPGVYDIHSPRIPSTEEIAERINKMLAVLDSKVLWVNPDCGLKTRNYSEVKSALSNMVA<br />
AAKLIRSQLNKS<br />
GENE ID: 550631 CCDC157 | coiled-coil doma<strong>in</strong> conta<strong>in</strong><strong>in</strong>g 157 [Homo sapiens]<br />
(10 or fewer PubMed l<strong>in</strong>ks)<br />
Score = 36.2 bits (82), Expect = 0.13, Method: Composition-based stats.<br />
Identities = 15/34 (44%), Positives = 21/34 (61%), Gaps = 2/34 (5%)<br />
Query 10 QPLASLPRRPPSLPP--PSSATPSLPCATASRRP 41<br />
QP S PR+P + PP P ++ P PC + SR+P<br />
Sbjct 137 QPCTSPPRQPCTSPPRQPCTSPPRQPCTSPSRQP 170<br />
>AT5G23680<br />
MAELQLVEGHQINGGFIPPAIINSIEAPETSAAAGVSVGSKRLRRPSVRLGDIGGDQYHQHVVAAYDSPQVRRPKWRPSGGGGGGGGNRKEPNNQSGKTT<br />
SSSRTRTMTNLSSGGYENTGTLDEDPVSIGSWRVKKWVKSSGGETAATTTTNTASAKRVRSNWATRNDGVEQGDEKFSGEEEEEEEDEELGGEEGFRDFS
REDSESPMKERRRRYENREVELLGDWQQSGGRGKEGVKIWLQE<br />
ELGLGRYWPMFEMHEV EVDEQVLPLLTLEDLK KDMGINAVGSRRKMYCAIQKLGREFS<br />
GENE ID: 800114<br />
BICC1 | biccaudal<br />
C homolog g 1 (Drosophila) ) [Homo sapiens]<br />
(10 or feweer<br />
PubMed l<strong>in</strong>ks) )<br />
Score = 555.5<br />
bits (132), Expect = 2e-07 7, Method: Compoositional<br />
matrix<br />
adjust.<br />
Identitiess<br />
= 24/55 (43%), , Positives = 35 5/55 (63%), Gapss<br />
= 0/55 (0%)<br />
Query 238<br />
Sbjct 881<br />
>AT5G27380<br />
MGSGCSSLSYSSSSSTCNATVFSISSSSPSSSSSLKLNPSSFL<br />
LFQNPKTLRNQSPLRC RCGRSFKMESQKPIFD DLEKLDDEFVQKLVYDALVWSSLHGLVVGDKK<br />
SYQKSGNVPGVVGLMHAPIALLPTAFPPEAYWKQACNVTPLFN<br />
NELIDRVSLDGKFLQD QDSLSRTKKVDVFTSR RLLDIHSKMLERNKKEDIRLGLHRFDYMLDEE<br />
ETNSLLQIEMNNTISCSFPGLSRLVSQQLHQSLLRSYGDQIGI<br />
IDSERVPINTSTIQFA FADALAKAWLEYSNPR RAVVMVIVQPEERNMY YDQHLLSSILREKHNII<br />
VVIRKTLAEVEEKEGSVQEDETLIVGGGQAVAVVYFRSGYTPN<br />
NDHPSESEWNARLLIEEESSAVKCPSIAYHL<br />
LTGSKKIQQELAKPGV VLERFLDNKEDIAKLRR<br />
KCFAGLWSLDDDSEIVKQAIEKPGLFVVMKPQREGGGNNIYGD<br />
DDVRENLLRLQKEGEE EEGNAAYILMQRIFPK KVSNMFLVREGVYHKH HQAISELGVYGAYLRSS<br />
KDEVIVNEQSGGYLMRTKIASSDEGGVVAAGFGVLDSIYLI<br />
GENE ID: 29937<br />
GSS | glutatthione<br />
synthetas se [Homo sapienss]<br />
(Over 10 PuubMed<br />
l<strong>in</strong>ks)<br />
Score = 3332<br />
bits (851), Expect = 1e-90 0, Method: Compoositional<br />
matrix<br />
adjust.<br />
Identitiess<br />
= 186/432 (43% %), Positives = 272/432 (62%), Gaps = 15/432 (3%)<br />
Query 115<br />
Sbjct 47<br />
Query 175<br />
Sbjct 107<br />
Query 232<br />
Sbjct 167<br />
Query 292<br />
Sbjct 225<br />
Query 352<br />
Sbjct 283<br />
Query 411<br />
Sbjct 343<br />
Query 467<br />
Sbjct 401<br />
Query 526<br />
Sbjct 461<br />
>AT5G35630<br />
MAQILAASPTCCQMRVPKHSSVIASSSSKLWSSVVLKQKKQSN<br />
NNKVRGFRVLALQSDN DNSTVNRVETLLNLDT TKPYSDRIIAEYIWIG GGSGIDLRSKSRTIEKK<br />
PVEDPSELPKWWNYDGSSTGQAPGEDSSEVILYPQAIFRDPFR<br />
RGGNNILVICDTWTPA PAGEPIPTNKRAKAAE EIFSNKKVSGEVPWFG GIEQEYTLLQQNVKWPP<br />
LGWPVGAFPGPPQGPYYCGVGADKIWGGRDISDAHYKACLYAG<br />
GINISGTNGEVMPGQW QWEFQVGPSVGIDAGD DHVWCARYLLERITEQ QAGVVLTLDPKPIEGDD<br />
WNGAGCHTNYSSTKSMREEGGFEVIKKKAILNLSLRHKEHISA<br />
AYGEGNERRLTGKHET ETASIDQFSWGVANRG GCSIRVGRDTEAKGKG GYLEDRRPASNMDPYII<br />
VTSLLAETTLLLWEPTLEAEALAAQKLLSLNV<br />
pdb|2OJW|C Cha<strong>in</strong> C, CCrystal<br />
Structur re Of Human Gluttam<strong>in</strong>e<br />
Synthetas se In Complex<br />
With Adp Annd<br />
Phosphate<br />
12 more seequence<br />
titles<br />
Score = 3397<br />
bits (1021), , Expect = 2e-1 110, Method: Commpositional<br />
matrix<br />
adjust.<br />
Identitiess<br />
= 196/371 (52% %), Positives = 249/371 (67%), Gaps = 12/371 (3%)<br />
Query 47<br />
Sbjct 20<br />
Query 107<br />
Sbjct 75<br />
Query 167<br />
Sbjct 135<br />
Query 227<br />
Sbjct 192<br />
Query 287<br />
Sbjct 252<br />
Query 343<br />
Sbjct 312<br />
Query 403<br />
Sbjct 372<br />
WLQELGLGRYWPMFFEMHEVDEQVLPLLTL<br />
LEDLKDMGINAVGSRR RRKMYCAIQKLGR 292<br />
+LGLG+Y +FF+<br />
E+D Q LT +DLK++GI G+RR RRKM AI +L +<br />
LFSKLGLGKYTDVFFQQQEIDLQTFLTLTD<br />
DQDLKELGITTFGARR RRKMLLAISELNK 935<br />
HAPIALLPTAFPEAAYWKQACNVTPLFNEL<br />
LIDRVSLDGKFLQDSLLSRTKKVDVFTSRLL<br />
LDI 174<br />
+AP L P+ P A +QA V FN L+D L VS + FL+ +LLS<br />
T K D FT+RL DI<br />
YAPFTLFPSLVPSAALLEQAYAVQMDFNLL<br />
LVDAVSQNAAFLEQTLLSSTIKQDDFTARLF<br />
FDI 106<br />
HSKMLERNKKEDIRRLGLHRFDYMLDEETN<br />
N---SLLQIEMNTISCCSFPGLSRLVSQLHQ<br />
QSL 231<br />
H ++L+ + + LGL+R DYM + +L QIE+NTIS SF GL+ +H+ +<br />
HKQVLKEGIAQTVFFLGLNRSDYMFQRSAD<br />
DGSPALKQIEINTISAASFGGLASRTPAVHR<br />
RHV 166<br />
LRSYGDQIGIDSERRVPINTSTIQFADALA<br />
AKAWLEYSNPRAVVMV MVIVQPEERNMYDQHL LLS 291<br />
L ++ + ++ N + A +A AKAW Y +P A+V++ +I Q +ERN++DQ +<br />
LSVLSKT--KEAGKKILSNNPSKGLALGIA<br />
AKAWELYGSPNALVLLLIAQEKERNIFDQRA<br />
AIE 224<br />
SILREKHNIVVIRKKTLAEVEKEGSVQEDE<br />
ETLIVGGQAVAVVYFR FRSGYTPNDHPSESEW WNA 351<br />
+ L + NI VIR+ +T ++ ++GS+ +D L V GQ +AVVYFR FR GY P + S W A<br />
NELLAR-NIHVIRRRTFEDISEKGSLDQDR<br />
RRLFVDGQEIAVVYFR FRDGYMPRQY-SLQNW WEA 282<br />
RLLIEESSAVKCPSSIAYHLTGSKKIQQEL<br />
LAKPGVLERFLDNK-EEDIAKLRKCFAGLWSLD<br />
410<br />
RLL+E S A KCP IA L G+KK+QQEL L++PG+LE L + E +A+LR FAGL+SLD<br />
RLLLERSHAAKCPDDIATQLAGTKKVQQEL<br />
LSRPGMLEMLLPGQPEEAVARLRATFAGLYSLD<br />
342<br />
DSE----IVKQAIEEKPGLFVMKPQREGGG<br />
GNNIYGDDVRENLLRL RLQKEGEEGNAAYILM MQR 466<br />
E + +A+ P FV+KPQREGGG GNN+YG+++ + L +LL<br />
K+ EE A+YILM M++<br />
VGEEGDQAIAEALAAAPSRFVLKPQREGGG<br />
GNNLYGEEMVQALKQL QL-KDSEE-RASYILM MEK 400<br />
IFPKVSNMFLVREGGVYHK-HQAISELGVY<br />
YGAYLRSKDEVIVNEQQSGYLMRTKIASSDEGG<br />
525<br />
I P+ L+R G + Q ISELG++ +G Y+R ++ +++N+ G+L+RTK + GG<br />
IEPEPFENCLLRPGGSPARVVQCISELGIF<br />
FGVYVRQEETLVMNKH KHVGHLLRTKAIEHAD DGG 460<br />
VAAGFGVLDSIY<br />
VAAG VLD+ Y<br />
VAAGVAVLDNPY<br />
537<br />
472<br />
FRVLALQSDNSTVNNRVETLLNLDTKPYSD<br />
DRIIAEYIWIGGSGIDDLRSKSRTIEKPVED<br />
DPS 106<br />
F+ +A N + +V L P +++ + A YIWI G+G LR K+RT++ +<br />
FQSMASSHLNKGIKKQVYMSL-----PQGE<br />
EKVQAMYIWIDGTGEGGLRCKTRTLDSEPKC<br />
CVE 74<br />
ELPKWNYDGSSTGQQAPGEDSEVILYPQAI<br />
IFRDPFRGGNNILVICCDTWTPAGEPIPTNK<br />
KRA 166<br />
ELP+WN+DGSST QQ+<br />
G +S++ L P A+ +FRDPFR N LV+CC+<br />
+ P TN R<br />
ELPEWNFDGSSTLQQSEGSNSDMYLVPAAM<br />
MFRDPFRKDPNKLVLCCEVFKYNRRPAETNL<br />
LRH 134<br />
KAAEIFSNKKVSGEEVPWFGIEQEYTLLQQ<br />
QNVKWPLGWPVGAFPGGPQGPYYCGVGADKIWG<br />
226<br />
I VS + PWFG+EQEYTL+ + P GWP FPGGPQGPYYCGVGAD+<br />
+G<br />
TCKRIMD--MVSNQQHPWFGMEQEYTLMGT<br />
TDGH-PFGWPSNGFPGGPQGPYYCGVGADRA<br />
AYG 191<br />
RDISDAHYKACLYAAGINISGTNGEVMPGQ<br />
QWEFQVGPSVGIDAGD GDHVWCARYLLERITEQA<br />
286<br />
RDI +AHY+ACLYAAG+<br />
I+GTN EVMP QWEFQ+GP Q<br />
GI GD GDH+W AR++L R+ E<br />
RDIVEAHYRACLYAAGVKIAGTNAEVMPAQ<br />
QWEFQIGPCEGISMGD GDHLWVARFILHRVCEDF<br />
251<br />
GVVLTLDPKPIEGDDWNGAGCHTNYSTKSM<br />
MREEGGFEVIKKAILNNLSLRHKEHISAY----<br />
342<br />
GV+ T DPKPI G+ +WNGAGCHTN+STK+M MREE G + I++AI LS RH+ HI AY<br />
GVIATFDPKPIPGNNWNGAGCHTNFSTKAM<br />
MREENGLKYIEEAIEKKLSKRHQYHIRAYDP<br />
PKG 311<br />
GEGNERRLTGKHETTASIDQFSWGVANRGC<br />
CSIRVGRDTEAKGKGY GYLEDRRPASNMDPYIVT<br />
402<br />
G N RRLTG HETT++I+<br />
FS GVANR SIR+ R + KGY GY EDRRP++N DP+ VT<br />
GLDNARRLTGFHETTSNINDFSAGVANRSA<br />
ASIRIPRTVGQEKKGY GYFEDRRPSANCDPFSVT<br />
371<br />
SLLAETTLLWE 4413<br />
L T LL E<br />
EALIRTCLLNE 3382
AT5G37590<br />
MSLLRILSTLYYKGTHRTSRSFSSSRNNLICTTFANPLSGKPR<br />
RISYQNDYGGHRTNLH LHLLDSRLWIILSGQA AAILGFCGNTVLAEDESMKSKSGDNMDESGNN<br />
TGLEKIEDGSVVVSNIHTSKWRVFTDSSGRDYFFQGKLEPAER<br />
RLFGSAIQEAKEGFGE GEKDPHVASACNNLAE ELYRVKKEFDKAEPLY YLEAVSILEEFYGPDDD<br />
VRVGATLHNLGGQLYLVQRKLEEARACCYELKGRVLGYNHPDY<br />
YAETMYHLGTEKIQMR MRKLLFWILLKYLRHE EGGQGESMAYIRRLRY YLSQIYIRSNRLAEAEE<br />
KLQRKLLHMMEELSKGWNSMEAITAAEEALALTLRLSGKLGEA<br />
ALELFEKCLNARKKLL LLPEGHIQIGGNLLHIAKTFMLQASQMRRTDNSEALSKLEKAKNYLL<br />
ENSARIAKDVLLHKLKNQKSKAQKDEKKSSAALRNYEHAALVI<br />
ILLQSLESLAALEMSKKNEIHEPKEENLHAA<br />
AEDSLLQCVTAYKEFG GYGTQLQDSSEVKSEYY<br />
LSCLKHLSALLLAKKETTLNSKASPISSLPELKEEIKRIDIDL<br />
LRSQKTG<br />
GENE ID: 899953<br />
KLC4 | k<strong>in</strong>ees<strong>in</strong><br />
light cha<strong>in</strong> n 4 [Homo sapienns]<br />
(Over 10 PuubMed<br />
l<strong>in</strong>ks)<br />
Score = 76. .3 bits (186), Expect = 1e-13, , Method: Compossitional<br />
matrix x adjust.<br />
Identitiess<br />
= 63/202 (31%) ), Positives = 97/202 9 (48%), Ga Gaps = 27/202 (13%)<br />
Query 130<br />
Sbjct 144<br />
Query 190<br />
Sbjct 204<br />
Query 246<br />
Sbjct 264<br />
Query 294<br />
Sbjct 316<br />
Score = 399.3<br />
bits (90), Expect = 0.014, , Method: Compossitional<br />
matrix x adjust.<br />
Identitiess<br />
= 25/70 (35%), , Positives = 38 8/70 (54%), Gapss<br />
= 2/70 (2%)<br />
Query 133<br />
Sbjct 273<br />
Query 192<br />
Sbjct 333<br />
>AT5G37600<br />
MSLVSDLINLNNLSDSTDKIIAEYIWVVGGSGMDMRSKARTLP<br />
PGPVTDPSQLPKWNYD YDGSSTGQAPGEDSEV VILYPQAIFKDPFRRG GNNILVMCDAYTPAGEE<br />
PIPTNKRHAAAAKVFSNPDVAAEVPWYYGIEQEYTLLQKDVKW<br />
WPVGWPIGGYPGPQGP GPYYCGIGADKSFGRD DVVDSHYKACLYAGIN NISGINGEVMPGQWEFF<br />
QVGPAVGISAAADEIWVARYILERITEEIAGVVVSFDPKPIPG<br />
GDWNGAGAHCNYSTKS KSMREEGGYEIIKKAIDKLGLRHKEHIAAYG<br />
GEGNERRLTGHHETADD<br />
INTFLWGVANRRGASIRVGRDTEKEGKKGYFEDRRPASNMDPY<br />
YIVTSMIAETTILWNP NP<br />
> gb|EEAW91118.1|<br />
[Homo sapieens]<br />
Length=384<br />
GENE ID: 22752<br />
GLUL | gluttamate-ammonia<br />
ligase l [Homo sappiens]<br />
(Over 10 PuubMed<br />
l<strong>in</strong>ks)<br />
Score = 3397<br />
bits (1020), , Expect = 2e-1 110, Method: Commpositional<br />
matrix<br />
adjust.<br />
Identitiess<br />
= 184/341 (53% %), Positives = 243/341 (71%), Gaps = 7/341 ( 2%)<br />
Query 17<br />
Sbjct 24<br />
Query 77<br />
Sbjct 84<br />
Query 137<br />
Sbjct 142<br />
Query 197<br />
Sbjct 201<br />
Query 257<br />
Sbjct 261<br />
Query 313<br />
Sbjct 321<br />
YFFQGKLEPAERLFFGSAIQEAKEGFGEKD<br />
DPHVASACNNLAELYR YRVKKEFDKAEPLYLEAV<br />
189<br />
Y QG+ E A L A+++ + G P VA+ N LA +YR YR + ++ +A L + A+<br />
YAAQGRYEVAVPLCCKQALEDLERTSGRGH<br />
HPDVATMLNILALVYR YRDQNKYKEAAHLLND DAL 203<br />
SILEEFYGPDDVRVVGATLHNLGQLYLVQR<br />
RKLEEARA----CYELLKGRVLGYNHPDYAETM<br />
245<br />
SI E GPD V ATL+NL LY + K +EA E+ ++ +VLG NHPD A+ +<br />
SIRESTLGPDHPAVVAATLNNLAVLYGKRG<br />
GKYKEAEPLCQRALEIIREKVLGTNHPDVAK<br />
KQL 263<br />
YHL------------GTEKIQMRKLLFWIL<br />
LLKYLRHEGGQGESMA MAYIRRLR-YLSQIYIRS<br />
293<br />
+L<br />
E+ R L + EG G + R + L+ Y+ +<br />
NNLALLCQNQGKYEEAVERYYQRALAIY--<br />
-------EGQLGPDNP NPNVARTKNNLASCYL LKQ 315<br />
NRLAEAEKLQRKLLL---HMMEL<br />
312<br />
+ AEAE L +++LL<br />
H+ E<br />
GKYAEAETLYKEILLTRAHVQEF<br />
337<br />
QGKLEPAERLFGSAAIQEAKEGFGEKDPHV<br />
VASACNNLAELYRVKK KKEFDKAEPLYLEAVSI-<br />
191<br />
QGK E ER + AA+<br />
+ G +P+V VA NNLA Y + ++ +AE LY E ++<br />
QGKYEAVERYYQRAALAIYEGQLGPDNPNV<br />
VARTKNNLASCYLKQG QGKYAEAETLYKEILTRA<br />
332<br />
-LEEFYGPDD 2000<br />
++EF DD<br />
HVQEFGSVDD 3442<br />
glutamate-ammon nia ligase (gluttam<strong>in</strong>e<br />
synthetase),<br />
is<strong>of</strong>orm CR RA_b<br />
DKIIAEYIWVGGSGGMDMRSKARTLPGPVT<br />
TDPSQLPKWNYDGSSTTGQAPGEDSEVILYP<br />
PQA 76<br />
+K+ A YIW+ G+GG<br />
+R K RTL +LP+WN+DGSSTT<br />
Q+ G +S++ L P A<br />
EKVQAMYIWIDGTGGEGLRCKTRTLDSEPK<br />
KCVEELPEWNFDGSSTTLQSEGSNSDMYLVP<br />
PAA 83<br />
IFKDPFRRGNNILVVMCDAYTPAGEPIPTN<br />
NKRHAAAKVFSNPDVA VAAEVPWYGIEQEYTL LLQ 136<br />
+F+DPFR+ N LVV+C+<br />
+ P TN N RH ++ V+ + PW+G+EQEYTL L+<br />
MFRDPFRKDPNKLVVLCEVFKYNRRPAETN<br />
NLRHTCKRIMDM--VS VSNQHPWFGMEQEYTL LMG 141<br />
KDVKWPVGWPIGGYYPGPQGPYYCGIGADK<br />
KSFGRDVVDSHYKACLLYAGINISGINGEVM<br />
MPG 196<br />
D P GWP G+ +PGPQGPYYCG+GAD+ +++GRD+V++HY+ACLLYAG+<br />
I+G N EVM MP<br />
TDGH-PFGWPSNGFFPGPQGPYYCGVGADR<br />
RAYGRDIVEAHYRACLLYAGVKIAGTNAEVM<br />
MPA 200<br />
QWEFQVGPAVGISAAADEIWVARYILERIT<br />
TEIAGVVVSFDPKPIPPGDWNGAGAHCNYSTKS<br />
256<br />
QWEFQ+GP GIS D +WVAR+IL R+ E GV+ +FDPKPIPPG+WNGAG<br />
H N+STK+<br />
QWEFQIGPCEGISMMGDHLWVARFILHRVC<br />
CEDFGVIATFDPKPIPPGNWNGAGCHTNFSTKA<br />
260<br />
MREEGGYEIIKKAIIDKLGLRHKEHIAAY-<br />
----GEGNERRLTGHH HHETADINTFLWGVAN NRG 312<br />
MREE G + I++AII+KL<br />
RH+ HI AY G N RRLTG HHET++IN<br />
F GVAN NR<br />
MREENGLKYIEEAIIEKLSKRHQYHIRAYD<br />
DPKGGLDNARRLTGFH FHETSNINDFSAGVAN NRS 320<br />
ASIRVGRDTEKEGKKGYFEDRRPASNMDPY<br />
YIVTSMIAETTIL 3353<br />
ASIR+ R +E KKGYFEDRRP++N<br />
DP+ + VT + T +L<br />
ASIRIPRTVGQEKKKGYFEDRRPSANCDPF<br />
FSVTEALIRTCLL 3361<br />
>AT5G45340<br />
MDFSGLFLTLSSAAALFLCLLRFIAGVVRRSSSTKLPLPPGTM<br />
MGYPYVGETFQLYSQD QDPNVFFAAKQRRYGS SVFKTHVLGCPCVMISSPEAAKFVLVTKSHLL<br />
FKPTFPASKERRMLGKQAIFFHQGDYHHSKLRKLVLRAFMPDA<br />
AIRNMVPHIESIAQES ESLNSWDGTQLNTYQE EMKTYTFNVALISILG GKDEVYYREDLKRCYYY<br />
ILEKGYNSMPIINLPGTLFHKAMKARKKELAQILANILSKRRQ<br />
QNPSSHTDLLGSFMED EDKAGLTDEQIADNIIGVIFAARDTTASVLTWILKYLADNPTVLEAA<br />
VTEEQMAIRKDDKKEGESLTWEDTKKMMPLTYRVIQETLRAAT<br />
TILSFTFREAVEDVEY EYEGYLIPKGWKVLPL LFRNIHHNADIFSDPG GKFDPSRFEVAPKPNTT<br />
FMPFGSGIHSCCPGNELAKLEISVLIHHHLTTKYRWSIVGPSD<br />
DGIQYGPFALPQNGLP LPIALERKP<br />
GENE ID: 566603<br />
CYP26B1 | ccytochrome<br />
P450, , family 26, subbfamily<br />
B, poly ypeptide 1<br />
[Homo sapieens]<br />
(Over 10 PuubMed<br />
l<strong>in</strong>ks)
Score = 200 bits (508), Expect = 5e-51, Method: Compositional matrix adjust.<br />
Identities = 151/501 (30%), Positives = 245/501 (48%), Gaps = 53/501 (10%)<br />
Query 1 MDFSGLFLTLSAAALFLCL---------------LRFIAGVRRSSSTKLPLPPGTMGYPY 45<br />
M F GL L + A L CL LR+ A R S KLP+P G+MG+P<br />
Sbjct 1 MLFEGLDLVSALATLAACLVSVTLLLAVSQQLWQLRWAA--TRDKSCKLPIPKGSMGFPL 58<br />
Query 46 VGETFQLYSQDPNVFFAAKQRRYGSVFKTHVLGCPCVMISSPEAAKFVLVTKSHLFKPTF 105<br />
+GET Q F ++++ +YG+VFKTH+LG P + ++ E + +L+ + HL +<br />
Sbjct 59 IGETGHWLLQGSG-FQSSRREKYGNVFKTHLLGRPLIRVTGAENVRKILMGEHHLVSTEW 117<br />
Query 106 PASKERMLGKQAIFFHQGDYHSKLRKLVLRAFMPDAIRNMVPHIESIAQESLNSWDG--T 163<br />
P S +LG + GD H RK+ + F +A+ + +P I+ + Q++L +W<br />
Sbjct 118 PRSTRMLLGPNTVSNSIGDIHRNKRKVFSKIFSHEALESYLPKIQLVIQDTLRAWSSHPE 177<br />
Query 164 QLNTYQEMKTYTFNVALISILGKDEVYYREDLKRCYYILEKGYN---SMPINLPGTLFHK 220<br />
+N YQE + TF +A+ +LG EDL + + ++ + S+P++LP + + +<br />
Sbjct 178 AINVYQEAQKLTFRMAIRVLLGFS--IPEEDLGHLFEVYQQFVDNVFSLPVDLPFSGYRR 235<br />
Query 221 AMKARKELAQILANILSKRRQ-----NPSSHTDLL-GSFMEDKAGLTDEQIADNIIGVIF 274<br />
++AR+ L + L + ++ Q + S DLL S E +T +++ D + +IF<br />
Sbjct 236 GIQARQILQKGLEKAIREKLQCTQGKDYSDALDLLIESSKEHGKEMTMQELKDGTLELIF 295<br />
Query 275 AARDTTASVLTWILKYLADNPTVLEAVTEEQMAIRKDKKEG----ESLTWEDTKKMPLTY 330<br />
AA TTAS T ++ L +PTVLE + +E A G +L + +<br />
Sbjct 296 AAYATTASASTSLIMQLLKHPTVLEKLRDELRAHGILHSGGCPCEGTLRLDTLSGLRYLD 355<br />
Query 331 RVIQETLRAATILSFTFREAVEDVEYEGYLIPKGWKVLPLFRNIHHNADIFSDPGKFDPS 390<br />
VI+E +R T +S +R ++ E +G+ IPKGW V+ R+ H A +F D FDP<br />
Sbjct 356 CVIKEVMRLFTPISGGYRTVLQTFELDGFQIPKGWSVMYSIRDTHDTAPVFKDVNVFDPD 415<br />
Query 391 RFEVAPKPNT-----FMPFGSGIHSCPGNELAKLEISVLIHHLTTKYRWS---------- 435<br />
RF A + ++PFG G+ +C G LAKL + VL L + R+<br />
Sbjct 416 RFSQARSEDKDGRFHYLPFGGGVRTCLGKHLAKLFLKVLAVELASTSRFELATRTFPRIT 475<br />
Query 436 ---IVGPSDGIQYGPFALPQN 453<br />
++ P DG+ F L N<br />
Sbjct 476 LVPVLHPVDGLSVKFFGLDSN 496<br />
>AT5G48375<br />
MKFRALGLVLLLAVETCKAEEITCEETKPFTCNQTDRFNRKHFDDDFIFEGGKGRGLNVWDGFTHRYPEKGGPDLGNGDSTCGSYEHWQKDIDVMTELGV<br />
DGYRFSLAWSRIAPRESNQAGVKYYNDLIDGLLAKNITPFVTLFHWDLPQVLQDEYEGFLNHEIIDDFKDYANLCFKIFGDRVKKWITINQLYTVPTRGY<br />
AMGTDAPEPYIVAHNQLLAHAKVVHLYRKKYKPKQRGQIGVVMITRWFVPYDSTQANIDATERNKEFFLGWFMEPLTKGKYPDIMRKLVGRRLPKFNKKE<br />
AKLVKGSYDFLGINYYQTQYVYAIPANPPNRLTVLNDSLSAFSYENKDGPIGPWFNADSYYHPRGILNVLEHFKTKYGNPLVYITENGELLILSGCNVKG<br />
YFAWCLGDNYELWPSRSFHVSPFYLLHRKDKGAFPSFEA<br />
GENE ID: 197021 LCTL | lactase-like [Homo sapiens] (10 or fewer PubMed l<strong>in</strong>ks)<br />
Score = 246 bits (628), Expect = 7e-65, Method: Compositional matrix adjust.<br />
Identities = 155/416 (37%), Positives = 219/416 (52%), Gaps = 71/416 (17%)<br />
Query 54 GRGLNVWDGFTHRYPEKGGPDLGN--GDSTCGSYEHWQKDIDVMTELGVDGYRFSLAWSR 111<br />
G+G ++WD FTH G LGN D C Y Q+DI ++ EL V+ YRFSL+W R<br />
Sbjct 60 GKGPSIWDVFTH---SGKGKVLGNETADVACDGYYKVQEDIILLRELHVNHYRFSLSWPR 116<br />
Query 112 IAP-----RESNQAGVKYYNDLIDGLLAKNITPFVTLFHWDLPQVLQDEYEGFLNHEIID 166<br />
+ P + N+ G+++Y+DLID LL+ NITP VTL HWDLPQ+LQ +Y G+ N + +<br />
Sbjct 117 LLPTGIRAEQVNKKGIEFYSDLIDALLSSNITPIVTLHHWDLPQLLQVKYGGWQNVSMAN 176<br />
Query 167 DFKDYANLCFKIFGDRVKKWITINQLYTVPTRGYAMGTDAP-------EPYIVAHNQLLA 219<br />
F+DYANLCF+ FGDRVK WIT + + +GY G AP Y AH+ + A<br />
Sbjct 177 YFRDYANLCFEAFGDRVKHWITFSDPRAMAEKGYETGHHAPGLKLRGTGLYKAAHHIIKA 236<br />
Query 220 HAKVVHLYRKKYKPKQRGQIGVVMITRWFVPYD-STQANIDATERNKEFFLGWFMEPLTK 278<br />
HAK H Y ++ KQ+G +G+ + W P D S +++A ER +F LGWF P+<br />
Sbjct 237 HAKTWHSYNTTWRSKQQGLVGISLNCDWGEPVDISNPKDLEAAERYLQFCLGWFANPIYA 296<br />
Query 279 GKYPDIMRKLVGR----------RLPKFNKKEAKLVKGSYDFLGINYYQTQYVYAIPANP 328<br />
G YP +M+ +GR RLP F+ +E +KG+ DFLG+ ++ T+Y+ N<br />
Sbjct 297 GDYPQVMKDYIGRKSAEQGLEMSRLPVFSLQEKSYIKGTSDFLGLGHFTTRYI--TERNY 354<br />
Query 329 PNRLTVLNDSLSAFSYENKDGPIG----PWFNADS---YYHPRGILNVLEHFKTKYGNPL 381<br />
P+R SY+N I W + S Y P G +L +T+YG+P<br />
Sbjct 355 PSR--------QGPSYQNDRDLIELVDPNWPDLGSKWLYSVPWGFRRLLNFAQTQYGDPP 406<br />
Query 382 VYITENG------------------------ELL--ILSGCNVKGYFAWCLGDNYE 411<br />
+Y+ ENG E+L I G N+KGY +W L D +E<br />
Sbjct 407 IYVMENGASQKFHCTQLCDEWRIQYLKGYINEMLKAIKDGANIKGYTSWSLLDKFE 462<br />
>AT5G65540<br />
MALLGDDGRGFDLARKLEVSGVWRTWLGDSIYSSFHHYLSSPSTWEAFMRVDESKSRAQIQLQLRVRALLFDKATVSLFLRSNTIAASSSSSASISDVSS<br />
VAVSKLNPNYLQLHGDDVYYTLENASLESGFQREGGIRHNPSLTKSLSKPSFTSGTRGSESDFSNLSQRSRFEELPDTWYTQFISRYGFKYGMSVGGQES<br />
DKRTPEGMSTYLRVVDTHKRKRAPFLEDRSLAHMSRSSTHPSSGFDGSTSEDDILFLPETMFRMNCVPETALSPITRTQDNLKTEFYGVLDTLPQVTTRS<br />
HIMIERLGLMPEYHRMEERGVLRSRKAEKMGFSDDQAALVSRKVVARMLLTMGFEGATEVPIDVFSQLVSRHMSKLGRILKLLTDSYKKECSAMQLIKMF<br />
LNTTGYSNLGSLAEIVKDGTRNHPPPNQKQPQVLQQQLHLQQQASLRLPQQIQRQMHPQMQQMVNPQNFQQQQQLERMRRRPVTSPRPNMDMEKDRPLVQ<br />
VKLENPSEMAVDGNAFNPMNPRHQQQLQQQLRQQQQIAAMSNMQQQPGYNQFRQLASMQIPQMQTPTLGTVRAQPVKVEGFEQLMGGDSSLKHDSDDKLR<br />
SPPTK<br />
No significant homologies
ATCG00480<br />
MRTNPTTSNPEEVSIREKKNLGRIAQIIIGPVLDVAFPPGKMP<br />
PNIYNALVVKGRDTLG LGQEINVTCEVQQLLG GNNRVRAVAMSATEGL LKRGMDVVDMGNPLSVV<br />
PVGGATLGRIFFNVLGEPVDNLGPVDTTRTTSPIHKSAPAFIE<br />
ELDTKLSIFETGIKVV VVDLLAPYRRGGKIGL LFGGAGVGKTVLIMEL LINNIAKAHGGVSVFGG<br />
GVGERTREGNDDLYMEMKESGVINEQNNLAESKVALVYGQMNE<br />
EPPGARMRVGLTALTM TMAEYFRDVNEQDVLL LFIDNIFRFVQAGSEV VSALLGRMPSAVGYQPP<br />
TLSTEMGTLQEERITSTKKGSITSIQAAVYVPADDLTDPAPAT<br />
TTFAHLDATTVLSRGL GLAAKGIYPAVDPLDS STSTMLQPRIVGEEHY YETAQQVKQTLQRYKEE<br />
LQDIIAILGLDDELSEEDRLTVARARKKIERFLSQPFFVAEVF<br />
FTGSPGKYVGLAETIRRGFNLILSGEFDSLP<br />
PEQAFYLVGNIDEATA AKATNLEMESKLKK<br />
> ref| |NP_001677.2|<br />
GENE ID: 5506<br />
ATP5B | ATP synthase, H+ tr ransport<strong>in</strong>g, mittochondrial<br />
F1 complex,<br />
beta polypeeptide<br />
[Homo sappiens]<br />
(Over 10 PubMed l<strong>in</strong>ks)<br />
Score = 6667<br />
bits (1721), , Expect = 0.0, , Method: Compossitional<br />
matrix x adjust.<br />
Identitiess<br />
= 341/501 (68% %), Positives = 391/501 (78%), Gaps = 15/501 (2%)<br />
Query 1<br />
Sbjct 33<br />
Query 53<br />
Sbjct 92<br />
Query 113<br />
Sbjct 147<br />
Query 173<br />
Sbjct 207<br />
Query 233<br />
Sbjct 266<br />
Query 293<br />
Sbjct 326<br />
Query 353<br />
Sbjct 386<br />
Query 413<br />
Sbjct 446<br />
Query 473<br />
Sbjct 506<br />
MRTNPTTSNP---------EVSIREKKNLG<br />
GRIAQIIGPVLDVAFP FPPGKMPNIYNALVVK KGR 52<br />
+R PT +P<br />
S + GRI G +IG V+DV F G +P I NAL V+ GR<br />
LRAAPTAVHPVRDYYAAQTSPSPKAGAATG<br />
GRIVAVIGAVVDVQFD FDEG-LPPILNALEVQ QGR 91<br />
DTLGQEINVTCEVQQQLLGNNRVRAVAMSA<br />
ATEGLKRGMDVVDMGN GNPLSVPVGGATLGRIFN<br />
112<br />
+T + EV Q LG + VR +AM TEGL RG V+D G P+ +PVG TLGRI N<br />
ET-----RLVLEVAAQHLGESTVRTIAMDG<br />
GTEGLVRGQKVLDSGA GAPIKIPVGPETLGRIMN<br />
146<br />
VLGEPVDNLGPVDTTRTTSPIHKSAPAFIE<br />
ELDTKLSIFETGIKVV VVDLLAPYRRGGKIGL LFG 172<br />
V+GEP+D GP+ TT+<br />
+PIH AP F+E E+ + I TGIKVV VVDLLAPY +GGKIGL LFG<br />
VIGEPIDERGPIKTTKQFAPIHAEAPEFME<br />
EMSVEQEILVTGIKVV VVDLLAPYAKGGKIGL LFG 206<br />
GAGVGKTVLIMELIINNIAKAHGGVSVFGG<br />
GVGERTREGNDLYMEMMKESGVINEQNLAESKV<br />
232<br />
GAGVGKTVLIMELIINN+AKAHGG<br />
SVF GVGERTREGNDLY G<br />
EMM<br />
ESGVIN ++ A SKV<br />
GAGVGKTVLIMELIINNVAKAHGGYSVFAG<br />
GVGERTREGNDLYHEMMIESGVINLKD-ATSKV<br />
265<br />
ALVYGQMNEPPGARRMRVGLTALTMAEYFR<br />
RDVNEQDVLLFIDNIFFRFVQAGSEVSALLG<br />
GRM 292<br />
ALVYGQMNEPPGARR<br />
RV LT LT+AEYFR RD QDVLLFIDNIFFRF<br />
QAGSEVSALLG GR+<br />
ALVYGQMNEPPGARRARVALTGLTVAEYFR<br />
RDQEGQDVLLFIDNIFFRFTQAGSEVSALLG<br />
GRI 325<br />
PSAVGYQPTLSTEMMGTLQERITSTKKGSI<br />
ITSIQAVYVPADDLTDDPAPATTFAHLDATTVL<br />
352<br />
PSAVGYQPTL+T+MMGT+QERIT+TKKGSI<br />
ITS+QA+YVPADDLTDDPAPATTFAHLDATTVL<br />
PSAVGYQPTLATDMMGTMQERITTTKKGSI<br />
ITSVQAIYVPADDLTDDPAPATTFAHLDATTVL<br />
385<br />
SRGLAAKGIYPAVDDPLDSTSTMLQPRIVG<br />
GEEHYETAQQVKQTLQQRYKELQDIIAILGL<br />
LDE 412<br />
SR +A GIYPAVDDPLDSTS<br />
++ P IVG G EHY+ A+ V++ LQQ<br />
YK LQDIIAILG+ DE<br />
SRAIAELGIYPAVDDPLDSTSRIMDPNIVG<br />
GSEHYDVARGVQKILQQDYKSLQDIIAILGM<br />
MDE 445<br />
LSEEDRLTVARARKKIERFLSQPFFVAEVF<br />
FTGSPGKYVGLAETIRRGFNLILSGEFDSLP<br />
PEQ 472<br />
LSEED+LTV+RARKKI+RFLSQPF<br />
VAEVF FTG GK V L ETI+ +GF IL+GE+D LP PEQ<br />
LSEEDKLTVSRARKKIQRFLSQPFQVAEVF<br />
FTGHMGKLVPLKETIKKGFQQILAGEYDHLP<br />
PEQ 505<br />
AFYLVGNIDEATAKKATNLEME<br />
493<br />
AFY+VG I+EA AKKA<br />
L E<br />
AFYMVGPIEEAVAKKADKLAEE<br />
526<br />
>ATCG00490<br />
MSPQTETKASVVGFKAGVKEYKLTYYTTPEYETKDTDILAAFR<br />
RVTPQPGVPPEEAGAA AAVAAESSTGTWTTVW WTDGLTSLDRYKGRCY YHIEPVPGEETQFIAYY<br />
VAYPLDLFEEGGSVTNMFTSIVGNVFGGFKALAALRLEDLRIP<br />
PPAYTKTFQGPPHGIQQVERDKLNKYGRPLL<br />
LGCTIKPKLGLSAKNY YGRAVYECLRGGLDFTT<br />
KDDENVNSQPFFMRWRDRFLFCAEAIYYKSQAETGEIKGHYLN<br />
NATAGTCEEMIKRAVF VFARELGVPIVMHDYL LTGGFTANTSLSHYCR RDNGLLLHIHRAMHAVV<br />
IDRQKNHGMHFFRVLAKALRLSGGDHIIHAGTVVGKLEGDRES<br />
STLGFVDLLRDDYVEK EKDRSRGIFFTQDWVS SLPGVLPVASGGIHVW WHMPALTEIFGDDSVLL<br />
QFGGGTLGHPWWGNAPGAVANRVALEAACVQARNEGRDLAVEG<br />
GNEIIREACKWSPELA LAAACEVWKEITFNFP PTIDKLDGQE<br />
No significcant<br />
homologies to human protei <strong>in</strong>s<br />
ATP synth hase subunit betta,<br />
mitochondria al precursor [Ho omo sapiens]