12.07.2015 Views

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

128 P. Tompa5.5 Prediction of the <strong>Function</strong> of IDPsAs suggested by the foregoing considerations, reliable all-round prediction ofthe functions of IDPs is still a long way off, and we have only taken the firststeps <strong>to</strong>wards this goal. As discussed in the next section, there are severalapproaches that may shed some light on the function of an IDP not yet experimentallycharacterized. <strong>Function</strong>al correlation of the global pattern on disorder(Lobley et al. 2007), sequence-based prediction of short LMs by a variety ofalgorithms (Davey et al. 2006; Neduva and Russell 2006), prediction of MoRFsin IDPs/IDRs (Mohan et al. 2006; Vacic et al. 2007), and combination ofsequence information <strong>with</strong> disorder (Iakoucheva et al. 2004; Radivojac et al.2006) are reasonable approaches <strong>to</strong> assess the function of an unknown piece ofdisordered protein.5.5.1 Correlation of Disorder Pattern and <strong>Function</strong>Jones and colleagues have taken a direct approach <strong>to</strong> find association between theglobal pattern of disorder and the function of a protein (Lobley et al. 2007) describedby standard Gene On<strong>to</strong>logy (GO) categories. It was first found that both location- andlength-descrip<strong>to</strong>rs of disorder correlate <strong>with</strong> functional categories associated <strong>with</strong>signal transduction and transcription regulation. Both molecular function (MF) andbiological process (BP) annotations were used. The location descrip<strong>to</strong>rs displayedseveral trends associated <strong>with</strong> GO categories, such as an elevated level in the middleof the protein in transcription regula<strong>to</strong>r, DNA binding, and RNA pol II transcriptionfac<strong>to</strong>r functions, in the C-terminus in transcription fac<strong>to</strong>r activa<strong>to</strong>r, transcription fac<strong>to</strong>rrepressor, and transcription fac<strong>to</strong>r or in the N-terminus in potassium channelannotated proteins. Length descrip<strong>to</strong>rs showed even more significant associations<strong>with</strong> function than position descrip<strong>to</strong>rs. For example, disordered regions of more than500 continuous residues are over-represented in transcription-related categories,whereas shorter regions of the order of 50 residues or fewer are overrepresented inproteins performing metal ion binding, ion channel, and GTPase regula<strong>to</strong>ry functions.The observed associations could be used <strong>to</strong> improve prediction of protein function:an SVM predic<strong>to</strong>r applied <strong>to</strong> 26 GO categories, prediction of 11 BP categories and12 MF categories showed improvements resulting from the addition of disorder features.In all, disorder adds significantly <strong>to</strong> the prediction of protein function, <strong>with</strong>more significant improvements observed in BP than in MF classification.5.5.2 Predicting Short Recognition Motifs in IDRsA completely different but relevant approach is <strong>to</strong> predict short sequence motifsin IDPs/IDRs, which may then be directly related <strong>to</strong> certain functions, such aspost-translational modification or binding <strong>to</strong> cognate partners. As suggested

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!