View - ResearchGate

More documents

Recommendations

Info

Sybil: Multiple Genome Comparison and Visualization 103a. The results did not appear to be overly sensitive to the values chosen (i.e.,small changes in the parameter values in the neighborhood of 80% and 0.6 didnot produce disproportionately large changes in the composition of the resultingprotein clusters).b. The protein clusters produced were—in the judgment of the curator—a goodapproximation of the “true” paralogous families in each of the genomes in question.With respect to condition b it is worth noting that the Jaccard clustering phase ofthe clustering analysis can serve multiple purposes. Its primary goal is to clusterparalogs within each genome and prevent them from confusing the subsequent bidirectionalbest hit analysis. However, the Jaccard clustering phase can be viewed moregenerally as a kind of compression algorithm that eliminates duplicate or near-duplicatepolypeptides and their corresponding genes from the data set. In realistic data setssuch duplicates can be produced by processes other than recent gene duplication. Forexample, in one recent project (22) sequencing was performed on genomic DNAsampled from two distinct haplotypes and in this case the Jaccard clustering was usedto collapse the two extremely similar sets of polypeptides into one, which greatlysimplified the downstream analyses. Incomplete or erroneously assembled sequencecontigs in early versions of draft genomes may also contain small-scale duplicationsthat are artifacts of the assembly process and lead to duplicate gene calls.9. An earlier version of the clustering algorithm relied solely on the second phaseof the clustering process (see Fig. 5), which is acceptable for analyzing compactgenomes with relatively little recent gene duplication. But as a bidirectional besthit analysis is easily confounded by the presence of close paralogs, the initialJaccard clustering phase was introduced and the best hit analysis was modifiedto run on (Jaccard) clusters instead of individual polypeptides (see Fig. 6).10. The “highest-scoring” BLASTP match is determined by comparing BLAST E-values.In the case of a tie one of the matches is picked arbitrarily as the “highest-scoring.”The exact method for doing this is not important, but it should be deterministic sothat the algorithm generates reproducible results. In practice, it should not matterhow such ties are broken, because any two polypeptides that match a third equallywell are likely to be clustered together by the first phase of the algorithm.11. A consequence of using connected components is that the clustering of genes fromgenomes A and B may depend on the other genomes included in the analysis. Forexample, if genomes A, B, and C are clustered and gene A1 is a reciprocal best hitof B1 but not C1, and B1 is a reciprocal best hit of C1 but not A1, then A1, B1,and C1 will be placed in the same cluster. If, however, genome B were notincluded in the analysis then A1 and C1 would not be clustered. At first glance thismay seem to be an undesirable property of the algorithm. However, it is justifiablefrom a logical standpoint, because if it is believed that A1 and B1 are orthologsand B1 and C1 are orthologs then it follows from the definition of the term that itshould also be believed that A1 and C1 are orthologs.12. As particularly large clusters (in terms of the number of proteins) can take muchlonger to run through ClustalW, and may even cause the program to (eventually)fail, a parameter for this phase of the analysis allows the ClustalW computation to
Page 2:
Gene Function Analysis
Page 6:
METHODS IN MOLECULAR BIOLOGYGene Fu
Page 12:
PrefaceThis volume of Methods in Mo
Page 16:
Prefaceixcolleagues demonstrate how
Page 20:
xiiContentsPART III EXPERIMENTAL ME
Page 26:
ICOMPUTATIONAL METHODS I
Page 34:
4 BidautTable 1Input File Format Us
Page 38:
6 BidautTable 2Folder Layout to Use
Page 42:
8 Bidaut• alphaA: this is the num
Page 46:
10 Bidautcomputing the maximum corr
Page 50:
12 BidautFig. 3. The complete Clutr
Page 54:
Table 3Some Identified Patterns (5,
Page 58:
16 BidautFig. 4. This is a comparis
Page 62:
18 BidautReferences1. Hughes, T. R.
Page 66:
20 Kirov et al.way to associate gen
Page 70:
22 Kirov et al.based on a study ass
Page 74:
24 Kirov et al.1. Retrieve the gene
Page 78:
26Fig. 1. Functional associations f
Page 82:
28 Kirov et al.Fig. 2. Pathway anal
Page 86:
30 Kirov et al.3. Gene symbols usag
Page 90:
32 Kirov et al.9. OBO_Team, Open Bi
Page 94:
3Estimating Gene Function With Leas
Page 98:
Estimating Gene Function With LS-NM
Page 102:
Page 106:
Page 110:
Page 114:
Page 118:
Page 122:
50 Gonye et al.activity and problem
Page 126:
52 Gonye et al.Currently, PAINT can
Page 130:
54 Gonye et al.dynamic nature of th
Page 136:
Prediction Using PAINT 57represente
Page 140:
Prediction Using PAINT 59In PAINT,
Page 144:
Prediction Using PAINT 6114. On the
Page 148:
Prediction Using PAINT 634.2. Size
Page 152:
65Fig. 4. Localization of enrichmen
Page 156:
Prediction Using PAINT 673. Okubo,
Page 160:
5Prediction of Intrinsic Disorder a
Page 164:
Prediction of ID and Its Use in Fun
Page 168:
Table 1Summary of the Web Servers O
Page 172:
Page 176:
Page 180: Prediction of ID and Its Use in Fun
Page 208: IICOMPUTATIONAL METHODS II
Page 212: 94 Crabtree et al.genomes, which is
Page 216: 96 Crabtree et al.Fig. 2. Sybil pro
Page 220: 98 Crabtree et al.Fig. 3. Computing
Page 224: 100 Crabtree et al.3.1.5.1. FILTER
Page 228: 102 Crabtree et al.3. For the sake
Page 234: Sybil: Multiple Genome Comparison a
Page 238: Sybil: Multiple Genome Comparison a
Page 242: 7Estimating Protein Function Using
Page 246: Estimating Protein Function Using P
Page 282:
130 Davuluriinteracting proteins an
Page 286:
Table 1Web URLs of Promoter, TF Dat
Page 290:
134 DavuluriPWM-based models do not
Page 294:
136 DavuluriTF-map alignments of or
Page 298:
138 Davuluridiscussed which program
Page 302:
140 DavuluriTable 2ER-a-Responsive
Page 306:
Table 3Sample Data Matrix Represent
Page 310:
Table 3 (Continued)Class MYCMAX MYC
Page 314:
146 DavuluriFig. 3. (A) CART Tree:
Page 318:
148 Davuluri11. Vlieghe, D., Sandel
Page 322:
150 Davuluri44. Berezikov, E., Gury
Page 326:
9Mining Biomedical Data Using MetaM
Page 330:
Mining Biomedical Data Using MMTx a
Page 334:
Page 338:
Page 342:
Page 346:
Page 350:
Page 354:
Page 358:
Page 362:
172 Ho et al.Fig. 1. Artificial exa
Page 366:
174 Ho et al.allowing for cases whe
Page 370:
176 Ho et al.A different measure is
Page 374:
178 Ho et al.3.1.3. LA and Generali
Page 378:
180 Ho et al.The ECF-statistic can
Page 382:
182 Ho et al.In the special case of
Page 386:
184 Ho et al.Fig. 5. An illustratio
Page 390:
186 Ho et al.Fig. 7. The power curv
Page 394:
188 Ho et al.this section were not
Page 398:
190 Ho et al.References1. Schena, M
Page 402:
IIIEXPERIMENTAL METHODS
Page 406:
194 Caldwell et al.for sequences th
Page 410:
196 Caldwell et al.query because it
Page 414:
198 Caldwell et al.Fig. 1. (A) Prot
Page 418:
200 Caldwell et al.outside primer o
Page 422:
202 Caldwell et al.5. Targeting scr
Page 426:
204 Caldwell et al.will allow the s
Page 430:
206 Caldwell et al.3.1.6. Plasmid P
Page 434:
208 Caldwell et al.PCR amplify the
Page 438:
210 Caldwell et al.8. Thawing cells
Page 442:
212 Zhang et al.Going one step beyo
Page 446:
214 Zhang et al.Fig. 2. Generation
Page 450:
216 Zhang et al.Perform PCR cycles,
Page 454:
218 Zhang et al.Fig. 4. Schematic m
Page 458:
220 Zhang et al.Fig. 5. Replacement
Page 462:
13Construction of Simple and Effici
Page 466:
DNA Vector-Based shRNA-Expression S
Page 470:
Page 474:
Page 478:
Page 482:
Page 486:
Page 490:
Page 494:
Page 498:
Page 502:
244 Hust et al.overcome by two appr
Page 506:
246 Hust et al.Fig. 1. Schematic de
Page 510:
248 Hust et al.interaction during p
Page 514:
250 Hust et al.3.4. Titering1. Inoc
Page 518:
252 Hust et al.10. Shortly before u
Page 522:
254 Hust et al.activity by preservi
Page 526:
15A Bacterial/Yeast Merged Two-Hybr
Page 530:
Screening in Yeast With a Bacterial
Page 534:
Page 538:
Page 542:
Page 546:
Page 550:
Page 554:
Page 558:
Page 562:
Page 566:
Page 570:
Page 574:
Page 578:
Page 582:
Page 586:
Page 590:
Page 594:
16A Bacterial/Yeast Merged Two-Hybr
Page 598:
Dual Bait-Compatible Bacterial Two-
Page 602:
Page 606:
Page 610:
Page 614:
Page 618:
Page 622:
Page 626:
Page 630:
Page 634:
Page 638:
Page 642:
Page 646:
318 Thibodeau-Beganny and Joungbeen
Page 650:
320 Thibodeau-Beganny and JoungFig.
Page 654:
Page 658:
324 Thibodeau-Beganny and JoungTypi
Page 662:
Page 666:
328 Thibodeau-Beganny and JoungPCR
Page 670:
330 Thibodeau-Beganny and Joung16-1
Page 674:
332 Thibodeau-Beganny and Joung2. P
Page 678:
334 Thibodeau-Beganny and Joung11.
Page 682:
336 IndexKknockin (gene knockin) 19
show all

View - ResearchGate

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?