01.04.2015 Views

Gene Cloning

Gene Cloning

Gene Cloning

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

190 <strong>Gene</strong> <strong>Cloning</strong><br />

Contig 2<br />

Clone 12<br />

Clone 18<br />

Contig 4<br />

Contig 1<br />

Clone 12 Clone 18<br />

Contig 3<br />

Figure 7.8 Assembly of shotgun sequence reads into contiguous<br />

sequences (contigs). The solid black lines indicate the regions of each insert<br />

that have been sequenced, the dotted lines represent sequence that has not been<br />

determined and is therefore not known. Note some clones have been sequenced<br />

from both ends (e.g. 12 and 18); however, there is still some sequence in the<br />

middle that has not been determined for these clones.<br />

times to give six- to 10-fold coverage. At this point you will have sequenced<br />

tens of thousands of clones, but invariably you will not be able to assemble<br />

your sequence reads into one contiguous sequence, i.e. one contig covering<br />

the whole genome, as you will have gaps in your sequence information.<br />

These gaps will obviously mean that you do not have the complete genome<br />

sequence and also you may not know the relative position of each contig or<br />

their positions on the genome. This “draft” sequence now has to be “finished”.<br />

“Finishing” involves determining the sequence across the gaps to<br />

generate a complete genome sequence. The sequence gaps will fall into<br />

two categories. The first type are called sequencing gaps and these arise<br />

because you have often sequenced one clone from both ends and therefore<br />

have 500 bases of sequence from both ends but no sequence information<br />

from the middle, e.g. clone 12 in Figure 7.8, and it is possible that all or part<br />

of this middle sequence has not been determined from any other clones.<br />

The two different sequence reads from the same clone will therefore fall<br />

into two different contigs because there is no overlapping sequence information<br />

to join up the two reads (e.g. the two reads from clone 12 fall into<br />

contigs 2 and 3). However, you know that the two sequence reads come<br />

from the same clone and so you only need to go back to that clone (clone<br />

12 in this example) and sequence the middle section to fill the gap and join<br />

up the two contigs. Sequencing gaps are thus where you know that you

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!