01.04.2015 Views

Gene Cloning

Gene Cloning

Gene Cloning

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Genome Organization 11<br />

both of which have arisen through the duplication of regions of the<br />

genome during evolution. A duplicated region can either be on the same or<br />

a separate chromosome.<br />

Introns<br />

Most eukaryotic genes are made up of stretches of DNA that code for<br />

amino acids, interrupted by non-coding sequences. The coding and noncoding<br />

sections are called exons and introns respectively. An example of<br />

eukaryotic gene structure is shown in Figure 2.1, where the gene indicated<br />

has three exons separated by two introns. Eukaryotic RNA polymerases<br />

transcribe both the exons and introns to yield an RNA molecule called the<br />

primary transcript; it requires further processing before it can be used as a<br />

template for translation (Figure 2.2). This processing includes removal of<br />

the introns in a process called splicing, which edits the exons into a contiguous<br />

coding sequence that can be translated by the ribosome. Introns<br />

make up 24–28% of the human genome, and the “average” human gene is<br />

encoded by nine small exons of 145 bp each separated by large introns of<br />

over 3 kb each.<br />

The existence of introns has many implications for gene cloning and<br />

analysis. Most genetic engineering requires the manipulation of a<br />

sequence that is devoid of introns, i.e. is a single contiguous coding<br />

sequence. As you can see from Figure 2.2a, the mature messenger RNA<br />

contains an uninterrupted copy of the protein coding sequence. Therefore<br />

the easiest way to obtain a contiguous coding sequence is to isolate messenger<br />

RNA and to then create a DNA copy. A DNA copy of messenger RNA<br />

is called cDNA, and the way in which this is produced and used will be discussed<br />

in Section 4.18. Another issue that is raised by introns is the problem<br />

of identifying genes within whole genome sequences. This is made<br />

particularly difficult because of the fact that small exons are separated by<br />

very large introns and predicting exon-intron boundaries is a difficult challenge.<br />

The problem of identifying the coding sequences within genomic<br />

kb<br />

0 10 20 30 40 50 60 70 80 90 100<br />

<strong>Gene</strong>s<br />

CpG<br />

SSR<br />

ITDR<br />

Figure 2.1 Schematic representation of 100 kb of a human chromosome. The four rows<br />

represent different classes of sequence found within the genome. This region of the genome<br />

contains only one gene, the exons are shown as blocks and the introns as lines. The position of<br />

simple sequence repeats (SSRs), interspersed transposon-derived repeats (ITDRs) and CpG islands<br />

(CpG) are indicated by the blocks.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!