Unit 2
Unit 2
UNIT II
DNA LIBRARY (GENETIC LIBRARY)
DNA Library or Gene Library is simply the collection of DNA fragments cloned
into vectors and stored within host organisms. They contain either the entire genome of a
particular organism or the genes that are expressed at a given time.
The genome is vast and complex. To understand the entire genome or study specific genes, it is
important to study it in smaller and more manageable fragments. DNA libraries make the entire
genome accessible in small fragments. These libraries are used to identify, isolate, and study
particular genes of interest.
The development of molecular biology technologies including recombinant DNA (rDNA)
technology, cloning vectors, and techniques for transforming bacteria laid the foundation for the
construction of DNA libraries.
Types of DNA Library
Genomic information can be obtained by two primary methods. Based on this, DNA libraries are
divided into two types: genomic and cDNA library.
1. Genomic Library
A genomic DNA library is a collection of DNA fragments that represent all genetic
information of an organism. This includes both coding and noncoding regions of the DNA.
Genomic libraries are suitable for a wide range of applications, including genome mapping
and comparative genomics. It allows the study of regulatory elements and noncoding
sequences that are important in gene expression and regulation.
Genomic libraries also have certain disadvantages including the complexity and size of
handling large DNA fragments and the resource-intensive process of creating and
maintaining such libraries.
Construction (Preparation) of Genomic Library
1. Isolation of Genomic DNA
The construction of a genomic library begins with the isolation of genomic DNA from the
organism of interest.
Genomic DNA can be isolated using different methods such as cell lysis, protein digestion,
and phenol-chloroform extraction.
The isolated DNA represents the entire genome of the organism and contains both coding
and non-coding regions.
2. Fragmentation of Genomic DNA
The isolated genomic DNA obtained is often too large to be cloned directly into vectors.
So, it needs to be fragmented into smaller fragments suitable for cloning.
This fragmentation can be achieved using physical methods such as sonication, mechanical
shearing, or enzymatic methods involving restriction enzymes.
3. Cloning
The fragmented genomic DNA is then cloned into a suitable vector. Some common vectors
used for genomic library construction include plasmids, bacteriophages, bacterial artificial
chromosomes (BACs), and yeast artificial chromosomes (YACs).
Bacterial vectors are suitable for smaller DNA fragments, while YACs are used for larger
DNA fragments.
The vectors are treated with restriction enzymes to create sticky ends that are compatible
with the fragmented DNA.
DNA ligase enzyme is used to bind the DNA fragments to the vector, creating recombinant
DNA molecules.
4. Transformation
The recombinant vectors containing the cloned genomic DNA fragments are transformed
into a suitable host organism, usually E. coli and yeast.
The transformed host cells take up the recombinant vectors and are cultured on agar plates
containing selective media to allow the growth of colonies containing the recombinant
DNA. These colonies form a genomic DNA library.
Colony Hybridization
Probes used for hybridization
DNA or synthetic oligonucleotide probes can be used for identification of a clone from a genomic
library. A common method of labeling probes is the incorporation of a radioactive or other marker
into the molecule. A number of alternative labeling methods are also available that involve an
amplification process to detect the presence of small quantities of bound probe and avoid the use
of radioactivity. These methods involve the incorporation of chemical labels such as digoxigenin
or biotin into the probe which can be used for detection
Colony hybridization Colony hybridization, also known as replica plating, allows the screening of
colonies plated at high density using radioactive DNA probes. This method can be used to screen
plasmid or cosmid based.
Plaque hybridization
Plaque hybridization, also known as Plaque lift, was developed by Benton and Davis in
1977 and employs a filter lift method applied to phage plaques. This procedure is successfully
applied to the isolation of recombinant phage by nucleic acid hybridization and probably is the
most widely applied method of library screening. The method of screening library by plaque
hybridization is described below-
• The nitrocellulose filter is applied to the upper surface of agar plates, making a direct contact
between plaques and filter.
• The plaques contain phage particles, as well as a considerable amount of unpackaged
recombinant DNA which bind to the filter.
• The DNA is denatured, fixed to the filter, hybridized with radioactive probes and assayed by
autoradiography.
Advantages
• This method results in a ‘cleaner’ background and distinct signal (less background probe
hybridization) for λ plaque screening due to less DNA transfer from the bacterial host to the
nitrocellulose membrane while lifting plaques rather than bacterial colonies.
• Multiple screens can be performed from the same plate as plaques can be lifted several times.
• Screening can be performed at very high density by screening small plaques. High-density
screening has the advantage that a large number of recombinant Genetic Engineering &
Applications clones can be screened for the presence of sequences homologous to the probe in a
single experiment.
.
Immunological screening
This involves the use of antibodies that specifically recognize antigenic determinants on
the polypeptide. It does not rely upon any particular function of the expressed foreign protein, but
requires an antibody specific to the protein. The secondary antibody recognizes the constant
region of the primary antibody and is, additionally, conjugated to an easily assayable enzyme (e.g.
horseradish peroxidase or alkaline phosphatase) which can be assayed using colorimetric change
or emission of light using X-ray film.
• In this technique, the cells are grown as colonies on master plates and transferred to a solid
matrix. • These colonies are subjected to lysis releasing the proteins which bind to the matrix.
• These proteins are treated with a primary antibody which specifically binds to the protein (acts as
antigen), encoded by the target DNA. The unbound antibodies are removed by washing.
• A secondary antibody is added which specifically binds to the primary antibody removing the
unbound antibodies by washing.
• The secondary antibody carries an enzyme label (e.g., horse radishperoxidase or alkaline
phosphatase) bound to it which converts colorless substrate to colored product. The colonies with
positive results (i.e. colored spots) are identified and subcultured from the master plate.
Screening by PCR
PCR screening is employed for the identification of rare DNA sequences in complex
mixtures of molecular clones by increasing the abundance of a particular sequence. It is
possible to identify any clone by PCR only if there is available information about its
sequence to design suitable primers.
Preparation of a library for screening by PCR can be done by following ways-
● The library can be plated as plaques or colonies on agar plates and individually inoculated
into the wells of the multi-well plate. However it is a labor intensive process and can lead to
bias in favor of larger colonies or plaques.
● The alternative method involves diluting the library. It involves plating out a small part of
the original library (the packaging mix for a phage library, transformation for a plasmid
library) and calculating the titer of the library. A larger sample is diluted to give a titer of 100
colonies per mL. Dispensing 100 μL into each well theoretically gives 10 clones in each well.
These are then pooled and PCR reactions are carried out with gene-specific primers flanking
a unique sequence in the target to identify the wells containing the clone of interest. This
method is often used for screening commercially available libraries.
Applications
o BAC vectors have been used in studying large double-stranded DNA viruses for
academic research and d as a tool to develop advanced vaccines.
o BAC are utilized for the construction of genomic libraries. They have been used for
basic science research to industrial research like animal husbandry.
o BAC are also used in determining phylogenetic lineage between different species.
o They are also used in the study of horizontal gene transfer.
Advantages of BACs:
• They are capable of accommodating large sequences without any risk of rearrangement.
• BACs are frequently used for studies of genetic or infectious disorders.
• High yield of DNA clones is obtained.
Disadvantages of BACs:
• They are present in low copy number.
• The eukaryotic DNA inserts with repetitive sequences are structurally unstable in BACs
often resulting in deletion or rearrangement.
The primary components of a YAC are the ARS, centromere, and telomeres
from Saccharomyces cerevisiae.
ARS: This sequence in YAC is an autonomously replicating sequence.
CEN: This represents the yeast centromere region. This ensures chromosome partitioning
between two daughter cells and a selective marker gene.
TEL: TEL region provides the telomeres. They are not complete telomere sequences but
once they go inside the yeast nucleus they act as seeding sequences in which telomeres build.
Additionally, selectable marker genes, such as antibiotic resistance and a visible marker, are
utilized to select transformed yeast cells. Without these sequences, the chromosome will not
be stable during extracellular replication, and would not be distinguishable from colonies
without the vector.
Structure of Yeast Artificial Chromosomes
A yeast artificial chromosome cloning vector consists of two copies of a yeast telomeric
sequence (telomeres are the sequences at the ends of chromosomes), a yeast centromere, a
yeast ars (an autonomously replicating sequence where DNA replication begins), and
appropriate selectable markers.
Working Principle of Yeast Artificial Chromosomes
The yeast artificial chromosome, which is often shortened to YAC, is an artificially
constructed system that can undergo replication. The design of a YAC allows extremely large
segments of genetic material to be inserted. Subsequent rounds of replication produce many
copies of the inserted sequence, in a genetic procedure known as cloning.
The principle is similar to that for plasmids or cosmids.
The experimenter introduces some typical elements that are necessary for correct
replication.
In the case of YACs, the replication origins are the centromeres and telomeres of the
yeast chromosomes, which must be inserted into the DNA being cloned.
The constructs can be transformed in yeast Spheroplast and are then replicated there.
In contrast to the vectors, YACs are not circular; they are made of linear DNA.
Process of Yeast Artificial Chromosomes
YAC vector is initially propagated as circular plasmid inside bacterial host utilizing
bacterial ori sequence.
The circular plasmid is cut at a specific site using restriction enzymes to generate a
linear chromosome with two telomere sites at terminals.
The linear chromosome is again digested at a specific site with two arms with
different selection marker.
The genomic insert is then ligated into YAC vector using DNA ligase enzyme.
The recombinant vectors are transformed into yeast cells and screened for the
selection markers to obtain recombinant colonies.
CHROMOSOME WALKING
When a probe is used for the identification of a gene sequence in a genomic library,
the probe may hybridize with a number of clones, each carrying a part of a large gene
fragmented during preparation of genomic library. If we obtain partial digests (by digesting
the DNA only partially) from the genome, different genomes (from large number of cells)
may give fragments which have overlapping sequences, because sites cleaved in different
genomes of the same organism, will differ being random. Since none of these fragments may
have its entire sequence represented in the probe, overlapping sequences may be used to
construct the original genomic sequence. Identification of fragments with an overlapping
sequence may be a key to the reconstruction or characterization of large chromosome regions.
This is achieved by the technique popularly called chromosome walking.
Introduction
Chromosome walking is a method of positional cloning used to find, isolate, and
clone a particular allele in a gene library. Chromosome Walking was developed by Welcome
Bender, Pierre Spierer, and David S. Hogness in the early 1980's. There are nearly half a
dozen positional cloning tests that are done prior to a chromosome walk. Each clone in the
cosmic library has a DNA insert of 50 KB.The walking starts at the closest gene that has
already been identified, known as a marker gene.
Once the markers on either side of an unmapped sequence are found, the chromosome
walk can begin from one of the markers. Each successive gene in the sequence is tested
repeatedly, known as overlap restrictions and mapped for their precise location in the
sequence. Eventually, walking through the genes reaches the mutant gene in an unmapped
sequence that binds to a fragment of a gene of that particular disease. The testing on each
successive clone is complex, time-consuming, and varied by species. This series of
overlapping clones could for example consist of Bacterial Artificial Chromosomes.
Hybridization Probe
A more straightforward approach thus is to use the insert DNA from the starting clone as a
hybridization probe to screen all the other clones in the library. Positive hybridization signals
that are given by clones, whose inserts overlap with the probe, are used as new probes to
continue the walk. There are about 96 clones that a library consists of and each clone contains
a different insert. A probe may have a genome wide repetition of sequences. This can be
reduced by blocking the repeat sequence with pre-hybridization with unlabeled genomic
DNA. But this isn’t that affective solution especially in the case when high capacity vectors
such as BACs or YACs are used in the walk. Therefore for chromosome walks with human
DNA which have a high rate of repetition, intact inserts are not used in general. Instead the
probe is taken from the end of an insert which has a lesser chance of repetition. The walk can
also be sped up by using the PCR instead of hybridization.
Disadvantages
There is a limitation to the speed of chromosome walking because of the small size of
the fragments that are to be cloned.
Another limitation is the difficulty of walking through the repeated sequence that are
scattered through the gene.
If the markers were too far away, it simply was not a viable option.
Additionally, chromosome walking could easily be stopped by unclonable sections of
DNA.
A solution to this problem was achieved with the advent of chromosome jumping
(Marx, 1989), which allows the skipping of unclonable sections of DNA.