BIO 301 Lecture Notes 2024-25
BIO 301 Lecture Notes 2024-25
1
specifically the chromosomes within it, contained the factors responsible for inheritance.
Weismann’s ideas were revolutionary because they connected the physical structures
observed in cells (chromosomes) with the abstract concept of heredity proposed by
Mendel.
In the early 20th century, American geneticist Thomas Hunt Morgan and his team used
the fruit fly (Drosophila melanogaster) to study the role of chromosomes in heredity.
Morgan’s experiments provided definitive proof of the Chromosome Theory of
Inheritance. Morgan discovered sex-linked traits, such as eye colour in fruit flies, which
were inherited differently between males and females. He linked these traits to the X
chromosome, demonstrating that specific genes are located on specific chromosomes.
Morgan’s student, Calvin Bridges, provided further evidence by studying chromosomal
abnormalities and their effects on inheritance.
Griffith’s Transformation Experiment
Frederick Griffith, a British bacteriologist, in 1928 was studying the bacterium
Streptococcus pneumoniae (pneumococcus) to understand the mechanisms behind
bacterial virulence and immunity. His experiments unexpectedly revealed a phenomenon
he called transformation, which demonstrated that genetic material could be transferred
between bacteria, altering their characteristics.
Griffith worked with two strains of Streptococcus pneumoniae. The smooth (S) strain
produced a polysaccharide capsule that makes it virulent (disease-causing). The capsule
protects the bacteria from the host’s immune system, allowing it to cause pneumonia in
mice while the rough (R) strain lacks the polysaccharide capsule and is non-virulent.
Without the capsule, the bacteria are easily destroyed by the host’s immune system.
When mice were injected with live S strain bacteria, they developed pneumonia and died
and when they were injected with live R strain bacteria, they survived, as the bacteria
were non-virulent. When mice were injected with heat-killed S strain bacteria, they
survived, as the bacteria were no longer viable and when mice were injected with a
mixture of heat-killed S strain and live R strain bacteria, they unexpectedly developed
pneumonia and died. Moreover, live S strain bacteria were recovered from the dead mice.
Griffith concluded that something in the heat-killed S strain bacteria had transformed the
live R strain bacteria into the virulent S strain. He called this phenomenon transformation
and hypothesized that a "transforming principle" from the dead S strain bacteria was
responsible for converting the non-virulent R strain into the virulent S strain.
2
In 1944, American scientists Oswald Avery, Colin MacLeod, and Maclyn McCarty built on
Griffith’s work to identify the transforming principle. They conducted a series of
experiments to isolate and test the components of the heat-killed S strain bacteria. Their
experiments included:
1. Enzymatic Degradation: They treated the heat-killed S strain extract with enzymes
(proteases to degrade proteins, RNases to degrade RNA, DNases to degrade DNA) that
destroyed proteins, RNA, and DNA separately.
2. Transformation Test: They mixed the treated extracts with live R strain bacteria and
tested for transformation.
3. Results: Transformation only occurred when DNA was present. When DNA was
destroyed, transformation did not occur, even if proteins or RNA were intact.
Avery, MacLeod, and McCarty concluded that DNA was the molecule responsible for
transformation, providing the first direct evidence that DNA carries genetic information.
This discovery challenged the prevailing belief that proteins were the carriers of genetic
information and paved the way for the discovery of the double-helix structure of DNA.
The Hershey-Chase experiment, conducted in 1952 by Alfred Hershey and Martha Chase,
was a fundamental study in molecular biology that provided conclusive evidence that
DNA, not protein, is the genetic material responsible for heredity. This experiment built
on the work of earlier scientists, such as Frederick Griffith and Avery-MacLeod-McCarty,
and helped resolve the debate over whether DNA or proteins carry genetic information.
Hershey and Chase used bacteriophages (viruses that infect bacteria) to demonstrate that
DNA is the molecule responsible for transmitting genetic information during infection.
Hershey and Chase used bacteriophage T2, a virus that infects Escherichia coli bacteria,
to determine whether protein or DNA directs the production of new viruses.
Bacteriophages consist of a protein coat (capsid) surrounding a DNA core. To track the
fate of these components, they employed radioactive labelling. Phages were grown in a
medium containing radioactive sulphur-35 (³⁵S) to label the protein coat, as sulphur is
found in the amino acids cysteine and methionine but not in DNA, and in a separate
medium containing radioactive phosphorus-32 (³²P) to label the DNA, as phosphorus is
a component of DNA but not proteins. These labelled phages were then used to infect E.
coli. Following infection, the mixtures were agitated in a blender to dislodge phage
particles from the bacterial surface and centrifuged to separate the heavier bacterial cells
from lighter phage particles. Analysis showed that for ³⁵S-labeled phages, radioactivity
was found in the supernatant, indicating that the protein coat did not enter the bacteria.
In contrast, with ³²P-labeled phages, radioactivity was detected inside the bacterial cells,
demonstrating that DNA, not protein, entered the bacteria to direct the production of new
viruses. The results demonstrated that DNA, not protein, is the genetic material that
enters the bacterial cell and directs the production of new phages. This provided
conclusive evidence that DNA is the molecule responsible for heredity.
3
Hershey-Chase experiment
Chargaff’s Discovery
Erwin Chargaff, an Austrian-American biochemist, made significant contributions to the
understanding of DNA structure and composition in the 1940s and 1950s. Through his
meticulous analysis of the base composition of DNA from various organisms, Chargaff
formulated two key principles, now known as Chargaff’s Rules. These rules provided
insights into the structure of DNA and played a pivotal role in the discovery of the double-
helix model by James Watson and Francis Crick in 1953.
Chargaff and his team used a technique called paper chromatography to separate and
quantify the four nitrogenous bases in DNA: adenine (A), thymine (T), cytosine (C), and
guanine (G). They analysed DNA from a wide range of organisms, including bacteria,
plants, and animals, to determine the relative amounts of these bases. Chargaff’s analysis
led to the formulation of two fundamental rules:
1. Base Pairing Rule: the amount of adenine (A) is equal to the amount of thymine (T) and
the amount of cytosine (C) is equal to the amount of guanine (G).
In mathematical terms: A = T and C = G.
This rule indicates a specific pairing relationship between the bases, which was later
explained by the complementary base pairing in the DNA double helix.
2. Species-Specific Variation: the ratio of A+T to C+G varies between different species.
However, within a species, the ratio remains constant across different tissues and is
characteristic of that species, highlighting the diversity of DNA composition among
organisms while emphasizing the consistency within a single species.
4
Watson and Crick’s Double Helix Model of DNA
The discovery of the double helix structure of DNA by James Watson and Francis Crick in
1953 is one of the most significant achievements in the history of science. This model
provided the first accurate representation of DNA’s molecular structure and explained
how genetic information is stored, replicated, and transmitted.
In the early 1950s, scientists knew that DNA was the molecule responsible for heredity,
thanks to experiments like those of Avery, MacLeod, McCarty, and Hershey-Chase.
However, the three-dimensional structure of DNA remained unknown. Several
researchers, including Linus Pauling, Rosalind Franklin, and Maurice Wilkins, were
working to uncover this structure. Watson and Crick’s success was built on the
contributions of these scientists, particularly Franklin’s X-ray diffraction images of DNA.
Franklin, a British biophysicist, produced high-resolution X-ray diffraction images of DNA
fibres. The images revealed a helical structure with a regular, repeating pattern,
suggesting that DNA had a helical shape. Linus Pauling had previously discovered the
alpha-helix structure of proteins, which inspired Watson and Crick to consider a helical
structure for DNA while Erwin Chargaff’s discovery that A = T and C = G in DNA provided
clues about base pairing, which Watson and Crick incorporated into their model.
Watson and Crick proposed that DNA consists of two strands that wind around each other
to form a double helix. The two strands run in opposite directions (antiparallel), one in
the 5' to 3' direction, while the other runs in the 3' to 5' direction. The outer edges of the
helix are formed by alternating sugar (deoxyribose) and phosphate molecules, which
create a stable backbone while the interior of the helix contains the nitrogenous bases:
adenine (A), thymine (T), cytosine (C), and guanine (G). The bases on one strand pair with
complementary bases on the other strand through hydrogen bonds.
Characteristics of the Genetic Material
Genetic material must fulfil certain important criteria to support the development,
functioning, reproduction, and evolution of organisms. It must store vast amounts of
information, a role DNA fulfils through its sequence of nitrogenous bases (A, T, C, G),
which encode genetic instructions that determine traits. It must also replicate accurately,
as DNA does via a semi-conservative mechanism where each strand serves as a template
for synthesizing a complementary strand, ensuring precise inheritance during cell
division. The ability to transmit information is another essential criterion, and DNA
achieves this by being packaged into chromosomes and passed to offspring during
meiosis.
Furthermore, the genetic material should reserve the ability to undergo mutations to
enable genetic variation, a key driver of evolution, which DNA achieves through base
sequence changes such as substitutions, insertions, and deletions. Stability is also vital,
and DNA’s double-helix structure, strong covalent bonds in its sugar-phosphate backbone,
and repair mechanisms maintain the integrity of genetic information over time.
Additionally, DNA must express information by directing the synthesis of proteins and
other molecules through transcription and translation, where its base sequence
determines amino acid sequences in proteins essential for cellular functions. Finally, DNA
5
must be universal, present in all living organisms and functioning similarly across species,
from bacteria to plants and animals, using the same principles of replication,
transcription, and translation.
Molecular Basis of the Genetic Material
The molecular basis of genetic material revolves around the structure, function, and
transmission of DNA and ribonucleic acid (RNA), which are the molecules responsible for
storing and expressing genetic information in living organisms.
Structure of DNA
DNA is composed of repeating units called nucleotides, which serve as its building blocks.
Each nucleotide consists of three components: a phosphate group, a sugar molecule
(deoxyribose), and a nitrogenous base. The nitrogenous bases include adenine (A),
thymine (T), cytosine (C), and guanine (G).
Pyrimidines Purines
These nucleotides are linked together through phosphodiester bonds, forming a single
strand of DNA with a sugar-phosphate backbone and projecting nitrogenous bases. The
sequence of these bases along the DNA strand encodes genetic information, which is
essential for the development, functioning, and reproduction of all living organisms.
Structure of a nucleotide
6
Phosphodiester bond
Two complementary DNA strands come together to form the double helix structure. The
strands are held together by hydrogen bonds between the nitrogenous bases, with
adenine pairing with thymine and cytosine pairing with guanine. This complementary
base pairing ensures the accurate replication and transmission of genetic information.
The two strands run in opposite directions, a configuration known as antiparallel, with
one strand running 5' to 3' and the other running 3' to 5'. The double helix twists into a
spiral shape, with approximately 10 base pairs per turn, creating major and minor
grooves that provide binding sites for proteins and other molecules.
7
Structure and Composition of the DNA
The stability of the DNA double helix is maintained by several forces, including hydrogen
bonds between base pairs, hydrophobic interactions that push the bases inward, van der
Waals forces between stacked bases, and electrostatic interactions that neutralize the
negatively charged phosphate backbone. DNA can also adopt different structural
conformations, such as the common B-DNA, the shorter A-DNA, and the left-handed Z-
DNA, depending on environmental conditions. The double helix structure enables DNA
replication, transcription, and the storage of genetic information.
The formation of different DNA forms, such as B-DNA, A-DNA, and Z-DNA, is influenced
by a variety of factors, including environmental conditions, DNA sequence, supercoiling,
protein interactions, and chemical modifications. Environmental conditions like
hydration levels, salt concentration, and temperature play a significant role. For instance,
B-DNA, the most common form, is stable under normal physiological conditions with high
hydration, while A-DNA forms under low hydration or high salt concentrations. Z-DNA, a
left-handed helix, often arises in regions with alternating purine-pyrimidine sequences,
such as CG repeats, and is favoured under conditions of negative supercoiling. The DNA
sequence itself also determines its conformation, as regions with high GC content tend to
stabilize B-DNA, while repetitive or alternating sequences may promote alternative
structures like Z-DNA or hairpins.
8
B-DNA, A-DNA and Z-DNA
Supercoiling, which results from the over- or under-winding of the DNA helix, further
influences DNA conformation. Negative supercoiling, for example, can induce the
formation of Z-DNA, while positive supercoiling may stabilize B-DNA. Protein
interactions are another critical factor, as proteins like histones, transcription factors,
and enzymes can bind to DNA and alter its structure. Histones, for instance, help package
DNA into nucleosomes, influencing its local conformation, while enzymes like
topoisomerases can relieve torsional stress, affecting DNA supercoiling and structure.
Chemical modifications, such as DNA methylation or the binding of small molecules, can
also stabilize specific conformations or induce structural changes.
During processes like transcription and replication, DNA must unwind, creating regions
of single-stranded DNA or inducing supercoiling, which can favour alternative
conformations. In eukaryotic cells, DNA is organized into chromatin, and the degree of
compaction—whether in euchromatin (less condensed) or heterochromatin (highly
condensed)—can influence whether DNA adopts B-DNA or other forms.
RNA as a Genetic Material
The RNA molecule serves as both a carrier of genetic information and a functional
molecule in cellular processes. While DNA is the primary genetic material in most
organisms, RNA plays a crucial role in gene expression and, in some cases, acts as the
genetic material itself. RNA is composed of nucleotides, each consisting of a phosphate
group, a ribose sugar, and one of four nitrogenous bases: adenine (A), uracil (U), cytosine
(C), or guanine (G). Unlike DNA, RNA is typically single-stranded, allowing it to fold into
complex three-dimensional structures that enable it to perform diverse functions, such
as catalysing biochemical reactions and regulating gene expression.
In certain viruses, RNA serves as the primary genetic material instead of DNA. These RNA
viruses can be classified into single-stranded RNA (ssRNA) viruses and double-stranded
RNA (dsRNA) viruses. Single-stranded RNA viruses can have either positive-sense RNA,
which can directly serve as mRNA for protein synthesis, or negative-sense RNA, which
must first be transcribed into a complementary RNA strand. Examples of RNA viruses
include retroviruses like HIV, which use reverse transcriptase to convert their RNA
9
genome into DNA, and influenza viruses, which have negative-sense RNA genomes. SARS-
CoV-2, the virus responsible for COVID-19, also has a positive-sense RNA genome.
In cellular organisms, RNA plays a central role in gene expression, acting as an
intermediary between DNA and proteins. Messenger RNA (mRNA) carries genetic
information from DNA to the ribosome, where it is translated into proteins. Transfer RNA
(tRNA) brings amino acids to the ribosome during protein synthesis, matching them to
the codons on mRNA, while ribosomal RNA (rRNA) forms the core of the ribosome and
catalyses protein synthesis. These roles highlight the importance of RNA in translating
genetic information into functional proteins, which are essential for cellular processes.
RNA has several unique properties that make it suitable as genetic material in certain
contexts. Its single-stranded nature allows it to fold into complex structures, enabling it
to perform catalytic and regulatory functions. RNA viruses often replicate faster than
DNA viruses, allowing them to adapt quickly to changing environments. In positive-sense
RNA viruses, the RNA genome can directly serve as mRNA, simplifying the process of
protein synthesis. However, RNA also faces challenges as genetic material, such as its
relative instability compared to DNA due to the presence of the 2' hydroxyl group, which
makes it more prone to degradation. Additionally, RNA-dependent RNA polymerases lack
proofreading mechanisms, leading to higher mutation rates in RNA viruses, and RNA
genomes are generally smaller than DNA genomes, limiting the amount of genetic
information they can carry.
Non-coding RNAs (ncRNAs) are a diverse group of RNA molecules that do not encode
proteins but play essential roles in regulating gene expression and maintaining cellular
functions. Key types include microRNAs (miRNAs) and small interfering RNAs (siRNAs),
which silence gene expression by targeting mRNAs for degradation or translational
inhibition; long non-coding RNAs (lncRNAs), which regulate gene expression at
transcriptional and post-transcriptional levels; Piwi-interacting RNAs (piRNAs), which
protect genome integrity by silencing transposable elements in germ cells; small
nucleolar RNAs (snoRNAs), which guide chemical modifications of other RNAs; circular
RNAs (circRNAs), which act as miRNA sponges and regulate splicing; and transfer RNA-
derived fragments (tRFs), which influence gene expression and stress responses. These
ncRNAs are integral to cellular processes such as development, differentiation, and
genome stability, highlighting their critical roles in health and disease.
Central Dogma of Molecular Biology
The Central Dogma of Molecular Biology is a framework that describes the flow of genetic
information within a biological system. Proposed by Francis Crick in 1958, it outlines how
genetic information is transferred from DNA to RNA to proteins, which are the functional
molecules that drive cellular processes. The Central Dogma consists of three main stages:
replication, transcription, and translation, each of which plays a critical role in gene
expression and inheritance. Transcription and translation are the two central processes
by which genetic information in DNA is used to synthesize proteins, guided by the genetic
code.
10
DNA Replication
DNA replication is the process by which a cell makes an identical copy of its DNA before
cell division, ensuring the accurate transmission of genetic information from one
generation of cells to the next. It is a semi-conservative process, meaning each new DNA
molecule consists of one original (parental) strand and one newly synthesized (daughter)
strand. Replication begins at specific sites called origins of replication, where the enzyme
helicase unwinds the DNA double helix, creating replication forks. Topoisomerase
relieves torsional stress ahead of the replication fork by preventing the accumulation of
supercoils and ensuring the replication machinery can function without interruption
while single-stranded binding proteins (SSBs) stabilize the unwound strands, while
primase synthesizes short RNA primers to provide a starting point for DNA synthesis. The
enzyme DNA polymerase then adds nucleotides to the growing DNA strand, following the
base-pairing rules (A-T and C-G). Because DNA polymerase can only synthesize DNA in
the 5' to 3' direction, the leading strand is synthesized continuously, while the lagging
strand is synthesized discontinuously in short segments called Okazaki fragments. After
synthesis, RNase H removes the RNA primers, and DNA ligase joins the Okazaki fragments
to create a continuous strand. The process is highly accurate, with DNA polymerase
proofreading for errors and additional repair mechanisms correcting mistakes.
DNA replication
Transcription
Transcription begins when the enzyme RNA polymerase binds to a specific region of DNA
called the promoter, unwinding the DNA double helix to expose the template strand. RNA
polymerase then synthesizes a complementary mRNA strand in the 5' to 3' direction,
using RNA nucleotides (A, U, C, G) that pair with the DNA template (A-U, T-A, C-G, G-C).
Once RNA polymerase reaches a terminator sequence, transcription ends, and the newly
synthesized mRNA is released. In eukaryotes, the mRNA undergoes additional processing,
including the addition of a 5' cap, a poly-A tail, and the removal of non-coding introns
through splicing.
11
Translation
Translation
After transcription, the mRNA then moves to the ribosomes for translation, where the
genetic information is decoded into a polypeptide chain. Translation begins when the
small ribosomal subunit binds to the mRNA near the start codon (AUG), which codes for
methionine. The initiator tRNA, carrying methionine, binds to the start codon, and the
large ribosomal subunit joins to form a complete ribosome. During elongation, the
ribosome moves along the mRNA, reading each codon and recruiting the corresponding
transfer RNA (tRNA) molecule carrying the appropriate amino acid. The ribosome
catalyses the formation of peptide bonds between amino acids, elongating the
polypeptide chain. When the ribosome encounters a stop codon (UAA, UAG, or UGA), a
release factor binds, causing the ribosome to release the completed polypeptide. The
ribosome then disassembles, and the mRNA and tRNA are released.
Translation
12
After translation, the polypeptide may undergo post-translational modifications, such as
folding into its three-dimensional structure, cleavage to remove specific segments, or
chemical modifications like phosphorylation or glycosylation, to become a functional
protein. Together, transcription and translation, guided by the genetic code, enable the
synthesis of proteins that perform virtually all cellular functions.
The Genetic Code
The genetic code is the set of rules that defines how nucleotide sequences in mRNA are
translated into amino acid sequences. It is a triplet code, with each three-nucleotide
codon specifying one amino acid. There are 64 possible codons, encoding the 20 standard
amino acids, as well as start and stop signals. The code is degenerate/redundant, meaning
multiple codons can code for the same amino acid (e.g., leucine is encoded by six codons),
and universal, with nearly all organisms using the same codons for the same amino acids.
Genetic code
Chromosomes and Genome Organization
Chromosomes and genome organization are fundamental to the storage, transmission,
and regulation of genetic information in all living organisms. Chromosomes are thread-
like structures composed of DNA and proteins, primarily histones, which package DNA
into a compact and organized form. This packaging occurs through the formation of
nucleosomes, where DNA is wrapped around histone proteins, and further coiled into
higher-order structures called chromatin. Chromatin exists in two forms: euchromatin
and heterochromatin. Euchromatin is less condensed and transcriptionally active,
meaning it contains genes that are actively being expressed. It is typically found in regions
of the genome that require frequent access for transcription and other regulatory
processes. In contrast, heterochromatin is highly condensed and transcriptionally
inactive, playing a role in silencing genes and maintaining the structural integrity of
13
chromosomes. Heterochromatin is often found in regions such as centromeres and
telomeres, where gene expression is tightly regulated or suppressed. Chromosomes also
contain specialized regions, such as the centromere, which ensures proper segregation
during cell division, and telomeres, which protect chromosome ends from degradation.
In humans, chromosomes are classified into autosomes (22 pairs) and sex chromosomes
(XX in females and XY in males), with each chromosome carrying thousands of genes that
encode proteins and regulate cellular processes.
14
MOLECULAR ASPECTS OF GENE REGULATION
Gene regulation is the process by which cells control the expression of genes, ensuring
that the right genes are turned on or off at the right time and in the right amounts. This is
essential for cellular differentiation, development, and response to environmental
changes. It occurs at multiple levels, including transcriptional, post-transcriptional,
translational, and post-translational stages. Each level involves specific mechanisms and
molecular components that work together to fine-tune gene expression.
Transcriptional Regulation
This is the primary level of gene control, determining whether a gene is transcribed into
mRNA. It involves an interplay of DNA sequences, regulatory proteins, and chromatin
structure to ensure that genes are expressed at the right time, in the right place, and in
the right amounts. In prokaryotes, transcriptional regulation often involves operons,
which are clusters of genes transcribed as a single mRNA molecule and regulated
together. A classic example is the lac operon in E. coli, which controls the metabolism of
lactose. The lac operon consists of three structural genes (lacZ, lacY, and lacA) that are
transcribed together, along with a promoter (specific DNA sequence where RNA
polymerase binds to start transcription of downstream genes), an operator (specific DNA
sequence where repressor proteins bind to regulate transcription), and a regulatory gene
(lacI). The lac repressor, encoded by lacI, binds to the operator sequence to inhibit
transcription in the absence of lactose. When lactose is present, it binds to the repressor,
causing it to release from the operator and allowing transcription. This system allows
bacteria to efficiently regulate gene expression in response to environmental conditions.
15
Lactose absent, repressor active, operon off
16
Transcriptional regulation in eukaryotes
Chromatin structure also plays a critical role in transcriptional regulation in eukaryotes.
DNA is packaged into chromatin, which can exist in two forms: euchromatin, which is less
condensed and transcriptionally active, and heterochromatin, which is highly condensed
and transcriptionally inactive. Chemical modifications to histone proteins, such as
acetylation and methylation, can alter chromatin structure and influence gene expression.
Acetylation, typically carried out by histone acetyltransferases, loosens chromatin and
activates transcription, while methylation can either activate or repress transcription
depending on the context. Additionally, DNA methylation, the addition of methyl groups
to cytosine residues in DNA, often represses transcription by preventing the binding of
transcription factors or recruiting repressive proteins.
Post-transcriptional Regulation
Post-transcriptional regulation is a process that controls the processing, stability, and
translation of mRNA after transcription. This level of regulation ensures that the correct
amount of protein is produced at the right time and place, allowing cells to fine-tune gene
expression in response to internal and external signals.
One key mechanism of post-transcriptional regulation is RNA splicing, which occurs in
eukaryotic cells. After transcription, the initial RNA transcript, known as pre-mRNA,
contains both coding regions (exons) and non-coding regions (introns). Spliceosomes,
which are complexes of proteins and small nuclear RNAs (snRNAs), remove the introns
and join the exons to produce mature mRNA. This process is essential for generating
functional mRNA that can be translated into proteins.
17
RNA splicing
Additionally, alternative splicing allows different combinations of exons to be joined,
producing multiple mRNA variants from a single gene. This increases protein diversity
and enables cells to produce different protein isoforms from the same gene, depending
on cellular needs or environmental conditions.
Alternative Splicing
18
Another important mechanism is RNA editing, which involves the enzymatic modification
of RNA sequences after transcription. For example, adenosine-to-inosine (A-to-I) editing
and cytosine-to-uracil (C-to-U) editing can alter the coding potential or stability of mRNA.
These modifications can change the amino acid sequence of the encoded protein or affect
how the mRNA is processed and regulated.
The stability and degradation of mRNA are also tightly regulated processes that influence
gene expression. MicroRNAs (miRNAs) are small non-coding RNAs that play a significant
role in this regulation. miRNAs bind to complementary sequences in the mRNA, usually
in the 3' untranslated region, leading to either mRNA degradation or inhibition of
translation. This allows cells to rapidly reduce the levels of specific proteins without
altering transcription. Additionally, RNA-binding proteins (RBPs) bind to mRNA and
influence its stability, localization, and translation. Some RBPs stabilize mRNA and
enhance translation, while others promote mRNA degradation. These proteins help
regulate the lifespan and activity of mRNA, ensuring that proteins are produced only
when and where they are needed.
Post-translational Regulation
Post-translational regulation refers to the modifications and processes that occur after a
protein is synthesized. This level of regulation is essential for fine-tuning protein function
and ensuring that cellular processes are tightly controlled. Post-translational regulation
includes protein modifications, protein degradation, and protein localization, each of
which plays a critical role in maintaining cellular homeostasis and responding to
environmental changes.
Protein modifications are chemical changes that alter a protein's structure and function.
These modifications are often reversible and can act as molecular switches to regulate
protein activity. One of the most common modifications is phosphorylation, which
involves the addition of phosphate groups to specific amino acids (e.g., serine, threonine,
or tyrosine) by enzymes called kinases. Phosphorylation can activate or deactivate
proteins, change their interactions, or alter their localization. For example,
phosphorylation of the p53 tumour suppressor protein activates its function in response
to DNA damage, while phosphorylation of glycogen synthase inhibits its activity to
regulate glucose metabolism. Another important modification is ubiquitination, the
process of attaching ubiquitin molecules to a protein. This modification typically marks
proteins for degradation by the proteasome, a large protein complex that breaks down
unwanted or damaged proteins. For instance, ubiquitination of cyclins regulates the cell
cycle by ensuring their timely degradation at specific phases. Additionally, glycosylation
involves the addition of sugar molecules to proteins, usually in the endoplasmic reticulum
or Golgi apparatus. This modification influences protein folding, stability, and
interactions, particularly for proteins that are secreted or located on the cell surface. For
example, glycosylation of cell surface receptors is crucial for their function in cell
signalling and immune responses.
19
Protein degradation is the process of removing damaged, misfolded, or unnecessary
proteins, ensuring cellular quality control and regulating protein levels. One major
pathway for protein degradation is the proteasome, a large protein complex that
degrades ubiquitin-tagged proteins. For example, the degradation of misfolded proteins
by the proteasome prevents their accumulation, which could otherwise lead to diseases
such as Alzheimer's or Parkinson's. Another important degradation pathway is
autophagy, a process by which cells degrade damaged or unnecessary proteins and
organelles. Autophagy involves the formation of autophagosomes, which engulf cellular
components and deliver them to lysosomes for degradation. This process helps cells
survive under stress conditions, such as nutrient deprivation, by recycling cellular
components to generate energy.
Protein localization ensures that proteins are directed to the correct cellular
compartments, where they can perform their specific functions. Proteins often contain
short amino acid sequences, called signal sequences, that direct them to specific
organelles or cellular compartments. For example, the nuclear localization signal (NLS)
directs proteins to the nucleus, while the mitochondrial targeting sequence (MTS) directs
proteins to mitochondria. The transcription factor NF-κB, for instance, is kept inactive in
the cytoplasm until a signal trigger its translocation to the nucleus, where it activates
gene expression. Proteins can also be transported to specific locations through
interactions with transport proteins or vesicles. For example, proteins destined for
secretion are transported through the endoplasmic reticulum and Golgi apparatus before
being packaged into vesicles for delivery to the cell surface.
Epigenetic Regulation
Epigenetic regulation involves heritable changes in gene expression that do not alter the
underlying DNA sequence. These mechanisms allow cells to control which genes are
turned on or off in response to developmental cues, environmental signals, or cellular
needs. Epigenetic regulation plays a critical role in processes such as cellular
differentiation, development, and disease. The three main mechanisms of epigenetic
regulation are DNA methylation, histone modifications, and the action of non-coding
RNAs. Each of these mechanisms contributes to the dynamic control of gene expression
by modifying the structure of chromatin or influencing the stability and translation of
RNA.
DNA methylation is one of the most well-studied epigenetic mechanisms. It involves the
addition of a methyl group to the 5' position of cytosine residues, typically in CpG islands
(i.e. regions of DNA with a high frequency of cytosine-phosphate-guanine) sequences.
DNA methylation is often associated with the repression of gene expression because it
can prevent the binding of transcription factors or recruit proteins that compact
chromatin into a transcriptionally inactive state. For example, the methylation of tumour
suppressor genes can lead to their silencing, contributing to cancer development. DNA
methylation is also essential for processes such as genomic imprinting and X-
chromosome inactivation.
Genomic imprinting is an epigenetic phenomenon where only one allele of a gene is
expressed, depending on its parental origin (either the maternal or paternal allele). This
20
means that for certain genes, only the copy inherited from the mother or the father is
active, while the other copy is silenced. DNA methylation is the primary mechanism that
establishes and maintains this parent-specific silencing. During gamete formation,
specific regions of DNA are marked with methyl groups in a parent-specific manner. For
example, the H19 gene is methylated on the paternal allele and expressed from the
maternal allele, while the Igf2 gene is methylated on the maternal allele and expressed
from the paternal allele. Imprinting ensures that the correct dosage of certain genes is
maintained, which is critical for normal development. In females, who have two X
chromosomes, X-chromosome inactivation ensures dosage compensation by silencing
one of the two X chromosomes. This process prevents the overexpression of X-linked
genes, which could otherwise disrupt cellular function. Early in embryonic development,
one X chromosome is randomly inactivated in each cell. This inactivation is initiated by
the XIST gene, which produces a long non-coding RNA (lncRNA) that coats the X
chromosome and recruit proteins to modify its chromatin structure. DNA methylation
then locks the inactive X chromosome into a silenced state. X-chromosome inactivation
ensures that females, like males, have only one active copy of X-linked genes in each cell.
Histone modifications are another key epigenetic mechanisms that influences gene
expression by altering the structure of chromatin. Histones are proteins around which
DNA is wrapped to form nucleosomes, the basic units of chromatin. Chemical
modifications to histone proteins, such as acetylation, methylation, phosphorylation, and
ubiquitination, can either loosen or tighten chromatin structure, thereby regulating
access to DNA. For example, histone acetylation, carried out by histone acetyltransferases
(HATs), neutralizes the positive charge on histones, reducing their affinity for DNA and
making chromatin more accessible for transcription. Conversely, histone deacetylation,
performed by histone deacetylases (HDACs), compacts chromatin and represses
transcription. Histone methylation can have different effects depending on the specific
amino acid residue that is modified.
Non-coding RNAs (ncRNAs) are RNA molecules that do not encode proteins but play
crucial roles in regulating gene expression at both the transcriptional and post-
transcriptional levels. Two major classes of ncRNAs involved in epigenetic regulation are
microRNAs (miRNAs) and long non-coding RNAs (lncRNAs). miRNAs are small RNA
molecules that bind to complementary sequences in messenger RNAs (mRNAs), leading
to their degradation or the inhibition of their translation. This allows miRNAs to fine-tune
gene expression by reducing the levels of specific proteins. For example, miR-21 is known
to promote cancer progression by targeting tumour suppressor genes. lncRNAs, on the
other hand, are longer RNA molecules that regulate gene expression through various
mechanisms, including chromatin remodelling, transcriptional interference, and the
formation of RNA-protein complexes. For instance, the lncRNA XIST is essential for X-
chromosome inactivation in females.
21