Transcription
Transcription
o Gene expression is the process by which cells interpret and "read out" the
instruc�ons encoded in genes to produce proteins.
o Gene expression involves two main stages: transcrip�on (copying DNA to RNA) and
transla�on (conver�ng RNA into protein).
o Cells can produce mul�ple iden�cal RNA copies from a single gene, and each RNA can
generate mul�ple iden�cal protein molecules. This enables large-scale protein
synthesis from a single gene when needed.
2. Variability in Expression:
o Efficiency Differences: Genes are transcribed and translated with varying efficiencies,
allowing cells to control the quan�ty of each protein.
o Some genes produce a high quan�ty of protein (e.g., Gene A in the example), while
others produce less (e.g., Gene B), depending on cellular needs (see Figure 6-3 for
illustra�on).
Transcrip�on
1. Defini�on of Transcrip�on:
1. Single-Stranded Nature:
o This single-stranded nature allows RNA to fold into unique three-dimensional shapes.
o Func�on: RNA polymerase is the enzyme responsible for catalyzing RNA synthesis.
o It binds to the DNA and moves along the template strand, unwinding the helix as it
progresses.
o Primer Requirement: RNA polymerase can start an RNA chain without a primer, unlike
DNA polymerase, which requires a primer.
o Error Rate and Proofreading: RNA polymerase has a higher error rate (approximately
1 in 10,000 nucleo�des) compared to DNA polymerase due to fewer proofreading
mechanisms. However, errors in RNA are less consequen�al as RNA does not store
gene�c informa�on permanently.
o Release of RNA: A�er the RNA transcript is synthesized, it is quickly displaced from
the DNA, allowing the DNA helix to reform.
2. Energy Requirement:
o Ribonucleoside Triphosphates (ATP, CTP, UTP, GTP) are the substrates for RNA
synthesis.
o Rapid Synthesis: Since RNA strands are immediately released, mul�ple RNA
polymerase molecules can transcribe the same gene simultaneously, allowing
thousands of transcripts to be generated in a short �me.
1. Evolu�onary Origins:
o Divergent Evolu�on: X-ray crystallographic studies show that DNA and RNA
polymerases are structurally dis�nct, despite both containing a cri�cal Mg²⁺ ion at
the cataly�c site.
o These enzymes likely evolved separately in early cellular evolu�on, with different
lineages leading to modern DNA polymerases and RNA polymerases.
o RNA polymerase has a specialized structure that enables it to unwind DNA, catalyze
RNA synthesis, and release the RNA strand as it moves along the DNA.
o Figure 6-8 and 6-9: Illustrates that RNA polymerase synthesizes RNA as a single-
stranded molecule complementary to one DNA strand, using the DNA template
strand’s sequence.
o 3D Structure: RNA molecules can fold into specific three-dimensional structures due
to conven�onal and nonconven�onal base-pair interac�ons within the molecule.
2. Func�onal Implica�ons of Folding:
o The ability to fold allows RNA to have structural and cataly�c roles, similar to how
proteins func�on. Some RNA molecules, like those involved in catalysis, can form
highly specific structures to perform their tasks.
Given that DNA and RNA polymerases both carry out template-dependent nucleo�de
polymeriza�on, it might be expected that the two types of enzymes would be structurally related.
However, x-ray crystallographic studies reveal that, other than containing a cri�cal Mg2+ ion at the
cataly�c site, the two enzymes are quite different. Template-dependent nucleo�de-polymerizing
enzymes seem to have arisen at least twice during the early evolu�on of cells.
o The large pale blue area represents RNA polymerase, the enzyme responsible for
synthesizing RNA.
o RNA polymerase moves along the DNA template, unwinding the double helix as it
progresses. This structure surrounds the DNA and provides the necessary
environment for RNA synthesis.
o The yellow strand represents the template DNA. It’s the strand that RNA polymerase
reads to synthesize a complementary RNA strand.
o The direc�on of the DNA template is labeled with 3' and 5' ends, indica�ng its
orienta�on.
o The light blue line on the le� is the newly synthesized RNA transcript. This strand is
complementary to the template DNA.
o The RNA transcript grows in the 5' to 3' direc�on as RNA polymerase adds
nucleo�des.
o This area represents a brief hybrid of DNA and RNA where they form a short DNA-
RNA helix.
o Approximately nine nucleo�de pairs of DNA and RNA temporarily hybridize here. This
region allows the RNA strand to base-pair with the DNA template before detaching.
o The segment of the yellow DNA strand to the right of the polymerase is the
downstream DNA double helix, which has not yet been unwound by RNA
polymerase.
o The direc�on of transcrip�on (movement of RNA polymerase along the DNA) is also
indicated.
o The red dot represents a magnesium ion (Mg²⁺) at the ac�ve site of RNA polymerase.
o This channel in the blue RNA polymerase structure is the ribonucleoside triphosphate
uptake channel.
o Through this channel, incoming ribonucleoside triphosphates (ATP, UTP, CTP, and GTP)
enter RNA polymerase. These are the building blocks for the RNA strand, and they
provide the energy needed for polymeriza�on through their phosphate bonds.
TYPES OF RNAs
Transcrip�on Unit:
• In eukaryotes, a transcrip�on unit usually corresponds to one gene, producing a single RNA
or protein.
• In bacteria, a transcrip�on unit can include mul�ple genes, producing mRNA for several
proteins together (operon structure).
• RNA makes up a small percentage of a cell’s dry weight, whereas proteins comprise about
50%.
• rRNA is the most abundant RNA type in cells; mRNA is only 3-5% of total RNA in mammalian
cells.
• Each mRNA species is present in low amounts, with an average of 10-15 molecules per
species per cell.
1. Ini�a�on of Transcrip�on
• Importance of Ini�a�on:
o Core Enzyme: The bacterial RNA polymerase core enzyme is a mul�subunit complex
that synthesizes RNA by reading the DNA template.
o Sigma (σ) Factor: An addi�onal protein subunit, the sigma factor, binds to the core
enzyme, forming the RNA polymerase holoenzyme. The σ factor is essen�al for
recognizing promoter regions on the DNA, which signal the start of transcrip�on.
• RNA Polymerase Holoenzyme Binding:
o The RNA polymerase holoenzyme ini�ally binds weakly to DNA and slides along it.
When it encounters a promoter sequence (a specific DNA sequence indica�ng the
start of transcrip�on), the σ factor binds specifically to the DNA, causing the
holoenzyme to adhere �ghtly.
o The �ghtly bound holoenzyme at the promoter opens up the double-stranded DNA to
expose about 10 nucleo�des of unpaired DNA, forming a transcrip�on bubble.
o The σ factor binds to the unpaired bases on one strand, stabilizing the bubble.
o The exposed DNA strand in the bubble acts as a template for complementary base-
pairing with incoming ribonucleo�des.
o The polymerase joins the first two ribonucleo�des together, ini�a�ng the RNA chain.
• Scrunching Mechanism:
o This scrunching creates stress, which can lead to abor�ve ini�a�on, where short RNA
segments are o�en released as stress is relieved, and the polymerase restarts RNA
synthesis.
• Promoter Clearance:
o A�er several rounds of abor�ve ini�a�on, the stress generated by scrunching enables
the core enzyme to release its �ght hold on the promoter DNA.
o Once the core enzyme detaches from the σ factor, the polymerase is ready to proceed
with transcrip�on.
o The RNA polymerase moves along the DNA, adding nucleo�des to the growing RNA
strand in a stepwise manner. It advances one base pair for each added nucleo�de.
o The transcrip�on bubble expands at the front of the enzyme and contracts at the rear
as it progresses.
• Elonga�on Rate:
o Transcrip�on con�nues un�l the polymerase reaches a termina�on signal. This signal
is typically a sequence of A-T pairs in the DNA, followed by a region with twofold
symmetry.
o The symmetric sequence in the DNA, when transcribed into RNA, folds into a hairpin
structure through complementary base-pairing.
o This hairpin structure disrupts the interac�on between the RNA transcript and the
RNA polymerase ac�ve site, helping to release the newly synthesized RNA.
o A�er termina�on, the RNA polymerase releases both the DNA template and the RNA
transcript.
o The free RNA polymerase core enzyme then binds with a new σ factor to form
another holoenzyme, ready to start a new cycle of transcrip�on.
o These changes help open and posi�on the DNA in the polymerase ac�ve site and
ensure �ght binding so that the enzyme does not dissociate prematurely.
o The folding of RNA into specific structures, such as hairpins, plays a crucial role in
various transcrip�on stages, including termina�on. This structural folding influences
how effec�vely RNA polymerase reads and decodes gene�c informa�on.
Ini�a�on Complex Forma�on (Step
1 in Figure):
• Mechanism:
o At the promoter, the holoenzyme binds �ghtly due to specific interac�ons between
the σ factor and the DNA, forming a closed complex.
• Transcrip�on Bubble:
o The σ factor stabilizes the unpaired DNA strands by binding to one of them, preparing
the template strand for RNA synthesis.
• Ini�al RNA Synthesis: RNA polymerase begins synthesizing a short RNA strand.
• Scrunching Mechanism:
o RNA polymerase pulls addi�onal upstream DNA into its ac�ve site without moving
forward, which creates tension in the transcrip�on bubble.
• Abor�ve Cycles:
o The ini�al RNA transcripts are o�en short and may be released prematurely. This
process is called abor�ve ini�a�on.
o RNA polymerase may repeat this process several �mes before it successfully
synthesizes an RNA strand long enough to con�nue.
• Breaking Free: The stress generated from scrunching aids RNA polymerase in overcoming its
interac�on with the promoter.
• Release of σ Factor:
o Once RNA polymerase synthesizes around 10 nucleo�des of RNA, it detaches from the
promoter.
o The σ factor is released, allowing RNA polymerase to transi�on from the ini�a�on to
the elonga�on phase.
• RNA Synthesis:
o RNA polymerase moves along the DNA template, adding nucleo�des to the growing
RNA strand in a 5' to 3' direc�on.
o The transcrip�on bubble con�nues to move forward with RNA polymerase, with DNA
strands re-pairing behind it.
• Termina�on Signal: RNA polymerase eventually encounters a specific DNA sequence, known
as a terminator.
• Termina�on Hairpin:
o This sequence o�en contains A-T rich regions that, when transcribed into RNA, form a
hairpin structure (inverted repeat sequence in RNA).
o The hairpin destabilizes the RNA-DNA hybrid within RNA polymerase, leading to
termina�on.
• RNA Release:
o RNA polymerase releases both the newly synthesized RNA transcript and the DNA
template.
o The holoenzyme is now ready to start another round of transcrip�on on a new gene
or promoter.
A consensus nucleo�de sequence is derived by comparing many sequences with the same basic
func�on and tallying up the most common nucleo�des found at each posi�on. It therefore serves as
a sum mary or “average” of a large number of individual nucleo�de sequences.
• Eukaryo�c nuclei contain three types of RNA polymerases (I, II, and III), unlike bacteria,
which have only one.
The 5.8S, 18S, and 28S rRNA molecules are ribosomal RNA (rRNA) components of
the eukaryo�c ribosome
o RNA Polymerase II: Transcribes all protein-coding genes, as well as snoRNA genes,
miRNA genes, siRNA genes, lncRNA genes, and most snRNA genes.
o RNA Polymerase III: Transcribes tRNA genes, 5S rRNA genes, some snRNA genes, and
other small RNA genes.
• General Transcrip�on Factors are essen�al for correctly posi�oning RNA polymerase II at
the promoter. They also help separate the DNA strands for transcrip�on and release RNA
polymerase to ini�ate elonga�on.
o These transcrip�on factors are denoted as TFIIA, TFIIB, TFIIC, TFIID, etc. (TFII stands
for “Transcrip�on Factor for Polymerase II”).
o General transcrip�on factors func�on similarly to the σ factor in bacteria, aiding in the
ini�al stages of transcrip�on.
• The transcrip�on process begins with the binding of TFIID to the TATA box, a specific DNA
sequence rich in T and A nucleo�des.
o The TATA box is generally located 25 nucleo�des upstream of the transcrip�on start
site.
o The TATA-binding protein (TBP), a subunit of TFIID, recognizes the TATA box and binds
to it, distor�ng the DNA as a landmark for the promoter.
• TFIID, via its TBP subunit, binds to the TATA box, causing a significant DNA distor�on.
• This distor�on acts as a landmark, enabling other transcrip�on factors to recognize the
promoter region and begin assembling.
• TFIIB binds adjacent to TFIID, interac�ng with BRE elements in the promoter, which
posi�ons RNA polymerase II at the transcrip�on start site.
• TFIIF binds to RNA polymerase II and helps stabilize its interac�on with TFIID and TFIIB,
preparing the complex for transcrip�on ini�a�on.
It contains helicase ac�vity that uses ATP hydrolysis to unwind the DNA at
the transcrip�on start point, exposing the template strand.
• TFIIH phosphorylates Ser5 in the CTD of RNA polymerase II. This phosphoryla�on triggers:
o The release of RNA polymerase II from the general transcrip�on factors, enabling it to
proceed to elonga�on.
• At the top of the image (part A), you can see a small
rectangular TATA box (green) in the DNA. This TATA box is a
specific DNA sequence that signals where transcrip�on
should start.
• Other general transcrip�on factors, including TFIIE (orange), TFIIF (brown), and TFIIH
(yellow), assemble around RNA Polymerase II.
o TFIIF helps stabilize RNA Polymerase II’s interac�on with the DNA.
o It uses energy from ATP to unwind the DNA strands at the transcrip�on start site. This
unwinding creates a single-stranded region of DNA, which RNA Polymerase II can use
as a template.
o TFIIH also phosphorylates the C-terminal domain (CTD) of RNA Polymerase II
(illustrated as a light blue tail extending from RNA Polymerase II). This
phosphoryla�on changes the polymerase’s structure, freeing it from the general
transcrip�on factors so it can begin transcribing the DNA into RNA.
• As indicated by the yellow phosphate groups on the CTD tail, phosphoryla�on has occurred,
allowing RNA Polymerase II to enter the elonga�on phase, where it will read the DNA
template and synthesize RNA
• Eukaryo�c DNA is �ghtly packed into nucleosomes and organized into higher-order
chroma�n structures.
• Transcrip�on Ac�vators are proteins that bind to specific DNA sequences known as
enhancers, which are usually located far from the promoter region.
• These ac�vators help atract RNA Polymerase II to the promoter, enhancing the efficiency of
transcrip�on ini�a�on.
• The Mediator complex acts as a bridge between the ac�vators and RNA Polymerase II,
facilita�ng communica�on between them.
• It helps coordinate the interac�ons of ac�vator proteins, RNA Polymerase II, and the general
transcrip�on factors, ensuring efficient transcrip�on ini�a�on.
• These enzymes modify chroma�n structure to increase DNA accessibility, allowing the
transcrip�on machinery to bind to the DNA.
• Over 100 protein subunits come together at the transcrip�on start site to form the pre-
ini�a�on complex (PIC).
• The PIC includes transcrip�on ac�vators, general transcrip�on factors (e.g., TFIID, TFIIB, TFIIE,
TFIIF, TFIIH), RNA Polymerase II, Mediator, chroma�n remodelers, and histone-modifying
enzymes.
• TFIIB binds next, stabilizing the TFIID-DNA interac�on and posi�oning RNA Polymerase II.
Step 6: DNA Unwinding and Phosphoryla�on of RNA Polymerase II (Key Role of TFIIH)
• TFIIH uses ATP to unwind the DNA at the transcrip�on start site, exposing the template
strand.
• TFIIH also phosphorylates the C-terminal domain (CTD) of RNA Polymerase II at Ser5
residues.
• This phosphoryla�on signals RNA Polymerase II to begin transcrip�on and helps recruit RNA
processing factors.
• RNA Polymerase II must be released from the general transcrip�on factors and other
components of the PIC to start moving along the DNA.
• This release may involve the breakdown (proteolysis) of certain ac�vator proteins, freeing
RNA Polymerase II for elonga�on.
• RNA Polymerase II begins moving along the DNA template, synthesizing a nascent (newly
formed) RNA transcript.
• A�er transcribing about 25 nucleo�des, RNA Polymerase II has ini�ated the early elonga�on
phase.
• As the nascent RNA grows to about 25 nucleo�des, the phosphorylated CTD of RNA
Polymerase II (with Ser5 phosphoryla�on) recruits enzymes for 5ʹ capping.
• The 5ʹ cap is a 7-methylguanosine structure added to the beginning of the RNA, which will
later protect it from degrada�on and aid in transla�on ini�a�on.
• This phosphoryla�on recruits addi�onal RNA processing factors for splicing and 3ʹ-end
processing.
o When Ser5 is phosphorylated, capping proteins (highlighted at the top of the image)
bind to the CTD. These proteins add a 5' cap to the emerging RNA (shown at the top
right with the RNA exi�ng from RNA Polymerase II). The capping process is important
for RNA stability and future processing steps.
4. Splicing Proteins and 3'-End Processing Proteins (Middle and Lower Sec�ons of Image):
o Further down, 3'-end processing proteins (highlighted in green in the lower sec�on of
the image) bind to the CTD. These proteins are responsible for the cleavage and
polyadenyla�on of the RNA, crea�ng a mature 3' end, which stabilizes the RNA and
aids in its export from the nucleus.
o Once RNA Polymerase II has transcribed an en�re gene, it releases the RNA transcript
and soluble phosphatases remove the phosphate groups from Ser2 and Ser5 on the
CTD. This dephosphorylated state is essen�al for RNA Polymerase II to begin another
round of transcrip�on, as only the fully dephosphorylated form can ini�ate RNA
synthesis at a new promoter site.
Summary of Events
• Elonga�on: Ser2 phosphoryla�on recruits splicing proteins, while the RNA con�nues
elonga�ng.
• Termina�on: 3'-end processing proteins bind for RNA cleavage and polyadenyla�on.
• Recycling: Phosphatases remove phosphates from the CTD to reset RNA Polymerase II for
new transcrip�on.
o RNA synthesis starts, 5ʹ cap is added, and CTD phosphoryla�on at Ser2 signals
readiness for splicing and 3ʹ-end processing.
• 5' Capping: The addi�on of a modified guanosine cap to the 5' end of the RNA, which helps
protect the RNA from degrada�on and assists in ribosome binding during transla�on.
• 3' Polyadenyla�on: The addi�on of a poly-A tail at the 3' end, which also stabilizes the RNA
molecule and aids in its export from the nucleus.
• Splicing: The removal of introns from pre-mRNA and joining of exons to form a con�nuous
coding sequence, producing mature mRNA that can be translated.
o A specific adenine nucleo�de within the intron, located at a region called the
branch-point site, ini�ates the first reac�on.
o The 2' hydroxyl (–OH) group of this branch-point adenine atacks the 5' splice site,
which is located at the boundary between the exon and intron.
o This reac�on cuts the RNA at the 5' splice site and forms a unique 2'-5'
phosphodiester bond between the branch-point adenine and the 5' end of the
intron, crea�ng a lariat structure (a looped shape).
o The 3' hydroxyl (–OH) group of the upstream exon, which was freed in the first
reac�on, now atacks the 3' splice site at the boundary between the intron and the
downstream exon.
o This reac�on joins the two exons together and releases the intron as a lariat
structure.
o The lariat is later degraded, while the spliced exons form a con�nuous coding
sequence in the mRNA.
• Spliceosome Catalysis:
o A�er splicing, the spliceosome disassembles, releasing the mRNA and the lariat
intron.