Unit 2 Flashcards
monomers of nucleic acids
nucleotides
3 components of a nucleotide
1) base
2) sugar
3) phosphate
2 types of nucleotide bases
1) purine
2) pyrimidine
longer word, smaller structure
Adenine
Guanine
Cytosine
Thymine (DNA)
Uracil (RNA)
nucleoside
base connected to a pentose sugar
-for RNA, the sugar is ribose
-for DNA, the sugar is deoxyribose because it lacks the 2’OH
What is the structural (sugar) difference between RNA and DNA?
DNA lacks the 2’OH of ribose (making it deoxyribose), while RNA does not
connection between base and sugar
glycosidic bond (covalent) forms between the 1’ position on the sugar and a nitrogen on the base
nucleotides
the building block of nucleic acids
-a nucleoside with phosphoryl group(s) attached via ester linkage
nucleoside monophosphates
what the final nucleic acid polymer is composed of
nucleoside triphosphates
serve as high energy building blocks used by the cell to synthesize the nucleic acid polymer
phosphodiester linkage
covalent bond that connects two nucleotides to form polymer (nucleic acid)
mechanism of phosphodiester bond formation
1) Base activates 3’OH
2) 3’OH acts as a nucleophile, attacking the alpha phosphate (phosphate closest to the sugar) of a nucleoside triphosphate
3) Pyrophosphate acts as a leaving group to drive the reaction forward
Note: The nucleic acid polymer always grows in the 5’ to 3’ direction (i.e., nucleotides always added at 3’ end)
How do two strands of nucleotides interact?
complementary base pairing (Watson-Crick-Franklin)
-bases of nucleotides have hydrogen bond donors/acceptors
-C/G and A/T (DNA) or A/U (RNA)
How many hydrogen bonds are formed in A/T complementary base pairing?
2 hydrogen bonds (less energy required to separate; lower melting point)
How many hydrogen bonds are formed in C/G complementary base pairing?
3 hydrogen bonds (more energy required to separate; higher melting point)
B-form double-stranded DNA double-helix
-right-handed turn
-strands are anti-parallel
-sugar-phosphate backbone on the outside
-nucleobases on the inside
-asymmetrical
Also:
-orthogonal base-pairing
-base-stacking interactions
dsDNA base-stacking interactions
-van der Waals interactions between hydrophobic nucleobase faces (steric effects)
-pi-stacking (electronic effects)
These interactions have the effect of re-enforcing the individual A/T and C/G hydrogen-bonding interactions to drive massive stabilization in the larger context of the dsDNA helix
major groove
wide and deep groove in B-form dsDNA that provides access to the nucleobases from the outside
-proteins can bind through the major groove in a sequence specific manner
minor groove
shallow and narrow groove in B-form dsDNA that provides access to the nucleobases from the outside
puckered nucleotide sugar
in three dimensions, the sugar is puckered and can exist in a C3’-endo or C2’-endo conformation
-RNA nucleotides prefer the C3’-endo conformer because the C2’-endo conformation creates steric problems due to the 2’OH
-DNA nucleotides are predominated by the C2’-endo conformation, but the 2’ position is a hydrogen so either conformation is possible
RNA nucleotide conformation (sugar puckering)
the C3’-endo conformation is preferred because the C2’-endo conformation creates steric problems due to the 2’OH
DNA nucleotide conformation (sugar puckering)
the C2’-endo conformation predominates, but the 2’ position contains a hydrogen (as opposed to OH in RNA) so either conformation is possible
-B-form is C2’-endo (phosphates farther apart)
-A-form is C3’-endo (phosphates closer together)
C2’-endo vs C3’-endo in dsDNA
C2’endo places phosphates farther apart than C3’-endo
-B-form is C2’-endo (phosphates farther apart)
-A-form is C3’-endo (phosphates closer together); RNA prefers A-form DNA for this reason
syn- vs anti- base conformations
syn- has the base “over” the sugar, while anti- has the base swung out like a flag; the anti-conformation is energetically more favorable
-only purines can adopt the syn conformation (although it’s very rare and results in Z-form DNA)
-traditional Watson-Crick-Franklin base pairs are anti/anti, but Hoogsteen base pairs can form
Z-form DNA
results from the syn-conformation of purines and is very rare
-left-handed and has a zig-zag shape
Hoogsteen base pairs
forming of base pairs from syn-nucleotides (only one base needs to be syn)
-allows 3-4 strands to be present in a helix as part of some nucleic acid structures
-found in damaged DNA and DNA bound by drugs, but may also be present normally to regulate gene expression
tertiary structure in DNA/RNA
regions of secondary structure in nucleic acids can fold into a complex 3D structure (tertiary); possible in DNA but mostly seen in RNA molecules
-stabilized by metal ions and unusual base pair geometries
polynucleotide backbone
sugar-phosphate via phosphodiester linkage forms polynucleotide backbone
Is RNA typically single- or double-stranded?
typically single-stranded
What do the 5’ and 3’ ends refer to?
5’ = the 5’ carbon of a nucleotide
3’ = the 3’ hydroxyl of a nucleotide
Why is the genome DNA and not RNA?
RNA is more susceptible to degradation because the 2’OH can lead to breakdown of the chain
Other reason: B-form DNA allows easier access to info via the major groove
Why do we need to separate annealed DNA strands in the genome?
the annealed form is great for long-term storage but we also need to separate the strands to make use of base-pairing for replication, transcription, etc.
Tmelting(m)
the temperature at which the helix is half double-stranded, half single-stranded (50% denatured)
-stable helix = high Tm
-unstable helix = low Tm
linking number (Lk)
describes the topology of a DNA fragment; the sum you get from counting twist and writhe in a strcuture (Lk=Tw+Wr)
twist (Tw)
number of times each of the curves rotates around the central axis C of the double helix
writhe (Wr)
the number of times the intact B-form helix twists about itself
Can linking number be changed by simply deforming the structure?
No, if you try to increase or decrease twist, you will introduce writhe to ensure that the linking number remains unchanged; conversely, if you introduce writhe, the number of twists will adapt to endure that the linking number remains unchanged
When can we say DNA molecules have the same topology?
any time the linking number is the same; if linking number is different, the configuration of DNA is a different topology
natural twist (Lk0)
the helical structure of B-form DNA imparts a natural twist to the DNA polymer
Lk0 = (# of base-pairs)/10.4
This is essentially just the number of turns
Lk, Lk0, and DNA stability
B-form DNA is extremely energetically favorable, so for a given Lk, the Tw component will be ~Lk0, and the Wr will account for the difference between Lk and Lk0
difference between linking number and natural twist
deltaLk = Lk - Lk0
-if deltaLk < 0, the DNA is underwound and negative-supercoiling will result
-if deltaLk > 0, the DNA is overwound and positive-supercoiling will result
-if deltaLk = 0, the DNA is relaxed
How do cells change DNA topology (linking number)?
topoisomerase enzymes cut a strand, allowing the unbroken DNA strand to pass through break in first strand, then the cleaved strand is religated
What is the coiled state of most DNA in a cell?
as a general rule, most DNA inside the cell is negatively supercoiled
supercoiling
results from changing twist of dsDNA
nucleosomes
DNA wound in histones; this is how DNA is compacted in eukaryotes
How is DNA compacted in eukaryotes?
via the wrapping of DNA in histones and additional compaction; this drives chromosome assembly
histone acetylation
if histones are acetylated, chromatin opens and info becomes accessible; i.e., euchromatin
-histones can be acetylated on lysine residues by HATs
-acetylation can be removed from histones by HDACs
-acetylation reduces affinity for DNA
-acetylation marks can recruit transcriptional activators
histone methylation
if histones are methylated, heterochromatin proteins such as HP1 bind across them to promote chromatin compaction; i.e., heterochromatin
-histones can be methylated on lysines by histone methyltransferases
-histone methylation can be removed by histone demethylases
-methylated histones interact with heterochromatin proteins that oligomerize to coat and compact methylated regions
euchromatin
accessible, open regions of chromosomes; transcription can occur here (active)
-marked by histone acetylation
heterochromatin
inaccessible, compact regions of chromosomes; transcription cannot occur here (silent)
-marked by histone methylation and binding of heterochromatin proteins
parent strand
serves as a template to generate the daughter strand in replication; daughter helix composed of one strand from parent and one entirely new strand
semi-conservative genome replication
each daughter helix gets one strand from the parent and one completely new strand
What catalyzes nucleotide addition?
DNA polymerase
How does DNA polymerase catalyze nucleotide addition?
-adds dNTPs (nucleotide triphosphates) to the 3’ end of that last nucleotide in the polynucleotide strand
-synthesis can only proceed off an already existing double-stranded fragment
DNA polymerase (general)
catalyzes nucleotide addition by adding dNTPs to the 3’ end of a polynucleotide strand
-requires a primer and Mg2+ (metal)
Mg2+ and DNA pol/transcription
-activates 3’OH attack on dNTP phosphate, makes alpha phosphate more electrophilic
-stabilizes phosphate’s negative charge
Why does DNA polymerase require a primer?
base-pairing interactions are more specific in the context of an existing double-stranded structure than on their own
-dsDNA/primer facilitates addition of complimentary base-pairs to the growing strand by taking advantage of the energy of base-stacking
tautomerization of nucleobase
rare situation in which the position of H-bond donors/acceptors are different
-allows for C/A and T/G pairing
nontautomeric mispairing
wobble mispairing resulting from bases with an extra proton that can still bind but often with a mismatched nucleotide
“normal” mispairing
wobble mispairing between normal bases that nonetheless bond inappropriately because of a slight shift in position of the nucleotides in space
wobble mispairing
mispairings that occur because DNA double helix is flexible enough to accomodate slightly misshaped pairings
-nontautomeric (bases with extra proton) mispairings
-“normal” mispairings
DNA polymerase proofreading
DNA polymerase has 3’ to 5’ proofreading activity to correct mistakes
-the exonuclease domain removes errant nucleotides from the end of a DNA strand
two domains of DNA polymerase
1) polymerase - adds nucleotides
2) exonuclease - removes nucleotides
How does the DNA pol exonuclease correct errors?
an errant base destabilizes the dsDNA structure, causing the 3’ end to “flop” into the exonuclease site where the terminal nucleotide can be removed
DNA polymerase III
the “workhorse” polymerase (main replicative polymerase) that is built for speed and processivity (once it gets going it’s going to run for as long as it can)
DNA polymerase I
the “handyman” polymerase (odd-jobs polymerase) that polymerizes small stretches of DNA as part of cleanup or repair jobs
-built for accuracy instead of speed
ori
origin of replication
-prokaryotes have a single ori
-euaryotes have multiple oris
helicases
unwind DNA using energy of ATP hydrolysis
primase
synthesizes a short complementary RNA sequence to serve as a primer for DNA polymerase
Okazaki fragments
the fragments that make up the discontinuous strand
How are primers removed in replication?
DNA pol I recognizes the RNA primers and removes them, replacing them with DNA sequence
DNA ligase
joins together Okazaki fragments of the lagging strand and nicks created by DNA pol I
How does DNA ligase catalyze the reaction joining Okazaki fragments?
1) Ligase uses an ATP to adenylate itself, forming Ligase-AMP
2) AMP is transferred from ligase to the 5’ phosphate of the nicked DNA strand
3) Base catalyzed nucleophilic attack by 3’OH can seal the strand
single-stranded binding proteins (SSBs)
help keep ssDNA from reannealing after being melted by helicase
telomerase
prevents ends from shrinking by extending leading strand (brings its own RNA primer)
telomeres
regions of repetitive sequences at the end of eukaryotic chromosomes
What are the two main paths to mutations in the genome?
1) polymerase makes a mistake
2) chemistry takes its toll
mismatch repair pathway
1) Cells specifically mark the parental strands of DNA with methylation marks
2) Protein complex detects mismatch, indentifies which strand is methylates, and cleaves the OTHER (daughter) strand error
3) Resulting gap is filled in by DNA pol III and the nick is sealed by DNA ligase
alkylation chemical modification
adds a methyl or ethyl group to the nucleobase, disrupting a position of H-bond donor/acceptor
-can be mutagenic or not
-corrected by methyltransferase
mutagenic
chemical modification that disrupts base-pairing
deamination chemical modification
replaces a nucelobase amine with a carbonyl group, changing the nucleotide as a whole
-thymine cannot undergo deamination
-corrected by base-excision repair
-makes alternative base-pairing between deaminated bases possible
depurination chemical modification
loss of the entire purine base from the nucleotide
-can’t base-pair (literally doesn’t have a base)
-corrected by base-excision repair
UV light chemical modification
UV light can cause adjacent, stacked pyrimidines to dimerize
-these dimers produce a distorted DNA structure that is often misread by DNA pol and thus results in replication errors
-corrected by nucleotide-excision repair
-pyrimidines only
direct repair by methyltransferase (alkylation repair)
the alkyl groups can be transferred from the nucleobase to an alkyltransferase, often becoming permanently stuck; therefore, an entire protein takes on the damage from an alkylated base in a sacrificial manner
base-excision repair (depurination repair)
1) AP nuclease recognizes abasic sites and nicks the offending backbone at the site of damage, exposing a stretch of dsDNA with a free 3’OH and single nucleotide of ssDNA (this provides a template for DNA pol to extend)
2) DNA pol hops on, fills in the gap, and displaces some DNA
3) Ends are ligated together
Note: exact same as deamination repair but without glycosylase enzyme
base-excision repair (deamination repair)
1) Simply convert the offending nucleobases into an abasic site using a glycosylase enzyme
2) AP nuclease recognizes abasic sites and nicks the offending backbone at the site of damage, exposing a stretch of dsDNA with a free 3’OH and single nucleotide of ssDNA (this provides a template for DNA pol to extend)
3) DNA pol hops on, fills in the gap, and displaces some DNA
4) Ends are ligated together
Note: exact same as depurination repair after glycosylase enzyme is applied
glycosylase enzyme
removes base from a nucleotide
-used to correct deamination in the base-excision repair pathway
nucleotide excision repair (UV dimer repair)
1) Surveillance complex scans DNA structure and locates error
2) Excinuclease cuts both sides of the lesion, strand is unwound
3) DNA pol I fills in the gap and ligase connects the ends
endonuclease vs exonuclease
endonuclease cleaves the phosphodiester backbone within a nucleotide chain, while exonuclease cleaves the phosphodiester backbone at the 3’ end of a nucleotide chain
homologous recombination
1) 5’->3’ exonuclease generates free 3’ overhangs (allows for DNA polymerase activity)
2) 3’ overhang performs base pairing interactions with target (homologous segment), displacing one strand of DNA (“strand invasion”)
3) Polymerase extends off free 3’ end using homology target as template
4) Four-way DNA structure called a “Holliday junction” results
5) Holliday junction resolved by cutting either vertically or horizontally (either way, “origins” of DNA in each chromosome is altered)
Holliday junction
formed by two crossovers in homologous recombination
-B-form is preserved
-helicases can translocate the position of the cross-over point in a process called branch migration
-can be resolved by cutting either vertically or horizontally
branch migration
process by which helicases can translocate the position of the cross-over point in a Holliday junction
two options following cleavage of a Holliday junction
1) DNA only “exchanged” between chromosomes along the site of repair
2) An entire arm of chromosome is exhanged outside the repair site
homologous recombination and variety
homologous recombination also functions to generate variety in the distribution of paternal and maternal genes on your chromosomes:
1) Provides a way to ensure that homologous chromosomes correctly pair during gamete production (must have same sequences to initiate recombination)
2) Resolution of Holliday junctions leads to exchange between chromosomes
sequence specific recombination
induces DNA rearrangements at defined sites; can be carried out by either tyrosine recombinase or serine recombinase
-orientation of recombinase leads to different recombination products
tyrosine recombinase mechanism
1) Tyrosine -OH attacks phosphate in DNA backbone, nicking the backbone and forming phosphodiester bond between tyrosine and DNA
2) DNA -OH attacks phosphodiester bond between tyrosine and DNA, kicking off tyrosine and forming a Holliday junction between fragments
3) Tyrosine -OH in recombinase attacks phosphate in DNA backbone again
4) DNA -OH agains attacks phosphodiester bond between tyrosine and DNA, kicking off tyrosine
5) Final product
tyrosine recombinase takeaways
1) Two sequential single strand breaks
2) Holliday junction intermediate
serine recombinase mechanism
1) Two double strand breaks produced simultaneously - one across each dsDNA fragment
2) Fragments on one side of the break are rotated so that they switch places
3) Double strand break is then repaired by religation of the broken pieces
i.e., cleave, rotate, connect back together
serine recombinase takeaways
1) Two double strand breaks introduced simultaneously
2) Repair proceeds without a Holliday junction
tissue-specific promoter
a recombinase protein can be placed under a tissue-specific promoter so that the recombination only occurs within certain locations in the body
two sequence specific recombination outcomes
1) insertion/excision
2) inversion
transposons
transposable elements that copy/paste and cut/paste themselves into the genome; selfish - only purpose is to use your genome to propogate themselves
Does transposon movement always cause changes in gene expression?
No, only if inserted into a gene coding region or a sequence that regulates gene transcription; this in turn creates genetic diversity (corn example from slides)
two major transposon types
1) Class I: retrotransposons (copy/paste) - copied to an RNA intermediate that is used to make a second DNA copy for insertion
2) Class II: DNA transposon (cut/paste) - DNA sequence is excised and pasted elsewhere in the genome
genetic mosaics
the movement of transposons within a genome means that different cells in the ame tissue can have different genomes from each other; they are genetic mosaics
What strand do we copy in transcription?
an mRNA copy of the coding strand is made using base-pairing interactions with the template strand
nucleotide addition by RNA polymerase
RNA polymerase is used to add nucleotides to mRNA in transcription (proceeds like DNA polymerase)
-Base activation of 3’OH allows it to act as a nucleophile, attacking the 5’ alpha phosphate of an NTP
-MG2+ ons are critical to activate -OH and stabilize the negative charge on the phosphates during the reaction
DNA polymerase vs RNA polymerase
-RNA polymerase has an intrinsic helicase (it unwinds/melts DNA on its own) and does not require a primer
-RNA polymerase does not have a separate exonuclease site for proofreading, but can “backtrack,” allowing the same active site used for polymerization to act as a nuclease and give a second chance to correctly transcribe
Can both strands of DNA be transcribed?
Yes, but not at the same time; only one strand at a time is transcribed
-this is because RNA polymerase holds onto both the template strand and coding strand in the complex
Why might RNA polymerase not need a primer but DNA polymerase does?
1) DNA replication needs to be more accurate to prevent deleterious mutations from locking in the genome
2) RNA polymerases initiate transcription at promoters
gene promoter
“landing strip” of DNA that specifies where RNA polymerase will initiate transcription
-positions the RNA polymerase/DNA complex to initiate at a specific location and to use the correct strand as a template
sigma subunit
facet of RNA polymerase in bacteria that recognizes the promoter in transcription
How does RNA polymerase initiate transcription?
-in bacteria, sigma subunit recognizes promoter sequence, RNA polymerase binds here, and the sequence positions the complex to initiate at a specific location and use the correct strand as a template
-RNA polymerase melts the DNA strand and begins to add complementary nucleotides, causing sigma to dissociate and the polymerase to “escape” the promoter
elongation state
state of RNA polymerase in transcription caused by mRNA extension proceeding rapidly
RNA polymerase in prokaryotes vs eukaryotes
-eukaryotes have 3 forms of RNA polymerase, and each type has its own initiation specificity (recognizes different DNA sequences as promoters)
-transcription in eukaryotes is also regulated by enhancer elements and other sequences that might be far from the transcription site
Rho-dependent termination (prokaryotes)
Rho factor is a helicase that translocates from the Rho binding site to the 3’ end of the transcript (“climbs” to RNA polymerase), where it unwinds/”peels” RNA polymerase off the template strand
Rho factor
helicase that acts as a terminator for transcription by translocating (“climbing”) towards the 3’ end of the transcript, where it unwinds/”peels” RNA polymerase off the template strand
Rho-independent termination (prokaryotes)
if RNA polymerase transcribes through a stretch of DNA that will produce a G/C rich region and a U-rich segment downstream, a hairpin could form, causing RNA polymerase to stall; the U-rich region downstream can only form weak base-pairing interactions with the template strand, and the combination of the stall and weak interaction causes the polymerase to fall off
hairpin structure
forms when the template strand in transcription produces a G/C rich region in mRNA, stalling the RNA polymerase and thereby acting as a terminator when paired with a U-rich segment downstream
When does a bacterial (prokaryotic) cell use the lac operon to produce lactose?
in the absence of glucose, as glucose is the preferred energy source
repressor
binds to operator sequence and blocks RNA polymerase binding, preventing transcription
-operon is only repressed in the absence of a ligand metabolite, as the ligand binds repressor and prevents it from binding to DNA
What is the only situation in which the lac genes are not expressed (transcription does not occur)?
when a repressor is present but not lactose metabolite
-any time the repressor is absent or a lactose metabolite is present, gene expression occurs
lac operon activator synthesis (switch from glucose production to lactose production)
1) Glucose inhibits cAMP production, so as glucose levels decrease, cAMP levels increase
2) cAMP binds to a protein called CAP
3) CAP binds to a promoter proximal site and stimulates RNA polymerase recruitment, making it an activator of transcription
CAP
activated by elevated levels of cAMP in response to low levels of glucose, binds to a promoter proximal site and stimulates RNA polymerase recruitment to the promoter, making it an activator of transcription
two mechanisms for turning a gene on in the presence of a ligand
1) Repressor (no ligand) = gene off —> repressor + ligand = gene on
2) Activator (no ligand) = gene off —> activator + ligand = gene on
two mechanisms for turning off a gene in the presence of a ligand
1) Activator - ligand = gene on —> activator + ligand = gene off
2) Repressor - ligand = gene on —> repressor + ligand = gene off
Why is eukaryotic gene regulation/transcription so much more complex?
the fact that DNA is wrapped in nucleosomes means it will always take extra effort to transcribe any gene
eukaryotic transcription activators
activators bind to enhancer elements to recruit coactivators and the basal machinery to stimulate transcription
-basal transcription machinery is needed to be able to assemble eukaryotic RNA polymerase II at promoter sites
relationship between eukaryotic activators and repressors
eukaryotic activators and repressors are modular proteins that can infuence transcription and influence each other; allows for combinatorial control of gene expression
-Activators:
1) recruit coactivators
-Repressors:
1) block coactivator recruitment
2) block activator binding to DNA
3) recruit corepressors
sequence-specific interaction of repressor and DNA
hydrogen bonds form between specific amino acids and nucleotide bases based on patterns of H-bond donors and acceptors in each molecule; interactions occur in the major groove of B-form dsDNA
-in this way, proteins can read the bases without needing to “melt” the DNA
What determines sequence-specificity of DNA binding proteins (e.g., repressor)
motifs in the protein structure that make direct contact with the nucleobases through the major groove of B-form dsDNA
How do researchers test if two proteins interact?
exploiting the modularity of transcriptional activators (DNA binding activity is often separate from regulatory activity); allows researchers to screen protein-protein interactions quickly
epigenetics
changes in gene expression not caused by changes in DNA sequence
-3 focal types of epigenetic regulation:
1) Regulation of chromatin structure
2) Chemical modification of nucleosides
3) microRNA-mediated regulation
two regions of chromatin (accessibility)
1) heterochromatin - transcriptionally silent
2) euchromatin - transcriptionally active
5-methyl cytosine
epigenetically inherited DNA modification that causes methylation with each replication, thereby leading to transcriptional silencing (useful for silencing transposons by turning off gene expression)
histone methyltransferases
methylate histones on lysines
histone acetyltransferase (HAT)
acetylate histones on lysines
transcription factors
activators and repressors
-bind to enhancers
-stabilize or destabilize RNA polymerase on the promoter
final mRNA transcript length
in eukaryotes, the final mRNA transcript is shorter than the DNA template from which it is transcribed due to removal of introns
introns
sequences removed from the RNA transcript
exons
sequences that remain in the RNA transcript and are expressed
spliceosome
complex enzymatic machine (RNP) composed of both proteins and non-coding RNAs called snRNAs that facilitates splicing, the removal of introns
-only eukaryotes have spliceosomes
splicing
the process of removing introns from transcribed RNA
5’ modification to transcribed mRNA
7-methyl guanosine (m7G) cap is added to the 5’ end of eukaryotic mRNA
-aids in formation of translation initiation complex
-protects 5’ end from degradation
-signal for nuclear export
3’ modification to transcribed mRNA
poly-A tail added to 3’ end of eukaryotic mRNA by polyadenylate polymerase (PAP)
-signals nuclear export
-stabilizes mRNA
-promotes translation
alternative isoforms from mRNA processing
alternative splicing contributes to the idea that many proteins can be coded from a single gene
mRNA processing and surveillance
mRNA processing ensures bona fide mRNA has features that provide proof to the cell that the RNA was made by it (rather than it being a virus)
Why are pre-mRNAs spliced and processed?
1) Alternative isoforms
2) Surveillance against invaders
ribonucleoproteins (RNPs)
any machine with protein/RNAs working together
-spliceosome is an RNP
spliceosome pre-mRNA splicing mechanism
snRNAs use base-pairing interactions to locate the intron/exon boundaries within the pre-mRNA; once located, the spliceosome activates and proceeds in 2 steps, both involving nucleophilic attack of a hydroxyl on the phosphodiester backbone:
1) 2’OH from the ribose of an adenosine within the intron cleaves the backbone at the 5’ splice site
2) Free 3’OH of 5’ first exon attacks the phosphate connecting the intron to the second exon, releasing an RNA lariat
group I intron mechanism
1) Free guanosine nucleotide attacks phosphate at 5’ splice site
2) 3’OH of 5’ exon initiates nucleophilic attack on the phosphodiester backbone at the 3’ exon, releasing a linear fragment
group II intron mechanism
1) 2’OH of an adenosine in the intron attacks the phosphate at 5’ splice spite
2) 3’OH of the 5’ exon attacks the phosphate connecting the intron to the second exon, releasing an RNA lariat
microRNAs
small ~22nt genome-encoded RNAs that regulate gene expression
-base-pairing is used to locate miRNA targets; degree of base-pairing sets type of regulation:
1) if 100% miRNA form WCF base-pairs, RNA is destroyed
2) imperfect match - repression but not destruction of RNA
three main types of RNA in the cell
1) mRNA - encodes protein sequences
2) tRNA
3) ribosomal RNA (rRNA) - non-coding
translation
going from mRNA to protein
codons
3 nucleotides that code for a single amino acid
properties of the genetic code
1) triplet - codon corresponds to one amino acid
2) redundant - except for methionine and tryptophan, each amino acid is encoded by more than one codon
3) universal - same code is used in all organisms
tRNA
adapter molecules that link codons to associated amino acids by presenting an anti-codon that uses WCF base-pairing to read triplet codons in mRNA
charging of tRNA
amino acid is activated by adenylation of the carboxyl group, then tRNA synthetase selects the matching tRNA and “charges” it, transferring the amino acid to the tRNA 3’OH
-i.e., a charged tRNA is simply a tRNA with an amino acid attached
Why are their fewer than 64 species of tRNA but 64 possible codons?
some tRNA can pair with multiple, synonymous codons, all of which encode the same animo acid
-wobble hypothesis
wobble hypothesis
first two bases create coding specificity, while third base does not make strong interaction and does not need to be a canonical base-pairing interaction
-can occur only at the 3’ end of RNA (5’ end of anti-codon)
-this allows the cell to get around needing 64 tRNAs
inosine
placed in the anti-codon stem by some tRNAs, inosine can base-pair with multiple different nucleobases, making it well-suited to place in the wobble position (5’ end of anti-codon)
reading frame
consecutive, non-overlapping codon sequences translated into a polypeptide; reading fram start is set by AUG (START codon)
UTR
untranslated region that does not code for protein
-outside the START and STOP codons
silent mutation
changes the mRNA sequence to a synonymous codon and therefore has no effect on resulting polypeptide
missense mutation
changes the mRNA sequence from a codon for one amino acid to a codon for a different amino acid, changing the polypeptide sequence
nonsense mutation
changes the mRNA sequence from a codon for an amino acid to a STOP codon, terminating translation and resulting in a truncated polypeptide
frameshift mutation
disrupts the normal reading frame by inserting or deleting nucleotides that are not multiples of 3 (in this case frame isn’t shifted)
-insertion or deletion
ribosome in translation
1) Initiates synthesis at START codon
2) Synthesis: forms peptide bonds between amino acids to make a polypeptide
3) Terminating synthesis at STOP codon
ribosomal initiation of translation (prokaryotes)
ribosome RNA base-pairs with mRNA at the Shine-Dalgarno sequence, aligning ribosome at intended AUG START codon
Shine-Dalgarno sequence
sequence of mRNA in prokaryotes that aligns ribosome RNA at intended AUG START codon
ribosomal initiation of translation (eukaryotes)
1) Ribosome assembes initiation complex mediated by 5’m7G cap and poly-A tail interactions
2) Ribosome scans from 5’ cap toward 3’ end of mRNA searching for AUG
Note: very poorly understood beyond this
three sites for tRNA/mRNA interactions on ribosome
1) A site - holds an aminoacyl tRNA
2) P site - holds the tRNA with growing polypeptide attached
3) E site - holds a tRNA that will exit
ribosomal condensation reaction
A-site N-terminal amine attacks P-site’s ester linkage to polypeptide’s C-terminal end and the polypeptide is transferred from the P-site tRNA to the A-site tRNA (ribosomal RNA catalyzes the formation of a peptide bond between the amino sites bound to tRNAs at P and A sites; one amino acid added to the end of tRNA in the process)
release factor
protein that recognizes STOP codon and promotes polypeptide release and translation termination
-not a tRNA, but protein component that mimics tRNA
-reads STOP codon sequence using amino acids, not bases
tRNA translocation in ribosome
tRNA moves from A-site (amino acid) to P-site (protein) to E-site (exit) in a ribsome
N-terminal end of protein
corresponds to the 5’ end of mRNA; polypeptide grows from N-terminus to C-terminus
physical coupling of transcription and translation (prokaryotes)
transcription and translation are physically coupled in prokaryotes, meaning the mRNA begins being translated before its finished being transcribed); but this is not so in Eukaryotes where mRNAs are made in the nucleus and translated in the cytoplasm
oligonucleotide synthesis (solid state synthesis)
chemical (laboratory) synthesis of short (10-80nt) DNA fragments that are used as primers for PCR
-unique in that fragments are synthesized from 3’->5’
polymerase chain reaction (PCR)
uses oligos as primers to amplify defined DNA fragments
-repeated cycles of denaturation, primer annealing, and polymerase extension lead to exponential amplification of a target region of DNA
Gibson assembly
mimics a recombination reaction by assembling smaller overlapping fragments into larger genes
-5’->3’ exonuclease
-high-fidelity DNA polymerase
-taq DNA ligase
plasmid
a circular DNA molecule that replicates separately from the host chromosome; designed genes can be cloned into plasmids for propagation and downstream applications
Key components:
1) Ori - recruits cellular enzymes to initiate replication
2) Selectable genetic marker - enables selection of cells with the recombinant plasmid
3) Polylinker - allows vector to be cut with restriction enzymes
Sanger method
involves the incorporation of ddNTPs to terminate synthesis, allowing multiple products to be generated that allow us to identify the locations of a base
-highly accurate, but can only sequence one clonal DNA at a time
Illumina method
single DNA molecules are spotted on their “own location” on a slide, locally amplified and sequenced using microscopy to read out each base that is added
PacBio method
single molecules of DNA are trapped in tiny wells, and the sequence of DNA is read out by repeatedly resequencing the same fragment over and over again
-allows us to read longer sequences
Nanopore method
DNA is fed through a protein-based pore in a membrane and the changes in conductance as the DNA goes through provides the identity of different bases
-many more bases sequenced, but less accuracy
Cas9
RNA-programmable DNA endonuclease that uses WCF base pairing between “guide RNA” and dsDNA to locate the target site
Cas9 gene modification
1) Use Cas9 to induce a break at a desired location
2) Use synthetic DNA fragment as repair template
dCas9
“dead” Cas9 that binds target DNA but doesn’t cut