Information Transfer Flashcards
How big is the human genome in diploid cells?
6x10^9 bp
6x10^9 bp * 0.34nm/bp = 2.04x10^9nm = 2m/cell
37x10^12 cells in human body = 74x10^9 km of DNA in body
What is the structure of chromosomes?
- Single DNA molecule wrapped around scaffold of proteins
- highest condensed structure
- made up of condensed chromatin fibers
- average human mitotic chromosomes is 5μm in length and contains 5cm of DNA
- unwound within the nucleus, only appear during mitosis
What is the structure of chromatin?
- first step in the folding events to condense DNA 10,000 fold from it’s extended form into compact mitotic chromosome
- made up of DNA, histone protein, and RNA
- made up of nucleosomes
what is a karyotype?
full set of chromosomes of an individual
humans have 22 pairs of autosomes and 1 pair of sex chromosomes
How is there only one genome?
Every cell has the same genome, but the information differs from person to person
- epigenetics is the reason the body can make different cell types and organs
- some genes are modified to either be expressed or not expressed
- whole different transcriptome can be expressed in different cells
- not always considered a change in sequence, but cancer cells may have mutations related to epigenetics
What are the two levels of epigenetic modifications?
DNA modifications
Histone modifications
How are histones involved?
- 150bp wrapped twice around histone core protein
- each histone has N-terminal tail that’s unstructured and chemically modified by many types of enzymes
what are the three tail residues and how can they be modified?
K (Lysine): acetylated, methylated, ubiquitinated
T/S (Threonine/serine): phosphorylated
What are histones?
- key DNA protectors of information storage
- main packaging proteins
What are the 5 classes of histones?
H1: linker histone (not as conserved between species)
H2A, H2B, H3, H4: core histones
- 2 copies of each make up nucleosome octamer which forms the “beads on a string” structure
what is an important characteristic of histones?
- very basic, rich in lysine and arginine
- arginine can be methylated
what are introns and exons?
intron: intervening sequences
exon: expressed sequences
What are 3 things found in the promoter region (where transcription factors bind to recruit RNA pol)?
- Trans-acting elements (DNA)
- Cis-acting elements (proteins) that bind to trans elements
- TATA box: recruits binding factors that initiate transcription
where does transcription start and stop?
start codon: ATG
stop codon: AUG
What is gene expression?
- series of events where info in DNA sequence is converted to RNA product and then to protein, which performs biochemical/biological function in the cell or organism
what is the central dogma of gene expression?
Gene (DNA) –––(transcription, processing)––→RNA–––(translation)––→ Protein
what are 3 things that gene expression helps us understand?
- why different types of cells have different functions (1 genotype, many phenotypes)
- why organisms differ from one another
- how cells adapt and respond to different signals and changes in their environment
what is a gene?
the entire DNA sequence that is necessary for the synthesis of a protein or RNA molecule
- includes the promoter, translational coding region, untranslated regions, introns, transcription termination signal, and regulatory sites
what 5 things determines the amount of each protein in a cell?
abundance of mRNA is determined by:
1. the rate of its synthesis (transcription of gene)
2. the rate of mRNA degradation (stability)
3. the availability of the RNA molecule (mRNA sequestration)
- efficiency of translation
- processing and stability of the protein
purpose of prokaryotic gene control
allows cells to optimize growth and division in response to changing environments
purpose of eukaryotic gene control
regulate a genetic program that underlies embryonic development and tissue differentiation
mRNA processing
-RNA synthesized by RNA pol II is termed pre-transcript/pre-mRNA and is extensively modified before transported to cytosol for translation
4 reasons for processing mRNA
- identifies it as mRNA
- allows transport to the cytosol
- makes it competent for translation
- stabilizes and protects in from degradation (RNA is inherently unstable)
Overview of eukaryotic mRNA processing
(DNA)
- transcription by RNA pol II
- addition of 5’ cap when 20-30 nt of pre-mRNA made
- addition of 3’ polyA tail (longer tail = more stable the pre-mRNA)
(pre-mRNA)
- RNA splicing: introns removed
(mRNA)
- export out of nucleus
- translation
(Polypeptide)
architecture of final mRNA product
- 5’ cap is guanosine added to 5’ end of pre-mRNA backwards (via 3 phosphates instead of by sugar) and methyl group is added to it to protect mRNA from degradation and determine translation efficiency
- open reading frame (ORF) determines amino acid sequence of protein
- polyadenylation signal ( in the 3’ UTR) determines stability
2 main parts of mRNA processing
- capping
- splicing
describe capping
first 2 ribose sugars are methylated (guanine cap and first nt after it), first methylation of G cap is Cap0 and nt methylation is Cap1
- functions to regulate mRNA transport out of nucleus, prevent 5’ exonuclease degradation, promotes translation via binding to ribosomes, and promotes 5’ proximal exon excision
what are the 4 steps of 5’ capping?
- RTPase removes a phosphate from pre-mRNA
- GTase pulls a phosphate off of GTP and adds it back to 5’ end of pre-mRNA backwards which regenerates triphosphate
- N7MTase (methyltransferase) pulls methyl group from S-adenosyl methionine (SAM) and adds it to G cap (Cap0)
- SAM becomes SAH (S-adenosyl homocysteine)
- if not enough SAM, can’t properly methylate cap = problems - repeat reaction to methylate Cap1 (catalyzed by 2’OMTase)
How is capping catalyzed?
-guanylyltransferase (GTase) is recruited by the CTD
- GTase is associated with CTD to ensure that each mRNA is capped as its transcribed and in position to interact with the newly synthesized mRNA 5’ end
- GTase only dissociates once pre-mRNA is capped, then the cap-binding complex (CBC) binds with CTD and G cap so the 5’ end is kept protected
describe splicing together of exons and removal of introns
- exons: expressed sequences
- introns: intervening sequences
- coding sequence starts with exon and ends with exon
- eukaryotic genes can have dozens of introns
- exons are usually short (50-250 bp) and introns range from short to long (50-20,000 bp)
why did exon/intron structure evolve in eukaryotic genes?
- may be result of organisms attempting to increase # of genes
- enhances rate of expression of genes
- differential splicing provides a way to obtain different proteins from a single gene
- makes it hard for toxic agents to damage coding sequence when low percent of gene is actually expressed
What is the structure of pre-mRNA
5’exon-GU———branch point A—–(pyr rich region)—-AG-3’exon
what are the splice sites of introns?
- always has 5’ and 3’ splice sites
- 5’ splice site has a conserved GU
- 3’ splice site has conserved AG
- branch point A is always within 15-45 bases from the 3’ splice site
what is the structure of U#snRPs in splicing
- U1/U# is a complex of RNA and protein
- snRP = small nuclear RNA protein
- RNAs are always highly structured and form stem loops when homology exists which form binding sites for protein
- explains why all pre-mRNAs begin with an exon: splicing apparatus (U1snRP) needs 5’ intron GU site (which base pairs with U1’snRP’s 5’ RNA end) to initiate cleavage and is just downstream of first exon
- if started with intron, the first intron wouldn’t be spliced out because GU isn’t there
9 steps of splicing cycle
- U1 (5’ end) binds 5’ splice site
- U2 (5’ end) binds internal branch point A (but has no U to pair with branch point A so that is free to attack the GU at the 5’ end)
- U4/5/6 recruited and bring together U1/2
- U1 is ejected from complex which allows U6 to bind 5’ splice site
- U4 is ejected and NTC enters which activates the complex (U4 is only to stabilize complex before activation)
- 5’ splice site is brought to branch point A
- 5’ splice site is joined to branch point A (branch point A attacks GU, 5’ exon is cleaved but not released by the complex)
- the 5’ and 3’ exons are brought together and joined
- intron lariat (discarded product) is detached and entire complex is recycled for next intron
Describe polyadenylation
- most eukaryotic mRNAs have 50-200 A residues at the 3’ end which are not coded for by the gene
- this poly A tail is added on by an enzyme (poly A polymerase: PAP) following the cleavage of mRNA at specific site near the 3’ end
- mRNAs have polyadenylation signal AAUAAA near the 3’ ends, coded for by the last exon in the gene which dictates where the polyA tail should be added
- polyA tail shortens as mRNA ages
How does polyadenylation happen?
- polyadenylation signal triggers cleavage 10-35 nts downstream of poly A signal on pre-mRNA
- PAP polyadenylates the tail
what are 3 main roles of polyadenylation
- protect from degradation
- signal for mRNA transport out of nucleus
- required for efficient translation
what allows assembly of pre-cleavage complex?
- CPSF-73 (processing nuclease) performs the cleavage event including bending the RNA to expose the cleavage site
steps of how RNA pol II terminates transcription
- RNA pol II transcribes exons and introns
- splicing occurs and introns are removed
- RNA pol II reaches polyA signal
- mRNA gets cleaved and polyadenylation occurs
- RNA pol II continues even though the mRNA is complete and reaches the cotranscription cleavage site (CoTC) and transcribes RNA that isn’t needed
- exonuclease Xm2 recognizes the cleaved site on the useless RNA and begins chewing it back
- Xm2 reaches RNA pol II and it is released from the template, resulting in termination
what are 6 additional ways of transcript processing?
- RNA editing
- selection of different promoters
- Selection of different AUG translation start sites
- selection of alternative polyadenylation sites
- Alternative splicing
- mRNA stability
describe RNA editing
C to U occurs by deamination and can result in premature stop codon at truncated protein
- mechanism: bisulfite deaminates C to U - used to determine amount of C methylation in a sequence since methylated Cs are protected from bisulfite conversion
- C residue is selected; mRNA forms hairpin loop; ACF is recruited and clamps on to RNA; ACF recruits APOBEC1 (deaminating enzyme)
- if APOBEC1 is overexpressed there are too many conversions which can lead to cancer
- mRNA melts so that APOBEC 1 has a C in it’s grip and converts it to U
- complex dissolves and hairpin RNA reforms but with C as U
describe selection of different promoters
- ex: PPAR𝜸 gene has many promoters and 2 start sites that can be used
- transcription can start 1 of 3 promoters and translation can start from one of the 2 AUG start codons in the transcribed RNA
- if starting from 𝜸1, transcription of A1 and A2 but start site B is cut out
- if starting from 𝜸2 or 𝜸3, only get transcription of B or A2, respectively
- use of 𝜸1 and 𝜸3 results in different mRNAs but the same protein because they use the same start codon
- differential selection of transcription start site is often coupled with differential splicing
describe selection of different AUG translation start sites
- if there are three potential start sites you can have three different proteins
- in most cases the AUGs are all in frame, so everything downstream of the start (whether it’s AUG1, 2, or 3) will be all in frame and the triplet codes all match us so amino acid sequence will be the same
Describe selection of alternative polyadenylation sites
- 70% of human gene transcripts undergo polyadenylation
- can have impact on 3’ UTR region and regulatory sequences therein such as miRNA binding sites
- selection of polyA site is linked to rate of transcription and loading of polyA complex onto the site
- weaker polyA sites will be passed over if pol II is moving quickly
- RNA binding protein regulators affect interactions between core machinery and poly A sites
- strongest poly A site is usually most distal, and weakest most proximal
describe alternative splicing
- ex: calcitonin gene can be expressed into 2 different gene products (calcitonin and CGRP which have different biological activities)
- involves alternative splicing and alternative selection of polyA sites
- some exons may be spliced out
- decision of which exon is removed is determined by RNA sequence elements and protein regulators, and is associated with rates of transcriptional elongation
what are the 4 categories of sequence elements in alternative splicing
- ESEs (exonic splicing enhancers)
- ESSs (exonic slicing silencers)
- ISEs (intronic splicing enhancers)
- ISSs (intronic splicing silencers)
- all are cis-elements that are recruited to DNA
- enhancers in exons/introns facilitate splice-site recognition and exon/intron definition and are important when they’re weak non-consensus sites
- they slow down elongation of RNA pol II which may facilitate recognition of weak splice sites
- speeding up rate may facilitate non-recognition
- there are specific proteins that bing to these mRNA sequence elements
- if proteins bind enhancer sequence they facilitate splicing/recruit spliceosome
- of proteins bind silencer sequence, they silence splicing and don’t recruit spliceosome
- splice site recognition can be inhibited which allows some exons to be excluded (some sequence elements inhibit site recognition by binding proteins that in turn block access of SNRPs or enhancer binding proteins so splicing of an exon can’t occur)
describe mRNA stability
- half lives of eukaryotic mRNAs vary significantly, so mRNA from housekeeping genes generally have long half-lives and mRNA genes that need to be dynamically regulated and expressed in short bursts have shorter half lives