BIOL 3080 pt 1 Flashcards
Shotgun Strategy
- DNA extraction
- DNA fragmentation (broken up into tiny pieces)
- Cloned into vectors
- Transform Bacteria, hybridize and grow, isolate the vector DNA
- Sequence genome library
- Assemble continuous fragments using computer software
Primer Walking Strategy
- Start sequencing from specific site in genomic DNA
- Design pimer at a site based on sequence information
- Start sequnecing with designed primer
- Repeat until formed sequence
- Often used to “close the gaps” in shotgun sequencing, or obtain small sequence
Enzymatic Method
- Single stranded molecules are sequenced
- All strands of different lengths end with ddA - causes it to stop
- Strands are separated by length in gel electrophoresis
- Anneal primer to one dna strand
- Split into 4 reactions: dATP, dTTP, dCTP, dGTP
- DNA polymerase is added to all of the rxns
- longest DNA strand is furthest from primer
- shortest strand is shortest from primer
- causes chain termination
DNTP
OH group allows strand extension at 3’ end
ddNTP
does not allow extension at 3’ end
Standard Vector Primers for sequencing
- need new primer for every new DNA template sequenced
- cannot design a primer if no previous sequence information is known
- insertion site are binding sites for vector primers
Automated Sanger DNA sequencing
- allows for large scale DNA projects
- Use a different colour fluorescent dye to tag each ddNTP
- The DNA primer is unlabeled
ADVANTAGES: - Can read up to 900 nucleotides per Rx
- Cost effective —› <$4 per sequence determination with a standard plasmid vector primer
- Can perform all 4 ddNTPs Rx in one sample and load in same lane of gel
Next Gen Sequencing
Examples
- High-throughput sequencing (HTSeq)
- Massively Parallel Sequencing
- Sequencing By Synthesis - SBS (Illumina)
- Pyrosequencing (454 Life Sciences)
- Ion semiconductor sequencing (Ion Torrent Systems)
- SOLiD - Sequencing by Ligation (Applied Biosystems)
ADVANTAGES:
- Can obtain 800 - 5000x more DNA sequence per day
- 6 - 25x cheaper than “old” technology
- Disadvantage over Sanger method
ELIMINATES:
- insertion and cloning of DNA
- transformation of vector to BAC
- isolation of plasmid
ADDED:
- ligate “adaptor sequences” to each end of DNA fragment and PCR amplify
Prep of genomic library
- isolate genomic DNA
- fragment genomic DNA
- Ligate DNA primers (adaptors) to each end of genomic fragments
- attach tagged DNA fragments to slide
- PCR amplify to abtain large #
- perform DNA sequencing reaction directly through the slide
Third Gen Sequencing
- Single molecule real-time sequencing (SMRT) – Pacific Biosciences (PacBio)
- Nanopore sequencing – MinION Portable Gene Sequencer (Oxford Nanopore)
-Pool of dNTPs, each labeled with a different colour fluorescent tag attached to g-phosphate is added to well. - Due to location of laser and detector -> only the labeled dNTP transiently held by DNA pol could be detected -> recorded as a pulse of coloured light.
- When the dNTP is incorporated into DNA being synthesized -> the a-phosphate is cleaved off -> pulse of light is lost.
- Pulses of different colour light correspond to different dNTP incorporated by single DNA polymerase in PacBio (SMRT) sequencing
- Each type of dNTP is labeled with different colour fluorescent tag. In this example: yellow -> C, red -> G, blue -> A, green -> T.
DISADVANTAGES in contrast to illumina - HIGHER ERROR RATE
- INCREASED COST
- NOT AS WELL ACCESSED
Illumina sequencing
- DNA sequencing by synthesis but does not involve permanent chain termination
- All 4 colours can be viewed after each cycle
- Each modified nucleotide has a different coloured fluorescent tag
T - green
A - yellow
C - red
G - blue - a solution is added to remove the block off the 3’ end
3 examples of large scale genome projects
Personal genome project
–> Correlate genotypes with health info
Cancer genome project
–>Correlate drug sensitivity
Pediatric Cancer Genome
–>Find genetic causes of childhood cancers
Reading Frames
- There are 6 potential protein sequences that can be derived from any double-stranded DNA fragment
What is the rate of stop codons
1/21.3
What is the correct reading frame
The longest strand with no stop codon
What is the central dogma
DNA–>RNA–> protein
- DNA is for permanent storage of genetic information
- RNA is for transient storage of genetic information
- Protein is the final gene product
RNA polymerase
- major enzyme involved in transcription
- RNA is synthesized from 5’ —› 3’ from DNA template
- Catalyzes formation of phosphodiester bond from linkage of ribonucleotides (rNTPs)
- Does not require primer to initiate synthesis
- Modest proofreading mechanism → - Error rate of 1/104 rNTPs incorporated
DNA polymerase
- major enzyme involved in DNA replication
- DNA is synthesized from 5’ —› 3’ from DNA template
- Catalyzes formation of phosphodiester bond from linkage of deoxyribonucleotides dNTPs)
- Require pre-existing primer (either DNA or RNA) to initiate synthesis
- Good proofreading mechanism → - Error rate of 1/107 dNTPs incorporated
- can stop or stall causing hairpins
What are mRNAs
Messanger RNAs
- code for proteins
Percent?
3-5% of RNA
What are rRNAs
Ribosomal RNAs
- form the basic structure of the ribosome and catalyze protein synthesis
Percent?
Majority of RNA 75%
what are tRNAs
Transfer RNAs
- central to protein synthesis as the adaptors between mRNA and amino acids
Percent
10-15%
Telomerase RNA
Serves as a template for telomerase enzyme that extends ends of chromosomes
<5%
snRNAs
Small nuclear RNAs
- function in a variety of nuckear processes, including splicing pre-mRNA
<5%
snoRNAs
Small nucleolar RNA
- help to process and chemically modify rRNAs
<5%
IncRNAs
Long non-coding RNAs, not all of which appear to have a function, some serve as scaffolds and regulate diverse cell processes, including X chromosome inactivation
<5%
miRNAs
MicroRNAs
regulate gene expression by blocking translation of specfic mRNAs and causing their degradation
<5%
siRNAs
Small-interfering RNAs
turn off gene expression by directing the degradation of selective mRNAs and helping to establish repressive chromatin structures
<5%
piRNAs
Piwi-interacting RNAs
bind to piwi proteins and protect the germ line from transposable elements
<5%
First Human Genome Sequencing and who was involved
- cost 2.7 billion
- took 15 years
- 2013
- Francis Collins
- Craig Ventor
Genome Library
All possible DNA fragments of genomes
What is a Contig
A series of overlapping DNA sequences used to make a physical map that reconstructs the original DNA sequence of a chromosome or a region of a chromosome
What would happen in Sanger sequencing if too much ddA is present
The odds of pulling a ddA are very high which would cause chain termination or a stoppage
Contrast, if there was too little ddA, chains will be longer
Tell me about gel electrophoresis
- bottom is the positive node
- from bottom to top; smallest to largest
- Bcuz negative and positives attracts(smaller the space smaller the DNA)
- 5’ end is lined up at the positive node
Why are adaptors added in Next Gen sequencing
- you know the sequence of the adapter therefore you know what primer you need to use
Why is a PCR step added in Next Gen sequencing
- To build up an amount of copies to have a bigger signal
Would a signal with 2x the regular intensity be likely to 2x incorporation of the same nucleotide in the SAME cycle in Illumina sequencing
No
Which of the following statements are FALSE about comparison between RNA and DNA polymerase:
- The proofreading of RNA pol is not as good as DNA pol
- Both RNA and DNA pol catalyze the formation of phosphodiester bonds
- While DNA pol synthesizes DNA from a DNA template, RNA pol synthesize RNA from RNA template
- While DNA pol requires primers to initiate DNA synthesis, RNA pol can initiate RNA synthesis de nova
- None of the above
While DNA pol synthesizes DNA from a DNA template, RNA pol synthesize RNA from RNA template
Which of the following is TRUE about DNA sequencing:
- The DNA sequences for both strands of double-stranded DNA template are usually obtained to check for any technical errors in the sequencing procedure
- In sanger dye terminator sequencing with fluorescent tags, the DNA sequence of both strands of double stranded dna template can be determined simultaneously in the same tube
- In sanger dye terminator sequencing it is still necessary to split the reactions into 4 samples for each ddNTP
- Nanopore is another form of sequencing involving chain termination by the incorporation of modified nucleotides
- 2 of the above
The DNA sequences for both strands of double-stranded DNA template are usually obtained to check for any technical errors in the sequencing procedure
Error rate of DNA polymerase
1/107
Error rate of RNA polymerase
1/104
sigma factor
called holoenzyme
Completes the enzyme and binds to promoter sequence
Transcription cycle of RNA polymerase
Initiation starts and sigma factor and core subunit associate to form holoenzyme
holoenzyme slides along DNA
sigma factor recognizes promoter sequence on DNA
Transcription bubble forms: ~10 ntd of single stranded DNA
Scrunching occurs - short fragments of RNA, pulls DNA towards promoter
RNA polymerase breaks free of promoter, and releases sigma factor
Elongation starts, the jaw closes and increases rate of transcription
Termination starts, RNA pol reaches and transcribes termination sequence on template DNA
Hairpin structure formed in RNA corresponding to the DNA termination sequence causes the RNA pol to release its hold on RNA
Hairpin
Forms inverted repeats and only forms on RNA because RNA is single stranded so it snaps together.
Which of the following statements are false:
Initiation of transcription occurs at the ATG start codon
Promoters are regulatory DNA sequences found infront of genes
Initiation of transcription occurs at the ATG start codon
- This is translation start
UTR
Transcribed not translated
Consensus Sequences
are located within the promoter region
High expression Gene
Large amount of protein and RNA
Low expression gene
small amount of protein and DNA
No expression gene
No protein
How many strands of DNA are used as a template for RNA synthesis
One strand of DNA
Promoter Sequences
Are asymmetrical so RNA polymerase can only bind in one direction
Bottom strand is tempate strand if RNA pol is proceeding 5’–> 3’ , left to right
Top strand is template strand if RNA pol is proceeding 3’–>5’ right to left
What RNA is transcribed by RNA pol I
5.8S, 18S, 28S rRNA
alpha amanitin - insensitive
What RNA is transcribed by RNA pol II
mRNA, snoRNA, miRNA, siRNA, LncRNA, snRNA
alpha amanitin - very sensitive
What RNA is transcribed by RNA pol III
tRNA, 5S RNA, some snRNA and other small RNA
alpha amanitin - moderately sensitive
What does eukaryotic RNA pol require the presence of to bind to promoters
Requires the presence of additional initiation proteins (transcription factors) before they can bind selectively to different promoter sequences.
Order of transcription binders
TBP, TFIID, TFIB, TFIIA, TFIIF, TFIIH, TFIIE
What does a typical eukaryotic RNA polymerase II core promoter consist of
TATA box
Initiator element (INR)
Transcriptional start site is marked with what
+1 arrow
TBP
TATA binding protein
usually the first transcription factor to bind
Formation of the PIC
Transcription pre-initiation Complex
INR is start of transcription
TBP binds to TFIID
TFIIB then binds to TFIIA
Both bind together
The kink in the promoter is the physical location
TFIIF then binds to RNA polymerase
TFIIH then TFIIE then all bind together
This begins initiation of transcription (UTP, ATP, CTP, GTP,) which begins elongation
Phosphorylation of Ser in Hepapeptide
Still have to write this skip
Which direction is protein synthesis in
N terminus to the C terminus
N terminus
Amino end - H3N+
C terminus
carboxyl end - C-O=O-
Peptide bond
O=C-N-H
N and C terminus correlated
Role of tRNA in translation
tRNAs are RNA molecules between 70-90 nucleotides long
transferrnas that transfer genetic information into protein sequence
rate of tRNAs in amino acids
1/20
Clover structure
leaf 1 - D end - has D codons
leaf 2 - anticodon loop - GAA anticodon
leaf 3 - T loop - Psi located
3’ end - attached to Phe
Where is an activated amino acid located
3’ end of each tRNA
3 binding sites
E site, P site, A site
E site
Exit site
tRNA exits ribosome
P site
Peptide site
Peptide binding
A site
Aminoacyl tRNA
how many binding sites for mRNA and tRNA
1 binding site for mRNA and 3 sites for tRNA
How many tRNAs can be bound on the ribosome at the same time
2 tRNAS
Where is the tRNA that is connected to the polypeptide chain
P-site
Step 1 in translation of mRNA
Incoming Aminoacyl-tRNA binds to a site
tRNA linked to growing Peptide chain isat Psite
Spent tRNA Exits from E site
Step 2 in translation of mRNA
Peptide bond forms between amino acids 3 and 4
peptide chain is now linked to tRNA
Step 3 in translation of mRNA
Large ribosome subunit moves relative to small subunit so sites are unaligned creating hybrid sites
tRnA 3 is at E/P hybrid site
tRNA 4 is at P/A hybrid site
Step 4 in translation of mRNA
Small ribosome subunit shifts 3 nucleotides down mRNA - resets
A site is empty and available for incoming tRNA
Spent tRNA can dissociate from E site