Test 3 Flashcards
Genomics
Chromosomes/DNA
DNA copy-number assessment
– Comparative genome hybridization to DNA microarray
Mutation screening
- DNA sequencing
- Mass-spectrometry-based genotyping
- Mutation-specific PCR
Transcriptomics
DNA/mRNA
Gene-expression profiling
- DNA microarray
- Multiples PCR
MicroRNA-expression profiling
- DNA microarrays
- Multiplex PCR
Proteomics
Protein
Proteomic profiling
– Mass spectrometry
Phosphoproteromic profiling
– Mass spectrometry after immunoprecipitation specific antibodies
Methods of determining the sequence of DNA
Sanger sequencing (chain termination/dideoxy method)
Shotgun sequence method
2nd gen (eg. pyrosequencing)
Sanger Sequencing: protocol
- Denaturation
- Primer attachment and extension of bases (like PCR)
- Termination w/dye-labeled ddNTPs
- Gel electrophoresis
- - Run four separate reactions each with different ddNTPs
- - Run on a gel in four separate lanes
- - Read the gel from the bottom up
(Can be automated)
Sanger Sequencing: disadvantages
Only good for 500-750bp rxn
Expensive
Takes a while
The human genome is ~3 bil bp
Why was the Human Genome Project started?
Began in 1990
To study:
Human evolution
Nature v. nurture
Causes of disease
Shotgun Sequencing: protocol
Used to sequencing whole genome
- DNA extraction
- DNA fragmentation: DNA is broken up randomly into smaller fragments
- - Clone into vectors
- - Transform bacteria, grow, isolate vector DNA - Sequence the library: dideoxy method produces reads
- Reconstruction: assemble contiguous fragments
- - Look for overlap of reads
Pyrosequencing
Visible light is generated and is proportional to the number of incorporated nucleotides.
– Double peak heights indicate incorporations of two nucleotides in a row
DNA + NTP + DNA pol = DNA-NP = PPi
PPi + APS + ATP sulfurylase = ATP + SO4^2-
ATP + Luciferin + O2 + Luciferase =
AMP + PPi + Oxyluciferin + CO2 + LIGHT!!!!
Pyrosequencing:
disadvantages
Smaller sequences
Nonlinear light response after more than 5-6 identical nucleotides
Clonal Single Molecule Microarray: protocol
Attach DNA to flow cell
1) Prepare genomic DNA sample: randomly fragment genome DNA and ligate adaptors to both ends of the fragments
2) Attach DNA to surface: bind singel stranded fragments randomly to the inside surface of the flow cell channels
Bridge Amplification
3) Bridge amplificaiton: add unlabled nucleotides and enzyme to initiate solid phase bridge amplification
4) Fragments become double-stranded
Cluster Generation
5) Denature the double-stranded molecules
- - repeat cycles of solid phase bridge amplification
6) Completion of amplification: on completion, several million dense clusters of double-stranded DNA are generated in each channel of the flow cell
~1000 molecules/ 1um cluster
~ 20-30,000 clusters/tile
~ 40 M cluster/flowcell
Sequencing by Synthesis (SBS): protocol
Cycle 1:
- Add sequencing reagents
- First base incorporated
- Remove unincorporated bases
- Detect signal
Cycle 2-n: add sequencing reagents and repeat
(Same as reversible terminator chemistry?)
Reversible Terminator Chemistry
All 4 labelled nucleotides in 1 rxn
Higher accuracy
No problems w/homopolymer repeats
Steps:
- Incorporation
- Detection
- Deblock: fluor cleaved/removed
(same as sequencing by synthesis (sbs)?)
Base calling from images
The identity of each base of a cluster is read off from sequential images
1x flowcell = 8 lanes 1x lane = 3 columns (rows) 1x column = 100 tiles 1x tile = 4 images/cycle = 345,600 images for a 36-cycle run
Glass Slide Array/Affymetrix Gene Chip: protocol (general)
- RNA extraction
- Reverse transcription: cDNA reaction, purfication, and labeling by IVT
- Fragmentation (heat + Mg2+)
- Hybrization (label incorporation, Cy3/5)
- Washing
- Laser scanning
(Glass Slide: scan cy5 channel + cy3 channel & overlay images)
(Affymetrix GeneChip: photolithography) - Quantify
— load into database —
computer analysis –> bioinformatics
Glass Slide cDNA Microarray: advantages/disadvantages
Hybridize two samples/chip for direct comparison of samples
Non-standardized production can affect reproducibility (although there are now many quality-controlled commercial arrays available)
Longer sequences can have cross-hybridization with other genes
Don’t necessarily need to know all the genes in the genome. Can use unsequenced ESTs, for instance.
Affymetrix GeneChip: advantages/disadvantages
Limits 1:100,000 transcripts, ~5 transcripts/cell
Internal control lane with mismatch olgionucleotide probe cells to prevent false positives.
Can hybridize only one sample/chip. No direct comparisons of 2 samples.
Standardized production tends to give good reproducibility.
Limited amount of probe sequence can be problematic, but can also be helpful in limiting cross-hybrization
ChIP-chip: protocol
ChIP: chromatin immunoprecipitation
- Add formaldehyde and sonicate DNA to ~1kb
—- 1/2 sample —-
2. Add specific antibody
3. Immunoprecipitation
4. Reverse cross links and purify DNA
5. Amplify and label with Cy5
Hybridize to microarray
- — 1/2 sample —-
2. Reverse cross links and purify DNA
3. Amplify and label with Cy3
4. Hybridize to microarray
Tiled microarray
Cover a genomic region (or whole genome) at hight coverage.
Probes are designed to cover virtually every basepair of the sequence, usually excluding only simple sequence repeats.
In this way, there is no bias toward known transcribed regions.
Probe size and spacing determines resolution of the array.
Antibody Array: protocol
- Extract proteins from 2 samples
- Label 2 samples with Cy3/Cy5 and then mix
- Incubate on the array (with antibodies)
- Scan array
Single Cell Transcriptomic Approaches (3x): protocols
1)
- Introduce cell-unique barcoded primer beads
- Intracellular RT makes barcoded cDNA beads
- Cleave barcoded cDNA from beads
- Sequence barcoded cDNAs
2)
- Introduce primers and reagents, preform RT and RCA
- RCA generates ‘rolonies’ directly in cell sections.
- Sequence rolonies directly in cell sections.
3)
- Capture RNAs on surface bound primers, perform RT
- Eliminate all cell debres, except bound cDNA
- Single molecule sequencing of cDNA
Analysis Methods
T-test
ANOVA
Mann Whitney U Test
Type I error
alpha
false positives
p-value
Single gene analysis
Molecular cloning
Bacteria are usually the host cell used for basic cloning experiments
Applications of biotechnology
Virus-resistant crop plants and livestock
Diagnostics for detecting genetic diseases and acquired diseases
Therapies that use genes to cure diseases
Recombinant vaccines to prevent disease
Biotechnology can also aid the environment through bioremediation
Definition of biotechnology
Any technique that uses living organisms or substances to make or modify a product, to improve plants, animals, or microorganisms for specific uses
Process/Workflow of bacterial cloning
Source = DNA target
Fragmentation
Ligation to linear cloning vector to for chimera
Introduce DNA into host cell
Isolate cells with cloned gene on agar plate
- suspension of bacteria plated and spead
- isolated colony dervied from single partent cell = clones
Allow to replicate and produce protein from cloned gene (binary fission)
Restriction Enzymes
Type II restriction endonucleases (most common)
- cut DNA like scissors at specific sites called restriction sites
- cut across the sugar-phosphate backbone of DNA
- DNAse cuts into random pieces, while Hae III has specific site cleavage (cut in same place)
Recognition/cleavage sites of type II restriction enzymes
Cuts usually occurs at a palindromic sequence
Homodimeric ptns to help find palindromic sites.
SmaI or HindII: produces blunt ends (more difficult to ligate together)
5´ CCCGGG 3´
3´ GGGCCC 5´
EcoRI: produces sticky ends (good for adding target DNA with complement ends)
5’ GAATTC 3´
3´ CTTAAG 5´
Isoschizomers
(‘iso-sticky’)
Cut at same sequence but different end configurations
“pairs of restriction enzymes specific to the same recognition sequence. For example, SphI (CGTAC/G) and BbuI (CGTAC/G) are isoschizomers of each other.” ~ Wikipedia
Isocaudomers
Different recognition site but give same cleavage products
“pairs of restriction enzymes that have slightly different recognition sequences but upon cleavage generate identical termini… an enzyme that recognizes a slightly different sequence, but produces the same ends.” ~ Wikipedia
Meganuclease I-Sce I
Unusual restriction enzyme
Homing endonuclease I-Sce I
18-base pair sequence TAGGGATAACAGGGTAAT
4 base pair 3’ hydroxyl overhang.
Sequence will occur once in every 6.9 x 1010 base pairs.
This sequence does not normally occur in a human or mouse genome.
Enzyme encoded by an intron in yeast mitochondria
Methylation Sensitve Restriction Enyzmes
HpaII: only cuts when non-methylated (methylation sensitive Restriction Mapping)
Isoschizomer is MspI: Does not matter whether Meth or not
Separating Restriction Fragments and Visualizing DNA
Pieces of DNA are generated by restriction enzymes can be separated, viewed and manipulated based on SIZE using gel electrophoresis.
http://arbl.cvmbs.colostate.edu/hbooks/genetics/biotech/gels/virgel.html
Gel Electrophoresis
The gel is submerged in a buffer solution, and DNA samples are loaded in the wells.
Electricity is applied to electrodes at opposite ends of the gel to create an electrical field in the gel and the buffer.
Pores in the agarose catch the DNA pieces and slow down movement through the gel. Supercoiled DNA moves faster than nicked DNA form II.
Sugar-P backbone is negative, runs to postive.
Agarose gel and separation
Agarose percentage = pore sizes.
- Lower percentages = larger pores, (separating large DNA)
- Higher percentages = smaller pores. [smaller pieces of DNA]
What else affects resolution?
Voltage gradient used.
Visualizing DNA in gels
Stains [ethidium bromide] added to the gel to visualize DNA.
Ethidium bromide* molecules lodge in between the bases of DNA
‘Glows’ when an ultraviolet light is used
- Intercalator: DNA gets longer!
- ITS ALSO MUTAGENIC (genotoxic)
DNA ligase for joining DNA fragments together
Enzymes that cut with staggered cuts result in complementary ends that can be ligated together.
HindIII - leaves 5’ overhangs (“sticky”)
5’ --A AGCTT--3' 3’ --TTCGA A--5’ => 5’ --AAGCTT-- 3’ 3’ --TTCGAA-- 5’
Sticky ends that are complementary (from digests with the same or different enzymes) can be ligated together.
Sticky ends that are not complementary cannot be ligated together.
Plasmids
vehicles for cloning
Plasmids are naturally occurring extra-chromosomal DNA molecules.
Plasmids are circular, double-stranded DNA.
Plasmids are the means by which antibiotic resistance is often transferred from one bacteria to another.
Plasmids can be cleaved by restriction enzymes, leaving sticky or blunt ends.
Artificial plasmids can be constructed by linking new DNA fragments to the sticky ends of plasmid.
Copy #s vary: Low (10) to High (100s)
Cloning Vectors
A cloning vector is a plasmid that can be modified to carry new genes.
Plasmids useful as cloning vectors must have:
An origin of replication.
A selectable marker (antibiotic resistance gene, such as ampr and tetr).
Multiple cloning site (MCS) (site where insertion of foreign DNA will not disrupt replication or inactivate essential markers).
Easy to purify away from host DNA.
Alkaline Phosphatase: Helps prevent recircularized plasmid (Fig. 3.1)
Incorporating plasmid to bacteria
Electroporation or someing equitable
Selection and Counterselection of transformed bacteria
1) Plate with antibiotic:
Only bacteria with plasmid antibiotic resistance will survive. Original colonies will have another resistence gene. But clones will not. So will have to plate onto second plate with second antibiotic, and clone colonies will not grow.
2) LacZ with blue and white selection
Ampicillin
Inhibits cell wall formation
Inactivated by beta-lactamase
Streptomycin
Blocks protein initiation complex formation and causes misreading during translation
Inactivated by phosphotranserfase
Tetracyline
Prevents binding of aminoacyl-tRNA to 30S ribosomal subunit
Resistence gene encodes an inner cell membrane portein that passes the antibiotic out of the cell and blocks the passage of the antibiotic through the cell wall
Bonferroni correction
The traditional way to confront multiple testing problem.
Instead of p
False discovery rate (FDR or q-value)
The expected proportion of false-postives among the postive results.
At q = 0.05, 50/1000 significantly changed genes might be false postives.
Clustering approaches to analysis
Divides or groups gene/samples into groups “clusters” based on similarities and differences.
Number of groups is user defined.
Algorithms:
Hierarchial clustering/tree
Kmeans clustering
Self organising maps
Distance Metrics
Distance between 2 expression vectors (relative amounts, rather than absolute values).
proteonomics
Large-scale study of proteins, particularly their structures and functions.
SDS PAGE
The speed of migration in an electrical field depends on the dimension, form, and charge of the molecules.
For deaggregation and denaturation of the proteins, SDS, beta-mercaptoethanol or DTT, and heat is used.
SDS (strongly anionic detergent) provides negative charge to the proteins
Silver staining v. Coomassie Blue Staining
2D gel
Isoelectric focusing gel based on charge/pH.
Then equilibrate in SDS, and apply orthogonal electric field (based on size)
How to study proteonomics
Confocal microscopy
Fluorescent resonance energy transfer (FRET)
Co-immunoprecipitation
Far western blot
(Utilizes biotin modificaiton of purified bait protein probe. Prey proteins separated in-gel or transferred to membrane can be probed w/biotinylated bait. Detection w/streptividin-horseradish peroxidase conjugate + chemiluminscence)
Yeast two hybrin assay
(Fusion of yeast reporter gene that is activated every time gene of interest expressed. eg. Lac Z/beta-galatosidase)
Mass analysis/spectrometry/Matrix Assisted Laser Desorption/Ionization (MALDI)/Tandem Mass (MS/MS)
Gas chromatography
Metabolomics and profiling methods
High-throughput analysis of metabolites
Simultanous measurement of the levels of a large number of cellular metabolites
Gas chromatography/Mass-spectrometry (GC/MS)
Liquid Chromatography/Mass-Spectrometry (LC/MS)
Nuclear Magnetic Resonance (NMR) Spectroscopy
What info do we get from: DNA, RNA, Protein, Metabolites?
DNA: the ultimate potential of a cell
- What is possible
RNA: the current direction of a cell
- What appears to be happening
Proteins: the functional capabilities of a cell
- What makes it happen
Metabolites: the limiting currency of a cell
- What is happening
Metabolic Profiling Methods: Gas Chromatography/Mass-Spectrometry (GC/MS)
In GC/MS, it may be necessary to first derivatize the sample to increase metabolite stability and volatility. The derivatized mix is then fractionated by a gas chromatograph that is coupled to a mass spectrometer.
The mass spectrometer scans the peaks emerging from the GC column at frequent intervals (~1 sec) and so acquires the mass spectrum of each peak, from which peaks can be identified and quantified. Mass spectrometry ‘weighs’ ionized individual molecules and their fragments. Molecules are identified from their fragmentation pattern and ‘weights’ (mass/ charge ratios – m/z values), with the help of mass spectra libraries, and can be quantified from peak size.
Metabolic Profiling Methods: Liquid Chromatography/Mass-Spectrometry (LC/MS)
In LC/MS (also termed high performance liquid chromatography, HPLC/MS) the samples are not derivatized before analysis and an HPLC instrument is used for separation. LC/MS is more suitable than GC/MS for labile compounds, for those that are hard to derivatize, or hard to render volatile. LC/MS is less developed than GC/MS. A closely related method is capillary electrophoresis (CE)/ MS.
Metabolic Profiling Methods: Nuclear Magnetic Resonance (NMR) Spectroscopy
Advantages of NMR over MS: - NMR does not destroy the sample - NMR can detect and quantify metabolite because the signal intensity is only determined by the molar concentration - NMR can provide comprehensive structural information, including stereochemistry Many atoms have nuclei that are NMR active, but most NMR data are collected for 1H and 13C since these are present in all organic molecules.
The main weakness of NMR is low sensitivity relative to MS. It is therefore less suited for analysis of trace compounds. As the natural abundance of 13C is only 1.1%, 13C-NMR is less sensitive than 1H-NMR. Recent developments have considerably increased sensitivity, making it less of a problem
Restriction Fragment Length Sizes(predicted)
If 25% A, 25% T, 25% G, 25% C and Random Distribution of Nucleotides (probability of given base is 0.25), then Distance between cut sites is equal to 4^n bases (n = #bp in recognition site)
n = 4 --> 256 bp n = 6 --> 4096 bp n = 8 --> 65.5k bp
Clone Screening: Hybridization, Immunological
Hybridization:
Make specific DNA probe
Hybridization screen (Southerns with colony lifts).
- Denature/anneal
Immunological:
Expression based screening
Requires mono-specific Antibody