Sequencing Genomes Flashcards
Whole genome shotgun approach steps
- Extract DNA from human cells
- Cut DNA into small, overlapping fragments (contigs) using restriction enzymes (reaction is performed at suboptimal conditions- enzymes don’t cut at every site)
- Clone contigs to make genomic library
- Sequence each clone using Sanger sequencing
- Use computers to reassemble sequences of contigs by puzzling together overlapping sequences
- Deposit sequences into database, NCBI GenBank
Other names for Sanger sequencing
Dideoxy sequencing or chain terminating sequencing
Designing a primer for an unknown sequence
Create universal primer that anneals to MCS of cloning vector just upstream of unknown sequence
Dideoxynucleotide (ddNTP)
Nucleotide that is lacking -OH group on 3’ carbon of pentose sugar
When polymerase adds this, sequencing reaction ends (can’t add nucleotide to it)
Sanger sequencing steps
- Cloning vector containing DNA of interest is amplified by PCR
- When polymerase randomly adds ddNTP with attached fluorescent probe, sequencing reaction stops
- Different sized segments of DNA are created
- Segments are separated by size using capillary gel electrophoresis (electrophoresis in a capillary tube)
- Detector identifies ddNTPs based on their unique fluorescence
- Sequence is recreated by reading ascending order of ddNTPs
of base pairs in human genome
3.2 billion
of protein-coding genes in human genome
20,000
% of genome that is genes
2%
% of sequence that is common amongst people
99.9%
Pseudogene
Gene that has lost protein coding ability through mutation
Hard to tell this apart from gene and dubious ORF
Dubious ORF
Gene that hasn’t necessarily been mutated, but still produces no product
Hard to tell this apart from gene and pseudogene
Annotation
Identification and description of genes and their important sequences
Can be done using computer algorithms to predict open reading frames: look for promoters, start sites, and polyadenylation sites
Next generation sequencing/pyrosequencing steps
- Extract DNA
- Cut into contigs
- Affix DNA to solid support (bead, chip, etc.)
- One by one, wash dNTPs across the DNA
- If known dNTP is incorporated by polymerase, then light is emitted
- Reassemble overlapping sequences
DNA marker
Specific region of DNA that varies among individuals
Used to create a detailed map of the individual’s genome
Haplotype
Set of SNPs that are close together on a chromosome
SNPs are inherited together: low recombination rates