Sequencing Genomes Flashcards

1
Q

Whole genome shotgun approach steps

A
  1. Extract DNA from human cells
  2. Cut DNA into small, overlapping fragments (contigs) using restriction enzymes (reaction is performed at suboptimal conditions- enzymes don’t cut at every site)
  3. Clone contigs to make genomic library
  4. Sequence each clone using Sanger sequencing
  5. Use computers to reassemble sequences of contigs by puzzling together overlapping sequences
  6. Deposit sequences into database, NCBI GenBank
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Other names for Sanger sequencing

A

Dideoxy sequencing or chain terminating sequencing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Designing a primer for an unknown sequence

A

Create universal primer that anneals to MCS of cloning vector just upstream of unknown sequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Dideoxynucleotide (ddNTP)

A

Nucleotide that is lacking -OH group on 3’ carbon of pentose sugar
When polymerase adds this, sequencing reaction ends (can’t add nucleotide to it)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Sanger sequencing steps

A
  1. Cloning vector containing DNA of interest is amplified by PCR
  2. When polymerase randomly adds ddNTP with attached fluorescent probe, sequencing reaction stops
  3. Different sized segments of DNA are created
  4. Segments are separated by size using capillary gel electrophoresis (electrophoresis in a capillary tube)
  5. Detector identifies ddNTPs based on their unique fluorescence
  6. Sequence is recreated by reading ascending order of ddNTPs
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

of base pairs in human genome

A

3.2 billion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

of protein-coding genes in human genome

A

20,000

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

% of genome that is genes

A

2%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

% of sequence that is common amongst people

A

99.9%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Pseudogene

A

Gene that has lost protein coding ability through mutation

Hard to tell this apart from gene and dubious ORF

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Dubious ORF

A

Gene that hasn’t necessarily been mutated, but still produces no product
Hard to tell this apart from gene and pseudogene

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Annotation

A

Identification and description of genes and their important sequences
Can be done using computer algorithms to predict open reading frames: look for promoters, start sites, and polyadenylation sites

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Next generation sequencing/pyrosequencing steps

A
  1. Extract DNA
  2. Cut into contigs
  3. Affix DNA to solid support (bead, chip, etc.)
  4. One by one, wash dNTPs across the DNA
  5. If known dNTP is incorporated by polymerase, then light is emitted
  6. Reassemble overlapping sequences
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

DNA marker

A

Specific region of DNA that varies among individuals

Used to create a detailed map of the individual’s genome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Haplotype

A

Set of SNPs that are close together on a chromosome

SNPs are inherited together: low recombination rates

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Haplogroup

A

Group of individuals that share a common ancestor

Individuals have similar haplotypes

17
Q

Tag SNP

A

SNP used to represent an entire haplotype

18
Q

SNP chips or arrays

A

Method that allows for analysis of thousands of SNPs at once

19
Q

Haplotype map (hapmap)

A

Collection of all combinations of haplotypes present in a population
Used to study inheritance of complex traits and evolutionary relatedness