Exam 1 Homework and Quizzes Flashcards

Question 1

Q

The human genome project was launched in ____, produced the first draft assembly in _____, and was finished in _____

Answer

A

1990
2001
2003

Question 2

Q

Abbreviations in order from smallest to largest

Answer

A

bp Kb Mb Gb Tb Pb

Question 3

Q

approximately how many bases are in a typical diploid mammalian genome

Answer

A

6 million

Question 4

Q

approximately how many bases are in a typical mammalian mitochondrial genome

Answer

A

16,000 bp

Question 5

Q

Approximately what proportion of a mammalian genome codes for proteins

Question 6

Q

Approximately 50% of a mammalian genome is comprised of what type of DNA element

Answer

A

repetitive DNA

Question 7

Q

The polymerase chain reaction has four key ‘ingredients’ necessary to replicate DNA in a tube

Answer

A

DNA template
DNA polymerase
Nucleotides
Primers

Question 8

Q

What are the three stages of the polymerase chain reaction

Answer

A

denaturing
annealing
extending

Question 9

Q

Sanger sequencing differs from PCR in one key element, what is that key element

Answer

A

sanger uses dideoxynucleotides along with the deoxynucleotides

Question 10

Q

Illumina sequencing requires that the library contain fragments in a certain size range. what size range are typical of whole genome sequencing libraries

Answer

A

300-350 bp

Question 11

Q

PacBio and Oxford Nanopore sequencing are very different but share one characteristic in common, and this characteristic differentiates them from Illumina technology. What is that characteristic?

Answer

A

they need a long DNA template strand to start sequencing

Question 12

Q

Five domains of genome research

Answer

A

understanding the structure of genomes
-understanding the biology of genomes
-understanding the biology of disease
-advancing the science of medicine
-improving the effectiveness of health care

Question 13

Q

What is the largest current bottleneck in genomics

Answer

A

analyzing the stream of data from technological advances

Question 14

Q

In the Illumina process, the nucleotides are very specialized. they have two key attributes, what are they

Answer

A

flour specific for the identity of the nucleotides
3’ hydroxyl group is blocked with a chemical blocker so next step can be accurately detected

Question 15

Q

____ of reads to the reference sequence is the first step to identify variation of all types

Answer

A

alignment

Question 16

Q

Long read sequencers such as the PacBio instrument are a departure from short read sequencings such as Illumina. What is the first major requirement for these long read technologies that is different from short read technologies

Answer

Study These Flashcards

A

high molecular weight genomic DNA
must be sufficient quality to allow for >30Kb shearing to produce PacBio continuous reads

Question 17

Q

A typical workflow of whole exome sequencing analysis consists of the following steps

Answer

Study These Flashcards

A

-raw data QC
-Pre-processing
-mapping
-post-alignment processing
-variant calling
-annotation
-prioritization

Question 18

Q

standard preprocessing procedure includes

Answer

Study These Flashcards

A

-3’ end adapter removal
-trimming of low quality bases at the ends of the reads

Question 19

Q

many different tools have been developed for short reads mapping. In general, they use two algorithms for aligning sequences

Answer

Study These Flashcards

A

-Burrows-Wheeler transformation- compression technique
-smith-waterman- dynamic programming algorithm

Question 20

Q

Of the sequence aligners they evaluated which two were the fastes

Answer

Study These Flashcards

A

Bowtie 2
BWA

Question 21

Q

After mapping reads to the reference genome, a three-step post-alignment processing procedure is recommended to minimize the artifacts that may affect the quality of downstream variant calling. It consists of

Answer

Study These Flashcards

A

-read duplicate removal
-indel realignment
-base quality score recalibration (BQSR)

Question 22

Q

Variant analysis consists of

Answer

Study These Flashcards

A

genotyping
variant calling
annotation
prioritization

Exam 1 Homework and Quizzes Flashcards

(22 cards)