Shane 3 (AI) Molecular Diagnostics - Whole Genoms/Whole Exome Sequencing Flashcards

1
Q

What percentage of the human genome is similar across individuals?

A

99.9%

The differences in the human genome are only found in 0.1% of the genome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the significance of genetic mutations in cancer?

A

Mutations are usually found in many different genes.

An example is the BRCA1 mutation, which often occurs alongside other mutations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the relationship between mutations and cancer development?

A

Multiple mutations occur before someone develops cancer.

This indicates that cancer is often the result of cumulative genetic changes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define Mendelian disorders.

A

Mutations in one gene.

Examples include Cystic Fibrosis (CF) and Prader-Willi Syndrome (SNRPN gene).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are some characteristics of Mendelian disorders?

A

Some remain mysterious with unknown causative genes.

This highlights gaps in current genetic understanding.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a method for prenatal diagnosis?

A

Cell-free fetal DNA from the mother.

This method is less invasive compared to traditional techniques.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What was previously used for prenatal diagnosis but is no longer commonly done?

A

Amniocentesis.

It was considered too risky for routine use.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Fill in the blank: Genetic mutations associated with cancer usually occur in _______.

A

[many different genes].

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

True or False: The majority of human genome differences are found in more than 0.1% of the genome.

A

False.

Differences are only in 0.1% of the genome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What year did Watson and Crick infer DNA’s structure?

A

1953

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What significant event in 1977 involved nucleotide sequencing?

A

The first nucleotide sequence of the gene encoding yeast alanine tRNA was reported

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Who introduced initial DNA sequencing methods in 1980?

A

Sanger, Maxam, and Gilbert

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What was the first human linkage map based on in 1983?

A

Restriction fragment length polymorphism

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What significant discovery was made in 1985?

A

Mullis discovered the PCR technique

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What idea was proposed in 1986?

A

The idea of human genome sequencing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What was cloned in 1990?

A

The first human disease gene

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What was accomplished in 1993 regarding homozygosity?

A

The first homozygosity mapping was done

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What was the focus of the first positional cloning done in 1995?

A

A recessive disease gene on the basis of linkage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What was reported in 1996 regarding genome sequencing?

A

The first genome sequence of an organism (Hemophilus influenza)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What major milestone occurred in 2001?

A

The first human chromosome was sequenced

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What was the first organism whose genome was sequenced in 2000?

A

Fruit fly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

In what year was the first draft of the human genome sequence published?

A

2001

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What key event occurred in 2003 regarding the human genome?

A

The human genome sequence was completed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What technology emerged in 2005 that transformed sequencing?

A

Massively parallel sequencing platforms (next-generation sequencing)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What year saw the introduction of the first NGS instrument?

A

2009

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What was the first application of WES reported in 2010?

A

Proof of principle: disease gene identification by WES

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Fill in the blank: The PCR technique was discovered by _______.

A

Mullis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

True or False: The first individual genome based on NGS was published in 2010.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What percentage of Watson’s genome was published in 2008?

A

33%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What major advancement in genetics was made in 1985?

A

PCR technique

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Who received a Nobel Prize for the development of the PCR technique?

A

Mullis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What did Watson’s genome reveal about his health?

A

Predisposed to lots of conditions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What is the primary purpose of Whole Genome Sequencing?

A

To determine the complete DNA sequence of an organism’s genome

This includes identifying all of the organism’s genes and their functions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What is a BAC in the context of genome sequencing?

A

Bacterial Artificial Chromosome

BACs are used to clone large fragments of DNA for genomic studies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What is the first step in the Hierarchical Shotgun Approach?

A

Fragmenting genomic DNA into large pieces

This initial step allows for the organization and mapping of the genome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

What does the term ‘minimal tiling path’ refer to in genome sequencing?

A

The process of using restriction enzymes to digest DNA into smaller fragments

This approach helps to create a more manageable set of sequences for assembly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

Fill in the blank: In the shotgun approach, DNA is fragmented into different sizes like _______.

A

shotgun pellets

This metaphor illustrates the random nature of fragmenting DNA.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What is the significance of overlapping reads in genome assembly?

A

They allow for the reconstruction of the original sequence

Overlapping reads are crucial for accurate genome assembly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

True or False: The process of preparing chromosome spreads is simple and quick.

A

False

Preparing chromosome spreads is described as a tedious process.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

What is the role of restriction enzymes in the shotgun sequencing process?

A

To cut DNA into fragments of varying sizes

This facilitates the cloning and sequencing of DNA.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

What are shotgun clones made from?

A

BAC clones that have been digested into smaller fragments

Shotgun clones are the actual sequences that are read during sequencing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

What was a major reason why the Human Genome Project took approximately 10 years to complete?

A

The tedious process of assembling sequences from overlapping reads

The complexity of genome sequencing contributed to the lengthy timeline.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

What does the term ‘reads’ refer to in the context of genome sequencing?

A

Short sequences of nucleotides obtained from shotgun clones

Reads are the data points used to assemble the genome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

What is the primary technique used in whole genome sequencing known as the shotgun approach?

A

The shotgun approach involves randomly breaking up DNA into smaller fragments and then sequencing those fragments to reconstruct the entire genome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

True or False: The shotgun sequencing method requires the genome to be assembled in a specific order before sequencing.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

Fill in the blank: In shotgun sequencing, the DNA is first ______ into smaller fragments.

A

fragmented

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

Which of the following is a key advantage of the shotgun sequencing method? A) It is labor-intensive B) It requires prior knowledge of the genome C) It allows for rapid sequencing of large genomes

A

C) It allows for rapid sequencing of large genomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

What are the main steps involved in the shotgun sequencing process?

A

The main steps are fragmentation of DNA, sequencing of the fragments, and computational assembly of the sequences.

49
Q

What does BAC stand for in BAC libraries?

A

Bacterial Artificial Chromosome

50
Q

True or False: BAC libraries are used to clone large fragments of DNA for genomic studies.

A

True

51
Q

Fill in the blank: BAC libraries can contain inserts of DNA fragments up to _____ kilobases in size.

A

300

52
Q

How are BAC libraries utilized in whole genome sequencing?

A

BAC libraries are used to organize and sequence large sections of the genome, allowing for more efficient mapping and assembly.

53
Q

Which of the following is a primary advantage of using BAC libraries in genomic research? A) Small insert size B) Stability of cloned DNA C) High mutation rate D) Low cloning efficiency

A

B) Stability of cloned DNA

54
Q

What is a minimal tiling path (MTP) in whole genome sequencing?

A

A minimal tiling path is the shortest sequence of overlapping DNA fragments that covers the entire genome of an organism.

55
Q

True or False: A minimal tiling path includes gaps between DNA fragments.

A

False

56
Q

Fill in the blank: The primary purpose of a minimal tiling path is to ______ the genome efficiently for sequencing.

A

cover

57
Q

Which of the following best describes the utility of a minimal tiling path in genome sequencing? A) Reducing redundancy B) Increasing fragment size C) Enhancing computational complexity D) None of the above

A

A) Reducing redundancy

58
Q

How does a minimal tiling path contribute to genome assembly?

A

It provides a framework for assembling overlapping sequences into a complete genome, ensuring all regions are represented.

59
Q

What is the Celera Genomics Approach?

A

A private method for faster DNA sequencing that skips mapping of clones and goes straight to shotgun clones.

This approach allowed for quicker sequencing compared to public methods.

60
Q

What sequencing method did Celera Genomics utilize?

A

Shotgun sequencing.

This method involves directly sequencing random DNA fragments.

61
Q

What is the main advantage of the Celera Genomics Approach over public methods?

A

It took a lot less time to sequence DNA.

This efficiency was due to the direct sequencing of shotgun clones.

62
Q

What percentage of the genome was Celera Genomics attempting to patent?

A

28%.

This indicates their focus on a specific portion of the genome for commercial purposes.

63
Q

What is the significance of skipping the mapping of clones in the Celera Genomics Approach?

A

It allows for a more rapid sequencing process.

Mapping typically requires significant time and resources.

64
Q

Fill in the blank: The Celera Genomics Approach went straight to _______ clones.

A

shotgun.

Shotgun clones are fragments of genomic DNA that are sequenced directly.

65
Q

What is the starting material for Whole Genome Sequencing?

A

Double-stranded genomic DNA.

66
Q

How are DNA fragments prepared for sequencing?

A

DNA is fragmented using a sonicater, such as Covaris.

67
Q

What is the size range of DNA fragments for sequencing?

A

Small fragments, typically 200-500 nucleotides long.

68
Q

How do you verify the size of DNA fragments?

A

Run the fragments on a gel alongside a size marker.

69
Q

What is added to the DNA fragments to facilitate ligation?

A

A nucleotides (deoxy version of A nucleotide) to create an A overhang.

70
Q

Why is an A overhang important?

A

It allows for easier ligation of an adaptor starting with a T.

71
Q

What can you do if the DNA fragments are not the correct size?

A

Run the fragments on a gel again to check their size.

72
Q

What technique can be used to recover a specific DNA sequence from a gel?

A

Cut the sequence out of the gel using a blade and melt down the agarose.

73
Q

What can be designed using the adaptor sequences?

A

Primers for sequencing the DNA.

74
Q

What are the three types of genetic variation among people?

A
  • Single nucleotide variants (SNVs)
  • Short insertions/deletions (indels)
  • Structural variation (SV)

Each type represents different forms of genetic differences that can occur in the human genome.

75
Q

Define single nucleotide variants (SNVs).

A

Variations at a single nucleotide position in the genome, with approximately 3.7-4.1 million occurring per person.

SNVs can affect gene function, but many do not result in changes to the amino acid sequence.

76
Q

What is the range of short insertions/deletions (indels) in base pairs?

A

1-100 bp

Individuals can have approximately 300-600,000 indels in their genomes.

77
Q

What is structural variation (SV) in genetics?

A

Large blocks (1000 bp or more) of DNA that are deleted, inserted, or inverted, with over 2,500 differences relative to the reference human genome sequence.

SVs can be associated with various genetic conditions.

78
Q

What technology is primarily used for sequencing complete genomes?

A

Massively parallel DNA sequencing, with Illumina being the most efficient.

Other sequencing technologies can also be employed.

79
Q

What percentage of people have structural variations associated with genetic conditions?

A

26%

Structural variations can lead to serious genetic conditions, especially if DNA is deleted.

80
Q

What is the significance of short insertions/deletions in the human genome?

A

They are very common, and a person can have thousands of these variations.

Many occur in non-coding regions, leading to no observable effects.

81
Q

True or False: Most single nucleotide variants change the amino acid that is expressed.

A

False

Many single nucleotide variants do not result in changes to the protein produced.

82
Q

Fill in the blank: A person can have approximately _______ differences in structural variation compared to the reference human genome.

A

2500

This includes large changes such as deletions, insertions, and inversions.

83
Q

What notable aspect is mentioned about Watson’s genome?

A

It contains lots of mutations and was sequenced when he was about 80 years old.

This highlights the accumulation of genetic variations over a lifetime.

84
Q

What was the first complete genome sequenced using massively parallel DNA sequencing?

A

The genome of James D. Watson

Sequenced by Wheeler DA et al. in 2008

85
Q

How long did it take to sequence James D. Watson’s genome?

A

Two months

Using massively parallel sequencing technology

86
Q

What was the redundancy level achieved in sequencing James D. Watson’s genome?

A

7.4-fold redundancy

Indicates the depth of coverage during sequencing

87
Q

How many single nucleotide polymorphisms were identified in Watson’s genome?

A

3.3 million

Includes 10,654 that cause amino acid substitutions

88
Q

What is the range of base pairs for copy number variations identified in Watson’s genome?

A

26,000 to 1.5 million base pairs

Refers to large-scale gain and loss of chromosomal segments

89
Q

What percentage of James D. Watson’s genome variants caused amino acid substitutions?

A

Approximately 0.3%

10,654 out of 3.3 million variants

90
Q

What types of polymorphisms were identified in Watson’s genome?

A

Small-scale insertion and deletion polymorphisms

Ranging from 2 to 40,000 base pairs

91
Q

True or False: James D. Watson was about 80 years old when his genome was sequenced.

A

True

Indicates the age of Watson at the time of sequencing

92
Q

Fill in the blank: The sequencing of Watson’s genome revealed _______ variants.

A

[3.3 million]

Includes a significant number of mutations

93
Q

What is the significance of the amino acid substitutions found in Watson’s genome?

A

They can be very similar or very different, potentially pathologic

The impact of substitutions varies based on the properties of the amino acids

94
Q

What was the primary goal of the 1000 Genomes Project?

A

To sequence the genomes of individuals from populations around the world

The project aimed to provide a comprehensive map of human genetic variation.

95
Q

How many individuals’ genomes were sequenced in the 1000 Genomes Project?

A

1,092 individuals

These individuals were selected from diverse populations globally.

96
Q

What types of genetic variations were mapped in the 1000 Genomes Project?

A

SNPs, short insertions and deletions, larger deletions

Specifically, the project identified 38 million SNPs, 1.4 million short insertions and deletions, and more than 14,000 larger deletions.

97
Q

What is the significance of the data obtained from the 1000 Genomes Project?

A

Used to screen variants in genetic disorders and cancer genome projects

This data helps researchers understand the genetic basis of diseases.

98
Q

What is a SNP?

A

Single Nucleotide Polymorphism

SNPs are the most common type of genetic variation among people.

99
Q

What is considered normal variation in humans according to the 1000 Genomes Project?

A

Genetic variations found in healthy individuals

These variations include common SNPs, insertions, and deletions.

100
Q

What is the current status of the 1000 Genomes Project?

A

Morphed into 10,000 genomes and aims for 100,000 genomes

The project continues to expand its research on genetic variation.

101
Q

Fill in the blank: The 1000 Genomes Project provides a map of _______ variations in the human genome.

A

genetic

This includes SNPs, insertions, deletions, and other variations.

102
Q

True or False: The 1000 Genomes Project only sequenced individuals with genetic disorders.

A

False

The project focused on healthy individuals to establish normal genetic variation.

103
Q

What is the purpose of screening out common SNPs in genetic research?

A

To focus on rarer SNPs

Rarer SNPs may have more significant implications for understanding diseases.

104
Q

Where is the information from the 1000 Genomes Project stored?

A

In libraries like Ensembl

Ensembl is a genome browser that provides access to genomic data.

105
Q

What is a shotgun library in molecular biology?

A

A collection of DNA fragments used for sequencing and mapping

Shotgun libraries are created to facilitate the sequencing of large genomes by breaking them into smaller pieces.

106
Q

What percentage of the human genome is made up of exons?

A

Approximately 1%

Exons constitute about 180,000 exons and 30 million base pairs of the human genome.

107
Q

What percentage of disease-causing mutations are found in protein-coding genes (exons)?

A

85%

This highlights the importance of exome sequencing in identifying genetic diseases.

108
Q

What system is used for hybrid capture of target sequences in exome sequencing?

A

Agilent SureSelect Capture Array

This system utilizes a solution-based approach for capturing specific DNA sequences.

109
Q

What is the purpose of the shearing step in preparing genomic DNA?

A

To produce small DNA fragments

This step is crucial for creating a library suitable for sequencing.

110
Q

What are biotinylated RNA library baits used for?

A

To hybridize with sample DNA for target sequence capture

These baits are designed to be complementary to specific exon sequences.

111
Q

What is the role of streptavidin in the capture process?

A

It binds to biotin to isolate target DNA sequences

Streptavidin has a high affinity for biotin, allowing for effective separation of bound and unbound sequences.

112
Q

How are non-exons removed during the capture process?

A

By washing away unbound DNA after magnetic pull-down

This ensures that only the desired exonic sequences remain for further analysis.

113
Q

Fill in the blank: The process of _______ involves digesting RNA probes to isolate pure DNA.

A

RNase treatment

RNase is an enzyme that specifically digests RNA, leaving behind only DNA.

114
Q

What type of assay is used to pull down target sequences in the capture process?

A

Magnetic pull-down assay

This assay utilizes magnetic beads coated with streptavidin to isolate biotin-labeled DNA.

115
Q

True or False: Most of the human genome is coding DNA.

A

False

Most of the human genome is non-coding; only about 1% consists of exons.

116
Q

What is the primary goal of whole exome sequencing?

A

To identify mutations in protein-coding regions of the genome

This approach focuses on the parts of the genome that are most likely to affect disease.

117
Q

What are cRNA baits?

A

Complementary RNA baits used to capture specific DNA sequences

These baits are critical for targeting exons during exome sequencing.

118
Q

What is the significance of high-temperature conditions during hybridization?

A

To maintain stringency and prevent unwanted binding

High temperature helps ensure that only perfectly matched sequences hybridize.