Lecture 3 Flashcards

Question 1

Q

Define “massive parallel sequencing” in NGS:

Answer

A

Massive: several regions at once; parallel: several samples at a time

Question 2

Q

What are the 2 main NGS platforms currently used?

Answer

A

Ion torrent and illumina

Question 3

Q

What is ion torrent also referred to as?

Answer

A

“Label-free sequencing” - fluorescence or spikes of light are not used

Question 4

Q

What does ion torrent measure?

Answer

A

Changes in ph

Question 5

Q

What is ion torrent characterized by?

Answer

A

High accuracy and good coverage

Question 6

Q

How long are the reads sequenced by ion torrent?

Answer

A

About 200 bp

Question 7

Q

How long are the average Sanger sequence reads?

Answer

A

600-800 bp up to 1000

Question 8

Q

What is ilumina characterized by?

Answer

A

Bridge amplification

Question 9

Q

How is illumina visualized?

Answer

A

By the use of fluorescence → each nucleotide is linked to a different fluorophore which emits a unique signal

Question 10

Q

Describe the library structure of an illumina sample:

Answer

A

A DNA insert with “read1” and “read2” ( primers similar to the forward and reverse primers in Sanger) on either side, two indexes on either side that serve as an 8 bp “barcode” exclusive to each sample, and 2 adaptors complimentary to those linked on the flow cell called p5 and p7

Question 11

Q

What is the process called in which all samples collected by illumina sequencing are pulled together to observe associations between the obtained sequence and the sorted samples?

Answer

A

Demultiplexing → each read is associated to a unique sample

Question 12

Q

What is cluster generation?

Answer

A

Amplification of the flow cell

Question 13

Q

What is a quality score?

Answer

A

A prediction of the probability of an error in base calling

Question 14

Q

When measuring Phred quality score, what probability corresponds to high accuracy?

Answer

A

Low probability

Question 15

Q

What is read depth?

Answer

A

The total number of bases sequenced and aligned at a given reference base position

Question 16

Q

What is coverage defined as?

Answer

Study These Flashcards

A

The average number of sequenced bases that align to each base of the reference DNA → ex: a whole genome sequenced at 30x coverage means that each base in that genome analysis was sequenced 30 times on average