1. Sequencing intro Flashcards

1
Q

What is the biggest currently running genome project in UK?

A

Darwin Tree of Life (part of Earth BioGenome Project) - aim to seqeunce all living species

  • so far 500 species - mostly insects because smaller genomes - easier to sequence
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the uses of DNA sequencing?

A

Why sequence DNA:
- detect new species
- genotype individuals
- identify the presence of organisms (ex take air / water samples - detect organisms from the found DNA)
- determine epigenetic patterns - gene expression regulation
- determine gene expression patterns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the classification on sequencing based on sequence length? What methods are used for each?

A

Short-read:
- Illumina 150-300 bp both ways

Long-read:
- Sanger sequencing 1000 bp
- Oxford nanopore technology (ONT)
- Pacific Biosciences (PacBio)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Explain the classical sequencing mechanism

A

Sanger sequencing:
- based on synthesis + base-specific termination
- ssDNA sequence - adding specific primer - must know the sequence for the primer
- adding radioactively / ** fluorescently** labelled bases - termination of synthesis by ddNTPs (ddATP/ddGTP/ddCTP/ddTTP) - **lack 3’ OH **- no further nucleotide can be added
- for random integration 99% dNTPs + 1% ddNTPs of the specific base A/T/C/G - produces different length fragments - figure out position of the ddNTP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is manual and automated Sanger sequencing

A

Manual: radioactively labelled ddNTPs - manually figure out the sequence of bases by travelled fragment distance

Automated: fluorescentlly labelled ddNTPs - use detector to record fluoresence at each fragment - sequencing chromatogram (peaks of each base)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Compare all sequencing technologies based on read lengths and error rates

A
  1. Illumina: short read, low error rate - uses universal adaptor (no primer)
  2. Sanger: medium length reads, low error rate, requires a sequence-specific primer
  3. ONT: very long reads, high error rate + minion portable sequencer, uses universal adaptor (no primer)
  4. PacBio: long reads, low error rate (because of HiFi), uses universal adaptor (no primer)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What measure evaluates error rates in sequencing?

A

Q value - Phred quality score

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Explain Illumina sequencing

A

Illumina NGS:
1) Sample preparation: generating DNA library by sonication (DNA fragmented)
2) Cluster generation: ligation to 2 adaptors - ‘bridge amplification’ (cluster amplification) - when enough bridges - denaturation of one strand => high density clusters
3) Sequencing by synthesis: sequencing using dNTPS (dATP, dGTP, dTTP, dCTP) + reversible 3’ - universal primer annealed - DNA pol - sequening of all sites started at once - imaging records fluorescent colour at each position - after imaging dye cleaved => cycle repeated many times for all bases to be sequenced
4) Data analysis: overlapping reads aligned - data anaysed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How are next generation sequencing (NGS) technologies different to original DNA sequencing methods?

A

Different NGS technologies compared to original methods:
- Sequence DNA directly
- DNA cut into small fragments ~200 bp (ex by sonication)
- DNA fragments immobilised into solid support - DNA molecules physically separated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Describe the physical platform used in Illumina NGS

A

Illumina NGS uses a glass flowcell - short ss oligonucleotides adaptors (P5, P7) bound to surface or nanowells - dense lawn formed for adaptors (ligated to sequences) to bind to their OH end

The bound oligonucletides will act as primers for DNA polymerysation - bound sequence with adaptors acts as a template strand

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Explain the process of sonication

A

Sonication: using high-frequency sound waves to fragment DNA sequence into smaller pieces

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Explain cluster generation in Illumina NGS

A

When sonicated DNA added:
1) sonicated DNA fragments with ligated adaptors bind to embedded oligonucleotides
2) density of attached DNA adjusted - single DNA molecule at a separated well
3) Initial extension: DNA pol adds dNTPS to make ds DNA from 3’ end - oligonucleotides P5 and P7 act as primers - sonicated DNA as template strand)
4) Denaturation performed - original sonicated DNA washed off - ss copy left
5) Cluster generation: renaturation conditions created - non-bound adaptor bind to another embedded oligonucleotide - bridge formed - DNA pol - another round of DNA synthesis = bridge amplification
=> at each step two strands separated to act as templates for next strand synthesis

Steps 3) -5) repeated x35 times to create an identical sequence cluster in close proximity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Explain sequencing part in Illumina NGS

A

Illumina sequencing (sequencing of all DNA fragments at once):
- universal sequencing primer annealed to adapter sequences
- DNA pol uses dNTPs with different fluorescent groups: dATP, dGTP, dTTP, dCTP + 3’ reversible block
- incorporation of fluorescent dNTP + temporarily blocks - detector reads fluorescence at each DNA fragment
- the fluore + block removed - new 3’ OH open for next polymerization step - next fluor and block = repeated in cycles until all fragment recorded (leaves the nt but fluore+block removed)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Explain adapter ligation to sample DNA fragments in Illumina

A

Adapter ligation: adapters ligated at both ends of DNA fragment - different on each end -> on glass flowcell adaptors bind (base pair) to oligonucelotides P5 and P7 - which act as primers for DNA polymerization (ds to the bound ss sequence)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the difference between the primers and oligonucleotides bound to glass flowcells in Illumina?

A

Primers: bind to sonicated DNA sequences - allow binding to oligos embedded on glass flowcells

Oligonucleotides (P5 and P7): embedded in glass flowcells (the surface) - after binding act as primers for ds DNA synthesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Why is the density adjusted for only one DNA molecule to bind to glass flowcell well in Illumina?

A

When sonicated DNA added - density of attached DNA adjusted - single DNA molecule at separated wells => because in the end equal replicates of each sequence needed - ‘bridge amplification’ will amplify each sequence the needed # for fluorescence to be detectable

17
Q

How are cluster prepared for sequencing in Illumina?

A

Cluster preparation for sequencing - before sequencing ds DNA -> ss DNA:
- ddNTPS added to block further binding to oligonucleotides
- at oligo P5 bridge cleaved - one strand removed by denaturation
- a sequencing primer bound

18
Q

What is used to separate different DNA clusters on glass flowcell in Illumina?

A

Distance - identical sequence clusters are separated by distance on the glass flowcell

19
Q

Explain the structure of the glass flowcell in Illumina

A
20
Q

What are the main differences between Sanger and Illumina sequencing?

A
  • Sanger uses blocking terminator ddNTPs while Illumina uses reversible terminator ddNTPs (dye cleaved off after reading - 3’ OH end free for next ddNTP to be added)
  • Sanger needs sequence specific primer - Illumina uses universal adaptors as primers
21
Q

Explain PacBio sequencing method

A

PacBio uses single molecule long read (SMLR) and SEQUEL II/Revio technologies:
- uses fluorescently labelled dNTPs - when dNTP added by fixed polymerase - fluorescence released
- colour of fluorescence read - added base determined -> real-time monitoring of nucleotide incorporation (raw data a movie - not pictures)
- high error rate - fixed with HiFi libraries

22
Q

Explain what are HiFi libraries in PacBio sequencing method

A

HiFi produces many reads of the same fragment - more reliable data generated - higher accuracy for long reads

23
Q

Explain ONT sequencing technology

A

ONT technology:
- uses a synthetic membrane with very small protein pores - only DNA fits no proteins
- When DNA goes through the pore - change in current flow - current deviations can be converted into base data by modelling the current using neural net computing - diff base changes the current uniquely

24
Q

What are the two sequencers offered by ONT?

A
  • minION: portable (in fields) - 500 pores - 500 molecules read
  • promethION: stationary (in labs) - 3000 pores - 3000 molecules read simultaneously