Sequencing Flashcards

1
Q

what did Sanger first sequence and won Nobel prize for?

A

1958 AA structure of insulin

1980 developed first DNA sequencing technology

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

describe Sanger sequencing

A

uses DNA polymerase and a template strand with a primer (DNA pol needs primer).
DNA pol can attach 5’ hydroxyl of a new NT to 3’ of primer and bases.
- use mixtures including all nucleoSides and also one type of ddNTP (didepoxynucleoside triphosphate). this stops the chain from extending, because it has no OH, just H.
- ddNTPs get randomly incorporated, forming a mixture of fragments of different lengths, each ending in ddNTP. this is done for each ddNTS (4 mixtures).
- the correct proportions of ddNTPs must be added cover just enough of corresponding base. too much - all fragments are short. too little - doesn’t capture every complementary NT.
- polyacrylamide gel electrophoresis - separates oligonucteltides. originally used radioactive label on ddNTPs
Fragments are nested so fragments overlap and can determine order.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the Maxam Gilbert sequencing method?

A
  • developed near time of Sanger sequencing
  • ssDNA labelled with radioactivity at 5’ end (rather than NT as in Sanger)
  • base specific reagents cleave DNA.
    Dimethyl sulfate (+NaOH) - A and G (+HCl, just A)
    Hydrazine and Piperidine (T and C)
    Hydrazine, piperidine and NaCl (C)
  • separated on gel, read sequence straight from gel.
  • doesn’t use primed DNA synthesis so is limited to sequences near restriction sites.
    -toxic reagents.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what was a major advancement in Sanger sequencing?

A
  • using fl dyes rather than radioactivity

- can attach different dye to each ddNTP and do one reaction mixture instead of 4. Run on same gel lane.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what allowed automation of DNA sequencing?

A

Capillary systems replace flat gels.
1986
can have 96 capillaries at once.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

how long a sequence can Sanger reaction produce?

A

1000bp

automated machines can produce 1Mb per day

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is a shred score?

A

Sequence quality/ accuracy
q
q=20 means probability of <1% error per base.
Now, usually q range is 80 - 90 (1 base in 100000000 wrong)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a common unit of sequencing cost?

A

USD per 1,000 bases determined to q=20 accuracy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

when did price of sequencing drop?

A

2008 - invention of next gen sequencing technology

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

work flow of illumination sequencing

A
  1. library generation
  2. library fragments bind to complimentary oligos in nanowells on flow cell.
  3. bridge amplification - amplifies a single molecule into a cluster of replicated fragments. 1 cluster per well.
  4. Cycles exposing clusters to nucleoside triphosphate mixtures. polymerase adds a NT each time. only one NT added each stem due to blocking group in reagents.
    Laser pulses in each cycle excite newest added NT which has fl tag.
  5. distribution of colors in the image shows which base added to cluster.
  6. remove fl tags and blocking groups after imaging. cycle repeats.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is involved in DNA library preparation prior to illuminating sequencing?

A
  • fragment DNA (300-800bp)
  • ligate adapter sequence to both ends.
    adapter = a sequencing primer binding site, an index, and a flow cell attachment site.
    adaptors on each end of fragment are different.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is one of the most popular set machines today?

A

Illumina HiSeq 4000

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what amount of DNA sequences can illumine flow cell produce?

A

720Gb in 48-60 hours.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the primary error in SBS?

A

Phase error:
in a cluster, 1000 identical sequences. all produce same signal together in each cycle.
possible for some to get out of sync, eg if 2 bases incorporated instead of 1(incomplete blocking).
more cycles, more will get out of phase. eventually unphased signal will swamp in phase signal.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

examples of NGS technologies

A

Illumina - very versatile, only one to survive
Roche 454 - pyrosequencing
ABI SOLiD - seq by oligoNT ligation and detection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

2 new third gen seq technologies

A

SMRT - Single molecule real time (PacBio)

nanopore seq

17
Q

How does PacBio SMRT work?

A

uses single base extension of a primer
each sample is a single molecule, not fragmented.
each molecule in a well 70nm diameter, 100nm deep.
wells flooded with NTPs with fl label attached to phosphate.
incorporation of bases gives fl signal.
lack of amplification = longer reads 10kb.

18
Q

How does PacBio SMRT work?

A

uses single base extension of a primer
each sample is a single molecule, not fragmented.
each molecule in a well 70nm diameter, 100nm deep.
wells flooded with NTPs with fl label attached to phosphate.
incorporation of bases gives fl signal.
lack of amplification = longer reads 10kb.
reduced signal to noise ratio, solving problem of phase error.
v sophisticated optical system needed.

19
Q

what is the index in illumina?

A

8bp sequence
for identification of the sample
also gets sequenced and can distinguish samples in the same lane.
must know which index gives which sample. 4^8 possible sequences.

20
Q

how does Oxford nano pore work?

A

channel inserted into artificial membrane
electric field sends current through pore
passage of organic molecules through pore perturb or block the current.
ssDNA fed through pore and detect change in current = base seq.

21
Q

Advantages of Oxford nanopore.

A
  • can process long single strands, with read lengths of hundreds of kilobases
  • direct readout of the sequence, with no need for library construction, amplification, or labelling.
  • independent of a polymerase.
22
Q

2 variations of ox nanopore

A

Oxford nanopore NinIon - tiny machine, USB.

PromethIon - holds 48 flow cells, can generate 12 Tbp of seq in 48 hours

23
Q

how can high error rate be corrected for?

A

increasing coverage
however, non random errors cannot be corrected for.
costs increase with coverage.

24
Q

Which sequencing method has highest error rate?

A

Ox nanopore 38% compared to sanger 0.001% and illuminate 0.1%

25
Q

which seq method has highest no reads per run?

A

Illumina HiSeq high output - 8x10^9

26
Q

Which is the fasted seq technology?

A

PacBio - 0.5-4h

Sanger - 0.5-3h

27
Q

Cheapest?

A

Illumina 0.3$per million bases

most exp = Sanger - $500 per million bases