Next generation sequencing Flashcards
State the basic steps involved in sangar sequencing
- DNA templates go through enzymatic reaction
- Products of enzymatic reaction separated by size via gel or capillary electrophoresis
- Detection of fluorescently labelled ddNTPs at end of each elongated strand sequence
- Re-contruction of original template sequence using fluorescently labelled ddNTPs
Explain the process of the enzymatic reaction involved in sangar sequencing
- Clonal population of DNA templates are placed in a reaction mixture with an oligonucleotide primer, DNA-dependent DNA polymerase and deoxyribonucleoside triphosphates (dNTPs) and fluoresecently labelled dideoxyribonucleoside triphosphates, ddNTPs.
- Reaction mixture heated up to dentaure the DNA templates to make them single stranded
- Oligonucleotide primer binds to the single stranded DNA primers to form a partial duplex
- DNA-dependent DNA polymerase binds to partial duplex to form initiation complex
- DNA-dependent DNA polymerase then begins to synthesise new complementary strand from the position of the primer by incorporating dNTPs into new strand
- Eventually DNA-dependent DNA polymerase will incorporate one of the fluorescently-labelled ddNTPs at which point it won’t be able to add any other nucleotides into the newly sythesised strand
- Elongation of other strands continues until a ddNTP has been incorporated into every newly sythesised strand thus stopping the reaction.
Explain the processes that occur after the enzymatic reaction in Sangar sequencing
- After enzymatic reaction you separate the products of the reaction by size using capillary/gel electrophoresis
- Each fragment that is produced represents a newly sythesised strand with a terminal fluorescently-labelled ddNTP at a different position so each fragement is scanned to identify teh colour fluorescence it gives off
- Once you identify the colour fluorescence you can identify which ddNTP was incorporated at that particular position and use that information to re-contruct the DNA template sequence
Name the 4 steps involved in next generation sequencing
- DNA library Construction
- Cluster Generation
- Sequencing-by-synthesis
- Data analysis
What is a DNA library?
A collection of random DNA fragments of a specific sample to be used for further study, e.g. next generation sequencing
Explain the process of DNA library construction
- Obtain DNA sample, usually from a patients blood, and then fragment that DNA into small pieces in a process called shearing
- Shearing of the DNA produces single stranded overhangs at the end of the DNA fragments which are repaired in a process called end-repair
- You then add Adenine nucleotides to opposite ends of each strand of the DNA fragment, to produce adenine overhangs, in a process called A-tailing
- These Adenine overhangs then allow for adapters with thymine overhangs to ligate to the both strands of the DNA fragments
What structural elements do the adaptors have that allow the DNA libraty fragemnts to be sequenced using next generation sequencing?
- P5 and P7 anchors on the end to allow for attachment of library fragments to the flow cell
- Sequencing primer binding sites which allow for the hybridisation of primers to the DNA library fragment
Explain the process of cluster generation
- Involves Hybridisation of the DNA library fragments to the flow cell
- On the bottom of the flow cell there are oligonucleotides which match one of the 2 anchor sequences on the end of the adapator sequences attached to the DNA fragments either P5 or P7.
- These oligonucleotides bind to the complementary adaptor sequence on the DNA fragments causing the DNA fragments to become immobilised on the flow cell.
Why do the DNA fragments need to be amplified when hybridising to the flow cell?
DNA fragemnts need to be amplified because without amplification they are too small to be seen/measured on the flow cell
What is the name of the process used to amplify the DNA fragements on the flow cell?
Bridge amplification
How does bridge amplification lead to the amplification of the DNA fragements on the flow cell?
- Once the DNA fragments are attached, the fragment bends over and the adaptor on top of the fragment attaches to its complementary oligonucleotide, on the flow cell, to form a bridge-like shape.
- A primer and a DNA polymerase bind to the fragment and the polymerase produces a strand complementary to the DNA fragment (reverse strand)
- Once the reverse strand is produced you now have a double stranded DNA strand. This double stranded DNA strand is denatured so now you have the forward strand (original strand) and the reverse strand (complementary strand
- The 2 strands then separately attach to their complementary oligonucleotides on the flow cell
- This process happens across the flow cell creating clusters of amplified DNA fragments
Explain the process of sequencing-by-synthesis
- Before this process begins, all of the reverse strands (complementary strands) are washed away leaving only the forward strands on the flow cell
- Primers bind to the forward strand and a DNA polymerase adds a single fluorescently-labelled deoxyribonucleoside triphosphate (dNTP) which extends the sequence by one nucleotide from the primer
- Once the dNTPs have been added to the forward strand the flow cell is washed
- The fluorescence emitted from every single position on the flow cell is scanned by the machine so it is able to identify which dNTP has been added to each newly synthesised sequence
- The flow cell is then washed with an enzyme that cleaves the terminator chemical group allowing for the cycle to be repeated
- This cycle is then repeated until every single sequence on the flow cell has been scanned entirely
What data is produced from the next generation sequence machine about each sequence that it “re-constructs?”
- ID number which shows the position of each cluster on the flow cell - normally a coordinate
- DNA sequence itself
- Whether strand is positive or negative
- Thread quality score - how confident the machine is that it has chosen the correct base call
How can Next generation sequencing data be analysed?
- Short read sequences from the sequencing machine are re-assembled to form the entirety of the original DNA sequence that was extracted from the patient
- Location of Short read sequences within the reference genome can also be identified
- Short read sequences can also be used to generate a consensus sequence, a sequence that tells you the nucleotide most likely to be present at every position within the sequence.
- This consensus sequence can then be compared with the reference genome and any positions that are different between the 2 show the presence of a genetic variant within the orginal DNA sequence from the patient
What are some of the applications of Next generation sequencing?
- Whole genome sequencing
- Whole exome sequencing
- RNA-seq