SEQUENCING TECHNOLOGIES AND APPLICATIONS Flashcards
SANGER SEQUENCING?
Sanger sequencing is a method of DNA sequencing that involves electrophoresis and is based on the random incorporation of chain-terminating dideoxynucleotides by DNA polymerase during in vitro DNA replication. After first being developed by Frederick Sanger and colleagues in 1977, it became the most widely used sequencing method for approximately 40 years. More recently, higher volume Sanger sequencing has been replaced by next generation sequencing methods, especially for large-scale, automated genome analyses. However, the Sanger method remains in wide use for smaller-scale projects and for validation of deep sequencing results. It still has the advantage over short-read sequencing technologies (like Illumina) in that it can produce DNA sequence reads of >500 nucleotides and maintains a very low error rate with accuracies around 99.99%. Sanger sequencing is still actively being used in efforts for public health initiatives such as sequencing the spike protein from SARS-CoV-2 as well as for the surveillance of norovirus outbreaks through the Center for Disease Control and Prevention’s (CDC).
Sanger sequencing, also known as the “chain termination method,” was developed by the English biochemist Frederick Sanger and his colleagues in 1977. This method is designed for determining the sequence of nucleotide bases in a piece of DNA (commonly less than 1,000 bp in length). Sanger sequencing with 99.99% base accuracy is considered the “gold standard” for validating DNA sequences, including those already sequenced through next-generation sequencing (NGS). Sanger sequencing was used in the Human Genome Project to determine the sequences of relatively small fragments of human DNA (900 bp or less). These fragments were used to assemble larger DNA fragments and, eventually, entire chromosomes.
WHEN WAS PCR (TECHNIQUE FOR AMPLIFYING DNA) DEVELOPED?
1983
THE HUMAN GENOME PROJECT; KEY DATES AND FIGURES
1984 - PLANNING STARTED
1990 - PROJECT STARTED!!!!
2001 - 1ST DRAFT PUBLISHED
2003 - DECLARED COMPLETED!!!!
- ONE PERSON’S GENOME ONLY
- USED SANGER SEQUENCING
- HUMAN GENOME DETERMINED TO HAVE CCA 3 BILLION NUCLEOTIDE PAIRS
- $2.7 BILLION INVESTED (PRICE = CCA $1 PER BASE)
NUCLEOTIDES IN DNA ARE PAIR/HELD TOGETHER BY?
HYDROGEN BONDS
WHICH ENZYME IS RESPONSIBLE FOR DNA BASE PAIRING?
DNA POLYMERASE
DNA CAN ONLY GROW IN WHICH DIRECTION?
5’ TO 3’ DIRECTION (meaning that nucleotides are added by the DNA polymerase only to the 3’ end of the growing strand)
WHAT DO DNA NUCEOTIDES CONTAIN?
- 5 CARBON SUGAR DEOXYRIBOSE
- A NUCLEOBASE (C,G,A,T)
- A PHOSPHATE GROUP
WHAT TYPE OF BOND MAKES UP THE BACKBONE OF DNA?
PHOSPHODIESTER BOND
HOW DOES SANGER SEQUENCING WORK?
The Sanger sequencing method consists of 6 steps:
(1) The double-stranded DNA (dsDNA) is denatured into two single-stranded DNA (ssDNA)
(2) A primer that corresponds to one end of the sequence is attached (a DNA primer complementary to the template DNA (the DNA to be sequenced) is used to be a starting point for DNA synthesis)
(3) Four types of dNTPs (deoxynucleotide triphosphates (dNTPs: A, G, C, and T), the polymerase extends the primer by adding the complementary dNTP to the template DNA strand) and ddNTPs are added (four dideoxynucleotide triphosphates (ddNTPs: ddATP, ddGTP, ddCTP, and ddTTP) labeled with a distinct fluorescent dye are used to terminate the synthesis reaction) —-> The ratio of dNTP:ddNTP varies from around 10:1 to 300:1, depending on desired read length, buffer conditions, the polymerase used, and the electrophoresis conditions.
(4) The DNA synthesis reaction initiates and the chain extends until a termination nucleotide is randomly incorporated (Compared to dNTPs, ddNTPs have an oxygen atom removed from the ribonucleotide, hence cannot form a link with the next nucleotide)
(5) The resulting DNA fragments are denatured into ssDNA.
(6) The denatured fragments are separated by gel electrophoresis and the sequence is determined. (the sizes of molecules are different because each one had a different point in which DNA polymerase randomly incorporated ddNTP instead of dNPT)
In short: DNA is unwrapped and a primer is put on strand which we want to sequence. DNA polymerase starts DNA synthesis in a solution where the dNTPs are available (i.e. the typical bases: A,T,G,C) but also one type of ddNTPs, which, when incorporated into DNA, cause termination of synthesis. This produces DNA molecules of different length, based on when DNA polymerase randomly takes up ddNTP. Sequence of each fragment can then be determined.
Mechanism in history: Sanger sequencing was first done in 4 separate reactions using radioactivity (primers labelled with 32P); each reaction contained a different ddNTP at 1:100 ratio with dNTPs. After completion each reaction was resolved using SDS-PAGE (sodium dodecyl sulphate–polyacrylamide gel electrophoresis) to separate the chain termination products by size. The DNA was then transferred onto a membrane and the chain termination products visualised using X-ray film. The DNA sequence was determined manually by writing down the letters. Dangerous and long technique (2-3 weeks to sequence a couple of DNA molecules that were just 100-150 nucleotides long)
Mechanism now: Uses ddNTPs that are fluorescent in 4 different colours, meaning each reaction gives info on each base. Reactions are run on capillary gels (running 384 samples at the time), and read by lasers and detectors as the sample passes a detection window. Each reaction can read the sequence of a 1000 bp DNA molecule. This increases speed, accuracy and throughput.
OTHER NAMES FOR SANGER SEQUENCING?
‘CHAIN-TERMINATION’ OR ‘DIEDOXY’ METHOD
WHICH dideoxynucleotide triphosphates (ddNTPs) ARE USED IN SANGER SEQUENCING AND WHAT ARE THEY?
DdNTP refers to Dideoxynucleotides triphosphates which are used in Sanger dideoxy method to produce different lengths of DNA strands for DNA sequencing. DdNTP includes ddATP, ddTTP, ddCTP and ddGTP. DdNTP are useful in the analysis of DNA’s structure as it stops the polymerisation of a DNA strand during a DNA replication, producing different lengths of DNA strands replicated from a template strand. These newly synthesised DNA strands are used later in gel-electrophoresis to generate a series of bands pattern which are useful to analyse the sequence of the DNA strand.
DdNTP differs from dNTP by the lack of 3’-OH group on the pentose sugar structure. A hydrogen group was found on the position 3’ instead of OH-group. This results in the termination of DNA polymerisation(or DNA elongation) process because this process needs a 3’-OH group to continue.
DdNTP are often dyed(labelled with a certain fluorescent) to ease the analysis of the DNA sequence. Dyed DdNTP will fluoresces at different wavelengths which can be detected by modern technologies. A diagram of different wavelength analysed will be generated with base labelled.
MODERN SANGER SEQUENCING RUNS ON CAPPILARY GELS. HOW MANY SAMPLES CAN BE RUNNING SIMULTANEOUSLY?
384
2021: PRICE FOR SEQUENCING HUMAN GENOME?
MOORE’S LAW IN CONTEXT OF GENOME SEQUENCING?
Although Moore’s Law mainly applies to computing hardware, predicting a doubling of computing power every two years, DNA sequencing cost has followed a similar pattern for many years, approximately halving each two years. However since January 2008 there has been a break in that trend, with sequencing costs rapidly declining after that date. This applies to both the cost per megabase of DNA sequence and the total cost per genome.
WHEN WAS THE FIRST ‘HIGH THROUGHPUT SEQUENCING’ (HTS) INSTRUMENT RELEASED AND WHAT WAS IT?
2005, ROCHE 454
WHEN WAS THE FIRST PERSONAL HTS (HIGH THROUGHPUT SEQUENCING) INSTRUMENT RELEASED? WHAT WAS IT?
2010, MiSeq by company Illumina
DIFFERENCE BETWEEN SHORT READ AND LONG READ SEQUENCERS? WHAT DOES ‘READ’ REFER TO?
READ = THE LENGTH OF THE DNA MOLECULE BEING SEQUENCED
SHORT: LESS THAM 1000 BASES
LONG: GREATER THAN 1000 BASES
2022 SEQUENCING LANDSCAPE; MAJOR COMPANIES INVOLVED?
- ILLUMINA
- PACBIO
- ION TORRENT
- NANOPORE