Genomics : DNA Sequencing Flashcards

1
Q

Why sequence the whole genome?

A
  • Provides complete genetic information of an organism.
  • Detects mutations, SNPs, and structural variations across the genome.
  • Helps identify genetic causes of diseases.
  • Enables tailored treatments based on genetic profiles.
  • Reveals genetic relationships and evolutionary history.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What makes up a nucleotide?

A
  • Phosphate group: Linked to the sugar molecule.
  • Sugar: Deoxyribose in DNA, ribose in RNA.
  • Nitrogenous base: One of four types:
    - Purines: Adenine (A),
    Guanine (G)
    - Pyrimidines: Cytosine
    (C), Thymine (T) in DNA;
    Uracil (U) in RNA.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What bond is formed when 2 nucleotides join together?

A
  • Phosphodiester bond:
  • Forms between the phosphate group of one nucleotide and the sugar of another.
  • Links the 3’ carbon of one sugar to the 5’ carbon of the next sugar.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the principle of Sanger Sequencing?

A

Sanger sequencing works by using special building blocks of DNA, called dideoxynucleotides (ddNTPs), which act as “stop signals” during DNA synthesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How is the Sanger Library prepped?

A
  1. DNA Extraction: Isolate the DNA of interest from cells or tissue.
  2. Fragmentation: The DNA is fragmented into smaller, manageable pieces using enzymes or mechanical methods.
  3. Adaptor Ligation (if needed): Short, known sequences (adaptors) may be ligated to the DNA ends for primer binding.
  4. Cloning into Vectors: DNA fragments are inserted into plasmid vectors for amplification in bacterial cells.
  5. Transformation: The recombinant plasmids are introduced into competent bacteria.
  6. Amplification and Isolation: Bacterial colonies are grown, and plasmid DNA is extracted to isolate the cloned DNA fragments.
  7. Template Preparation: Purified DNA is used as the template for sequencing.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is mechanical DNA shearing?

A

Mechanical DNA shearing is a method used to fragment DNA into smaller pieces through physical forces. It creates random breaks in DNA, making it useful for preparing DNA libraries for sequencing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the common methods of mechanical shearing?

A
  1. Sonication
    - Ultrasonic waves are used to create vibrations that shear DNA.
  • The length of fragments depends on the duration and intensity of sonication.
  1. G-Tube
    - forcing DNA through a narrow constriction (the G-tube), creating consistent shear forces that fragment the DNA into specific, predictable sizes.
  2. Hydrodynamic Shearing:
    DNA is passed through a constricted tube, and the sudden pressure change causes shearing.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is enzymatic DNA shearing?

A

Enzymatic DNA shearing is a method of DNA fragmentation that uses enzymes like DNase I or endonucleases to cleave DNA into smaller fragments.

The fragment size can be controlled by adjusting reaction time and enzyme concentration.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the importance of Sanger Sequencing?

A

Gold standard for accuracy: provides highly accurate, low error rate DNA sequences. Used to validate results from high-throughput NGS

Small-Scale Applications: suitable for sequencing small DNA fragments, used in clinical diagnostics, gene cloning and identifying mutations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is next-generation sequencing?

A

an advanced sequencing technology for determining the sequence of DNA or RNA to study genetic variation associated with diseases or other biological phenomena.

It uses the concept of massive parallel processing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the 454 Library Prep?

A

454 Library Preparation is a method used in 454 sequencing (a type of Next-Generation Sequencing) to prepare DNA fragments for sequencing by attaching adapters, amplifying the DNA, and preparing it for pyrosequencing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Explain the key steps in 454 Library Prep.

A
  1. DNA Fragmentation:
    - DNA is sheared into
    fragments of 300–800 base
    pairs (bp).
    - Ends are polished by
    removing unpaired bases to
    create blunt ends.
  2. Adapter Ligation:
    - Adapters (named A and B)
    are attached to both ends of
    the fragments.
    - DNA is made single-
    stranded at this stage.
  3. Bead Attachment:
    - One adapter contains biotin,
    which binds to streptavidin-
    coated beads.
    - The ratio of beads to DNA is
    controlled to ensure that
    each bead gets only one
    DNA molecule attached.
  4. Emulsion PCR:
    - Oil is added to the beads,
    creating an emulsion where
    each aqueous droplet serves
    as a micro-reactor.
    - PCR is performed within
    these droplets, amplifying
    the DNA.
    - Each bead ends up coated
    with millions of identical
    copies of the original DNA
    fragment.
  5. Preparation for Sequencing:
    - The DNA-coated beads are used in 454 sequencing for pyrosequencing reactions.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Explain the brief chemistry on 454 sequencing.

A
  1. Nucleotide Incorporation:

When a base (dNTP) is incorporated by DNA polymerase, pyrophosphate (PPi) is released.

  1. Conversion to ATP:

PPi reacts with adenosine 5’-phosphosulfate (APS) in the presence of the enzyme sulfurylase, generating ATP.

  1. Light Signal Production:

Luciferase uses ATP to convert luciferin into oxyluciferin, releasing light as a byproduct.
The amount of light is proportional to the number of bases added (e.g., a homopolymer of “AAA” will produce a stronger signal).

  1. Degradation of Unused Reagents:

The enzyme apyrase degrades excess ATP and unused dNTPs before the next nucleotide cycle begins.

  1. Flowgram Analysis:

The light signals are recorded as peaks on a flowgram, with each peak corresponding to the nucleotide added (A, T, G, or C).
Flowgrams are used to reconstruct the DNA sequence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

454: why is it not commonly used anymore?

A

454 sequencing has largely been replaced by more advanced next-generation sequencing (NGS) technologies due to several limitations:

cost, error rate, and scalability, combined with the emergence of better alternatives,

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Explain the steps in enzymatic DNA shearing.

A
  1. Preparation and Input DNA:

Start with a minimal DNA input; often as low as 1 nanogram is sufficient.
This method works for small genomes, amplicons, and plasmids.

  1. Enzymatic Reaction:

DNA is incubated with an enzyme cocktail optimized to cleave at specific points.
The reaction time, temperature, and enzyme concentration are controlled to achieve desired fragment sizes.

  1. Normalisation of DNA:

Innovative sample normalization eliminates the need for manual library quantification.
This step ensures consistent DNA input across samples.

  1. Adapter Ligation (if applicable):

Once sheared, DNA fragments are ligated to adapters that are required for downstream sequencing.

  1. Library Preparation and Amplification:

The prepared DNA fragments are amplified using PCR to generate a sequencing-ready library.

  1. Sequencing:

Prepared DNA is loaded into a sequencing platform (e.g., MiSeq) for analysis.
Rapid Prep: The entire workflow from DNA to data can be completed in less than 8 hours.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is sample pooling?

A

Sample pooling is a method in DNA sequencing where multiple DNA samples are combined into a single sequencing run. Unique identifiers (indices) are added to each sample to differentiate them during analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is indexing/barcoding?

A

Indexing or barcoding is a technique in DNA sequencing where unique DNA sequences (indices) are added to multiple samples, allowing them to be sequenced together (multiplexed) and computationally separated (demultiplexed) later.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What are the advantages and disadvantages of indexing/barcoding?

A

Advantages:
Reduces Costs: Multiple samples can be processed in a single run, saving on reagents and sequencing costs.
Faster Turnaround Time: Speeds up processing as multiple samples are sequenced simultaneously.

Disadvantages:
Reduced Read Number per Sample: Since all samples share the same sequencing capacity, the number of reads per sample is lower.
Normalization Required: Ensures equal representation of each sample to minimize variation in read numbers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Illumina - What is cluster generation?

A

Cluster generation is a process in Illumina sequencing where individual DNA fragments are amplified in a flow cell to create clusters of clonal DNA molecules.

These clusters generate enough fluorescent signals during sequencing to be detected accurately.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Steps in Cluster Gen

A
  1. DNA Binding to Flow Cell:

DNA fragments, ligated with P5 and P7 adapters, bind to complementary oligonucleotides on the flow cell surface.

  1. Bridge Amplification:

The DNA fragment bends and hybridises to nearby oligos, creating a “bridge.”
PCR amplifies the DNA, forming double-stranded molecules.

  1. Denaturation and Clonal Amplification:

Double-stranded molecules are denatured into single strands.
The process repeats, creating clusters of identical DNA molecules.

  1. Purpose:

Amplify enough DNA to ensure the fluorescent signal from incorporated nucleotides during sequencing is strong enough for detection.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the purpose of P5 and P7 adapters in Illumina sequencing?

A

P5 and P7 adapters allow DNA fragments to bind to the flow cell and serve as primers for PCR amplification during cluster generation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Why is bridge amplification important?

A

Bridge amplification creates clusters of clonal DNA, ensuring that each cluster generates sufficient signal for accurate sequencing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is the primary goal of cluster generation?

A

To amplify individual DNA fragments into clusters, producing enough signal for sequencing by synthesis (SBS).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

How does Illumina ensure each cluster represents a single DNA fragment?

A

By using precise ratios of DNA and oligonucleotide sites on the flow cell, each DNA molecule is amplified into a distinct cluster.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Illumina - What is the chemistry behind this?

A

Illumina sequencing uses sequencing-by-synthesis (SBS), which involves the incorporation of fluorescently labeled nucleotides to build a DNA strand one base at a time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What are the key steps in Illumina chemistry?

A
  1. Dye-Termination Similar to Sanger Sequencing:

DNA synthesis starts with a primer.
Each added nucleotide is fluorescently labeled, emitting a signal when incorporated.

  1. Blocking of the 3’-OH Group:

Each nucleotide has a reversible terminator that blocks the 3’-OH, preventing further base additions.
After imaging, the fluorescent dye and blocking group are removed, allowing the next nucleotide to bind.

  1. Fluorescent Signal Detection:

A camera captures the signal emitted by the fluorescent dye, identifying which nucleotide (A, T, G, or C) was incorporated.
Unlike pyrosequencing, Illumina avoids issues with homopolymer stretches because it processes one base at a time.

  1. Cycle Repetition:

The process is repeated 50–300 times, extending the DNA strand and generating sequence reads.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

How does Illumina differ from pyrosequencing?

A

Illumina uses reversible terminators and avoids issues with homopolymer stretches, whereas pyrosequencing relies on pyrophosphate detection, which can misread homopolymers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What is the role of reversible terminators in Illumina sequencing?

A

They block further nucleotide incorporation until the dye and blocking group are removed, ensuring one base is added at a time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Why is fluorescence used in Illumina sequencing?

A

Fluorescent signals allow precise detection of which nucleotide is added during each cycle.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What is the significance of paired-end indexed sequencing

A

Paired-end indexed sequencing is a method in which both ends of a DNA fragment are sequenced, providing more comprehensive information about the DNA sequence and its context. It is particularly useful for discovering genome variation and improving sequencing accuracy.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

How does paired-end sequencing improve genome coverage?

A

It provides information about both ends of a fragment, reducing gaps and improving coverage in repetitive regions.

32
Q

What types of genome variations can paired-end sequencing detect?

A

Insertions, deletions, structural variations, inversions, and translocations.

33
Q

Why are indices (i5 and i7) used in paired-end sequencing?

A

Indices allow multiple samples to be multiplexed, reducing costs and increasing throughput.

34
Q

What is 3rd gen sequencing?

A

Third-generation sequencing refers to advanced sequencing technologies that allow real-time sequencing of single molecules of DNA or RNA without the need for amplification. These methods are faster and can produce longer reads compared to earlier technologies like Illumina (2nd generation).

35
Q

What are the two sequencing modes in PacBio?

A

LS (Long Sequencing Reads):
- Designed for large insert sizes (20 kb to >100 kb).
Generates one pass on each DNA molecule.

CCS (Circular Consensus Sequencing):
- Designed for small insert sizes (<10 kb).
Generates multiple passes on each DNA molecule to improve accuracy.

36
Q

What is the difference between LS and CCS modes in PacBio sequencing?

A
  • LS (Long Sequencing): Focuses on sequencing long DNA fragments in a single pass.
  • CCS (Circular Consensus): Focuses on generating highly accurate reads by repeatedly sequencing the same smaller DNA fragment.
37
Q

When would you choose CCS over LS in PacBio sequencing?

A

CCS: When high accuracy is critical, such as for small-scale projects or highly sensitive applications.

LS: When long-read data is needed to resolve structural variations or for large-scale de novo genome assemblies.

38
Q

What is the chemistry behind PacBio?

A

PacBio sequencing uses Single-Molecule Real-Time (SMRT) sequencing, which enables real-time observation of DNA synthesis at the molecular level.

39
Q

What are the key feature of Pacbio Chemistry?

A

Triphosphate-Linked Fluorophores:

Unlike Illumina and Sanger methods that attach fluorophores to the nucleotide base, PacBio attaches the fluorophore to the triphosphate group of the nucleotide.
This design minimises steric hindrance, allowing natural and efficient incorporation of nucleotides by DNA polymerase.

Real-Time Sequencing:

DNA polymerase synthesizes DNA in real-time, with fluorescence emitted as each nucleotide is incorporated.
The signal ends when the fluorophore is cleaved off during incorporation.

Zero-Mode Waveguides (ZMWs):

ZMWs are nanostructures that confine light to a very small observation volume.
These allow the detection of fluorescence from single molecules (e.g., the polymerase-nucleotide complex) despite the presence of unincorporated nucleotides in the background.

High Sensitivity:

The use of ZMWs ensures PacBio can detect individual base incorporations with minimal interference.

40
Q

How does PacBio differ from Illumina and Sanger in fluorophore attachment?

A

PacBio attaches fluorophores to the triphosphate group, whereas Illumina and Sanger attach fluorophores to the nucleotide base.

41
Q

What is the role of Zero-Mode Waveguides (ZMWs)?

A

ZMWs confine the light to an extremely small volume, allowing the detection of fluorescence from single nucleotide incorporation events while suppressing background noise.

42
Q

How does the PacBio sequencing work?

A

PacBio sequencing uses Single-Molecule Real-Time (SMRT) technology to directly observe DNA synthesis as it happens in real time.

43
Q

What is the role of Zero-Mode Waveguides (ZMWs) in PacBio sequencing?

A

ZMWs are nanostructures that allow light to focus on a tiny volume, enabling the detection of fluorescence signals from a single DNA polymerase and DNA molecule during sequencing.

44
Q

How does PacBio sequencing achieve real-time monitoring of DNA synthesis?

A

Fluorescently labeled nucleotides emit a signal as they are incorporated by DNA polymerase.

A laser excites the fluorophore, and the emitted light is measured to determine the nucleotide sequence in real time.

45
Q

How does PacBio sequencing detect methylated bases?

A

PacBio detects methylation through changes in polymerase kinetics, such as increased interpulse duration (IPD), the time between successive nucleotide incorporations.

Modified bases like methylated cytosine slow down DNA synthesis, creating detectable kinetic signals.

46
Q

Why is detecting methylation important in sequencing?

A

Methylation affects gene expression and can be associated with diseases like cancer, where it plays a role in malignant cellular transformation.

47
Q

What is the raw read error rate in PacBio sequencing?

A

PacBio sequencing has a raw read error rate of about 90%. However, this is corrected through multiple passes to generate a consensus sequence.

48
Q

PacBio sequencing has a raw read error rate of about 90%. However, this is corrected through multiple passes to generate a consensus sequence.

A

Multiple passes of the same DNA fragment are sequenced, generating subreads that are combined into a consensus sequence. This reduces errors to:

99% accuracy with 4 passes (Q20).
99.9% accuracy with 10 passes (Q30).
99.99% accuracy with 20 passes (Q40).
99.999% accuracy with >25 passes (Q50)

49
Q

What is Oxford Nanopore Sequencing?

A

Oxford Nanopore Sequencing is a third-generation sequencing technology that directly reads the sequence of DNA or RNA by detecting changes in electrical current as molecules pass through a tiny pore called a nanopore.

50
Q

Oxford Nanopore: How does it work?

A

Constant Current Flow:

A constant current is passed through a nanopore embedded in a membrane.
DNA Tethering:

A tethered motor protein binds to a DNA strand and feeds it through the nanopore, unzipping the double-stranded DNA into single strands.
Disruption of Current:

As the DNA moves through the nanopore, each nucleotide (A, T, G, or C) causes a unique disruption in the electrical current.
These disruptions create a distinct electrical signal for each base.
Base Identification:

The unique signal of each base is decoded into a DNA sequence by software.
High Speed:

Sequencing occurs at approximately 400 bases per second.

51
Q

What are the key features of oxford nanopore sequencing?

A

Portable Devices:

Devices like the MinION allow sequencing to be done in real-time and outside traditional laboratory settings.
No Amplification Required:

Directly sequences single molecules of DNA or RNA.
Long Reads:

Can sequence fragments up to hundreds of kilobases, making it ideal for resolving structural variants and sequencing entire chromosomes.
Real-Time Data Generation:

Allows sequencing to start immediately, useful for time-sensitive applications.

52
Q

What is a HiFi read in PacBio sequencing?

A

A HiFi (high-fidelity) read is a highly accurate sequence produced by combining multiple subreads from the same DNA molecule.

53
Q

What are the key steps in Sanger Sequencing?

A
  1. Denaturation: DNA is denatured into single strands.
  2. Primer Annealing: A primer binds to the single-stranded DNA.
  3. Extension: DNA polymerase extends the primer using a mixture of normal deoxynucleotides (dNTPs) and fluorescently labeled ddNTPs, which terminate elongation.
  4. Fragment Separation: DNA fragments of varying lengths are generated, each terminating at a ddNTP.
  5. Detection: Fragments are separated by capillary electrophoresis, and the sequence is determined from the fluorescence signal of the ddNTPs.
54
Q

What is the key concept of Sanger Sequencing?

A

ddNTPs lack the 3’-OH group required for DNA chain elongation, causing termination at specific bases.

55
Q

What are the key steps in enzymatic DNA Shearing?

A

Nick DNA with enzymes (e.g., DNase I):

The enzyme introduces single-strand nicks into the DNA.
Cleavage at ssDNA sites:

Endonucleases recognize and cleave at single-stranded DNA (ssDNA) sites, breaking DNA into fragments.
Fragment Length Control:

By adjusting reaction time and enzyme concentration, you can produce DNA fragments of specific lengths.

56
Q

What are the limitations of Sanger Sequencing?

A
  • Less cost-effective

-Slower for large-scale sequencing projects compared to NGS

-Limited to shorter DNA sequences

57
Q

What is the maximum read length for Sanger sequencing?

A

The maximum read length for Sanger sequencing is up to 700 base pairs (bp) per sequence.

58
Q

How does Sanger sequencing differ from PCR?

A

Sanger Sequencing uses only a single primer and polymerase to synthesize new single-stranded DNA (ssDNA).

It incorporates both regular nucleotides (A, C, G, T) and dideoxynucleotides (ddNTPs), which act as chain terminators.

PCR amplifies DNA, while Sanger generates fragments for sequence determination.

59
Q

What is the role of dideoxynucleotides (ddNTPs) in Sanger sequencing?

A
  1. Terminate DNA chain elongation because they lack the 3’-OH group required for adding the next nucleotide.
  2. Are fluorescently labelled, allowing detection during electrophoresis.
60
Q

What are the differences between unmodified nucleotides and dideoxynucleotides?

A

Unmodified Nucleotides:
- Have a 3’-OH group necessary for chain elongation.
- Used for normal DNA synthesis.

Dideoxynucleotides (ddNTPs):
- Lack the 3’-OH group, causing chain termination.
- Are labeled with different fluorescent dyes for each base (A, T, C, G).

61
Q

How is the DNA sequence determined in Sanger sequencing?

A

The DNA fragments terminated by ddNTPs are separated by capillary electrophoresis.

Fluorescent labels on ddNTPs are detected to determine the sequence of bases in the DNA.

62
Q

Why does Sanger sequencing include both regular nucleotides and ddNTPs?

A

Regular nucleotides (dNTPs) allow for normal DNA elongation, while dideoxynucleotides (ddNTPs) randomly terminate the synthesis at different points. This creates a series of DNA fragments of varying lengths that represent the sequence of the template DNA.

63
Q

What are the chemical differences between purines and pyrimidines?

A

Purines (Adenine, Guanine): Double-ring structures.

Pyrimidines (Thymine, Cytosine): Single-ring structures.

64
Q

What is the typical output of a single Sanger sequencing reaction?

A

A single Sanger sequencing reaction produces a chromatogram with a read length of approximately 600–1000 base pairs (bp).

65
Q

What does a chromatogram represent in Sanger sequenci

A

A chromatogram represents the fluorescent signals from the dideoxynucleotide (ddNTP) chain terminators, indicating the sequence of nucleotides in the DNA.

66
Q

How are bases identified in a Sanger sequencing chromatogram?

A

Each peak in the chromatogram corresponds to a specific nucleotide (A, T, G, or C).

The colour of the peak indicates the nucleotide:
Adenine (A): Green
Thymine (T): Red
Cytosine (C): Blue
Guanine (G): Black or yellow (depending on the software).

67
Q

What is the significance of peak height in a Sanger sequencing chromatogram?

A

Peak height reflects the strength of the fluorescence signal.

Taller peaks indicate higher confidence in base identification.

68
Q

What is the raw read accuracy of Sanger sequencing?

A

Sanger sequencing has a raw read accuracy of 99.99%, corresponding to a Q40 quality score.

69
Q

What are the key limitations of Sanger sequencing?

A

Cost: Sanger sequencing is expensive compared to next-generation sequencing (NGS).

Throughput: It has low throughput, producing single-read outputs rather than parallel processing.

Labor Intensive: Requires significant manual effort for setup and analysis.

Low Sensitivity:
Mutations must be present in more than 30% of cells to be detectable.

This limits its application in detecting rare mutations, such as in cancer samples.

70
Q

NGS Applications?

A

Genome sequencing, transcriptomics, epigenomics, and cancer research.

71
Q

What are the key steps in 454 sequencing?

A
  1. Emulsion PCR Products:

After emulsion PCR, oil is removed, and the DNA-coated beads are placed into wells of a pico-titer plate, where each well is just large enough to hold one bead.

  1. Addition of Pyrosequencing Enzymes:

Smaller beads carrying pyrosequencing enzymes (e.g., polymerase, luciferase) are added to each well.

  1. Sequential Washing with
    Nucleotides:

The plate is sequentially washed with each of the four nucleotides (A, T, G, C) in a controlled cycle.

  1. Light Detection:

When a nucleotide is incorporated into the growing DNA strand, a pyrophosphate (PPi) is released.
PPi triggers a chemical reaction that produces light, detected by a CCD camera.
The intensity of light correlates with the number of nucleotides added (e.g., homopolymers like AAA produce stronger signals).

  1. Data Capture:

The light flashes recorded from each well are converted into DNA sequence data.

72
Q

What is Illumina?

A

Illumina is a next-generation sequencing (NGS) technology that uses sequencing-by-synthesis (SBS) to generate massive amounts of DNA sequence data in a highly accurate and cost-effective manner.

73
Q

What are the key features of Illumina?

A
  1. Sequencing-by-Synthesis (SBS):

Fluorescently labelled nucleotides are incorporated into a growing DNA strand one at a time.
Each base emits a unique fluorescence signal, which is detected to determine the sequence.
After detection, the fluorescent label is cleaved off, allowing the next base to be added.

  1. Bridge Amplification:

DNA fragments are attached to a flow cell and amplified in clusters via bridge PCR.
Each cluster represents millions of identical DNA molecules, enhancing signal strength.

  1. High-Throughput:

Millions to billions of DNA fragments are sequenced in parallel on a single flow cell.

74
Q

What are the advantages of enzymatic shearing?

A

Rapid Prep: Only 15 minutes of hands-on time, with total prep in ~90 minutes.

No Library Quantification Needed: Simplifies workflow and reduces errors.

Ultra-Low Input: Works with very small amounts of DNA.

Optimised for Specific Applications: Especially effective for small genomes, PCR amplicons, and plasmids.

75
Q

What are the key components in sample pooling?

A
  1. Locus-Specific Primers (Green):
    Bind to target DNA regions to allow specific amplification.
    Include a tagmentation site to facilitate adapter attachment.
  2. Index 1 & Index 2 (Blue):
    8-base-pair (bp) DNA sequences unique to each sample.
    Allow sequencing reads to be assigned to individual samples after sequencing (also known as barcoding).

3.P5/P7 Tails (Purple):

Adapter sequences (P5 and P7) bind the amplified DNA to the flow cell during sequencing.
Ensure compatibility with the sequencing platform (e.g., Illumina).