Bioinformatics 14: High throughput sequencing 2 Flashcards
What is SMRT sequencing? Why is it novel technology?
SMRT -> Singe Molecule Real-Time
Sequencing by single, natural polymerase molecules
-> incorporation of fluorescently labelled dNTPs captured in real time
Zero Mode Waveguide imaging
Real-time output of 1000s of polymerases captured simultaneously (fast computing)
Revolutionary read length (~4kb)
450Mb in hours (speeeeeeed)
How does SMRT sequencing work?
Zero Mode Waveguide (ZMW) contains single polymerase (complexed with template)
- Only bottom of the well is illuminated
nucleotides diffuse in and out of well (microseconds event)
incorporation via polymerase over microseconds (1000x slower)
Real time data collection -> computer detects release of fluorescent label (incorporation event)
Pros and cons of SMRT sequencing?
Pros
Single molecule (no amplification) - easy sample prep
Long templates (rolling circle amplification)
Speedy, low cost reagents
Cons DNA only Higher error rate Low no. of features vs 454 + illumina Late to market
Applications of SMRT?
Fast sequencing + long reads useful for quick, complete genome sequencing
Haiti cholera outbreak, Sproutgate
What format is NGS data universally stored in? What information does the format give?
fastq
gives sequence AND quality score
quality = ‘Phred’ -> 10 - 1/10 error, 20 = 1/100 error …. so on
Other forms of NGS?
Exome sequencing
Nanopore