P4: ASV Inference Flashcards
generally explain the 16S rRNA workflow
sample collection –> DNA extraction –> library preparation –> sequencing
16S rRNA workflow - DNA extraction
- extract nucleic acid
- can be done with RNA as well
16S rRNA workflow: DNA extraction - why 16S
- 16S is a ribosomal subunit that combines with proteins and its present in both mitochondria and chloroplasts
- this makes it the best marker for analyzing DNA
16S rRNA workflow - library prep
- PCR 1 and 2 (with cleanup 1 and 2 stages between them)
- amplifying targets via primers
- result is the final library
16S rRNA workflow: library prep - PCR1
- region specific
- amplifies specific hypervariable regions
- has required primer overhangs
- goes on to clean up 1
16S rRNA workflow: library prep - PCR2
- indexing
- 2nd amplification
- adds barcodes/indexes (to identify the specific sequence)
- adds sequence adaptors
- needs a different primer than PCR1 and will go on to clean up 2
16S rRNA workflow: library prep - final library
- has the adaptor proteins necessary for sequencing
- from left to right: priming site for sequence reaction, library index, and flowcell handle
- will go on to sequencing
16S rRNA workflow - how is clean up 1 and 2 done
based on magnetic beads
how are sequencing results shown
- Fastq files
- they are a text-based format that contains the nucleotide sequence and its corresponding quality scores
- every 4 line represents 1 specific sequence
Fastq files - how to read them
- contains 4 levels of information
1. header
2. sequence results
3. base and Q
4. Q scores
Fastq files - header
- starts with “@” symbol
- has the barcode provided by sequencing authority
Fastq files - Base and Q
tells you what strand the gene was sequenced on (leading vs lagging)
Fastq files - Q scores
- shown as ASCII characters
- shows how reliable every sequenced nucleotide is
- numbered through 0-40
- 40: reliable
- </= 20: unreliable
- should have a ton of errors and quality drops in the beginning of a sequence
what are other (non-sequencing) 16S rRNA pipelines
- amplicon sequencing variants
- operational taxonomic units
- PhyloChips
other 16S rRNA pipelines - ASV
- distinguishing rogue amplicons by reducing noise (denoising) made by sequencing errors and keeping the reliable ones
- more resolution than OTUs
- intraspecific