Reference mapping SNPs Flashcards

1
Q

What Phred quality score requires the experiment to be restarted?

A

<20 as 20% is the minimum required to not consider as low quality, where low-quality reads are removed during quality control to prevent bias during analysis. 20 represents 99% accuracy.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What can FastQC check in a genome sequence?

A

Per base quality distribution: poor distribution is due to degraded quality over long runs, QUALITY TRIMMING helps.

Per tile sequence quality: average Phred scores per tile.

Quality scores: Low-quality reads must be REMOVED/TRIMMED as they could introduce BIAS during downstream analysis.

Per Sequence and per base GC Content: GC content across sequence is compared to a normal QC distribution where SHARP PEAKS indicate CONTAMINANTS or DIVERSITY.

Adapter sequence removal.

Per Base Sequence Content: Compares the proportion of each base and GC content where OVERREPRESENTED SEQUENCES may cause BIAS in the overall composition.

Sequence duplication: low duplication levels indicate high coverage of the target sequence, but high duplication levels indicate ENRICHMENT BIAS (e.g. PCR)

Per Base N Content: Bases with POOR QUALITY SCORES are indicated as ‘N’.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is de novo assembly?

A

It refers to the K-mer approach in reconstructing the DNA sequence. Repetitive sequences present will cause assembly errors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is reference mapping used for in next-generation sequencing?

A

Next-generation sequencing reads are mapped onto a reference genome to identify SNPs, insertions and deletions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the types of Single-nucleotide polymorphism (SNP)?

A

SNPs are single nucleotide variations in a DNA sequence between 2 or more organisms. It can be in the coding sequence or intergenic regions of DNA.

Synonymous SNPs: No change in amino acid sequence
Non-synonymous SNPs can change the amino acid sequence when a codon codes for a different amino acid which affects protein function (missense) or by creating a premature stop codon resulting in loss of gene function (nonsense).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is TRAMS: Tool For Rapid Annotation of Microbial SNPs?

A

TRAMS is a program that annotates SNPs as synonymous, non-synonymous or nonsense.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly