Core lecture Flashcards
Which format usually stores sequencing data?
FASTA
What does “>” mean in the FASTA files?
Indicates the start of a new sequence entry
What does the ASCII character and Phred score do?
It calculates the quality of the data, meaning it helps determining the reliability of each nucleotide sequenced in a run.
What does Q20 mean?
Q 20 means qualituy value of 20, P error can be calcaulted: 1/10^Q/10 = 1/10^20/10 = 1/100 = 0.01
What is the typical purpose of base calling in NGS analysis?
To identify the nucleotide sequence from the raw data.
What are paired end reads?
Sequences read from both ends of a DNA fragment
What is the significance of multiplexing?
It allows multiple samples to be sequenced together
What is main advantage of storing quality scores as ASCII characters?
It decreases the file size by only using 1 byte per score
What is the implication of having a “mate-pair”
The pair of reads are far apart and face away from eachother
What is the “Seed and Extend algorithm”
The seed and extend algorithm are used to align DNA sequenced reads to a reference genome.
The algorithm starts by finding a small piece of the sequencing read( seed) that matches the reference genome. This is usually done quickly using a hash table.
The extend; once a seed match is found, the algorithm tries to extend this match in both directions, to align the rest of the sequence to the genome. It continues to match the extend, until too many mismatches are found.
What does NGS stand for?
Next-generation sequencing
Which method is commonly used for sequencing DNA in NGS?
PCR amplification
What is a primary application of NGS technology?
Genetic disorder diagnosis
How does NGS differ from traditional Sanger sequencing?
NGS allows parallel sequencing of multiple fragments
What is a significant advantage of NGS over previous sequencing technologies?
Higher throughput
In NGS, what is the purpose of using barcodes in sequencing libraries?
To track the sample origin
What type of biological molecules can be sequenced using NGS?
DNA and RNA
How has NGS impacted the field of personalized medicine?
It has enabled tailored treatments based on genetic makeup
What is one of the challenges in handling NGS data?
The high volume of data
What does the term “read length” refer to in NGS?
Length of the DNA fragments sequenced
How has NGS technology influenced cancer research?
By identifying genetic mutations associated with cancers
What is the role of bioinformatics in NGS
To analyze and interpret the vast amount of sequencing data
What does de novo sequencing mean in the context of NGS?
Sequencing without a reference genome
How does NGS contribute to the study of rare genetic disorders?
By enabling the identification of genetic mutations responsible
In NGS, what does the term “coverage” refer to?
The number of times a nucleotide is sequenced
What is the significance of multiplexing in NGS?
Allows sequencing of multiple samples simultaneously
Which is an important consideration in NGS data analysis?
The accuracy and integrity of the data
How does NGS facilitate the study of microbial communities?
Through metagenomics, sequencing DNA from environmental samples