VL10 Genomics Flashcards

1
Q

What is de novo assembly in genomics?

A

De novo assembly involves piecing together a genome from short DNA sequences without a reference genome, relying on overlaps of identical sequence regions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two main types of sequencing reads used in de novo assembly?

A

Long read data (e.g., Sanger, Pacbio) using overlap-layout-consensus

short read data (e.g., Illumina) using de Bruijn graphs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the biggest challenge in de novo assembly?

A

The biggest challenge is dealing with repeats, such as transposable elements, centromeres, and telomeres, which can lead to fragmented assemblies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does a high-quality genome provide?

A

A high-quality genome serves as a reference for other genomes from the same or closely related species and is essential for analyzing genetic variation within species.

e.g. can be used as reference for re-sequencing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is structural genome annotation?

A

Structural annotation involves identifying the locations of genes, repeats, regulatory elements, and non-coding RNAs within the genome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is functional genome annotation?

A

Functional annotation determines the function of genes and other genomic elements, often using data from related species and RNA-seq data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is re-sequencing in genomics?

A

Re-sequencing involves mapping sample reads to a reference genome to identify genetic variants and support genome annotation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the benefits of transcriptomics/RNA-seq?

A

RNA-seq helps study:
* gene expression levels,
* alternative splicing,
* and provides support for genome annotation

by sequencing expressed genes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a pangenome?

A

A pangenome includes genetic information from all sequenced individuals of a species, representing the genetic diversity within the species.

Not set how many genomes it is including

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are runs of homozygosity (ROH)?

A
  • ROH are tracts of consecutive homozygous genotypes
  • two copies of an ancestral haplotype are inherited from a recent common ancestor
  • longer haplotypes from recent common ancestors
  • shorter haplotypes from distant common ancestors.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the main methods used for sequencing genomes?

A

The main platforms include Illumina for short reads and Pacbio for long reads, each with different strengths and weaknesses for genome assembly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the principle of the de Bruijn graph in genome assembly?

A

The de Bruijn graph breaks reads into k-mers and uses them to construct a graph where edges are k-mers and nodes are overlaps, aiding in assembling the target sequence.

Biggest probem: Duplkations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How are repeats in the genome a challenge for sequencing?

A

Repeats longer than the read length can lead to fragmented assemblies, making it difficult to accurately assemble the genome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the role of paired-end and mate-pair reads in genome assembly?

A

Paired-end and mate-pair reads help link contigs into scaffolds by providing information about the relative positions of sequences, aiding in the assembly process

Paired-end reads: Short DNA fragments (200-800 bp) are sequenced from both ends, providing two reads per fragment. These reads help link adjacent contigs by providing information about the distance and orientation between them.

Mate-pair reads: Longer DNA fragments (2-5 kb or more) are circularized, cut, and sequenced from both ends, producing reads that are further apart than paired-end reads. These longer distance reads help link contigs over larger genomic regions, aiding in assembling repetitive and complex regions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the importance of the N50 metric in genome assembly?

A

The N50 metric indicates the quality of the genome assembly, representing the length at which half of the genome is contained in contigs or scaffolds of that length or longer.

higher N50 value indicates a more contiguous and complete assembly, with fewer gaps and larger pieces of the genome accurately assembled.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the purpose of genome annotation?

A

Genome annotation identifies the functional elements within a genome, such as genes, regulatory elements, and repeats, to understand their roles and interactions.

17
Q

How does RNA-seq support genome annotation?

A

RNA-seq provides information on expressed genes, helping to identify gene locations and structures and study gene expression levels and alternative splicing.

18
Q

How does the use of hybrid long-read technology improve genome assemblies?

A

Hybrid long-read technology combines long reads with short reads to produce better assemblies by resolving repeats and producing longer, more accurate contigs and scaffolds.

repair errors with more accurate short reads

19
Q

What is the role of methylation in gene expression regulation?

A

Methylation involves adding methyl groups to DNA, often repressing gene expression and playing a role in regulating genes in response to environmental factors. (epigeetics)

20
Q

What are the main challenges in de novo genome assembly?

A

Challenges include dealing with repetitive sequences, ensuring accurate assembly of complex regions, and producing complete and contiguous assemblies.

21
Q

What is the significance of a reference genome?

A

A reference genome provides a standard against which other genomes can be compared, facilitating studies on genetic variation, evolution, and functional genomics.

22
Q

What are the benefits of population genomics?

A

Population genomics involves sequencing many individuals from a population to study genetic variation, identify adaptive traits, and understand population structure and dynamics.

23
Q

What is the future of evolutionary genetics?

A

The future includes easy access to genomic data, advances in bioinformatics and machine learning, and flexibility to adapt to new technologies and methodologies in genetic research.