Data science in medicine: the data Flashcards

1
Q

What is a research question that could be researched in the field of genomics?

A

(How) can we identify the disease causing mutation(s)?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the difference between a monogenic and complex disease?

A
  • Monogenic disease → (usually rare) disease caused by variation in a single gene
  • Complex disease → (usually common) disease that results from the contributions of multiple genomic variants and genes in conjunction with significant influences of physical and social environments.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How can monogenic and complex disease be identified?

A
  • Monogenic → whole-exome sequencing (WES)
  • Complex → Genome-wide association studies (GWAS)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How does whole-exome sequencing (WES) work?

A

Whole exome sequencing is a technique where all of the protein-coding regions of genes in a genome are sequenced.
- First the subset of DNA is selected that only encodes proteins (i.e. exons).
- After this, illumina sequencing can be applied to find out what the exome is of the patient.
- The patient’s exome can then be compared with reference DNA sequences.
- By identifying differences between the patient’s DNA and reference DNA, candidate mutations can be identified.
- By validating these candidate mutations, the disease causing mutation can be found.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How can we reduce the amount of candidate genes during exome sequencing and filtering of candidate genes?

A

By including more filters:
- list of differences: patients compared to reference
- list of mutations: shared between patients
- List of synonymous mutations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How can a candidate gene be identified as the causal gene (i.e. validated as the causal gene) with the use of exome sequencing?

A
  • If the mutation is present in new patient samples.
  • If → de novo mutation → absent in DNA of both parents
  • Absent in control samples
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are limitations of exome sequencing?

A
  • Technical failure: no capture probe, lack of sequence coverage, misalignment
  • Filtering may accidently remove the causal variant
  • Genetic heterogeneity makes finding the causal variant more difficult
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is whole-genome sequencing?

A

Instead of only sequencing the exome (-1% of the human genome), the whole genome is sequenced.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are advantages and disadvantages of whole-genome sequencing?

A

Advantages:
- More sensitive, also non-coding mutations can be causal
- Can also be used to detect other chromosomal aberrations (translocations and large deletions/insertions)

Disadvantages:
- Higher costs
- Less power → more samples needed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What three steps are needed before tests such as whole genome sequencing can be implemented inside the clinic?

A
  1. Test development optimization
  2. Test validation
  3. Quality management
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a genetic association study?

A

A study that is used to find candidate genes or genome regions that contribute to a specific disease by testing for a correlation between disease status and genetic variation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is Single Nucleotide Polymorphism (SNP)?

A

A point mutation that has persisted in the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

There are two hypotheses regarding diseases and variants of complex diseases:
1. Many genetic variants underlying complex diseases are common.
2. Genetic contributions to complex diseases arise from many variants, all of which are rare.

Describe for which hypothesis it is more likely to use an association study to detect genetic variants.

A

Many genetic variants underlying complex diseases are common.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a Genome Wide Association Study (GWAS)?

A

A type of observational study of a genome-wide set of genetic variants in different individuals to see if any variant is associated with a trait. GWAS typically ocus on associations between SNPs and traits like major human diseases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What applications does GWAS have?

A
  • Gaining insight into a phenotype’s underlying biology
  • Estimating its heritability
  • Making clinical risk predictions → polygenic risk score (PRS)
  • Informing drug development programmes
  • Inferring potential causal relationship between risk factors and health outcomes
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a Polygenic Risk Score (PRS)?

A

The prediction of an individual’s risk for physical and mental disease based on their genetic profile.

17
Q

What are methodological limitations of GWAS?

A
  • Population stratification → presence of multiple genetically distinct subpopulations that differ in their mean phenotypic value → spurious or biased associations.
  • Polygenecity makes gaining biological insight very challenging
18
Q

What are ethical limitations of GWAS?

A
  • Consent for future use of samples and data, storage and reuse of samples and data.
  • Sharing data with individual participants
  • Diversity and inclusion
  • Direct to consumer laboratory testing
19
Q

Eloborate/give an example on the methodological limitation → population stratification.

A

One disease is more common in one subgroup than in another. This results in cases tending to be over-sampled from that group, relative to controls. Thus, any allele that is more common in that group, will appear to be associated with the disease.

20
Q

Explain the steps in RNA sequencing.

A
  1. Isolating RNA
  2. Fragmentation
  3. Random priming
  4. Second strand synthesis
  5. Size selection
  6. Sequencing
  7. Mapping
  8. Counting
  9. Statistical analysis
21
Q

What are challenges of RNA sequencing (compared to DNA-sequencing)?

A
  • RNAs consist of small exons that may be separated by large introns (→ mapping reads to genome is more challenging)
  • The relative abundance of RNAs vary wildly (since RNA sequencing works by random sampling, a small fraction of highly expressed genes may consume the majority of reads)
  • RNAs come in a wide range of sizes
  • RNA is fragile compared to DNA
  • Measurment biases and variability
22
Q

What is meant with X-omics or multi-omics?

A

Profiling an individual using more than 1 omics technique (→ more information for each individual)