Genetics Flashcards

1
Q

CADD score

A

Combined annotation-dependent depletion score
- predicts pathogenicity (disease-causing potential) of variants/indels

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
1
Q

2 scores for predicting LoF constraint and their cut-offs

A
  • pLI (probability of LoF Intolerant): =/> 0.9
  • LOEUF (LoF observed/expected upper bound fraction): <0.35
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How to judge missense constraint on gnomAD

A

Using missense constraint Z-score >3.1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Concept of “constraint” in genetics

A

Constraint describes how tolerant a gene is to genetic variation (different variants), ie a gene with high constraint is intolerant to variation.
E.g. LoF constraints (measured by pLI and LOEUF) and missense constraint (z-score)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Concept of “depletion” and “enrichment” in genetics

A

Depletion/depleted: genetic variant observed as less common or less frequent than the expected value
Enriched: variant more common/over-represented in specific population than expected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the pext score

A

Proportion expressed across transcripts score: per base expression pattern across transcripts and exons as well as in tissue of interest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

When is the pext score useful in gnomAD?

A

Gives biological relevance of variant. When given variant is LoF and strong evidence for disease causing. A low pext score (<0.2) suggests variant not biological relevant (as it’s not expressed across the transcripts or across tissues of interest).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a Mendelian disease and some examples

A

Aka monogenic disorders. Caused by mutations in single gene.
Eg. cystic fibrosis, Huntington’s, Sickle Cell, Duchenne’s, Tay-Sach, PKU, Marfan, ADPKD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define Expressivity and Penetrance

A

Expressivity: Severity of the phenotype that develops in patient with the pathogenic variant
Penetrance: the proportion of individuals carrying the pathogenic variant who display a phenotype

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the “seed sequence” in relation to CRISPR

A

10-12 bps adjacent to the PAM (3’ end of the gRNA) that determines Cas9 specificity
- 1-5 bps = true seed region (from immunoprecipitation and ChIP-seq data - Zhang 2015)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the causes of the LOF pathogenic variants appearing in GnomAD?

A
  • Transcript error
  • Sequencing error
  • Mapping error
  • Last exon
  • Other annotation error
  • Rescue
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is homopolymer

A

Homopolymer refers to a stretch of DNA or RNA sequence where only one type of nucleotide is repeated consecutively, eg AAAAAAAAA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a “rescue splice variant”

A

A type of rescue mechanism in which alternative splicing of mRNA mitigates effect of LOF/pathogenic mutation, which preserves function of protein

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Definition of nonsense-mediated decay

A

Surveillance pathway that reduces errors in gene expression by eliminating mRNA transcripts that contain premature stop codons

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Types of mRNA surveillance pathways

A
  1. Nonsense mediated decay (NMD)
  2. Nonstop mediated decay (NSD)
  3. No-go mediated decay (NGD)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How does the location of the termination codon (from truncating mutations) in the last exon affect NMD?

A

The location of the last exon-exon junction complex (EJC) relative to stop codon NB. If stop codon downstream or within 50 nucleotides of final EJC, transcript translated normally. If upstream >50 nucleotides, NMD occurs.

16
Q

Why does truncating mutations in the last exon generally not pathogenic

A
  • Not subject to NMD
  • mutations in 3’ UTR region
  • Protein truncation tolerance - critical domains not affected
  • haploinsufficiency tolerance - better toeralted if functional copy still present
17
Q

Define linkage disequilibrium

A

Tendency of alleles to be transmitted more or less often than expected by chance alone - usually caused by close proximity of genes on the same chromosome

18
Q

Define epistasis

A

Phenomenon in genetics whereby the effect of a gene mutation is dependent on the presence or absence of mutations in one or more other genes, termed modifier genes

19
Q

Define heritability

A

The measure of proportion of the phenotypic variance of a population that can be attributed to genetic differences

20
Q

What is the “missing heritability” question

A
  • clear conclusion from multiple GWAS studies that highly significant hits accounts for small proportion of the heritability of disease
  • amount of heritability explained by GWAS findings much smaller than estimated heritability from family and twin studies
21
Q

Narrow sense heritability (h2)

A

Narrow-sense heritability (h2) is an important genetic parameter that quantifies the proportion of phenotypic variance in a trait attributable to the additive genetic variation generated by ALL causal variants

22
Q

What are the explanations for the missing heritability problem?

A
  1. large number of common variants of small effects not yet discovered
  2. rare variants with large effect sizes not tagged on genotyping arrays
  3. overestimation of h2 (narrow sense heritability) in siblings/families due to environmental factors or epigenetics
23
Q

Define tag SNP

A

A representative SNP in a genomic region with high LD that represent/called a haplotype

24
Q

Define fine-mapping

A

Process of determining the genetic variant(s), ie causal variant(s), responsible for complex traits, given evidence of association of genomic region with a trait and assuming at least one causal variant exists

25
Q

Types of chromatin annotations

A
  1. Open chromatin regions (indicate regions available for TF binding)
  2. histone modifications (highlights enhancer and promotor regions)
  3. DNA methylation
26
Q

What are DNAse Hypersensitive Sites (DHS)?

A
  • DHS are regions of DNA that are particularly accessible to cleavage by DNAse I, characterised by lack of nucleosomes.
  • Indicates regions of high regulatory activity/regulation of gene expression. Correspond to regulatory elements eg promotors, enhancers, silencers etc.
  • Used in SNP enrichment analysis (a type of chromatin mark)
27
Q

What is ATAC-Seq

A

Assay for Transposase-Accessible Chromatin with sequencing.
Technique used to investigate chromatin accessibility at genome-wide scale. Used to identify areas of open chromatin - ie areas of high regulatory activity (promotor, enhancers and TF binding sites)

28
Q

Steps of ATAC-Seq

A
  1. Transposase tagmentation: uses hyperactive Tn5 transposase enzyme that simultaneously fragments DNA and adds adapters to ends of DNA
  2. Selective fragmentation: Tn5 transposase selectively inserts adapters into regions of open chromatin
  3. Library preparation: PCR amplification of tagged sequences
  4. Sequencing
29
Q

What is pleiotropy

A

A phenomenon when one gene influences two or more seemingly unrelated phenotypic traits, aka a gene that exhibits multiple phenotypic expression

30
Q

What is allelic heterogeneity

A

Phenomenon is which multiple causal variants at the same locus influence a particular disease or trait

31
Q

Difference between epistasis and allelic heterogeneity (AH)

A

Epistasis: effect of one gene variant affects (or is dependent of) another gene variant at a DIFFERENT locus
AH: different mutations within SAME locus of SAME gene influence the particular trait

32
Q

What are phenocopy conditions?

A

Variations in phenotype that is caused by environmental conditions and not by genotype

33
Q

Which domains in MYH7 are mostly affected in HCM

A

Globular head and hinge regions

34
Q

What is the Non-stop Decay pathway and its mechanisms

A

NSD - targets mRNA (and peptide) for degradation if lacking a proper stop codon (ie the translation keeps going after where the stop codon should’ve been).
1. Recognition of non-stop mRNAs - ribosome stalls and signals NSD machinery
2. Ski complex (Ski2, Ski3 and Ski8) recruited to stalled ribosome, interacts with exosome (3’ to 5’ exonuclease activity)
3. Ribosome disassembled (by Pelota-Hbs1) - recycles ribosome
4. Faulty mRNA degraded by exosome assisted by Ski complex
5. Proteosomal degradation after ubiquitination

35
Q

Causes of non-stop mRNAs

A

Errors in transcription, splicing or premature polyadenylation

36
Q

Causes of ribosome stalling

A
  1. Defective mRNA:
    - non-stop mRNA
    - Damaged mRNA
    - Secondary structures within mRNA (eg hairpins)
    - Rare codons (due to low availability of corresponding tRNAs)
  2. Amino acid deprivation
    - leading to shortage of charged tRNA
  3. Aberrant translation events - misincorporation of amino acids or other errors during translation
  4. Protein quality control mechanisms - interaction with faulty nascent polypeptides that do not fold properly
37
Q

Slipped Strand Mispairing

A

SSM, or Replication Slippage
mutation process which occurs during DNA replication. It involves denaturation and displacement of the DNA strands, resulting in mispairing of the complementary bases. Slipped strand mispairing is one explanation for the origin and evolution of repetitive DNA sequences
Leads to dinucleotide or trinucleotide repeats
At sites of tandem repeats

38
Q
A