Genetics Flashcards
CADD score
Combined annotation-dependent depletion score
- predicts pathogenicity (disease-causing potential) of variants/indels
2 scores for predicting LoF constraint and their cut-offs
- pLI (probability of LoF Intolerant): =/> 0.9
- LOEUF (LoF observed/expected upper bound fraction): <0.35
How to judge missense constraint on gnomAD
Using missense constraint Z-score >3.1
Concept of “constraint” in genetics
Constraint describes how tolerant a gene is to genetic variation (different variants), ie a gene with high constraint is intolerant to variation.
E.g. LoF constraints (measured by pLI and LOEUF) and missense constraint (z-score)
Concept of “depletion” and “enrichment” in genetics
Depletion/depleted: genetic variant observed as less common or less frequent than the expected value
Enriched: variant more common/over-represented in specific population than expected
What is the pext score
Proportion expressed across transcripts score: per base expression pattern across transcripts and exons as well as in tissue of interest
When is the pext score useful in gnomAD?
Gives biological relevance of variant. When given variant is LoF and strong evidence for disease causing. A low pext score (<0.2) suggests variant not biological relevant (as it’s not expressed across the transcripts or across tissues of interest).
What is a Mendelian disease and some examples
Aka monogenic disorders. Caused by mutations in single gene.
Eg. cystic fibrosis, Huntington’s, Sickle Cell, Duchenne’s, Tay-Sach, PKU, Marfan, ADPKD
Define Expressivity and Penetrance
Expressivity: Severity of the phenotype that develops in patient with the pathogenic variant
Penetrance: the proportion of individuals carrying the pathogenic variant who display a phenotype
What is the “seed sequence” in relation to CRISPR
10-12 bps adjacent to the PAM (3’ end of the gRNA) that determines Cas9 specificity
- 1-5 bps = true seed region (from immunoprecipitation and ChIP-seq data - Zhang 2015)
What are the causes of the LOF pathogenic variants appearing in GnomAD?
- Transcript error
- Sequencing error
- Mapping error
- Last exon
- Other annotation error
- Rescue
What is homopolymer
Homopolymer refers to a stretch of DNA or RNA sequence where only one type of nucleotide is repeated consecutively, eg AAAAAAAAA
What is a “rescue splice variant”
A type of rescue mechanism in which alternative splicing of mRNA mitigates effect of LOF/pathogenic mutation, which preserves function of protein
Definition of nonsense-mediated decay
Surveillance pathway that reduces errors in gene expression by eliminating mRNA transcripts that contain premature stop codons
Types of mRNA surveillance pathways
- Nonsense mediated decay (NMD)
- Nonstop mediated decay (NSD)
- No-go mediated decay (NGD)