W8L1 Thu from Gwas to phenotype Flashcards

1
Q

rs8050136 SNPs marker

A
  • Identified in a GWAS of 72,598 European ancestry individuals.
  • p = 5 x 10-36
  • Strong association and large effect on:
  • BMI
  • Weight
  • T2D risk (comorbidity?)
  • Risk allele explains ~1kg in someone 1.7m tall
  • Located in an intron of the FTO gene
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

understanding manhattan plot

A

-if over the line, the result is significant
-y axis is significant and x is the position
-lead SNP is the most significant, the one that is normally reported on

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

rs8050136 and gwas

A
  • One of the early success of the GWAS field!
  • The logical assumption was that it impacted FTO expression levels somehow, since the phenotype made perfect sense.
  • All that was missing was the actual mechanism.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

problem of rs8050136

A

– variation in FTO expression could never be correlated to variation in genotypes at rs8050136
* rs8050136 was the wrong SNP… and FTO the wrong gene!
* rs9930506 – less significant, not an eQTL for FTO…
but an eQTL for nearby gene IRX3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the main challenges to decoding trait biology

A
  1. > 90% of GWAS hits fall outside protein-coding regions of genome
    * How to link them to genes?
  2. Relevant cell type is not always readily apparent from GWAS hits
    * How to link them to cell types?
  3. Most loci have small effect sizes on overall trait values
    * How to link them to phenotypes?
    Need to consider expression level
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

The human transcriptome

A
  • Definitely a lot fewer protein-coding genes than we expected (19K) But transcriptionally (63K) and translationally rather complex… and tightly
    regulated
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

The human genome

A

-protein coding gene is only 1.5% of the total genome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Large scale catalogues of expression and
regulation in humans

A
  • GTEx: 17,382 RNA sequencing samples from 52 tissues from 838 postmortem donors
  • eQTL meta-analyses (combining a bunch of studies together, harder than it sounds):
  • eQTLs in whole blood across > 30,000 people
  • ENCODE:
  • Extensive regulatory characterisation of 6 cell types, less extensive in 147 cell lines
  • Roadmap Epigenomics:
  • Chromatin state across 111 primary cell types and 16 cultured cell types
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What we learn from GTEx

A

-look at alot of eQTL and long noncoding in many tissue
-cis-sQTL for 66% of protein
-trans eQTL for 121 protein coding gene and 22lincRNA genes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

eQTLs are overrepresented within GWAS hits

A
  • look at over 200 study with GWAS
    -64 study where eQTL gene is the causal gene
    -51 study found eQTL but no gene is found
    -29 study have eQTL found but causal gene not present
    -104 did not find eQTL at locus
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

looking at gene regulation with GWAS

A
  • Profiled multiple histone modifications across 127 human cell types, alongside whole genome sequences, RNA-seq, DNA-methylation and others
    -use regulatory annotation to provide useful info for GWAS SNPs analysis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

problem with variant , CF example

A
  • CFTR is one of the longest genes in the genome.
  • ~200,000 bp long
  • Many mutations in CFTR lead to cystic fibrosis.
  • Effect of some mutations is known, but most are classified as Variants of Unknown Significance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Variants of Unknown Significance

A

-dont know how a gene/mutation can cause the problem
* VUS are widespread even within protein coding regions, more so within non- coding…
* 4.6 million VUS in gnomAD
-50% of mutation variant that lead to problem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Testing VUS the old fashioned way

A
  • Put a VUS into a plasmid w/ other reporter construct (GFP, luciferase, whatever)
  • Compare the reporter activity of cells carrying each version of the plasmid, learn something.
  • You will be dead well before you finish with all 4.6 million VUS
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Multiplex Assays of Variant Effects (MAVE)

A
  • MAVEs are a family of experiments designed to test the effect of many variants at once by using DNA barcode
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

TYPE of MAVE

A
  • MPRA: Massively Parallel Reporter Assay
  • DMS: Deep Mutational Scanning, also known
    as Saturation Mutagenesis
17
Q

Understanding BRCA1 mutations with DMS

A
  • ~50-70% of women with BRCA1 mutations will develop breast or ovarian cancer by age 70.
  • Majority of SNPs within BRCA1 are VUS
    -Use MAVE to find mutation that are benign/pathogenic
18
Q

MAVe and non-coding region

A
  • Mutations in TERT promoter often observed in cancer, associated with upregulation of telomerase in cancer cells
  • -124C>T or -146C>T are very common
  • Saturation mutagenesis of TERT promoter reveals these and other variants of possible therapeutic interest