W3 - Using The Result Of Genetic Studies Flashcards

1
Q

What are the two large data analysis techniques?

A

 GWAS
– Identify loci in the genome associated with a phenotype/disease - identifies millions of SNPs then use stats for identification of disease to pinpoint loci.
 RNAseq
– Take a sample tissue and sequence all the RNA to provide a snapshot of gene expression levels

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the difficulties of using GWAS data?

A

 Large GWAS for complex diseases detect many loci
– Prioritisation - so thinking which area we need to focus on.
 90% of GWAS SNPs are in non-coding regions of the genome
– Causal genes?
 What is the mechanism of action explaining the association?
– Tissue/cell type?
– Molecular mechanism?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Linkage Disequilibrium?

A

Where alleles (DNA markers) occur together more often than can be accounted for by chance because of their physical proximity on a chromosome.

This is to do with the crossing over and recombination. The further away the loci, the more chance of cross-overs and if close together, more chance of being inherited togethe. If it were in linkage equilibrium, there would be a 50% change of two loci being inherited at the same time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is RNAseq data?

A

Looking to see which genes have more or fewer mRNA copies present.
 Relative expression data for every gene
 Need to set significance threshold
– P-value
– Fold change
 Novel, allele specific expression, and
alternative transcripts may be identified

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the osteocyte transcriptome?

A

 Same cell type at different locations show different transcriptomes
 >100 novel transcripts identified
 Distinct pattern of different expressed genes in different tissues/cell types

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the other applications of RNAseq?

A

 Cell populations response to treatments
 How gene expression changes through development or under disease conditions
 Single cell transcriptome analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the difficulties of using RNAseq data?

A

 Many expression changes likely to be found
– Difficult to differentiate real from methodological artifacts
 Transcriptome is a snapshot of expression in a specific cell/tissue and at a specific time
 Identification of differential expression does not provide biological reasoning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How useful are GWAS and RNA sequencing in when it comes to complex diseases and large data sets?

A

GWAS
 Identifies associations across
whole genome
 Large number of loci
 Doesn’t identify causal variants or
genes
 Doesn’t identify cell
type/tissue/developmental stage

RNA Sequencing
 Transcriptome of single cell/tissue
type
 Large number of differentially
expressed genes
 Misses changes in other cell types
or stages of development
 Doesn’t identify reason for
differential gene expression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a pathway analysis?

A

You can take a list from a large data set and see which genes are enriched and related.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is personalised medicine?

A

 Applying the results of genetic studies to the healthcare management of an individual
– Predict and prevent disease
– Diagnosis
– Personalised interventions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is GWAS loci relating to biological reasons?

A

Links loci to disease traits.
-Causal mutation/gene for each loci
-Genes or pathways identifying with disease
-Prioritising what to investigate further
- Validating findings.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are difficulties linking loci to gene?

A

 Linkage Disequilibrium makes it difficult to distinguish causal variant
 90% of GWAS SNPs are in non-coding regions
– Regulatory elements
* Promoters, Enhancers, TF binding sites
 May act at a distance from effected gene(s)
 Need to determine relevant cells/tissues

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is fine mapping?

A

 High resolution study of loci attempting to pinpoint individual variants directly effecting trait
 Statistical and probabalistic methods
or comparison to a SNP correlation reference panel

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How are causal genes assigned?

A

 Proximity
 Non-synonymous exonic change
 Chromatin conformation capture

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the relevant cell types for fine mapping?

A

 Often unclear what are the causal cells
 SNP enrichment analysis
– Gene expression
– Regulatory elements
– Open chromatin - likely to indicate genes actively expressed. So you can assign each gene a regulatory activity score.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the basic stages of analysing GWAS?

A

 100s of GWAS loci, 90% in non-coding regions
 Fine mapping to attempt to define causal variants at loci
 Analysis of causal SNP location to predict causal gene
 Cell type SNP enrichment analysis to determine relevant cell types

17
Q

How can we combine genotype and expression data?

A

 Can we use gene expression data to help annotate our GWAS loci?
 eQTL - A locus that explains a fraction of the genetic variance of a gene
expression phenotype
 Colocalisation analysis – Compare the GWAS and eQTL at a locus to determine if they are due to the same causal variants
 TWAS -Transcriptome wide association studies

18
Q

What is co-localisation analysis?

A

GWAS and eQTL can be combined to pinpoint which gene and SNP is likely to associate with what you are looking for. You can compare if they are shown by the same signal or if they are two different signals that happens to be together.

19
Q

What is an explanation for locus overlap?

A

 GWAS and eQTL loci can overlap for 3 reasons:
 Independent causal variants
 A single causal SNP
 Pleiotropy

20
Q

What is a Transcriptome wide association study?

A

 Directly test for associations between gene expression levels and phenotypes. So this is basically combining GWAS and RNAseq at once.
– Overcomes most issues with GWAS and RNAseq
– Currently not generally feasible
 Gene expression is highly heritable
– Tissue specific eQTL maps
 You can use genotype to predict gene expression levels
– GWAS studies give us the SNP genotypes

21
Q

How do we prioritise results from GWAS?

A

We can use our knowledge of the genome and gene expression to prioritise loci/genes for further investigation
– eQTLs – expression quantative trait loci
– Colocalisation analysis to annotate causal genes
 TWAS to directly associate gene expression to trait phenotype

22
Q

How do you validate your results?

A

 Once you have prioritized genes for investigations what do you do?
– Cell Studies
– Functional Phenotyping
– High throughput screens

23
Q

How do you narrow in on interesting genes?

A

 GWAS looking at BMD
– 518 loci
– Fine mapping
– SNP annotation to assign causal genes
– Gene annotation to identify interesting results
– Dishevelled Associated Activator Of
Morphogenesis 2 (DAAM2)

24
Q

What is the functional analysis animal models?

A

 Knockout animals
– Total knockout
– Cell specific
– Inducible
– Gene editing

25
Q

What is in vivo functional analysis?

A

 Weaker bones are the relevant functional phenotype to osteoporosis

26
Q

What are reverse genetic screens?

A
27
Q

What are reverse genetics with modern tools?

A

 Classical reverse genetics
– Cloning to introduce mutations in target gene
– Observe phenotypic outcome
 Targeted genetics
– Gene targeting and trapping
– Gene editing

28
Q

What is the International Mouse Phenotyping Consortium?

A

 98% of human genes have a homolog in the
mouse genome
 IMPC is generating knockout mice strains for
each mouse gene
– ~20,000 genes
 All mice are same background, kept under the
same conditions
– Basic phenotyping
– Results freely available