Using the results of genetic studies Flashcards
Large data genetic analysis techniques
GWAS - identify loci in genome associated with a phenotype/disease
RNAseq - provides a snapshot of gene expression levels
difficulties using GWAS data
large GWAS for complex diseases detect many loci = prioritisation
90% of GWAS sap’s are in non-coding regions of the genome
RNA seq data
relative expression data for everyone
need to set significance threshold = p value, fold change
novel, allele specific expression and alternative transcripts may be identified
osteocyte transcriptome
same cell type at diff locations show diff transcriptomes
> 100 novel transcripts identified
distinct pattern of diff expressed genes in diff tissues/cell types
RNAseq other applications
cell populations response to treatment
how gene expression changes through development or under disease conditions
single cell transcriptome analysis
what are the difficulties using RNAseq data
many expression changes likely to be found = difficult to differentiate real from methodological artifacts
transcriptome is a snapshot of expression in a specific cell/tissue and at a specific time
identification of differential expression does not provide biological reasoning
what do GWAS do
identifies associations across whole genome
large number of loci
doesn’t identify causal variants or genes
doesn’t identify cell type/tissue/developmental stage
what does RNA sequencing do
transcriptome of single cell/tissue type
large number of differentially expressed genes
misses changes in other cell types or stages of development
doesn’t identify reason for differential gene expression
describe osteocytes
embedded in lacunae in mature bone
connected via processes through canalicular channels
form a mechanosensory network throughout bone
describe pathway analysis
generate a gene set and compare database
gene ontology and kyoto encyclopaedia of genes and gnomes
allows you to identify new biology by determining the type of genes with association/differential expression
personalised medicine.
applying the results of genetic studies to the healthcare management of an individual
predict and prevent disease
diagnosis
personalised interventions
difficulties linking loci to gene
linkage disequilibrium makes it difficult to distinguish causal variant
90% of GWAS SNPs are in non-coding regions
may act at a distance from effected genes
need to determine relevant cells/tissues
fine mapping
high resolution study of loci attempting to pinpoint individual variants directly effecting trait
statistical and probabilistic methods or comparison to a SNP correlation
assigning causal genes
proximity
non synonymous exonic change
chromatin conformation capture
what is the relevant cell type
often unclear what are the causal cells
SNP enrichment analysis
- gene expression
- regulatory elements
- open chromatin
GWAS to biological reasoning
100s of GWAS loci, 90% in non-coding regions
fine mapping to attempt to define causal variants at loci
analysis of causal SNP location to predict causal gene
cell type SNP enrichment analysis to determine relevant cell types
explanations for locus overlap
GWAS and eQTL loci can overlap for 3 reasons:
independent causal variant in LD
a single causal SNP
pleiotropy
transcriptome wide association studies
- directly test for associations between gene expression levels and phenotype
- overcomes most issues with GWAS and RNAseq
- currently generally
- gene expression is highly heritable = tissue specific eQTL maps
you can use genotype to predict gene expression levels = GWAS studies give us the SNP genotypes
prioritising from GWAS results
use knowledge of genome and gene expression to prioritise loci/genes for further investigations
eQTLs = expression quantitative trait loci
colocalisation analysis to annotate causal genes
TWAS to directly associate gene expression to trait phenotype
functional analysis - model animals
knockout animals = total knockout cell specific inducible gene editing
reverse genetics with modern tools
classical reverse genetics
- cloning to introduce mutations in target gene
- observe phenotypic outcome
targeted genetics-
- gene targeting and trapping
- gene editing
international mouse phenotypic consortium
98% of human genes have a homolog in the mouse genome
IMPC is generating knockout mice strains for each mouse gene = 20,000 genes
all mice are same background, kept under the same conditions
- basic phenotypic
- results freely available