Genetics of common disease Flashcards
Describe mendelian diseases
- Mainly due to 1 gene and one mutation
- e.g CF deltaF508 - a three-base deletion, which removes a phenylalanine codon.
- Inheritance pattern is clear (recessive loss of function, autosomal dominant, X-linked)
- Minimal influence from environment
Describe common diseases
- Are MULTIFACTORIAL
- Multiple genes affect the disease trait
- There is a strong influence of the environment
- The effect of every single gene may be neglibile
- Multiple common alleles influence susceptibillity
What is heritability?
Measure of how well differences in peoples genes account for differences in their traits
What does a heritability close to 1 indicate?
Almost all variability in a trait comes from genetic differences, with very little contribution from environment factors
e.g Cystic Fibrosis has a heritability of 1
How do we calculate heritability?
Using twin studies
- Monozygotic twins have 100% DNA
- Dizygotic twins have 50% DNA
- Both share the same environment therefore any differences would be due to disease/trait
How do you interpret twin studies?
When looking at a trait e.g height measure height in both sets and assess their concordance
- The higher the concordance = more similar they are going to be
The more the trait is determined by a genetic contribution, the greater the difference in concordance because monozygotic twins share 100% DNA, dizygotic twins share 50% DNA (essentially more genes involved = greater difference)
Once our heritabillity study has been carried out we identify the genes which contribute to that trait
What is concordance?
It’s how similar phenotype is
How can we find out which genes contribute to a trait/disease?
Through genetic association
- GWAS (genome wide association study)
How is a GWAS study carried out?
- Method for identifying gene variants (SNPs) involved in complex diseases by scored for hundreds or thousands of individuals who have the disease and controls (who do not)
- A typical GWAS collects data to find out the common variants in a number of individuals, bot with and without a common trait (e.g. a disease), across the genome, using genome wide SNP arrays.
How does a SNP microarray work?
- DNA sample prepared and fragmented
- DNA tagged/ labelled with fluorescent probe
- Mix DNA with slide, which contains oligonucleotides which match the region of the genome around each variant being tested
- If DNA sample contains a variant, then it will bind to specific matching oligonucleotides and fluoresce
- Signal is produced and detected
Encoding SNP chip data for analysis
After we have our data from the SNP chip we will convert it in a computer to get a code.
This works out what the genotypes of individuals are and converts them into a binary code that computer programmes deal with
Explain how we use SNP chips and avoid measuring all the genetic polymorphisms
- Rather than directly measuring genotypes at all genetic polymorphisms, we rely on the associations between SNPs we do assay and SNPs we don’t assay
- SNP-SNP association, or LINKAGE DISEQUILIBRIUM is the fundamental ability to sample a whole genome with few SNPs
What is linkage disequilibrium?
Non random association of alleles at two or more loci in a general population
Linkage disequilibrium between 2 SNPs decreases with physical distance as you are more likely to have recombination between them
If LD is strong (chance of variants in that region inherited together is higher), fewer SNPs are needed to capture variation in that region, therefore cheaper and easier way to analyse
What analysis is carried out to indicate how likely a variant is to be associated with a trait?
Statistical analysis (p- value indicates the significance of the association)
- Lower P-value = more significant
all of the p-values for all of the SNPs on the chip are then plotted on a Manhatten Plot
high peaks = high significance between a gene region and trait
What are the three possibilities if a SNP is associated with disease?
- There is a causal relationship between SNP and disease
- The marker is in linkage disequilibrium with a causal locus
- False posotive