Genetic Epidemiology Flashcards
What is genetic epidemiology?
- study of the role of genetic factors in determining health and disease in families and in populations, and the interplay of such genetic factors with environmental factors
- seeks to derive a statistical and quantitative analysis of how genetics work in large groups
- population dynamics alter the frequency and distribution of both genetic and environmental factors and thus their net effect on the phenotype of interest; gene pool in Canada evolves constantly, environment evolves (eg. cell phones), and as a result the findings we make at time t are not necessarily applicable to t plus 10 years
What is a genome?
-entire DNA of a species
Describe DNA composition and human genome
- A, G, T, C nucleotides
- DNA is double helix (A-T or C-G attached to sugar phosphate backbone)
- 3.2 million base pairs
- human genome is diploid; two DNA copies inherited from two parents
- 22 autosomal and 1 sexual chromosome pairs (women XX, men XY)
What is a gene?
- specific sequence that is transcribed into mRNA and translated to protein, and controls expression of traits
- 19000/20000 genes
- 2% coding, 98% not coding
- range from <1000 to >1000000 base pairs
- 99.9% of DNA sequence is identical in all humans
What are types of genetic variations?
Inversions: chunk of DNA inversed
Translocations: chunk of chromosome go on another chromosome (no loss of genetic info)
Fusions: relatively rare, chromosome becomes a ring and the telomeres stick together- functions the areas that stick together but the other genes in the ring are functional (growth is impacted)
Deletions: portion of chromosome that disappears
Duplications: 3 copies of the gene (compared to normal having 2 copies one from each parent)
Short/variable number-tandem repeats
Insertion: few base pairs are inserted but can mess up amino acid coding if its in a coding region
Single nucleotide variants: no structural variation, just variation in the sequence (accounts for 90% of human genetic variation)
What is an allele? What are the combinations of different alleles?
- state at a specific marker
- homozygous: 2 copies of same allele
- heterozygous: 2 different alleles
- haplotype: sequence of alleles along one chromosome (neighbouring mutations that are inherited)
What results from non-synonymous mutation (mutation leading to amino acid change)?
-50% has no effect on protein, 40% makes it less functional or not functional at all, 10% it increases function
When can a mutation in the coding region not have negative consequences? When does it impact amino acid structure of protein?
- can have a mutation but it doesn’t change the amino acid
- eg. your parents have TCT base pair combination, and you have a mutation of TCC but these both code for serine
- if there is a mutation that changes TAC into TAA, TAC leads to tyrosine but TAA is a stop codon so this will stop amino acid production and can lead to protein not being made
What is new regarding the intergenic region?
- geneticists usually interested in the coding region because it has the potential to change the structure of the protein and result in disease with mutations
- more recently people are realizing that there is a lot of gene regulation happening in intergenic region having an impact on other genes down further
What are linkage disequilibrium blocks?
- if you knew mutation 1, you could predict the allele for 100, etc. other mutations because they are always inherited together
- linkage disequilibrium is the non-random association of alleles at two or more loci
- human genome is composed of blocks of linkage disequilibrium
- extent of linkage disequilibrium blocks varies according to ethnic background
- areas of DNA that are most likely for mutation; as a result you have sections of DNA that are the same as your first ancestor
What are explanations for why the African population has smaller blocks of correlation between mutations compared to the European population that has large blocks of mutations inherited together?
- African population had more generations than the others (hypothesis that the human species started in Africa)
- they are more ancestral population so they have more opportunity to redistribute the DNA
- only 2 are coinherited together compared to 4 in the European population
What is inherited versus de novo genetic variation?
- most genetic variation we get from our parents; inherited
- de novo: DNA replication accidents (DNA repair methods are highly sophisticated but sometimes there is a de novo mutation that escapes attention of DNA polymerase)
- sometimes this happens at the cellular level and can lead to areas of the body having the mutation and some areas not having it
- de novo mutations can happen from reactive cellular metabolites, chemicals, radiation
What is a phenotype? What are important things about a phenotype?
- set of observable characteristics of an individual resulting from the interaction of its genotype with the environment
- can be a disease (endpoint), intermediary trait (appetite, physical activity, etc)
- binary (disease or no), categorical (smoker/non smoker), or quantitative (weight)
- clinically and biologically relevant
- easy and inexpensive to measure
- relevant in diverse ethnic groups
- minimal measurement error
- minimal misclassification and reporting biases
- heritable
How do we know if a trait is an heritable condition? How can it be studied?
- heritability: proportion of the total variation between individuals for the given trait in a given population that is explained by genetic factors
- between 0 and 100%
- if you have a heritability of 50%, 50% of the risk of disease/genetic variation is explained by genetic factors
- can be studied in twins, families, general population
How are traits compared in twins?
- compare the correlation between variable in identical twins and fraternal twins
- the difference in this correlation represents 50% of genome