Lecture 2: Human Genetic variation in Health and Disease Flashcards
What is the human genome and what does it contain?
13
1 * ~37 trillion cells in the body
– Many different types of cells & functions
2 * All cells with a nucleus contain DNA
(mature red blood cells; cornified cells in skin, hair, nails; no nucleus)
3 – 99.9% genes in nucleus
4 – mitochondrial DNA (mtDNA) = 37 genes
5 – Which genes are turned on or off, and when, determine cell type.
6.* 3,000,000,000 base pairs of DNA (X 2).
7 – A standard typist would take 34 years of non-stop typing to type this out!
8 – 1.8m long 1 cell nucleus (tightly wound)
9 * ~20,000 protein-coding genes
10 * ~98% of DNA is “junk DNA”
11 * Any 2 people are ~99.6 – 99.9% identical
12 – 0.4% variation = ~ 12 million base pairs
13 * Finding a molecular diagnosis is essentially searching for one or more typographical errors in 34 years of typing…
What is Human Genetic Variation?
Variation in structure or sequence of the human genome
Human Genetic Variation - whom/where is it among?
= 2
1 – Inter-individual (intra-individual)
2 – Inter-population
Multiple mechanisms contributing To Human Genetic Variation? =6
- Meiotic recombination
2 – DNA replication and repair
3 – Population effects
4—- * Random genetic drift
5 —- * Selection (adaptive advantage)
6 —-* Migration
Understanding Geographical Genetic Diversity
= 3
1 * Out of Africa theory
– H.sapiens evolved into modern humans solely in Africa, 100-200,000 years ago
– One African subpopulation of hominins (among several) was ancestral to all human
beings today
– Some members of that subpopulation left Africa by 60,000 yrs ago
2* Supported by study of present-day mitochondrial DNA, and Y
chromosome sequence variation.
3 * Genetic diversity decreases with migratory distance from Africa
– Due to bottlenecks during human migration (temporarily reduce population size)
Generally speaking ..genetic diversity in world…
- General similarities between neighbouring populations.
- Large differences between different parts of the world.
What are the two types of Variation?
- Structural (>1000bp)
- Sequence level (<1000bp)
Types of Variation
Understanding
* Structural (>1000bp) = 3
1 – Copy number (deletions & duplications)
2 – Positional (insertions, translocations)
3 – Orientational (inversion)
Types of Variation
Understanding Sequence level (<1000bp) = 3
1 – Single base substitutions
2 – Small insertions/deletions/duplications
3 – Repetitive sequence
What is Cytogenetics?
Structure, properties and behaviour of chromosomes
What is Molecular Genetics = 4
1 * Studies structure and function of genes at a
molecular level
2 * Uses molecular biology & genetics
3 * Inter-relationship between DNA, RNA and
synthesis of polypeptides
4 * Tests are typically DNA- or RNA-based
Spectrum of Variation in the Human Genome… SMALL VARIANTS = 3
VARIATION…REARRANGEMENT TYPE… SIZE RANGE
SMALL VARIANTS AT SEQUENCES
- MAY BE DIEASE CAUSING
- SINGLE BASE-PAIR CHANGES
- Single nucleotide polymorphisms, point mutations
- 1bp - SMALL INSERTIONS/DELETIONS
- Binary insertion/deletion events of short sequences (majority < 10bp in size)
- 1-50bp - SHORT TANDEM REPEATS
- Microsatellites and other simple repeats
- 1-500 bp
Spectrum of Variation in the Human Genome…LARGE VARIANTS … MAY BE DISEASE CAUSING OR NOT
…LARGE VARIANTS
… MAY BE DISEASE CAUSING OR NOT
- Large-scale structural variation
- deletions, duplications, large tandem repeats, inversions
- 50kb to 5 Mb - CHROMOSOMAL VARIATION
- Euchromatic variants, large cytogenetically visible deletions, duplications, translocations, inversions, and aneuploidy
– aprox 5Mb to entire chromosomes
Spectrum of Variation in the Human Genome…REPETITIVE ELEMENTS OF VARIOUS SIZES. === 8 features + examples
- Repetitive elements of various sizes.
2 * Tandem (vary in length) or interspersed (vary in copy number)
3 * Small (<5Mb) or cytogenetically visible (>5Mb)
4 * Non disease-causing but may predispose to
rearrangements or expansions/contractions. - FINE-SCALE STRUCTURAL VARIATION
- Deletions, duplications, tandem repeats, inversions
- 50bp to 5 kb - RETROELEMENT INSERTIONS
- SINEs, LINEs, LTRs, ERVs
- 300 bp to 10 kb - INTERMEDIATE-SCALE STRUCTURAL VARIATION
- deletions, duplications, tandem repeats, inversions
- 5-kb to 50kb - Large-scale structural variation
- deletions, duplications, large tandem repeats, inversions
- 50kb to 5 Mb - CHROMOSOMAL VARIATION
- Euchromatic variants, large cytogenetically visible deletions, duplications, translocations, inversions, and aneuploidy
– aprox 5Mb to entire chromosomes
Causes of Genetic variation? = 4
1 * DNA repair mechanisms contending with damage due to mutagens
– Ionising radiation
– UV
– Chemicals
2 * DNA replication errors
– Proof-reading errors
– Fork stalling
3 * Homologous DNA recombination during meiosis
– Allelic and non-allelic
4 * Retrotransposition
– Elements able to amplify themselves throughout the genome.
– Copy themselves to RNA and then back into DNA that integrates back
into the genome.
Causes pf Genetic variation?
Understanding DNA repair mechanisms contending with damage due to mutagens… 3
1 – Ionising radiation
2– UV
3 – Chemicals
Causes pf Genetic variation?
Understanding DNA replication errors = 2
– Proof-reading errors
– Fork stalling
Causes pf Genetic variation?
Understanding Homologous DNA recombination during meiosis
Allelic and non-allelic
Causes pf Genetic variation?
Understanding RETROTRANSPOSITION ..2
- Elements able to amplify themselves throughout the genome.
– Copy themselves to RNA and then back into DNA that integrates back
into the genome.
Defining Variation: Reference Genome
- The human reference genome is: 5
- Maintained by the Genome Reference Consortium
- – A product of multiple international contributions
- – Updated by new full releases and minor patches/fixes
- – Represents “normal”
- However we know that there are differences between “normal” populations
Defining Variation: Reference Genome
* The human reference genome is:
CREATED? WHEN NOT USEFUL? 3
- Created by sampling lots of individuals of varying genetic make-up.
- – Not necessarily appropriate when comparing large range of ethnicities (e.g.
Australia)?
—- 3* Probably less of an issue in Mendelian inheritance, however could be an issue when considering complex inheritance and epistasis (suppression of the effect of a gene by another).
Variation and Phenotype IN GENETIC VARIATION = 3
1 * Any individual will have 10s of thousands of structural and sequence variants when compared to
the reference genome.
2 – Any may contribute to that individual’s phenotype
3 – Phenotype = “set of observable characteristics of an individual resulting from the interaction of it’s genotype with the environment”.
Spectrum of consequences of genetic variation and Phenotype…4
1 – No change in phenotype
2 – Alternative phenotypes of no medical consequence
3 – Disease susceptibility
4 – Pathogenic
Continuous Vs Discontinuous Variation …IN POPULATION =6 GRAPH?
*1 Continuous variation has no limit on the value that
can occur within a population.
2 – Line graph is used to represent.
3 * Height
4 * Weight
5 * Heart rate
6 * Finger length
Continuous Vs Discontinuous Variation IN ORGANISIMS.. GRAPH? =7
1 * Discontinuous variation has distinct groups for
organisms to belong to
2 – A bar graph is used to represent
3 * Tongue rolling
4 * Finger prints
5 * Eye colour
6 * Blood groups
7* Detached ear-lobes
Databases of Variation… TYPES…3
1 * Range of web-based databases containing normal or disease-associated variants.
- – “Normal” variation
* Database of Genomic Variants - Copy number variants (CNVs)
* ExAC, Exome Variant Server, 1000 Genomes, dbSNP (sequence variants). - – Disease-associated variation
* Disease/gene specific databases
* Database of Human Gene Mutation Data (HGMD, sequence variants)
* DECIPHER (CNVs)
Databases of Variation… Generally speaking… 4
1 – Different levels of curation, evidence base
2 – Generally free to access, some require a fee
3 – In-house collection of variants.
….4 * International pressure to contribute local data to
international databases
What is a Reference genome?
normal vs Abnormal …5
1 * Caution needs to be taken when calling genetic variation “normal”
vs “abnormal” using a single reference
2 – Variation occurs between populations.
3 – Common variant = “normal”
* BRCA2, c.865A>C (p.Asn289His)
* ExAC frequency all people = 0.052 (e.g. not rare)
4 – Rare variant = “abnormal”
* DMD, c.10412T>A (p.Leu3471*)
– * = X or stop
5 * Does not exist in any benign variant databases
– most likely disease-causing
– also gene fits with phenotype, and type of mutation fits with previous
knowledge for this gene.