Genome Variation Flashcards

1
Q

What proportion of the genome is the exome - codes for protein?

How much of our genome do we share with someone else?

A

2%

99.7% - only 9 million bases out of 3 billion are different

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What kind of differences in the genome are associated with disease?

A

Macro-level differences (e.g. trisomy 21) and micro-level, molecular-level differences (e.g. a single point mutation as in SCA/ a 3BP deletion in the CFTR gene leading to CF)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is special about mono-zygotic twins’ DNA?

A

They are identical at every base

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What in DNA terms is considered polymorphic?

A

Any base in the genome that varies between individuals is polymorphic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a reference sequence and a reference allele?

A

A sequence database which summarizes the base at that position that is present for the majority of people

The most common allele

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How are variants/ polymorphic positions found?

A

By comparing someone’s sequence to a reference sequence and seeing that they are different

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How was the referencing sequence generated?

A

4 anonymous individuals genomes were sequenced and averaged out in the human genome project

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How often does a SNV occur in the reference sequence and in one individual?

A

Once every 300 nucleotides in the reference sequence; once every 1000 nucleotides in an individual

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Where are the majority of SNVs found?

A

Not in the exome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How are SNVs generated?

A

By faulty mismatch repairing that occurs during DNA replication

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a biallelic site?

A

A site in DNA where there could be 2 possible alleles (2 variants, one of which is the reference sequence base)

“A biallelic site is a specific locus in a genome that contains two observed alleles, counting the reference as one, and therefore allowing for one variant allele. In practical terms, this is what you would call a site where, across multiple samples in a cohort, you have evidence for a single non-reference allele.”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How is a SNV formed?

A

In DNA replication the two strands separate and are templates to synthesise complementary strands, forming identical copies.

However, when synthesising this strand instead of incorporating an A, a G has been incorporated.
The mismatch repair mechanism will identify this mistake and correct it so that the bases are a standard Watson-Crick base pair
However, in this instance it hasn’t corrected the G, it’s replaced the T with a C.

If this change occurs in the gametes and isn’t deleterious then it will get passed on to the next generation and as time goes on it can spread through the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Where can SNVs be found?

A

In genes, promoters and non-coding regions

In genes, they can change (non-synonymous/ missense) or not change (synonymous) an amino acid, and could change the amino acid into a stop codon (nonsense). They also change where the splicing can occur in a sequence (the splice sites) and can occur in a UTR, affecting gene expression

In promoters they can affect protein expression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

When do SNVs disappear from the genome?

A

When they have a deleterious effect (causing harm/ damage) or cause population annihilation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What kind of mutation is SCA?

A

A point, missense mutation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How common is the SCA point mutation?

A

White European people
0.02%
2 in every 10,000 chromosomes

African people
4.5%
1 in every 20 chromosomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is heterozygote advantage?

A

When being heterozygous for a disease allele provides benefit e.g. having a sickle cell trait protects against malaria in heterozygotes

18
Q

Minor allele frequency figures:

  1. For a mutation
  2. For a polymorphism (but this does not imply no pathogenicity e.g. SCA)
  3. Rare polymorphism
  4. Common polymorphism
A
  1. <1%
  2. > 1%
  3. 1-5%
  4. > 5%
19
Q

What determines if a variant (that starts off rare) remains that way?

A

Evolutionary forces

20
Q

How do SNVs spread?

A
  1. Migration introduces the variant into another population, known as gene flow
  2. Random change in variant allele frequency between generations, known as genetic drift
  3. Selection in favour of the allele (non random change in variant allele frequency)
21
Q

What is negative and positive selection?

A

Negative selection is the selective removal of rare alleles that are harmful, whilst positive selection are the traits of a species that are selected for

22
Q

What must be considered when determining of a variant is pathogenic?

A

If it is occurring in a gene/ not and if so, is it a key developmental gene (e.g. HOXD1, in which case it could be pathogenic) or not (e.g. MC1R gene for pigmentation)

So it is not easy to determine this. It depends on both the type of variant and the environment.

23
Q

Why is every genome not exactly 3000Mbs?

A

Every individual has a different (and highly variable) number of microsatellites/ STRs/ SSRs (simple sequence repeats)

24
Q

What are the types of microsatellites?

A

Di/ tri/ tetra/ penta/ hexa nucleotides (tells how many bases are present in 1 repeat)

25
Q

What does the reference sequence show in terms of microsatellites?

A

The number of repeats the vast majority of the population have

26
Q

How are microsatellites unlike SNPs?

A

They are multiallelic (can have 7,9,12 repeats etc.) not biallelic (only A and C versions of the allele)

27
Q

Why are microsatellites so prone to differences in the number of their repeats?

A

During replication, the growing strand can become unattached and then re-anneal in the wrong place i.e. shifts forward (since the whole of that area is complimentary). This leaves a little bubble of unpaired bases, which the polymerase complex backtracks and repeats the insertion of deoxyribonucleotides at, leading to extra repeated units being incorporated into the sequence (and so multiallelism)

28
Q

Where are microsatellites found?

A

Anywhere in the genome!

Part of introns, UTRs; affects gene expression
Part of intergenic regions
Part of exons (could introduce a new amino acid in a protein e.g. in Huntington’s disease, an expansion disorder)

29
Q

What is a CNV?

A

Copy Number Variants are big chunks of DNA (e.g. 5MB’s long) that have either been duplicated to form more than 2 copies (so we are no longer diploid for that locus) or have had 1 copy/ both copies deleted. This does not cause harm

30
Q

What is the simplest type of CNV?

A

The presence/ absence of a gene; so an individual can contain from 0-4 copies of a generic

31
Q

What causes CNV?

A

Non allelic homologous recombination occurring in Meiosis - 2 pairs of homologous chromosomes misalign before exchanging genetic material, due to sequence similarity between the 2 chromosomes, causing duplication in one pair and deletion in the other and so a copy number change

32
Q

Can CNVs be found intergenically?

A

Yes but due to their large size they often affect one or more genes

33
Q

How much of the genome is a CNV?

A

Approx. 12%

34
Q

What is an example of a CNV being pathogenic?

A
Trisomy 21 (Down syndrome)
Microdeletion disorders (DiGeorge syndrome)
35
Q

What are the types of common genetic variation? Do they cause mendelian, monogenic disorders?

A

SNPs, microsatellites and CNVs - we all have them, just have a different genotype. Most common variants do not cause mendelian, monogenic disorders, the majority are actually neutral. Instead they impact on complex, non-mendelian disorders + contribute to general individual variation (e.g. looks, sporting ability)

36
Q

What is the minor allele frequency in biallelic variants?

A

Relatively high

37
Q

What are some common variants and their associations?

A
Height
Allergies
Haemochromatosis
Type 1 and Type 2 diabetes
Alzheimer’s
Anxiety
Dyslexia
Memory
Sexual desire
Aging
Nicotine dependence
Faithfulness
Age-related hearing loss
Gout
Sciatica
Sense of smell
HIV susceptibility
Anti-social behaviour
38
Q

What are some variant effects?

A

Beneficial/ pathogenic (but most are neutral)

39
Q

What can variants be used for?

A

As marks to help find disease causing genes/ mutations

Autozygosity mapping (dentification of recessively inherited disease genes using small inbred families)
Linkage studies
(Microsatellites, SNPs)

Association analysis (SNPs, CNVs)

40
Q

What is the book analogy?

A

Whole book = genome
Chapter = chromosome
Delete or duplicate chapter and you can really mess the story up
Paragraph = CNV
Delete or duplicate a paragraph and, as long as it’s not key, you can make do
Sentence = microsatellite
Accidentally repeat several words within that sentence, you elongate it, it’s annoying but not fatal to the plot
Letter = SNP
Typos often barely change the meaning at all