Genetic Variation I Flashcards

1
Q

What percent of protein-coding genes are polymorphic?

A

33%

Additional nucleotide diversity in introns, regulatory sequences, and flanking sequences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What percentage of total genetic variation is found within populations?

A

~85%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What causes genetic variation in humans?

A

Changes to base sequence in 2 categories:

Do not affect DNA content (Number of nucleotides unchanged and insteadbases are replaced or translocated/inverted)

Causes a net gain/loss of DNA sequence (changes in copy number of DNA sequence or abnormal chromosome segregation; deletions or insertions of single nucleotides or short sequences of Mb DNA)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Do all DNA changes affect phenotype?

A

Most DNA changes are on small scale so they may or may not effect phenotype

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are DNA variants caused by?

A

Mutations resulting in alternative forms of DNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a polymorphism?

A

For any locus, if more than one DNA variant is common in the population (Pr>0.01) it is called a polymorphism.

If Pr<0.01 it is a rare variant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Where does knowledge of DNA variants come from?

A

From analysing DNA from complete genome sequencing of multiple individuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Where is most genetic variation located?

A

In non-coding regions of the human genome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are single nucleotide polymorphisms (SNPs) and variants (SNVs)?

A

Most common variation due to single nucleotide substitution:

Type of variant produces single nucleotide variants and if 2 or more alternative DNA variants exceed frequency of 0.01 in population it is called a single nucleotide polymorphism (SNP)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the “major allele”?

A

The allele that is more common in a population. Different populations can have different alleles as the “major allele”.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why are SNVs not considered random?

A

Different regions have different mutation rates

mtDNA higher than nuclear

C-T substitutions are most common

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What do alternative SNPs tell us about evolutionary ancestry?

A

Alternative SNPs mark alternative ancestral chromosome segments common in present day population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What do SNPs do to overall function of DNA?

A

They can cause gain or loss of restriction enzyme sites leading to (RFLP) Restriction fragment length polymorphism

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What do indels create?

A

Copy number variations.

Heterozygous deletion of a single nucleotide at a defined position on a chromosome has one copy of that nucleotide instead of 2.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does modern convention describe indels as?

A

Deletions/insertions up to 50 nucleotides long

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a change in copy number of a sequence described as?

A

Change in copy number of sequences resulting in larger deletions/insertions (>100 nucleotides)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How common are indels?

A

1/10th of single nucleotide substitution

18
Q

What are the types of tandom repeat structures in DNA?

A

Satellite DNA

Minisatellite DNA

Microsatellite DNA

19
Q

What are satellite DNA structures?

A

length = 20kb to many 100s kb; located at centromeres, heterochromatic regions

20
Q

What are minisatellite DNA strucutures?

A

length - 100 bp to 20 kb; located primarily on telomeres and subtelomeric regions

21
Q

What are microsatellite DNA structures?

A

Length - fewer than 100 bp located widely throughout euchromatin

22
Q

How stable are repeat sequences?

A

Variants differ in number of repeats

23
Q

What causes variation in copy number?

A

Replication slippage or unequal crossover

24
Q

How do microsatellites differ in population genomics to SNPs?

A

Microsatellites have multiple alleles unlike SNPs that only have 2 alleles

25
Q

How does slippage cause insertion?

A

Repeat loops out and so the replication creates an extra repeat in its place due to it not aligning perfectly with the template strand.

Opposite occurs for deletions (Template strand loops out)

26
Q

How can unequal crossing over result in additional repeats of a sequence?

A

Misaligned chromatids on homologous chromosomes can be on homologous chromosomes and when recombination occurs there is an extra few base sequences added to the end of the repeat sequences resulting in expansion of repeat length.

27
Q

What are microsatellite markers used for? How are they used?

A

They are used to track inheritance of different chromosomes in a family.

Primers are added to flank each region before PCR is used to amplify these regions.

Then length is determined by the longest sequences via sequencing capillary column.

28
Q

What do peaks on a gene scan (after sample is run through a sequencing capillary column) tell us about the gene?

A

The size of the repeats and their frequency. This is more informative than SNPs for distinguishing between individuals or following chromsome segments through pedigrees

29
Q

How important was the human genome project to understanding microsatellites?

A

Early years Human Genome Project largely devoted to defining and mapping microsatellites. 150000 identified.

30
Q

What are the limitations to using repeats over SNPs?

A

Repeat sequencing is much harder to automate than SNPs.

31
Q

Where does DNA variation come from?

A

Some arise from errors in DNA replication or recombination

Errors in chromosome segregation results in abnormal gametes with fewer or more chromosomes than normal

Various natural errors give rise to altered copy numbers of specific sequence within a DNA strand. Crossover errors

Various endogenous/exogenous sources can cause damage to DNA by altering chemical structure

32
Q

What large-scale changes are important for genetic variation?

A

Balanced Structural Variation

Unbalances Structural Variants

33
Q

What is balanced structural variation?

A

DNA variants have same DNA content but differ in some DNA sequences are located in different positions of the genome. Chromosomes break and fragments are incorrectly rejoined without loss or gain of DNA (inversions and translocation0

34
Q

What is unbalanced structural variation?

A

DNA variants differ in DNA content. Rare case where person gained/lost chromosomal region, often results in disease.

Also includes commonly occurring copy number variation, variants differ in number of copies of moderately long to very long DNA sequence. Some CNVs contribute to disease and others are normal

35
Q

What is the most common type of genetic variation?

A

SNPs (75% of DNA changes are SNPs)

36
Q

How many SNPs are there in a human genome?

A

38 million (1 per 100)

Vast majority are rare in any population

Most people would be homozygous for any SNP loci.

Personal sequencing - SNVs between maternal/paternal = 1 per 1000 bps.

Structural variations = 1/4 of mutational events dominated by CNV

37
Q

What is the human genome project good for?

A

For consensus, not good for individual differences. Not good for genetic variation.

38
Q

What are the steps that took place historically to come up with variant maps?

A

Human genome project

Identifying of genetic variants (anonymous with respect to traits)

Assay genetic variants (verify polymorphisms, catalogue corrections amongst sites, anonymous with respect to traits) [hapmap project]

39
Q

How were SNPs found and understood?

A

SNPs were first discovered (goal was to identify 300k SNPs and to determine the allele frequency of SNPs)

Then SNPS were characterised

40
Q

What reference genome was used for SNPs?

A

Human Genome Project