L8: Tandem repeats & Repeat expansion disorders Flashcards

1
Q

What makes up the majority of our genome?

A

Repetitive DNA that includes TEs and related sequences (44%)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two main categories of repeated elements?

A

tandem repeats and dispersed repeats

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the three types of tandem repeats?

A

Tandem paralogues
Satellite DNA
rDNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the three types of satellite DNA?

A

(Macro)Satellites
Minisattelites
Microsattelites

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the four types of dispersed repeats?

A

Paralogues
Transposons
tDNAs
Retro(pseduo-)genes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How can transposons be further categorised?

A

Class I:
LINEs
SINEs
LTR retrotransposons

Class II:
DNA transposons

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the difference between the different satellites?

A

Their motif length:
(Macro)Satellites: (>100bp)
Minisattelites (10- 100bp)
Microsattelites (1-9bp)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the max length of satellite DNA?

A

They are very large arrays of repetitive DNA – each repeat typically kilobases long. Satellite DNA can extend over megabases of DNA but its maximum length is unknown

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Give some other names for tandem repeats

A

Variable number tandem repeats VNTR (most frequently used when referring to minisatellites)

  • Simple repeat
  • Short tandem repeat (STRs)
  • Simple sequence repeats (SSRs)
    (most frequently used when referring to microsatellites)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Where are macro satellites most commonly found?

A

They are most commonly found at centromeres and in heterochromatin (cytologically dense material that is typically found at centromeres and telomeres)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Typically, how long are minisatellites and how often do they repeat?

A

Range in length from 10-60 base pairs, typically repeated 5-50 times.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How many minisatelites are there in the human genome?

A

More than 1000 locations in the human genome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How variable are minisatellites?

A

Highly variable between individuals in terms of repeat length

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Why would a sequence be classified as a mini satellite?

A

Repeat sequence is 10-100, repeats 5-50 times and the number of repeats varies between people but the motif itself does not.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Typically, how long are microsatellites and how often do they repeat?

A
  • Motifs range from one to ~six base pairs
  • Motif typically repeated 5-50 times
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the most common motif lengths for microsattelites and why?

A

Often 3-6 so no frame shift

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How variable are microsatellites?

A
  • Very high mutation rate compared to other regions in the genome
  • Highly variable between individuals
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What are the consequences of this variability in microsatellies?

A

Often pathological

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

In what ways can tandem repeats vary

A
  • Tandem repeats are highly variable in the number of repeat copies
  • They vary between individuals but can also vary within an individual – for example in different cell types or tissues
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

When was DNA fingerprinting first developed and how was it done?

A

Developed in the 1980s, Originally used restriction enzymes to fragment DNA and then a southern blot to detect fragment length

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What do modern DNA fingerprinting techniques utilise?

A

PCR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What structural effects can short tandem repeats have on DNA?

A

STRs are able to form secondary DNA structures such as G quadruplexes. Tandem repeats are very complimentary and are likely to repeat these secondary structures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

G-quadruplex (G4) structures are only one of many (ten or more) non-B-form DNA secondary structures analysed to date. Briefly describe three well-studied structures

A

Z-DNA: In contrast to standard B-form DNA (B-DNA), Z-DNA is a left-handed helix. Z-DNA motifs (that is, sequences that form Z-DNA in vitro) are tracts of alternating purines and pyrimidines. Negative supercoiling stabilizes the formation of Z-DNA under physiological salt conditions130, and it is hypothesized that Z-DNA relieves transcription-induced torsional stress

Cruciform structures: Negative supercoiling can also cause B-DNA to adopt a four-armed, cruciform secondary structure. These structures require ≥6-nucleotide inverted repeats (cruciform motif) to form, and such motifs are located near replication origins, breakpoint junctions and promoters in diverse organisms

Triplex DNA: Three-stranded triplex DNA occurs when single-stranded DNA forms Hoogsteen hydrogen bonds in the major groove of purine-rich double-stranded B-DNA. Triplexes in which the third strand is antiparallel to the DNA duplex can form at physiological pH, and these structures are stabilized by negative supercoiling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What secondary structures can form in CAG, CTG, and CCG repeats?

A

Hairpins of the As Ts or Cs, meaning in this structure they have no pairing. This can happen in both odd and even repeats although the pattern differs slightly in odd repeats.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What secondary structure can form in GAA repeats?

A

Triple helice formed by (GAA)n repeats

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What secondary structure can form in CCG repeats?

A

Tetraplex structures; G4s- unwinding this structure takes a lot of force

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Name three models for STR repeat expansion

A
  • Replication slippage model
  • Double strand break repair model
  • Transcription mediated model
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Describe the replication slippage model

A

During replication of a repeat-containing sequence, the replication machinery may pause on the lagging strand, due to secondary structures or other kinds of lesions.

Partial unwinding of the lagging strand may lead to replication slippage when replication restarts, giving rise to an expansion or a contraction of the repeat tract, depending on what strand (template or newly synthesised strand) slippage occurred.

Alternatively, partial unwinding of the lagging strand may lead to lesion bypass by homologous recombination with the sister chromatid, also leading to contractions or expansions of the repeat tract

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Describe the double stranded break model

A

Following a DSB, gene conversion is initiated by strand invasion, forming a “D-loop.” (two strands of a double-stranded DNA molecule are separated for a stretch and held apart by a third strand of DNA)

DNA synthesis within the repeat tract may be faithful or associated with slippage. After capture of the second end of the break, DNA synthesis of the second strand may also be faithful or associated with slippage.

Slippage events will lead to expansions of the repeat tract or to contractions if slippage occurs on the template strand.

Alternatively, after capture of both ends followed by DNA synthesis, the two newly synthesized strands may unwind and anneal with each other in frame or out of frame, leading to expansions or contractions of the repeat tract, however it is more biased to expansion than contraction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Describe transcription mediated repair

A

Transcription through CAG·CTG repeats promotes the formation of slipped-strand structures, which subsequently stall RNA polymerase (RNAP) and lead to recruitment of the nucleotide excision repair (NER) machinery. Transcription-coupled NER removes the portion of the transcribed strand containing the RNAP-blocking hairpin; the resulting gap is filled in using the non-transcribed strand (NTS) as a template. Depending on the location of loops on the NTS relative to the removed hairpin, the repair event will either expand or contract the trinucleotide repeat.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Why are STRs difficult to study using PCR?

A

The underlying properties of STRs make them difficult to study using PCR based methods, they show up in laddering fragments during gel electroporesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What else can be problematic about studying STRs?

A

Mapping/ sequencing of STRs is also problematic, As the similarity between two copies of a repeat increases, the confidence in any read placement within the repeat decreases.

33
Q

What is making the studying of STRs easier?

A

New technologies like PacBio and Nanopore are making the study of tandem repeats more feasible as they can read a whole long strand of DNA

34
Q

Besides telomers and centromeres, where are tandem repeats often found in the vicinity of?

A

Tandem repeats are frequently found in the vicinity of genes; 10% to 20% of coding and regulatory sequences in eukaryotes contain an unstable repeat tract. 15% of human promoters have a TR in close proximity

35
Q

Name three things tandem repeat variation has been linked to in different species

A
  • Rapid variation in microbial cell surface
  • Tuning of internal molecular clocks in flies
  • Dynamic morphological plasticity in mammals.
36
Q

Where is there a particularly high density of microsattelites in the human genome?

A

Transcriptional start sites

37
Q

What relationship is seen in these TRs and TSS? (2)

A

There is a correlation between TR polymorphism and gene expression variation. Genes with a promoter-associated TR showed significantly higher variation in both expression and DNA methylation levels. This effect was more pronounced for genes with highly polymorphic promoter TRs- More variability in TRs, more variability in gene expression and methylation.

38
Q

What functional consequences have been observed for this TR-gene relationship?

A

Some TR lengths correlate with quantitative traits: Correlation shown between specific features of the dog snout (curvature and length) with the ratio of length between two TRs in Runx-2, a gene that regulates bone formation

39
Q

Give five potential mechanisms for gene expression regulation by TRs

A
  • Overlap with regulatory protein binding sites
  • Chromatin structure
  • Z-DNA formation
  • Spacing of promoter elements
  • RNA structure
40
Q

What relationship is seen with repeats and TFs?

A

Repeats surrounding TF binding sites increase TF binding, More repeats, higher TF binding; did not have to match binding site

41
Q

When do repeat expansion disorders (REDs) occur?

A

REDs occur when a simple repeat expands over a certain threshold. Different diseases have different pathogenic ranges. Some disorders have a pre pathogenic range where its not healthy, you might have some symptoms but it is not in the pathogenic range

42
Q

Name four repeat exapnsion disorders

A

Friedrich’s ataxia
Fragile X
Huntington’s
Spinocerebellar ataxia

43
Q

Name four pathogenic mechanisms of REDs

A
  • Gene silencing
  • RNA binding protein sequestration
  • Toxic gain of function
  • Repeat associated Non-AUG (RAN) translation
44
Q

How can gene silencing come about?

A

The expanded repeat causes a significant reduction in the expression of a nearby gene. This can happen through the formation of secondary structures or through the induction of epigenetic changes: methylation, promote the formation of heterochromatin etc

45
Q

Name two diseases in which repeat expansions are associated with gene silencing

A
  • Friedrich’s ataxia (When CGG over 200Bp, methylation occurs and it can silence genes )
  • Fragile X syndrome
46
Q

How can TRs result in RNA binding protein sequestration?

A

Transcripts containing the repeat expansion bind with greater affinity to RNA binding proteins (RBP) than those without. By binding to the RBP, they reduce the available RBP for functional transcripts

47
Q

What disease is associated with RBP sequestration?

A

Myotonic Dystrophy type 1 (DM1): The DM1 pathogenic state includes the sequestration of Muscleblind (MBNL) proteins by toxic DM Protein Kinase (DMPK) transcripts, CUG RNA-Binding Protein Elav-Like Family Member 1 (CELF1) protein activation, and fetal-like splicing patterns.

48
Q

How may there be a toxic gain of function?

A

In Huntington’s disease and other polyglutamine (polyQ) disorders, mutant proteins containing a long polyQ stretch are well documented as the trigger of numerous aberrant cellular processes that primarily lead to degeneration and, ultimately, the death of neuronal cells. However, mutant transcripts containing expanded CAG repeats may also be toxic and contribute to cellular dysfunction.

49
Q

Name two ways that repeats can expand (as a trend)

A

Intergenerational (anticipation)- can expand over generations

Somatic (mosaicism)- can expand with age

50
Q

What is meant by antagonistic pleiotropy?

A

When a gene or genomic element can have multiple roles which are both beneficial and detrimental to an organism

51
Q

Describe one case of antagonistic pleiotropy

A
  • Sickle cell anaemia - Homozygous: blood can’t carry oxygen; Heterozygous: bad at carrying oxygen but resistance to malaria- quite high in African regions
  • Huntington’s disease
52
Q

How could STRs have evolved?

A

Kept around because they’re useful but can also lead to pathology after a threshold

53
Q

Describe a RED in NOTCH2NL

A

NOTCH2NLC & neuronal intranuclear inclusion disease (NIID): a progressive neurodegenerative disease that is characterized by eosinophilic hyaline intranuclear inclusions in neuronal and somatic cells. It can also result in essential tremor and a large range of disease phenotypes.

Individuals can present with repeat expansion, some purer, some with interruptions. This can determine what kind of disease phenotypes based on the length and interruptions in the repeat. The phenotypes are linked to the size and purity of the repeat.

54
Q

How do the repeat types correlate with phenotypes?

A

Generally, the repeat size of muscle weakness-dominant is largest, and parkinsonism-dominant is smallest. Dementia-dominant and essential tremor-dominant usually have a purer GGC repeat.

55
Q

What paradoxical trend can be seen in repeats in NIID? What disease is this similar to?

A

Individuals with a repeat number over a certain threshold do not get ill – instead there is evidence of silencing once the repeat reaches a certain threshold; Similarities with FXS/FTAXS

56
Q

What is FXS/FTAXS?

A

Fragile X syndrome/ Fragile X-associated tremor/ataxia syndrome

57
Q

How prevalent is fragile X syndrome?

A

Fragile X syndrome affects ~1 in 7000 males

58
Q

What are some common physical symptoms of fragile x?

A

Prominent, broad forehead
Large ears
Long face
Strabismus (squint)

Prominent Jaw, Dental
Crowding high arched palate

Murmer/ mitral valve prolapse
Hollow chest

Hypotonia/ Joint Laxity
Scoliosis

Macro-orchidism

59
Q

What cognitive symptoms present with fragile X syndrome?

A

Autism
Intellectual disability

60
Q

What is FXS caused by?

A

Fragile X Syndrome is caused by a CGG repeat expansion in the FMR1 5’UTR

61
Q

What is the normal FMR1 gene responsible for? What is the normal amount of repeats?

A

5-40 CGG repeats

Brain development, dendritic function

Regulation of protein-translation in neurons

62
Q

What is seen when there are too many repeats? How many is too many?

A

> 200 CGG repeats

Induces hypermethylation and a loss of the FMR1 protein

Fragile X phenotype

Abnormal dendritic development

No translational inhibition

63
Q

Like NIID, there are a number of molecular phenotypes, what do these correspond to?

A

Healthy: x30 CGG- normal amount of FMRP

Premutation: x50-200 CGG- More transcripts, lower amount of FMRP (?)

Methylated, full mutation: >x200 CGG: No transcripts

64
Q

Name two disorders arising from premutation length alleles

A

Fragile X-Associated Primary Ovarian Insufficiency (FXPOI)

Fragile X-Associated Tremor/Ataxia Syndrome (FXTAS)

65
Q

What are the symptoms of Fragile X-Associated Primary Ovarian Insufficiency (FXPOI)?

A
  • Causes infertility & early menopause in adult women
  • Women stop having menstrual cycles before 40 years of age
  • Higher risk of having children with FXS
66
Q

What are the symptoms of Fragile X-Associated Tremor/Ataxia Syndrome (FXTAS)?

A
  • Neurodegenerative disorder of the nervous system
  • Can cause tremors and problems with walking, balance, memory and mood disorders
67
Q

How frequent is Fragile X-Associated Tremor/Ataxia Syndrome (FXTAS)?

A

More common in men (X-inactivation)
1 in 450 men have the mutation
1 in 3000 men over 50 affected

68
Q

What is Fragile X-Associated Tremor/Ataxia Syndrome (FXTAS) through to be caused by?

A

Thought to be caused by toxic RNA product

69
Q

What is another molecular phenotype not mentioned previously?

A

People who have the full mutation but it is not methylated and so it behaves like the premutation phenotype- even higher transcripts, lower protein. Some individuals escape methylation at the expanded repeat

70
Q

What is seen in mouse models?

A

Mice do not gain methylation at the expanded repeat but still get ataxia symptoms

71
Q

What did these findings lead Grace towards considering?

A

Is there a primate specific mechanism responsible for the methylation of the FMR1 5’UTR repeat expansion?

72
Q

How is Grace looking to investigate this methylation?

A

Using unmethylated (UME) carriers she is investigating the cause of the CGG methylation in FXS.

73
Q

What did Grace look at regarding these repeats?

A

She noted that KZNFs can bind disease associated simple repeats and that KZNFs are recruiters of histone and DNA modifying enzymes.

She then broadened the candidate list to include a range of epigenetic modifiers such as DNA-methyltransferases, gene promoter binding KZNFs, methylcytosine dioxygenases, histone demethylases and histone methyltransferases.

(Their observations kick started collaborations with FXS experts in Rotterdam and Rome)

74
Q

What did RNA-Sequing transcripts reveal?

A

RNA-Sequencing revealed truncated transcripts unique to the UME carrier- UME stringtie transcripts. They also observed a change in the transcriptional landscape; there was increased expression in FMR1 carriers.

75
Q

What else did they see regarding their earlier considerations?

A

Surprisingly we saw some differences in the expression of epigenetic modifiers, this was backed up by results from the second biological replicate from Italy. TET3 was dysregulated in both carriers- UME carriers have significantly reduced TET3 expression levels. qPCR backed this up in a third biological replicate

76
Q

So, is TET3 a candidate? Why?

A

TET3 is a strong candidate:
Robust evidence for a downregulation of TET3:
* Across 3 different cell lines
* Using both qPCR and RNA-seq

77
Q

What is the relevance for FMR1 methylation?

A

The TET enzyme family are able to convert 5mC to 5hmC; they can associate to the chromatin using a methylation sensing domain. Their core catalytic domain can sense (CG) domains with substrate preserence and can undergoe catalytic activity. 5hmC is a stable form of methylation found in the brain

78
Q
A