Dr. Robert Weinzierl (TF Focus) Flashcards

1
Q

Definition of a core promoter?

A

A core promoter is defined as the minimal DNA region that is sufficient to direct low levels of activator‑independent (basal) transcription by RNAP II in vitro

Core promoters typically extends approximately 40 bp up‑ and down‑stream of the start site of transcription (+1) and can contain several distinct core promoter sequence elements

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Are core promoters very diverse in Higher eukaryotes?

A

Core promoters in higher eukaryotes are highly diverse in structure and each core promoter sequence element is only found in a subset of genes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does an ideal TATA-box promoter sequence look like?

A

TATA-box is a core promoter element that is typically located 25-30 bases upstream from TSS and is defined by a A/T rich consensus that is flanked by G/C rich sequences (emphasising the position of the TATA box)

Example shown is derived from adenovirus major late promoter –> One of the best examples of highly active TATA-boxes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is one of the main problems around using a TATA box element as a core promoter element in higher eukaryotes?

A

Eukaryotic genomes are huge in comparison with bacterial genomes

Hence, given that a TATA motif can arise by chance every 4 Kb, it is too short/weak to confer specificity

Thus, normally TATA boxes are accompanied by other core promoter cis-regulatory elements + TATA boxes are only found in a small proportion of core promoters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What protein complex recognises the TATA box motif?

A

The TATA-Box is specifically recognised by the TFIID protein complex.

TFIID contains a TATA-binding protein (TBP) that recognises the DNA motif

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How large is TFIID and how does it recognises the TATA box?

A

TFIID is a large multiprotein complex (~10-12 different subunits; species-dependent)

Only one of these subunits is responsible for recognizing the TATA box: the ‘TATA-Binding Protein’ (‘TBP’)

Hence, TBP can specifically bind to TATA-boxes on its own –> none of there other proteins are involved in TATA recognition and binding

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Outline the structure of TBP and what properties allow it to bind to DNA.

A

TBP is a crescent/U-shaped protein that is able to wrap around the DNA double helix

TBP contains +ive charged Lys and Arg residues which interact with the -ive charged backbone + conserved phenylalanine residues are placed in the DNA minor groove - allowing it to make hydrophobic contacts (reasonably strong binding)

Hydrophobic interactions cause the DNA to kink by 80o

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Is TBPs saddle structure highly conserved across species?

A

Sequencing of TBP-encoding genes from many organisms has shown that the ‘saddle’ structure is highly conserved

This saddle structure is present in TBPs from all species

On the DNA level → in the saddle region we see a drop in % conservation in distant species whereas the N-terminal region tends to be be more divergent.

In higher eukaryotes there is a substantial N-terminal extension present that may contain poly-glutamine (poly”Q”) stretches

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why do bacteria promoters tend to be A/T rich?

A

In bacterial promoters the –10 element is also AT-rich, but this element is not functionally equivalent to eukaryotic TATA boxes

The bacterial –10 region facilitates the localized DNA strand-separation (‘promoter melting’; ‘transcription bubble formation’) to allow RNAP to initiate transcription. The eukaryotic TATA-box is located much further away from the transcription start site (-25 to –30!) and this part of the promoter is never part of the transcription bubble!)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the functional importance of the TATA (A/T rich region) box in the promoter region?

A

In eukaryotic TATA boxes the deformability of the minor grove width in regions of A/T base-pairs is important

This increase deformability allows for more intimate contacts between the TBP saddle and minor groove → lack of protruding -NH2 in G bases allows for this

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does physical data on DNA bending at TATA boxes reveal to us?

A

TATA region already has a tendency to bend

Oligo DNA strand with TATA region will bend on its own without the presence of TBP → upon addition of TBP the bending angle increases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Recap - What are the two main factors that are important for TBP binding?

A

TATA-boxes work on two levels:

  1. Sequence-specific contacts between TBP and the TATA-box consensus sequence
  2. TATA-boxes on their own influence the higher-order structure of DNA by bending it slightly. This localised bend attracts the binding of TBP to the TATA-box (matches conformation).

Note that pre-bending can be achieved with a variety of A/T rich sequences, thus explaining (at least to a certain extent) the high degree of sequence heterogeneity of eukaryotic TATA-boxes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What important fact should you always keep in the back of your mind when thinking about TFIID/TBP binding to a TATA sequence?

A

Always important to consider DNA accessibility which depends on the chromatin environment

Most TATA-like sequences are ‘hidden’ by chromatin structures and are thus never accessible to TBP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

After TFIID binding to the TATA-box, what other two players bind to the protein complex?

A

TFIIB and A bind either side of TFIID → stabilising the complex (synergistic effect)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does X-ray crystallographic data reveal about TFIIB binding to the TFIID complex/DNA?

A

TFIIB is a protein made from a single polypeptide that folds to form two similar domains (Note - TBP is a dimeric protein fused together)

TFIIB contacts DNA either side of the TATA box → conveys additional specificity as it recognises GC elements upstream and downstream of the TATA box called BRE – B recognition elements upstream

Evidence → structural and biochemical data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do TFIIA and TFIIB interact with TBP?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How is TBP activity regulated within the cell?

A

TBP negatively auto regulates it’s accessibility to promoters through homodimerization, meaning…

  1. TBP monomers can dimers together forming a homodimer that does not bind to DNA in turn regulating activity levels → dimerisation defective yeast mutants show high levels of activator independent gene exrpession
  2. Furthermore, TBP in it’s monomeric state is more vulnerable to degradation

Hence, dimerisation prevents excessive levels of TBP binding/activity and also protects it from degradation in the cell (safe storage)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is an example of a TBP antagonist?

A

Some transcription factors prevent formation of a productive transcription initiation complex

NC2 (Dr1-DRAP1), a negative cofactor, inhibits transcription initiation via direct interactions with the TBP-DNA binary complex

Function → prevents pervasive transcription (stops random transcription of genes) → streamlining the transcription machinery

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the structure of NC2 and how does it prevent the transcriptional activity at the ?

A

NC2 is composed of small two subunits (alpha and beta) that are conserved among eukaryotes

NC2 bindings to TBP to form a stable complex → NC2 blocks binding and assembly of TFIIB (required for RNAP binding) in turn blocking RNAP binding → preventing initiation complex formation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What does the textbook description of RNAPII transcription initiation via a TATA motif?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Do most promoters have TATA boxes?

A

Bioinformatic analysis of metazoan genomes suggests that the prevalence of the TATA box has been overestimated in the past, and that the majority (>75%!) of core promoters do not contain TATA boxes

Many of the TATA-motifs even deviate form the consensus

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What are examples of other promoter elements can be present in the promoter region?

A

>80% of RNAPII-transcribed genes lack TATA-boxes

There are further consensus motifs that may be present to various extents in different promoters:

Initiator Element (‘Inr’)

Downstream Promoter Element (‘DPE’)

Motif 10 Element (‘MTE’)

TCT motif (polypyrimidine initiator; in ribosomal protein gene promoters)

These other promoter elements are even less clear than the TATA box

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What does promoter positional information reveal about TATA, Inr, DPE and Dref elements?

A

TATA and Inr → conserved dense peak at a specific position → conserved location

DPE and Dref (drosophila) → broader peak

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Summary picture of the proximal promoter elements?

A

MTE – minus ten element

Review - Kadonaga, J.T. (2012). Perspectives on the RNA polymerase II core promoter. Wiley Interdiscip. Rev. Dev. Biol. 1, 40-51.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What roles do other proteins in the TFIID complex play?

A

TBP is responsible for the sequence-specific binding to TATA-boxes

Some of the other subunits of the TFIID complex, ‘TBP-Associated-Factors’ (‘TAFs’) recognise the additional sequence elements TAF2, TAF6, TAF9

TAFs making sequence specific contacts with MTE and DPE

Inr element binding is not fully understood

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What are the two main types of transcription initiation patterns?

A

The terms “focused” and “dispersed” refer to opposite ends of a continuum of transcriptional patterns

Focused initiation, transcription starts from a single nucleotide or within a cluster of several nucleotides → associated with regulated genes (on/off)

Dispersed initiation, there are several weak transcription start sites over a broad region of about 50 to 100 nucleotides → housekeeping genes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Picture of the two different models of transcription initiation patterns?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

How does chromatin structure influence the transcriptional state (stable & labile)

A

Nucleosome positioning/density influences the type of transcription

Labile → focused transcription → all the core elements are recognized

Stable → dispersed → requires gene-specific transcription factors to drive transcription

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

How does TBP poly-glutamine expansion change over evolution?

A

TBP Polyglutamine expansion takes place as move through the vertebrate evolutionary lineage → more complex organisms = longer Polyglutamine repeat tail

Evolution progresses → Polyglutamine expansion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What types of diseases are associated with the TBP polyglutamine tail?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What is cerebellar ataxia?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What does DNA analysis of the TBP polyglutamine tail reveal about the healthy and pathological forms?

A

CAG - codes for glutamine

Most of us have around 35-36 glutamine residues at the TBP N-terminus

Some people have a pathological amount → corresponding to 47-55 residues

The number of CAG repeats is inversely correlated with the age of onset for cerebella ataxia → Meaning higher repeat number corresponds to a younger age of onset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

Under the microscope, what does cerebella ataxia pathology look like?

A

Main message - TBP aggregates accumulating in cells.

  1. TBP containing an expanded polyQ stretch is expressed at the same level as its normal counterpart and forms neuronal intra-nuclear inclusions containing other proteins involved in protein folding or degradation
  2. polyQ expansion reduces in vitro binding of TBP to DNA
  3. polyQ expansion causes abnormal interaction of TBP with the general transcription factor TFIIB

Note - SCA17 is another gene that is responsible for driving disease progression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What do mouse models reveal about polyglutamate expansion and cerebella ataxia pathology?

A

All WT mice survived

But mice with glutamine expansion had significantly reduced lifespan → correlated with the level of polyglutamine expansion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

How does TBP binding differ between a healthy and pathological state (cerebella ataxia)?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

What link was found between TBP and depression?

A

CAG repeat sizes in the TBP gene were investigated in two well-characterized Dutch cohorts → including 2165 depressed and 1058 non-depressed individuals aged 18–93 years

A CAG repeat length exceeding the median in both alleles was associated with an increased risk for lifetime depression → showing us that repeat polymorphisms act as complex genetic modifiers of depression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

What are Super-core promoters?

A

Using knowledge on pre-existing promoters we are able to construct artificial ‘Super-core promoters’ that drive high levels of gene expression (higher than strongest natural promoters i.e. viral promoters)

Mediated by introduction of expression vectors → useful for a wide range of scientific and biotechnological applications

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What does the in-vitro assay using the detergent Sarcosyl reveal about super-core promoters?

A

In vitro transcription reactions can be limited to a single round of transcription by adding the detergent Sarcosyl → Sarcosyl prevents further initiation but does not interfere with transcript elongation (1 round of initiation - no re-intiation)

The super core promoter is ‘stronger’ than the two other promoters because it supports a higher frequency of transcript initiation → ~40% of available DNA templates are used in the in vitro assay (this is very efficient!)

Likewise, Footprinting studies show that the super core promoter bind TFIID with higher affinity than other promoters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

What does the construction of the super-core promoter reveal about the nature of promoter regions?

A

Shows us that all of the core promoter motifs contribute to the strong binding of TFIID to the super core promoter → meaning higher specificity of each TF to the promoter elements contributes to an overall stronger interaction (additive)

Example → Super core promoter (SCP1) compared to viral promoters (CMV and AdML) in a Luciferase assay

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

Do the number of RNAP’s differ between species? Do mitochondria and chloroplasts have their own RNAP?

A

Eukaryotes

RNAP1 → exclusively transcribe ribosomal RNA

RNAP2 → very diverse → mRNA product

RNAP 3 → more diverse than 1 but still not as diverse as RNAP1

Archaea - RNAP similar to RNAP 2

Bacterial - RNAP – most distinctive → divergent evolution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

What does the following image highlight about the evolution of RNAPs?

A

When examining the different RNAP structures we can start to understand how closely related they are based on structural similarities

Takeaways…

LUCA – Last universal common ancestor + Color categorizes similar subunits

Bacteria RNAP branched off earlier → evolved separately relative to Archaea and Eukaryotes

Even though bacteria subunits evolved separately there are still several homologs (as shown by color)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

Characteristics of Eukaryotic RNAPII?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

Outline the 3D structure of yeast RNAPII?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

What are the two main conformations for RNAPII?

A

RNAPII has a ‘open’ and ‘closed’ clamp conformation.

The conformation is found prior to DNA binding whereas the closed conformation correlates to the DNA bound state.

X-ray crystallographic data suggests that there is a highly localised hinge mechanism required for RNAPII activity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

What does the electrostatic potential of the RNAPII surface show us about it’s interaction with DNA?

A

Inside of RNAP is positively charged whereas outside is negatively charged → favours the bound state of RNAP to the negatively charged DNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

Outline how RNAPII catalyses the synthesis of a novel mRNA molecule?

A
  1. Downstream DNA enters RNAPII
  2. Inside the DNA molecule is unwound forming ssDNA, exposing the template strand for RNA synthesis
  3. Nucleotide triphosphate enter into RNAPII via the pore and binds to the complementary base present on the DNA template
  4. Mg2+ ion catalyses nucleophilic addition of the nucleotide to the growing RNA strand → results int the formation of a short (10B.p) DNA-RNA hybrid
  5. As RNAPII moves along, the nascent RNA strand dissociates and exists the protein complex + the template and non-coding strand bind back together
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

What happens to the angle of the DNA strand as it moves through the RNAPII complex?

A

DNA that goes in and out is bent by 90 degrees → Strand separation allows the bending to be energetically favorable (difficult if it were DS)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

What is a problem with the conventional NTP funnel that feeds the nucleotides into the RNAP active site?

A

Problem with the classical funnel structure that leads up to the catalytic site → Pore is very narrow (1 nucleotide wide) → if the wrong nucleotide diffuses into the active site (¼ chance of being correct) it would have to diffuse back out before another nucleotide can move in → inefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

What is the alternative model for NTP entry into the RNAPII active site?

A

Additional channel → wider & exposes 3 + 4 nucleotides at a time to incoming NTPs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

What structure in RNAPII is responsible for the translocation of RNAPII across DNA?

A

Remember translocation is an active process that needs to be directed

Who’s responsible → Long alpha-helix - bridge helix → pushes RNA-DNA hybrid through the protein complex

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

Outline the bridge helix trigger loop translocation mechanism for RNAPII.

A

Bridge helix (green) bends (kinks), which pushes the DNA-RNA hybrid out of the catalytic site → subsequently, it returns to relaxed conformation

Trigger loop (blue also involved)

Further reading?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
52
Q

Are large quantities of abortive transcripts produced during transcription intiation?

A

The process of RNAPII breaking free from the initiation complex appears to be a difficult process → evident by the high levels of abortive (short) transcripts that are produced

1-5% of initiated transcript will end up as full length RNA but once RNA breaks free from the initiation complex the % of successful transcripts increases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
53
Q

What has FRET analysis revealed regarding the cause of the large number of abortive transcripts?

A

Initiation → DNA comes together as shown by FRET distance decrease (scrunching) which leads to the creation of DNA Loops. This creates tension in RNAP which results in two outcome which are that RNAP continues or aborts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
54
Q

Outline the domains in TFIIB and their importance in transcription initiation?

A
  • The N-terminal Zn-ribbon domain of TFIIB interacts with the RNAPII (via the ‘dock domain’ on RNAPII)
  • Helps to open up the DNA at the start site using the B-linker (N-terminal region of TFIIB enters the catalytic cleft of RNAP) → The DNA template strand slips into the cleft and is scanned for the transcription start site with the help of the ‘B-reader’ that approaches the active site (important for start site recognition and positioning in active site)
  • Synthesis of the RNA chain and rewinding of upstream DNA displace the B-reader and B-linker, respectively, to trigger TFIIB release and elongation complex formation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
55
Q

Do we understand how the transition from the intuition to elongation complex takes place?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
56
Q

What id the CTD tail in RNAPII and why is it important?

A

CTD grows with evolution → longest stretch in humans

2x Proline in consensus – high density → bad news for secondary structures → meaning that it doesn’t form regular structure (flexible tail)

Function → Y, S and T contain free -OH groups → targets for Post-translational modifications –> phosphorylation, ubiquitination → provides an opportunity for additional regulation i.e. phosphorylation by TFIIH kinase

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
57
Q

Is the CTD tail present in all RNAPs?

A

The CTD is only present in eukaryotic RNAPII, not in RNAPI, III, IV and V!

Prokaryotic (bacterial, archaeal) RNAPs also lack an equivalent structure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
58
Q

What phosphorylation states of the CTD tail are associated with initiation and elongation? What molecular processes is the phosphorylation of CTD associated with?

A

Degree of phosphorylation changes during the process of transcription → hypo to hyper

TFIIH is a versatile factor – helicase activity for DNA melting, kinase activity for phosphrylation of CTD)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
59
Q

How does DNA melting during transcription initiation take place?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
60
Q

Outline the structure of TFIIH.

A

Very large complex (multiple polypeptides) → uses ATP hydrolysis to unwind DNA start site

The TFIIH core forms a crescent-shaped complex spanning from Ssl2 to Rad3

Ssl2 binds downstream DNA consistent with its role in DNA opening → Ssl2 uses ATP hydrolysis to translocate on DNA. If the Ssl2 location is fixed, Ssl2 action results in a reeling of DNA into the active centre

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
61
Q

What does a ribbon structure of the initiation complex look like?

A

Huge molecular machinery → provides a decent explanation to why there is a high degree of abortive transcripts → a lot going on

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
62
Q

What other things are happening alongside transcription elongation that play an important role in mRNA formation?

A
RNAPII transcription and pre-mRNA processing are 
coordinated events (downstream processes are equally as complicated)

5’ Capping, splicing and 3’ poly-adenylation are occurring during transcription and RNAPII (and the CTD particularly) plays a role in the regulation of these events

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
63
Q

Why is it important to understand the various experimental techniques for transcriptional assays?

A

To obtain insights into gene expression mechanisms a number of different technical approaches need to be applied → Biochemistry, molecular biology, structural methods, cell biological methods etc.

Every method has inherent limitations and artefacts, so a coherent picture can only be built up by looking for consistencies emerging from a number of techniques

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
64
Q

What are eukaryotic In-vitro transcription assays?

A

This is when purified DNA templates are transcribed in the test tube with nuclear extracts and/or highly purified transcription factors

The ultimate goal is the re-construct regulated gene expression from recombinant (i.e.completely defined) transcription factors

But! This goal has up to now not been achieved with any eukaryotic system → For example, human RNAPII and TFIIH cannot currently be assembled from recombinant subunits in active form!

Hence, they need to be purified from eukaryotic cell line instead

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
65
Q

What does the overall strategy for in-vitro transcription assays look like?

A

Basically you are trying to get all the components required for transcription into a test tube so that you can create an experimental system whereby you can manipulate variables (i.e. presence of specific TFs).

Hence, it would be ideal if you all the required proteins using a recombinant system → not that this had not been achieved with RNAPII and TFIIH (large complexes).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
66
Q

How were the different core TFs in the basal initiation complex named?

A

Nomenclature for names come come from the original experiments

Ion-exchange chromatography and increasing salt conc was used to remove different subunits from the phosphocellulose column → systematic names given based on the time they exited the column

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
67
Q

Things to consider when performing in-vitro transcription assays?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
68
Q

What are the two most common ways of quantifying the level of transcripts produced from in-vitro transcription assays?

A

Requirement for techniques that allow quantitation of RNA originating from a specific region of the DNA template

Two major methods are used for quantitating in vitro transcripts: primer extension assay & ‘G-less Casette’ assay

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
69
Q

What is the primer extension assay?

A

A radiolabelled DNA oligonucleotide primer is hybridized to the RNA and extended by reverse transcriptase

This will result in the appearance of a single-stranded, radiolabelled cDNA product of a specific size

Gel electrophoresis allows quantitation + allows identification of transcription start site (Reverse transcriptase will polymerise in the 5’ direction to the start of the DNA strand)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
70
Q

What is the G-less Cassette assay?

A

Theory

G-less cassette → template strand has all C removed (mutagenesis) → RNA will not have any G nucleotides

Hence, when we introduce RNase T1 which requires a G residue to cleave → all other RNAs that initiated at other sites will be removed (removing background transcription) → giving us an accurate idea of the desired transcript which can be quantified after running on a gel

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
71
Q

What are the dis- & ad-vantages of using a in-vitro assays?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
72
Q

How are in-vivo transcription assays performed?

A

In-vivo transcription assays → carried out inside living cells (cell line, organism, organdie, etc.)

Basics → Transfer DNA of interest into cell → see whether it is transcribed by coupling our gene of interest to a reporter gene to tell us that the DNA is being transcribed (GFP, luciferase, etc. )

Luciferase → sensitive + good for quantification

Can be transient or stable assays

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
73
Q

What is the difference between in-vivo transient and stable assays?

A

Transient vs stable assay

Transient → introduction of plasmid à resulting in rapid expression – quick

Stable → transfection with stable integration into the genome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
74
Q

Advantages and disadvantages of transient in-vivo transcription assays?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
75
Q

What is an example of a commercial plasmid used for transient transfection?

A

pGL3-Control Vector

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
76
Q

How can these plasmid constructs be used for in in-vivo transcription assays?

A

Both cases we are examining the ability of a given inserted DNA sequence in promoting/enhancing levels of luciferase expression

Looking for promoter sequence (Left) → Insert DNA into restriction site → if the inserted contains a DNA promoter then elevated levels of Luc are expressed which can be visualised/quantified

Looking for enhancer sequence (Right) → No enhancer → we have a low basal level of expression but we can introduce our DNA and if expression increases, we know that we have an enhancer element

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
77
Q

What do we need to perform stable transfection assays?

A

Early stages identical to transient transfection assay (transfection using plasmid), but plasmid must contain a dominant selectable marker → creates selective pressure for integration in the genome

Example: Drug resistance gene (G418 drug resistance)

Result - DNA becomes stably integrated into genome and is expressed in chromosomal environment

Note → process of stable integration takes much longer than transient transfection assay

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
78
Q

Advantages and disadvantages of in-vivo transcription assays?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
79
Q

How could one use performing identification of control sequences in in-vivo systems?

A

Basically, incorporate gene (with reporter) into in-vivo system and examine the gene is expressed

If proper expression/regulation is obtained → methodically delete regions of the sequence and repeat

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
80
Q

How could one use performing identification of control sequences in in-vivo systems?

A

Basically, incorporate gene (with reporter) into in-vivo system and examine the gene is expressed

If proper expression/regulation is obtained → methodically delete regions of the sequence and repeat

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
81
Q

What experimental method is used to investigate the human transcriptome?

A

Transcriptome → collection of all the transcripts in a cell → use RNAseq

Counting frequency → indicates abundance in samples

RNAseq → can be used to detect transcript isoforms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
82
Q

Why is analysing the transcriptome of cancer useful? What problem are we faced with when trying to do so?

A

Compare cancers to normal tissue → provides insight to transcript dysregulation (over/under-expression of specific proteins)

Problem → cancer cells are not homogenous cell populations - meaning that healthy and cancerous tissues are mixed together

Consequence → When performing RNAseq we end up with a mixture of transcriptomes

Ideally we need to isolate the cancer tissue → Laser capture microdisection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
83
Q

What is Laser capture microdissection (LCM)?

A

Laser capture microdissection → dissect out cells based on specific characteristics – i.e. morphology

Using IR to dissect out the cancerous cells

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
84
Q

What needs to be performed after laser capture microdissection when analysing the transcriptome of the selected cell?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
85
Q

How is mRNA amplification carried out?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
86
Q

What are the different technologies for performing transcriptomics and their relative throughput & capacity?

A

Number of Samples - Capacity

Number of genes queried - throughput

RNAseq → has a high capacity & throughput

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
87
Q

What is qPCR with Taqman?

A

TaqMan is a quantitative PCR technique (qPCR) → It can be used to quantitate transcripts, if coupled with RT-PCR

  1. In the intact TaqMan probe, energy is transferred (via FRET) from the short-wavelength fluorophore on one end (green circle) to the long-wavelength fluorophore on the other end (red circle), quenching the short-wavelength fluorescence. After hybridization, the probe is susceptible to degradation by the endonuclease activity of a processing Taq polymerase.
  2. Upon degradation, FRET is interrupted, increasing the fluorescence from the short-wavelength fluorophore and decreasing the fluorescence from the long-wavelength fluorophore
  3. The rate at which fluorescence increases is directly correlated to the initial transcript level
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
88
Q

Does transcription elongation impact levels of gene expression?

A

Initially, initiation was considered the driver behind differential gene expression but it turns out that transcript elongation has also an influence (pausing/arrest site)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
89
Q

What are the different ‘decisions’ that can be taken by RNAPII during transcription elongation?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
90
Q

Outline the role of the bridge helix in RNAPII translocation.

A

Nano-mechanical process – using bridge helix to push DNA-RNA hybrid along

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
91
Q

What are the two main models that are used to explain RNAPs translocation during transcription elongation?

A

Powerstroke → rNTP hydrolysis drives movement

Brownian Rachet → kinetic energy from aq environment provides the energy for the kinking of the bridge helix (random kinetic input) → more passive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
92
Q

Explain both the powerstroke model and the brownian ratchet model using the attached diagram.

A

Possible to have a mixture of both → Not mutually exclusive

In the Brownian model RNAP is initially quite mobile until NTP moves in favoring forward movement → then allows for condensation + pyrophosphate release

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
93
Q

Why do elongation rates differ between in-vivo and in-vitro systems?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
94
Q

Using the example of c-myc explain how elongation in different stages of the cell cycle influence it’s expression?

A

Basically…

  • C-myc (oncogene) regulates 100’s of genes for cell proliferation
  • Cell needs to regulate intracellular myc expression
  • How? Make it hard to transcribe the gene - excessive levels can lead to cancer formation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
95
Q

What does the attached image show us (relation to C-myc)?

A

Pausing and arrest sequences in c-myc

P (pause) → does not need Efs to continue

P/A → arrest site → requires elongation factors to continue

A/T → Arrest + possible termination

Past the first 500 nucleotides the DNA transcript lacks the P and A sites → large initial hurdle

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
96
Q

What basal factors are able to stimulate transcription elongation?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
97
Q

What do the large number of stalling events near the initiation site tell us?

A

In drosophila → stalled sites at the +25-50 site quickly after initiation sites → prevents other RNAPs latching onto DNA and initiating transcription (blockage) → mechanism to downregulate expression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
98
Q

What do large scale studies in Drosophila reveal about RNA polymerase stalling?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
99
Q

What is required to convert stalled RNAPs into actively elongating enzymes?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
100
Q

What two main roles do elongation factors perform to keep the transcription party going?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
101
Q

What is transcriptional arrest?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
102
Q

What does transcriptional arrest look like diagrammatically?

A

RNAP takes a couple steps back once arrested (slides back) → mechanism not fully understood

Consequence? → Some of the nascent RNA hangs out of the catalytic site (into the cone shaped channel) due to the backwards movement → the 3’ end moves away from the catalytic site (not in close contact with Mg2+) → impossible for the RNAP to add RNAPs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
103
Q

What protein is required to rescue an arrested RNAP?

A

What needs to happen → How can we rescue this? Turn Mg2+ into a nuclease to cleave off the hanging 3’ end

Proteins responsible → TFIIS (GreB - bacterial homolog)

TFIIS and GreB produce a profound conformational change in RNA polymerases by converting the active site from a “polymerizing” to a “ribonucleolytic” function

104
Q

What is the structure of TFIIS?

A

TFIIS has a zinc binding domain (domain III) which contains a loop with acidic residues → this slides into RNAP funnel coming in close contact with Mg2+ → moves Mg2+ atom into a slightly different position allowing it to initiate nucleolytic activity

105
Q

How does TFIIS/GreB bind to RNAPII?

A

Universal activity in eukaryotes and bacteria – encounter similar problems

106
Q

How TFIIS/GreB loop interacts with Mg2+ in RNAPII?

A
107
Q

Are there any other transcription factors that can restart an arrested polymerase?

A
108
Q

What are the 4 different catalytic activities that RNAPII has been shown to exhibit?

A
109
Q

Outline the 5 different catalytic activities that can be performed by RNAPII.

A

Just because they can be performed does not mean that they commonly occur in-vivo → require the right conditions

110
Q

Are smaller RNAPs able to read through nucleosome bound DNA?

A

Chromatin (i.e. nucleosome-packaged DNA) is the natural substrate for RNAPII

Nucleosomes can be displaced by RNAPII in highly transcribed genes, but are not usually removed during active expression of a gene

Single-subunit RNA polymerases from bacteriophages T7 and SP6 were used for some of the earliest studies → Surprisingly, these small polymerases could transcribe completely through a nucleosome, albeit at a reduced rate compared with free DNA

111
Q

What happens when longer arrays of nucleosomes are present, can RNAPII read through this?

A

More physiological templates, containing long arrays of 15 or more nucleosomes, slow down the progress of prokaryotic T7 RNA polymerase substantially → nucleosome contacts significantly repress elongation

Most surprising: RNAPII is brought to an almost complete halt by arrays of nucleosomes! → RNAPs recruits special helpers

112
Q

What are some examples of elongation factors that allow RNAP to read to nucleosome bound DNA?

A
113
Q

Outline the principal for Spt6 and Fact activity?

A

Spt6 → associates with nucleosomes à holds on to it à returns it when it is finished

Fact → displaces H2A/H2B dimer → creates a nucleosome that is highly unstable allowing DNA to peal off from the nucleosome → once cleared Fact returns the dimer

114
Q

How does FACT and Nhp6 open the chromatin structure

A

Increased atomic detail → but the exact information about how the mechanism works is not clear

115
Q

What are pioneering RNAPs?

A

The nucleosome is only the first level in chromatin compaction, but is unclear whether these activities are sufficient for elongation through higher order chromatin fibers in vivo

It is possible that the first polymerase to transcribe a gene is a specialized ‘pioneer’ polymerase equipped with additional tools that break down higher-order chromatin structures

Note → This is a concept but evidence is not concrete

116
Q

What experimental setup was used in order to study the mechanism of RNAPII translocation during transcription?

A

Technique is known as Laser Trapping

Setup have two beads that are connect by a DNA strand. One bead is fixed in position whereas the other bead is able to move in the horizontal plane → all done using lasers i.e. Laser trapping

Hence, as RNAPII moves along the DNA strand it exerts a force on the mobile bead → this can be quantified by examining the level of input laser is required to hold the mobile bead in a fixed position.

This allows us to determine the kinetics and force of transcription translocation

117
Q

What does the laser trapping experiment reveal about the nature of RNAPII translocation?

A

The new results appear to be inconsistent with the ‘power-stroke’ model, favouring the view that RNAP translocation occurs by the ‘brownian ratchet’ mechanism

RNAP moves in steps of 3-4 A → equivalent to single nucleotide at a time

Problem à small number of nucleotides incorporated à not reflective of in-vivo experiments

118
Q

Is the basal transcription machinery considered to be rate-limiting step in transcription?

A

The basal transcriptional machinery (RNA polymerase and accessory factors) is largely invariant between different genes and is generally considered as not rate-limiting

The basal transcription rate is controlled by additional gene-specific transcription factors

119
Q

Generally speaking, how do gene-specific TFs differ between eukaryotes and prokaryotes?

A

Eukaryotic gene-specific transcription factors are often transcriptional activators → this is because the chromatin packaged genome is already in a naturally repressed state

Note - gene-specific TFs normally act in a combinatorial manner to regulate the expression of particular genes

Prokaryotic gene-specific transcription factors are often transcriptional repressors → this is because the DNA is more accessible as the DNA is not compacted in chromatin.

120
Q

How are gene-specific TFs normally regulated?

A

Normally regulated by…

Post-translational modifications and/or by differential intracellular localization

121
Q

What are the two main types of enhancer elements that you find in the eukaryotic genome?

A

Distinct regulatory elements, often called ‘enhancers’, act as binding sites for gene-specific transcription factors

Note that Enhancers are typically several hundred base pairs in length and contain multiple transcription factor binding sites

  1. Some enhancers increase the transcription rate of nearby promoters in a ‘non-specific’ manner → bound by generic TFs such as Sp1
  2. Some enhancers are ‘tissue-specific’ and allow genes to achieve and maintain tissue-specific expression patterns → bound by specialized transcription factors that are only expressed in particular cell-types
122
Q

What does the attached image show?

A
123
Q

What TF binds to the SV-40 enhancer?

A
124
Q

In terms of distance, how far can enhancer elements be from the core promoter site?

A

Some enhancer modules are located close to the core promoter (typically within 1 kb)

In vertebrate promoters’ ‘proximal’ enhancers often contain Sp1 sites

Other enhancers can act over long distances (10 – 1000 kb!) and may therefore be located anywhere with respect to the transcribed region

‘distal’ enhancers can be located anywhere: 5’ end, introns, 3’end

125
Q

What experimental setup can be used to study the distribution of tissue specific TFs?

A

Using Enhancer traps

Basic concept → Make transgenic animals with reporter genes (b-galactosidase) that is linked to a known enhancer.

This is injected into mouse genome and the pattern of b-galactosidase is examined to figure out the tissues that express the specific TF of interest.

126
Q

What property of DNA allows for interaction of distal enhancers with the basal initiation complex?

A

Long-distance action of enhancers is facilitated by the DNA’s flexibility

127
Q

What experimental evidence supports the idea of DNA looping in order to bring together TFs?

A
128
Q

What is a problem that arises when using DNA loops for long range TF interaction? What is a probable solution to this problem?

A

Enhancers can regulate the expression of genes over long distances by looping.

Hence raising the question, how can we get specific regulation of certain genes without interference by other enhancers? Basically, how can we ensure that our enhancer is acting on the correct gene?

Solution → insulator (barrier) proteins block long range interactions

129
Q

What are the three rules for enhancer-promoter interactions that one should consider?

A

Proximity: When multiple genes are compatible with and relatively close to a shared enhancer, the most proximal gene is preferentially activated over the distal gene → This competitive advantage disappears when both genes are located far away from the shared enhancer

Compatibility: Enhancers ignore the ‘first-come, first-served’ rule when the proximal promoter is incompatible with the enhancer → Result: activation of the distal gene

Insulation: The presence of an ‘insulator’ can block enhancer function across its binding site and prevent a compatible gene from being activated

130
Q

How do insulators block unintended interactions?

A

Physically block/prevent long range interactions between enhancers and core promoter’s that reside outside DNA loop

131
Q

How can insulator sites be used to increase expression of transgene?

A

If we try to incorporate a transgene into a new organism it is possible that it randomly integrates in a heterochromatic region → in turn yielding low levels of expression

But if we were to flank our DNA segment with two insulator regions we can create a region that is transcriptional active as the insulators isolate/loop the transgene from the surrounding chromatin

132
Q

How are enhancers normally organised in the genome?

A

Enhancers have a modular nature

Meaning that each enhancer typically contains dozens of binding sites for gene-specific transcription factors which carries out a specific function

i.e. Embryonic development → one of the modules may switch a specific gene on in neural cells, whereas another module may be responsible for the expression of the gene in epidermal cells

133
Q

How does the drosophila even skipped gene act as evidence for the modular nature of enhancers?

A
134
Q

How were the enhancer modules of the even skipped gene in drosophila analysed experimentally?

A

The existence of enhancer modules can be detected by linking different portions of the eve promoter to a reporter gene (e.g. b-galactosidase encoded by E. coli lacZ) → create different constructs containing different regions of the eve promoter

These constructs are then used to create transgenic Drosophila flies → If a given enhancer module is active in a specific region it will also drive the production of the reporter b-galactosidase

Subsequently, X-gal staining (blue color) of the embryos derived from such flies will reveal the expression of the reporter gene in the pattern directed by specific enhancer modules

This technique can also be used to map the location of enhancer modules

135
Q

Do enhancer modules mostly act independently of each other?

A
136
Q

What biochemical techniques can be used to localise important enhancer motifs as well as characterisation of unknown gene-specific transcription factors?

A
137
Q

How to study the importance of enhancer motifs for expression levels?

A

Use different deletion mutants in a transient transfection of cells → examine how levels of reporter gene changes

138
Q

How is the Eve expression concentrated into stripes during drosophila development?

A

Bicoid + Hunchback expression follows a gradient

Giant and Kruppel expression repress → resulting in narrow gene expression

139
Q

What function roles do gene-specific TFs need to carry out?

A
140
Q

Apart from direct activation of the basal initiation complex, how else can gene expression be enhanced?

A
141
Q

What are examples of true activation mechanisms (direct interaction wit B.I.C)?

A
142
Q

What should you keep in mind when considering the different tools available to enhance gene expression?

A
143
Q

Generally speaking, what is the role of gene-specific TFs?

A

The sequence-specific binding of gene-specific transcription factors to these modules allows genes to achieve and maintain controlled levels of (tissue-specific) expression patterns.

Remember that eukaryotic gene-specific transcription factors are usually transcriptional activators → chromatin compacted DNA is naturally repressed.

144
Q

Are there examples of gene-specific TFs that are present in virtually all different cell types and used by many different genes? Is this common?

A

Some gene-specific transcription factors are used in essentially all cell type by almost every gene (‘classic’ example: Sp1 in human cells)

But note that many gene-specific transcription factors are only expressed in particular cell types and thereby convey tissue-specific expression of genes (E.g. Eve)

145
Q

What are the generic components of a gene-specific transcription factor?

A

Generic representation of a gene-specific TFs

DNA binding domain → binds to DNA

Activation domain → interacts with other proteins/TFs/RNAP etc.

Domains are modular, meaning that they are made of different independent modules that carry out specific functions

146
Q

What sorts of physical and chemical interactions underpin Protein-DNA binding?

A
147
Q

How are sequence-specific contacts made? How can TFs recognise and target particular sequences?

A

Electrostatic interactions between proteins and the DNA backbone provide stabilising energy, but not sequence specificity → Given the TF binding does not lead to the DNA denaturation/melting, they have to be able to read the sequence from outside the double helix

How?

Boils down to the pattern of hydrogen bond donors and acceptors on each base in both the major and minor groove.

148
Q

How can Hydrogen bond donor and acceptor patterns in the DNA grooves be used for sequence specific recognition?

A

Absolute recognition of the four different base pairs is only possible via the major groove

Transcription factors binding via the minor groove can only distinguish (A-T; T-A) from (C-G; G-C) - AT from GC

What TF mainly look for?

  1. Asymmetric hydrogen donor/acceptors → pattern of Hydrogen bonds which can be recognised and use to identify the bases
  2. Bulky methyl group on T residues → in blue circle is often used as a point of recognition

Minor Groove - Hydrogen bonds acceptors/donors in minor groove are symmetrical but the G-C bases have an extra donor group → allowing TFs to distinguish A-T from G-C

Major Groove - Asymmetrical Hydrogen donors and acceptors + presence of methyl group allows to recognise each base pair

149
Q

What are the standard DNA-binding motifs?

A

Only a small number of ‘standard’ DNA-binding motifs (they mostly use the ‘a-helix in major groove’ method) → most TFs use these or a variant thereof

  1. Helix-turn-helix domain
  2. Leucine Zipper domain
  3. Helix-loop-helix domain
  4. Zinc-finger domains
150
Q

What is the Helix-turn-helix DNA binding domain? Where are they normally found?

A

Helix-turn-helix (‘H-T-H’) motifs are DNA-binding motifs used in many bacterial transcription factors

e.g. lac repressor, CAP protein, bacteriophage lrepressor

H-T-H motifs are also present in eukaryotic transcription factors that are specifically expressed during embryogenesis and determine regional differentiation along the body axis (‘homoeotic genes’)

151
Q

Do H-T-H domains normally dimerise?

A
152
Q

What is the Ubx/Exd Homoeodomain Complex (example of H-T-H)?

A

Ubx (‘Ultrabithorax’, red) and Exd (‘Extradenticle’, cyan) homeodomains bind in a tandem manner on opposite faces of the DNA → Ubx recognises sequence specific targets in DNA whereas Exd aids in Ubx binding

These TFs play an important role in the proper development of body sections during Drosophila development.

Extra - The dashed red line represents the disordered linker between the Ubx homeodomain and its YPWM motif. The YPWM motif reaches into a hydrophobic pocket on the surface of the Exd homeodomain.

153
Q

What happens when the homeotic genes are expressed in the correct spatial-temporal order?

A

We get the incorrect the development of body segments - e.g. sacral vertebrate converter to lumbar in mice

Note - Homeotic genes are master regulator genes that direct the development of particular body segments or structures in a the correct spatial-temporal manner

154
Q

Why is an alpha helix used to make DNA specific contacts?

A

The side-chains of amino acids present in the recognition helix make extensive and sequence-specific contacts with the base pairs via the major groove

Basically, good surface complementarity!

This becomes really evident when examining the Van Der Waals radii of structures.

155
Q

Are there H-T-H variants?

A

Yes, there is a lot of variation in H-T-H motifs

Some with Beta-sheets, multi-helical bundles, etc.

But in each case we have a core recognition helix, whereas any other region is responsible making other stabilising contacts with the DNA → regardless it is still considered a H-T-H

156
Q

Are there H-T-H variants?

A

Yes, there is a lot of variation in H-T-H motifs

Some with Beta-sheets, multi-helical bundles, etc.

But in each case we have a core recognition helix, whereas any other region is responsible making other stabilising contacts with the DNA → regardless it is still considered a H-T-H

157
Q

What is the Leucine zipper domain?

A

Leucine Zipper Domain

  • The binding of the a-helices to form the Y-shaped DNA-binding domains
  • The Y shape is held together by hydrophobic interactions between regularly spaced leucine residues on both helices ( i.e. ‘leucine zipper’)
  • From a structural perspective, the leucine-zipper is the simplest DNA-binding motif
158
Q

What are some examples of TFs that use the leucine zipper domain?

A

Several important transcription factors controlling cell proliferation contain ‘leucine-zipper’ DNA-binding domains

  • The yeast gene-specific transcription factor GCN4 uses a homodimeric leucine zipper motif to form a DNA-binding motif
  • The c-JUN/c-FOS heterodimer (‘AP-1’) uses a heterodimeric leucine zipper motif to form a DNA-binding domain → c-jun/c-fos have been identified as proto-oncogenes, i.e. they can be involved in converting a normal cell into a cancer cell if they are mutated
159
Q

On the primary structure level, what does a leucine zipper consist of?

A

Regular spaced hydrophobic residues that stick out on at a given position in the helix in turn allowing for extensive non-polar contacts between two helices - periodicity of 3/4 residues for the hydrophobic amino acids

Allows for the formation of a slightly coiled super-helix with leucine residues interacting with each other

160
Q

What residues are normally found near the DNA in a leucine zipper?

A

DNA binding → We typically find highly positively charged residues allowing for interactions with negatively charged DNA backbone

161
Q

In the case for jun/c-fos binding to DNA, what other protein gets involved?

A

NFAT à TF that aids and stabilizes the binding of C-JUN and C-FOS → cooperative binding → increase the binding half life

162
Q

What is the Helix-loop-helix DNA binding domain?

A

Helix-Loop-Helix → Remember that it is different to H-T-H

The H-L-H motif is a slightly more complex version of the ‘leucine zipper’ motif because it includes a loop interrupting the alpha-helices → this reverses the direction of the helix

The binding of the alpha-helices to form the Y-shaped DNA-binding domains still involves hydrophobic interactions between regularly-spaced leucine residues on both helices (-> ‘leucine zipper’)

163
Q

What are some examples of TFs that use a H-L-H DNA binding domain?

A

Several important transcription factors acting as oncoproteins contain helix-loop-helix (‘H-L-H’) DNA-binding domains

  • The c-MYC/MAX heterodimer and MAX/MAX homodimer uses a helix-loop-helix motif to form a DNA-binding domain
  • Several other proteins regulating various developmental and physiological processes in cells - the bHLH-PAS proteins (PAS: Per-ARNT-SIM, the first three transcription factors containing such a domain arrangement)

Regardless, DNA binding region contains positively charged residues → e.g. Max homodimer has a high density of arginine and lysine residues

164
Q

What are Zinc-finger DNA binding domains?

A
  • Zinc finger domains contain a zinc atom that is coordinated by cysteine and/or histidine residues
  • The zinc-finger motif is made from a beta-turn and an alpha-helix held together by
    a Zn atom, which is coordinated by cysteine (C) and histidine (H) residues → Forms a compact DNA binding domain (zinc acts as a molecular glue between the two 2o structures)
  • Only found in eukaryotes, not in bacteria
  • One of the most widely used DNA-binding motif!
  • BUT: some zinc fingers-containing proteins bind to RNA or proteins
165
Q

Why are zinc fingers a popular motif in eukaryotes?

A
  1. Short sequence/Small motif that is folded in a compact manner → Zinc acts as glue bringing the helix and B-turn together
  2. Can be used in tandem repeats → Each Zinc-finger helix recognises a 3-nucleotide target site in the DNA

Hence, longer & more complex DNA sequences can be recognised by arranging Zinc-fingers with different sequence recognition ability in tandem - allows for a higher level of sequence specificity!

166
Q

What structure in the Zinc-finger is responsible For DNA contacts?

A
167
Q

Are there any examples where a Alpha-helix is not used for sequence specific contacts?

A

Ribbon-Helix-Helix motifs places b-sheet into the major groove

Attached example → uses B-sheet to bind and interact with major groove → not as relevant in humans but present in bacteria

This functionally diverse protein superfamily regulates the transcription of genes that are involved in the….

Uptake of metals, amino-acid biosynthesis, cell division, the control of plasmid copy number, the lytic cycle of bacteriophages and, other cellular processes in bacteria.

168
Q

Why is p53 TF unique?

A

P53 TF → no other protein that is similar to p53 → a lot of effort to put the central helix into the major groove

Complex structure that is prone to mutation for a simple helix contact

169
Q

What does the attached image, derived from a paper published about p53, show us?

A

Paper → mapping frequency of mutations in DNA binding domains + table showing frequency of each specific mutation in tumours

High frequency of arginine mutations (DNA contacts)

Two types of mutations found:

  1. Structural → not direct effecting the DNA binding domain
  2. DNA-contact → direct mutation on DNA binding
170
Q

Does DNA binding of TFs in pairs (2 or more) influence the sequence specificity of each respective TF?

A

Yes, binding of transcription factor pairs alters their binding specificity

Binding of two TF next two each other can change specificity → combinatorial effect

Building a sequence logo for binding independently and together we can observe some differences

It is possible that binding alters the local structure/flexibility of DNA which changes the binding recognition of the other TF

Hence, it is important not not ro view the TFs in isolation

Example → ELK1 is more conservative in the duo pair whereas GCM1 more liberal in it’s choices (recognition changes)

171
Q

Do gene-specific TFs normally make direct contacts with RNAPII or basal factors?

A
172
Q

What is the mediator complex?

A
173
Q

What is the subunit composition of the mediator complex? Does it exhibit activity in-vitro after purification?

A

Mediator is a large complex consisting of ~20 different subunits

Purified Mediator has several biochemical activities in vitro:

  • it enables transcriptional activation (‘coactivator function’)
  • it stimulates the TFIIH-mediated phosphorylation of the RNAP CTD → stimulating the transition from initiation to elongation
174
Q

What triggers the release of mediator from the RNAPII complex?

A
175
Q

Are there other Co-activator complexes that have been identified?

A

Mediator → one of many examples → TFIID and Mediator probably only act as coactivators for transcription factors bound close to the transcription start site (proximal)

Other transcription factors binding further away use other coactivator complexes, such as CBP/p300, P/CAF, TRAP/DRIP/ARC

A lot of the co-activator complexes have histone acetylation activity → relax chromatin structure → open up DNA to other elements of the TF machinery

Problem → In Higher eukaryotes these complexes appear quite fragile → not stable for biochemical analysis

176
Q

What are the characteristic properties of activation domains?

A

Activation domain in solution is unstructured but the coactivator subunits are ordered protein surfaces →

Initial association between AD and CA is mainly driven by electrostatic complementarity → results in somewhat chaotic/disorganized binding

Slower step → we obtain a more ordered structure formed driven by hydrophobic interactions

Why is this useful? Allows activation domains to interact interact with a variety of co-activators → flexibility + promiscuity of binding partners

177
Q

What are some examples of common activation domains?

A
178
Q

What message is the following image trying to portray?

A
179
Q

Does the transcription factor Myc also exhibit a high degree of disorganisation in it’s activation domain?

A

Yes, Myc does exhibit a intrinsically disordered protein structure in it’s activation domain.

When observing the DNA sequence, we observe that the first 88 residues contain Myc boxes which are islands containing higher levels of conservation, in particular the bulky amino acid.

When observing the secondary structure, we see a relatively low propensity of secondary structures forming (highest is 40%) → indicating a very transient formation

180
Q

In eukaryotes, what proteins show the highest level of disorder in their secondary structure?

A

Looking at the number of Intrinsically disordered proteins in human, yeast and E. Coli for proteins associated with binding, catalysis and transcription

When examining proteins involved in transcription regulation…

E. Coli → low levels of disordered proteins → right skew

Yeast → normal distribution

Human → left -skew distribution

This trend clearly doesn’t not hold up in catalytic proteins → desire higher levels of conservation

181
Q

What is an example of a well-understood TF factor in yeast, specifically with regards to it’s activation domain?

A

The transcription factor GCN4 (from yeast cells) contains a central “acidic activation domain” (cAD) that binds to the MED15 (GAL11) subunit of the Mediator Complex.

Co-activator that recognizes the acidic activation domain is Gal11 → which is a subunit of the mediator complex → ABD1, 2 and 3 can bind to the activation domain on Gcn4

182
Q

What conformational change does the acidic activation domain in GCN4 undergo when interacting with Med15?

A

Upon addition of GAL11 we can visualize (NMR) the transition towards a more organised structure → alpha helical domain forms

183
Q

How is the GAL11-ABD1 domain structured?

A

The GAL11-ABD1 domain is made up of four a-helices linked by short flexible connections

There is a distinct cleft containing a “hydrophobic floor” made up of various amino acids with extensive hydrophobic side chains

(V=valine; Y=tyrosine; W=tryptophan; M=methionine; A=alanine; T=threonine)

The edges of the cleft are mostly positively charged

184
Q

How does the GCN4 complex interact with Gal-11?

A

The initially unstructured GCN4-cAD, when combined with GAL11-ABD1, takes up a polymorphic series of a-helical structures within the GAL11-ABD1 hydrophobic groove

The authors refer to this phenomenon as a “fuzzy” interaction, because it cannot be uniquely defined other than as a family of related structures that interconvert over time

Unlike other molecular interactions, this mode of binding combines order with a degree of chaos

185
Q

What residues are conserved in the GCN4 activation domain?

A
186
Q

What experiment was performed in order to highlight the importance of hydrophobic residues in the activation domain of GCN4?

A

Examine importance of residues using mutagenesis studies – Examined how mutations in the AD influenced ARG3 expression

Table

Top – primary amino acids in AD → Central portion is highlighted

Wild-type → induces a 7-fold induction of ARG3 → indicating activation → in the lanes we have residues that have been substituted and their effect on ARG3 expression

E.g. Alanine → replace by hydrophobic residues → improve ARG3 expression

WT large hydrophobic residues → can NOT improve activity → Look at W, L and F in red → highlighting importance

Graph

Alpha helical propensity à correlates with stimulation propensity à stabilization of alpha helix à improve stimulation potential

187
Q

What tools can be used to study molecular dynamics, both experimentally and computationally?

A
188
Q

Why are molecular dynamic simulations very useful?

A
189
Q

What are the principles of molecular dynamics?

A

Molecular dynamic → fill box with water molecules and ions (Brownian motion) to simulate real physiological conditions using coordinates + predict the forces

Basically, you start with you initial coordinates for our protein(s) of interest and give each atom a specific coordinate in our system → then we examine the forces of the moving molecules and move each atom accordingly → move time forward and repeat process → step-wise calculations of the dynamics of the system

190
Q

In terms of molecular timescales, why are computational simulations extremely useful?

A

Molecular timescales - some events are difficult/not captured experimentally → occurring extremely quickly

Hence, it is useful to adopt MD as it allows us to simulate interactions on this smaller timescale

BUT there is an upper limit → longer timescales extremely computationally demanding

191
Q

What are the computational requirements for running molecular dynamics?

A
192
Q

What did molecular dynamics simulations of GCN4 and Gal11 reveal about the relative contribution of specific residues to binding?

A
193
Q

What is one of the main advantages of using bioinformatics analysis over experimental data?

A

We currently have numerous experimental tools to study in depth the structure and function of any gene of interest in any organism

This, however, can take a lot of time! → months, years, decades …!

During the last few years, especially with the emergence of whole genome sequences and high-throughput technologies, the bioinformatic analysis of gene expression mechanisms has become both feasible and worthwhile

194
Q

What are some gene regulatory elements that can be analysed in-silico?

A
  1. Target sites for gene-specific transcription factors
  2. Identification of possible core promoter sequences
  3. Identification of other regulatory elements, such as nucleosome phasing sequences, chromatin structure, etc.
195
Q

In theory why is in-silico predictions of gene-specific TF binding sites possible? What are some problems with the predictions?

A

Gene-specific transcription factors recognise specific target sequences → hence, it should be possible to identify binding sites by analysing the DNA sequence

But there are some factors that make such predictions difficult….

  1. The majority of potential target sites within a genome are not available for binding because they are packaged into inaccessible chromatin
  2. Gene-specific transcription factors do not have precise requirements for recognizing particular DNA target motifs → i.e. there are some invariant residues (must be present) but others can vary substantially
  3. Naturally occurring sites differ in TF binding affinity → not a black or white, but grey!
  4. Presence of decoy binding sites in the genome
196
Q

What are decoy binding sites?

A

Recently, additional complications, such as ‘decoy sites’ have been identified → these tend to be genuine, but biologically non-functional transcription factor binding sites in the genome → these sites compete with regulatory enhancer modules/promoter regions for binding of transcription factors

Furthermore, decoy sites protect surplus transcription factors from degradation (binding to DNA protects?) → minimises fluctuation of transcription factor concentrations in the nucleus and thus increase stability of gene expression program

197
Q

Experimentally, how is DNA foot printing used to identify the sequences bound by transcription factors?

A
  1. Obtain genomic DNA
  2. Labelled with a fluorescent/radioactive probe/atom
  3. Expose some of the DNA to the TF of interest
  4. Expose the DNA to a restriction enzyme that cleaves DNA independently of the sequence
  5. Run on gel and visualise
  6. When comparing the control to the sample exposed to the protein, we should expect to see regions on the gel that lack DNA cleavage products → TF binding protected this region from degradation.
  7. Given enough data for a specific TF from foot printing analysis (different binding sites) we can align the sequences to gain a consensus.
198
Q

What are sequence logos?

A
199
Q

What does the bit-score in sequence logos tell us?

A

Scale in bits

Bits – unit of information

2 bits give 4 different combinations that allows us to identify 4 bases

Higher bit value → increased conservation

1 bit → half of the sequences have this specific base in the specified position?

When working with amino acids → we need 4 bits as there are more combinations possible

199
Q

What does the bit-score in sequence logos tell us?

A

Scale in bits

Bits – unit of information

2 bits give 4 different combinations that allows us to identify 4 bases

Higher bit value → increased conservation

1 bit → half of the sequences have this specific base in the specified position?

When working with amino acids → we need 4 bits as there are more combinations possible

200
Q

What is an example of an advanced sequence logo that can be included into the analysis?

A
201
Q

What is an example of an ‘alternative’ sequence logo that is also used?

A
202
Q

What the the terms ‘input’ and ‘output’ refer in the world of pattern recognition of sequences?

A

Input: a series of short DNA sequences each containing a common pattern (such as a binding site for a transcription factor)

Output: a ‘weight matrix’ or ‘scoring matrix’ description of the common sequence pattern

203
Q

What are Position Specific Scoring Matrices (PSSMs) or PWMs?

A

Basically PSSMs are trained by given a input data of many different sequences which it scans through.

When scanning it counts the frequency of nucleotides at specific positions and converts the overall frequency into log odds in order to normalise the scores (correcting for the ¼ change of having nucleotides there)

This PWM can then be used to scan through sequences of interest and if a high score is produced (matching TF consensus binding site), this indicates the possibility of a potential binding site.

204
Q

What are some problems associated with simple match algorithms (PWM)?

A

Low specificity due to redundancy of PWMs → a lot of noise in the data

Many PSSMs for typical eukaryotic gene-specific transcription factors will be pick up a signal at a frequency of approximately one site per 4 kb of DNA

This is an unrealistic number; many bioinformatically identified sites will not represent biologically active target sequences!

205
Q

What advanced feature can be used to increase PWM accuracy?

A

Additional specificity/biological relevance can be gained by considering that transcription factors often bind cooperatively

Programs can look for clusters of transcription factor target sites → acts as a filter to remove any false positive hits that were identified by only considering TF binding in isolation

206
Q

Example of using TF clustering to eliminate false positive results in in-silico TF binding site predictions?

A
207
Q

On top of PWMs, what other tool can be implemented in order to increase the accuracy of predictions?

A

Bioinformatic approaches can identify potential target site sites for a variety of known transcription factors in genes of interest, but d_o not considering the presence of chromatin and/or higher order structures_ that may have a substantial influence on the regulation → some attempts to address this issue (predicting nucleosome positioning, etc.) → unclear how reliable this is.

But we can use phylogeny in order to compare target sites between species → if a specific site is conserved, it underscores a potential biological function

This is based on the assumption that regulatory elements are more highly conserved than non-functional sequences

208
Q

Under what condition is using phylogenetic tools to identify TF binding sites most powerful?

A

Phylogenetic tools are likely to be most powerful if closely related (‘sensu strictu’) species are used.

Why? Go to far down the evolutionary line the sequences diverge too much → harder to pick up a signal

Example - S. Cerevisiae and S. Pombe are commonly used in labs but are quite distant in terms of evolutionary lineage → Useful to use more closely related species

209
Q

What is an example of an online tool that combines PWMs and phylogenetic foot printing?

A

OOOOoooo BBBy ConTra v3

Keep grinding my brother!

‘If you don’t believe in yourself, who else will?’ Fouad Abiad

210
Q

What did the Drosophila genome comparison study (Birney, E. et al., 2007) reveal about TF binding sites?

A

Genomes of 12 Drosophila species have been analysed comparatively

  1. Regulatory motifs tolerate local movement and nucleotide substitutions consistent with their degeneracy patterns (show variation in the expected regions)
  2. A large number of motifs were depleted in coding sequence and in 3’ UTRs suggesting specific exclusion
  3. Many of the intergenic or intronic motifs occur in clusters → confirms the concept of enhance modules
211
Q

What are Polyglutamine diseases?

A

At least 8 fatal neurodegenerative diseases are due to expanded polyglutamine tracts present in some proteins

All these diseases result in progressive loss of motor and cognitive functions due to neuronal degeneration in the central nervous system

212
Q

Outline what happens in the polyglutamine exranpsion disease, Kennedy’s Disease?

A

Normally the androgen receptor (male hormone receptor) binds to testosterone, resulting in hormone translocation into the nucleus à where it acts as a gene specific TF

There is normally a polyglutamine stretch in the healthy copy of the gene but in Kennedy’s disease we get excessive expansion → 2/3x as long

What’s the consequence of this?

Protein’s aggregate together to form inclusion bodies, resulting in a neurotoxic phenotype.

213
Q

What are the Cellular and Molecular Effects of polyglutamine expansion i.e. why is it pathological?

A
214
Q

How does a protein with polyglutamine expansion create inclusions? What are the toxic effects of having high levels of misfolded/aggregated proteins?

A

PolyQ → misfolded fragments form oligomers, which associates together to form inclusion bodies

Various toxic effects

  1. Transcriptional alteration
  2. Metabolic dysfunction
  3. Proteasome impairment
  4. Stress response abnormalities

Note - Toxic effects are most apparent in neuronal cells

215
Q

What is Huntington’s disease? What pathology do we observe in Huntington’s Disease?

A

Huntington’s disease (HD) is an inherited disorder that causes neurons to die in various areas of the brain → Caused by DNA mutation leading to CAG (glutamine) expansion

Post-mortem brain from a patient with Huntingdon’s Disease showing massive atrophy of the striatum and corresponding expansion of the lateral vesicles

216
Q

How is the transcriptional affected by PolyQ expansion?

A

The transcriptional machinery is affected in polyglutamine diseases

Many aberrant interactions between expanded polyglutamine proteins and transcriptional factors/co-factors have been described CREB-binding protein (CBP), p300/CBP-associated factor, (p/CAF), p53, Sp1, and TAF4

This causes depletion of transcription factors from normal nuclear locations and disruption of regulation of target genes

217
Q

How have Yeast models been useful to study the polyglutamine phenotype?

A

Yeast models have been a useful tool in understanding the influence of PolyQ expansion

Proteins containing 23 or 75 glutamine would be expressed in cytoplasm or nucleus (Either with or without nuclear localization sequence)

Subsequently, gene expression profiles were determined using DNA microarrays. The mutant expression profiles would be compared with the WT.

Results

Key - Dark color (decreased expression) / White/pale color (expression not significantly effected)

23 + 75 → cytoplasm → doing something but not a lot

N23 + N75 → nuclear localisation → significant downregulation in gene expression → specifically effecting genes involved in transcriptional and replicative processes → results match SAGA complex (HAT) mutants which is required to keep the chromatin structure open.

Note - Induced genes mimic a mild heat-shock response → normally happens when cells are stressed

218
Q

What important protein does the polyglutamine-containing domain of huntingtin bind to? (hint- linked to transcriptional regulation)

A
219
Q

What is an important link between Polyglutamine expansion and HATs? What consequences is this having for transcriptional regulation?

A

Polyglutamine-containing aggregates bind histone acetyltransferases (HATs) and inactivate them

HATs are responsible for the acetylation of histones (Lysine’s on histone tails) and basal transcription factors

The altered balance between protein acetylation and deacetylation may be a key factor underlying expanded polyglutamine-induced pathogenesis

The restoration of this balance may be possible by the genetic or pharmacological reduction of the opposing enzyme group, i.e. the histone deacetylases (HDACs) → image highlights potential HDAC inhibitors

220
Q

Is there evidence to suggest that HDAC inhibitors may be effective at rescuing cells expressing polyglutamine expanded peptides?

A

Experiments conducted in Drosophila Eye (contains neurons)

Photoneurons shown in image - normally 7 neurons in each group

Q48 → regular neuronal patterns becomes distorted – cytotoxicity

Q48 + butyrate and SAHA → normal pattern somewhat regained

221
Q

What is one practical problem with using HDAC inhibitors to tackle polyglutamine diseases?

A

Wide variety of HDACs (classes) present in humans → various different inhibitors → not one size fits all

222
Q

What are other safety considerations associated with inhibiting HDACs?

A

HDAC inhibition is associated with a range of disorders, ranging from cancers to neurodegenerative diseases.

For example, in mice a loss of HDAC3 is behaviourally deleterious: progressive neurodegeneration

223
Q

How would you summarise the research on polyglutamine diseases so far?

A
224
Q

Outline the basic mechanism by which the oestrogen receptor modulates gene expression.

A
  1. Estrogen receptors is located in the cytoplasm
  2. Estrogen diffuses across cell membrane (hydrophobic) → binds to the receptor
  3. Proteins covering (heat shock and cyclophilins) the NLS signal are displaced
  4. NLS drives movement of ER into the nucleus
  5. ER receptor undergoes PTMs (Phosphorylation) & dimerization
  6. ER binds to target sites in the genome → acting as a transcriptional activator
225
Q

What are estrogen’s biological effects?

A
226
Q

Outline the structure of the estrogen receptor

A

The estrogen receptor consists of 2 activation domains, hormone binding domain, DNA binding & dimerisation domain and the NLS signal

227
Q

What are the four members of the estrogen family? Which member is most predominant?

A
228
Q

What is tamoxifen? (Estrogen receptor)

A
229
Q

Why can tamoxifen help prevent/delay breaks cancer occurrence?

A
230
Q

How exactly does tamoxifen binding and interact with the ER?

A

Basically…

When estrogen binds it gets buried inside the estrogen receptor → placing the activation loop (‘domain’) in the active state

However, when tamoxifen binds → the signaling loop gets placed in the inactive conformation → no longer functionally active

231
Q

How does tamoxifen specifically influence the two activation loops/domains in the ER?

A

Inhibits expression of TGF-a expression → AF-2 is no longer active → limiting cell growth & proliferation

Stimulatory effect on TGF-B3 expression → AF-1 can still bind and function as an activation domain → positive effect on bone maintenance

Changing equilibrium of Activation domains → changing TF activity

232
Q

What are some problems associated with tamoxifen?

A
233
Q

What can we do to identify patients that are suitable for anti-estrogen treatment using tamoxifen?

A

Tamoxifen resistance arises due to mutations in ER + loss ER expression → Hence, since the effects of tamoxifen are primarily mediated through the ER, and the degree of ER expression is a strong predictor of responses to tamoxifen, loss of ER expression (or mutations in ER) result in resistance to therapy

Hence, measurement of ER levels in breast cancer tissues (common used in practise) or sequencing for mutations in the ligand binding domain (common mutation) can indicate whether tamoxifen treatment is applicable

Highlights the importance of real-time measurements & sequencing → shed light on resistance + guide therapies (personalised approach)

234
Q

What have recent advances in transcriptomic and genomic sequencing allowed us to do for human disease detection?

A

Large-scale integration of genetic variation data, extensive annotation of functional genomic elements, along with the ability to measure global transcription, allows the impacts of genetic variants on gene expression to be detected

Genomic variation includes…

  1. Single-nucleotide
  2. Multinucleotide variants
  3. Short insertions or deletions (indels)
  4. Larger copy number variants
  5. Similarly sized copy neutral inversions and translocations

But importantly there is also an abundance of cis-regulatory variation in the human genome. Regulatory variation may be the key primary effect contributing to phenotypic variation in humans

235
Q

What are the two ways in which variation in gene expression can manifest itself?

A

Variation in gene expression can be manifested in two ways:

  1. Changes to transcript sequences and isoforms by coding variants
  2. Changes to transcript abundance by dosage or regulatory variants
236
Q

What are the two main ways in which variation can contribute towards disease?

A

Variation can be divided into…

  1. Single mutations that play a central role in disease pathogenesis - i.e. a single mutation in a transcription factor which results in clear disease phenotype → Duffy Blood Group & Antigen Hemophilia B Leyden
  2. Variation alters the susceptibility for a particular diseases as well as an individuals response to drugs and treatment - more common
237
Q

What is the Duffy antigen receptor?

A
238
Q

What mutation is found in the DARC promoter region blocks DARC expression?

A
239
Q

What is the Hemophilia B Leyden disorder?

A

Factor IX (F9) is an essential serine protease that participates in the intrinsic pathway of coagulation.

The enzyme is exclusively expressed in the liver

(Rare) mutations in the X chromosome-linked gene encoding Factor IX results in hemophilia B

In a variant form, Hemophilia B Leyden, affected males have less than 1% of normal Factor IX activity until puberty, but after puberty levels rise to around 60% of normal activity - Why? → during puberty androgen receptor binds helps drive expression

All Leyden patients have point mutations located within 40 bp of the transcription initiation site (promoter region)

240
Q

Where are factor 9 mutations in haemophilia B Leyden located?

A

Note - androgen receptor has some overlap with mutant sites but the majority lies outside

241
Q

How do mutations in coding regions of basal and gene-specific transcription factors manifest on a disease phenotype level?

A

Mutations within the coding region of transcription factors

Mutations in genes encoding components of the basal transcriptional machinery are likely to be lethal

Many gene-specific transcription (more likely to arise than basal mutations) factors play a key role during development and/or are involved in physiological maintenance → Mutations affecting their function will in most cases have pathological consequences, ranging from mild to severe

Example - PAX family of transcription factors

242
Q

What are PAX transcription factors?

A

PAX factors are a highly conserved family of transcription factors containing a helix-turn-helix DNA-binding domain

‘Paired axial’ (‘Pax’) genes belong to the homeobox gene family of transcription factors → The first gene of this class, paired, was originally discovered in Drosophila where it causes segmentation defect

In higher vertebrates nine members of the PAX family have been isolated which are classified into four groups

The expression of Pax genes is temporally and spatially restricted during development of CNS and various other organs

243
Q

How many PAX genes are present in the human genome? In what tissues is PAX active?

A

The human genome contains 9 different PAX gees. Heterozygous loss-of-function mutations in PAX genes are associated with disease phenotype.

PAX influence gene expression in a wide range of tissues/cells across the human body - these cells share a common developmental origin

244
Q

How could you organise the different developmental pathways influenced by PAX activity?

A
  1. Proliferation and differentiation
  2. Hormonal regulation
  3. Organogenesis
  4. Cell adhesion and migration
245
Q

What are the three PAX genes in humans that are known to be mutated?

A

Pax genes with common mutations - PAX 2, 3 and 6

What does this tell us about the other genes?

Mutation in the other genes most likely results in lethality

246
Q

What is Waardenburg syndrome?

A

Not a serious condition → differences in pigmentation

247
Q

What mutations result in Waardenburg syndrome?

A

Mutations in the DNA binding domain (Paired domain)

Paired domain is basically two H-T-H domains that are linked together by a flexible linker

248
Q

What do mutations in PAX6 cause?

A

Not a serious condition → differences in pigmentation

249
Q

Where is PAX6 expressed in the eye development?

A
250
Q

Outline the organisation of the Pax6 gene?

A
251
Q

Does Pax6 have a large & complex interaction network?

A

Yeahhhhh BUDDDYYYY

252
Q

Where are the deleterious mutations located in the Pax6 gene?

A
253
Q

What is an example of a functional assay that can be carried out to study WT and mutant forms of Pax6 (or any other TF really)?

A

Transfection of Plasmids

  1. One encoding Pax6 gene (or TF of interest)
  2. One encoding a reporter gene with the Pax6 binding site
  3. Examine expression levels of reporter gene (Luciferase or GFP)

Old example - Using Chloramphenicol acetyltransferase (CAT) - assay with thin-layer chromatography → more CAT reporter expression more acetylation of Chloramphenicol which appears as a separate dot during chromatography

254
Q

What type of Pax6 mutation leads to Aniridia (loss of iris)?

A
255
Q

What do homozygous Pax6 mutations result in?

A