W1 - Genome Structure Flashcards

1
Q

What is DNA?

A

DNA is deoxyribonucleic acid
* It is a macromolecule consisting of a linear strand of nucleotides
* Single linear strands bind to complementary strands to form double-stranded DNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does a single strand of DNA consist of?

A

5’ and 3’ carbons are indicated – numbering starts at the carbon
closest to the base.
* Sequence is 5’->3’ by convention

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does a double stranded DNA consist of?

A

Antiparallel, complementary base pairing.
The 5’ and 3’ go in diagonal and opposite directions making it antiparallel.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How can DNA be seen in a 3D format?

A
  • Two antiparallel strands of DNA
  • Bases “stacked”
  • Two grooves
  • Major
  • Minor
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How is there genome variation?

A
  • Human genome is 3 x 10^9 base pairs – 3Gbp
  • It contains ~20 000 genes.
  • There is a trend for simpler organisms to have fewer genes,
    for example, flies have 10,000, yeast 4,000, bacteria 1,000
  • However, genome size is not strongly related to complexity of an organism - marbled lungfish is 130Gbp and Paris japonica is 149Gbp
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the solution to the DNA packing problem?

A

Nuclear DNA = 2m per cell

Solution to packing:
* Basic (+vely charged) proteins that binds the negatively charged DNA
* Eight histones 2x(H2A+H2B+H3+H4) form the nucleosome
* Histone 1 binds the linker DNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the nucleosome structure with the DNA packed?

A

After being packed into histones, the DNA then wound around themselves to form fibres. They then wound further to form an extended chromosome. This is again wound to form loops of chromatin fibre to then produce the metaphase chromosome - the densest form of DNA.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the definition of an exome?

A
  • The exome is made up of gene sequences
  • Some definitions use all of the coding sequences (~37 Mbp – 1.2% of genome)
  • Some definitions use all of the gene sequences (~60Mbp – 2% of genome)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a gene?

A
  • All of the DNA that is transcribed into RNA plus all of the cis-linked (local) control regions that are required to ensure quantitatively appropriate tissue-specific expression of the final protein
  • It is NOT just the bits that encode the final protein, regulation of the gene is
    very important.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the structure of a gene?

A

Structure of chromosome is: Genes, Intergenic region (98%), Genes.

These genes can be very different sizes.
Intergenic regions contain sequences of unknown function, such as repetitive DNA, endogenous retroviruses, pseudogenes. They may contain many regulatory elements.

Genes often cluster in families – e.g. globin clusters:
- allows for co-ordinated gene regulation
- may just reflect evolutionary history

Structure of a gene: Promotor (CAATboxes and TATA boxes) and within the transcriptional unit, multiple exons and introns. Transcriptional initiation and termination. There are also 5’ UTR and 3’ UTR (untranslated region).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are introns and what is their purpose?

A
  • Vary in number – from 0 to over 300
  • Vary in size - 30bp to 1Mbp
  • Some introns contain other genes

There is always one less intron than exons - none exist after final exon.

Purpose: Allow for splicing - multiple different forms of the same gene.

They do not exist in the matured RNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a promoter?

A
  • Promoters recruit RNA polymerase to a DNA template
  • RNA polymerase binds asymmetrically and can only move 5’ to 3’
  • Regulation occurs via transcription factors
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a regulatory element?

A

Regulatory element
(needed to regulate recruitment
of RNA polymerase)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a TATA box?

A

“TATA box”
(needed to recruit general
transcription factors and RNA
polymerase)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are enhancers?

A

Enhancers upregulate gene expression – they are short sequences that can be in the gene or many kilobases distant. They are targets for transcription factors (activators).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are silencers?

A
  • Silencers downregulate gene expression. They are also position-independent and are also targets for transcription factors (repressors).
17
Q

What are insulators?

A
  • Insulators are short sequences that act to prevent enhancers/silencers influencing other genes
18
Q

What is transcription?

A
  • Messenger RNA synthesis (transcription) is catalysed by RNA Polymerase II
  • Transcribes in 5’ to 3’ direction
  • Transcribes everything after the transcription start site
    (exons and introns)
  • mRNA is post-transcriptionally modified
19
Q

What are the detailed stages of transcription?

A

RNA polymerase ii recognises promoters efficiently with the assistance of many other transcription factors.

1) The promotor and transcription unit exists.
2) RNA polymerase comes in and unwinds local DNA helix
3) RNA synthesis begins by unwinding from the 5’ to 3’ direction
4) Elongation
5) Termination
6) RNA polymerase dissociates with the pre-mRNA made.

20
Q

What is the post-translational modification of DNA?

A

*Capped at 5’ end
*Spliced - introns removed
*Polyadenylated at 3’ end

21
Q

What is the 5’ cap?

A

After 25-30nts are synthesised, a methylated cap is added to the 5’ end by three enzyme activities:
* RNA 5’-triphosphatase
* Guanylyltransferase
* N7G-methyltransferase
The first two activities are carried out by a bifunctional capping enzyme (CE)
RNA Pol II is also required

22
Q

How are introns spliced?

A

This is done through a spliceosome, which loops out the intron and connects the two ends of the exon.
2’-5’ linkage is made in a lariat structure.

23
Q

What is the 3’ Poly A tail?

A
  • CPSF (Cleavage and Polyadenylation Stimulating Factor) recognises the PAS (Polyadenylation signal) and acts on cleavage site
  • CSTF (Cleavage Stimulating Factor) recognises GU-rich Downstream Elements (DSE)
  • PAP (Poly-A polymerase) is recruited and adds multiple A bases after cleavage site
  • PAB is Poly-A Binding Protein. Other proteins appear to be required for this process – CFIm (Cleavage Factor Im), CFIIm and Simplekin
24
Q

What are the steps of translation?

A

The mRNA will be transported to a ribosome. The codons on the mRNA will code for anticodons carried by tRNA. Each different tRNA is covalently linked to an amino acid.

AUG (methionine) initiate translation - aka start codon. When this binds to the mRNA within the ribosome, this creates an initiation complex. Another tRNA will come to join the adjacent codon and the first will leave. This continues until a stop codon is reached.

25
Q

What is alternative splicing and how is it useful?

A

Exons can be skipped or added, so many variations of a protein (called isoforms) can be produced from the same gene. Proteins created are very complex as a result.

26
Q

What is the 3D genome structure?

A
  • Most of the time, DNA is not organised into chromosomes
  • In somatic cells the nuclear DNA is arranged non-randomly
  • Organisation has been identified using Hi-C (detects genomic DNA sequences in close proximity) and high-throughput microscopy
  • Involves CTCF protein and Cohesin protein complex, as well as transcription machinery
27
Q

What are the different compartments of a 3D DNA structure?

A
  • The genome can be separated
    into compartments:
  • Compartment A –
    transcriptionally active with active histone modifications
  • Compartment B –
    transcriptionally repressed with repressive histone modifications
  • These are interspersed
    throughout the 2D sequence but
    the same compartment types are
    brought close together in the 3D
    genome
28
Q

What are the Topologically Associated Domains (TADs)?

A

Topologically means close together in 3D space.
* Individual compartments are made up of several non-interacting sub-compartments
* These are Topologically-Associated Domains (TADs)
* They are usually separated by the Transcriptional Repressor CTCF protein

29
Q

What is 3D transcription control (Loop extrusion model of control of transcription)?

A

We have DNA bound by CTCF on either end of it.
A Cohesin Complex comes in and pulls the DNA through itself - this is loop extrusion.
The loop extrusion model enables the DNA to bring together an enhancer and promoter sequence - pulled together from the primary structure for them to interact in 3D space.