Genome structure Flashcards

1
Q

How do we know the difference between a deoxyribose and a ribose?

A

Ribose sugar has a Hydroxyl group on the 2’ carbon. This is absent in deoxyribose. (Both have OH group on 3- carbon)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How many bp does the human genome contain, and how many genes?

A

Human genome is 3 x 10^9 base pairs – 3Gbp.

It contains ~20,000 genes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the basis of the ‘Packing problem’?

A

In a nucleated cell there is around 2m of DNA. There is an estimate of >37 trillion cells in your body, which would be 7.44x10^13 metres of DNA. However, the avg cell is 50um in length. Thus, there is a huge packing problem- fitting 2m of DNA in a 50um cell.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the solution to the packing problem?

A

Histones. They are basic (positively charged – perfect as DNA is negatively charged hence they can strongly associate) proteins that bind DNA.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the structure of a histone?

A

Eight histones 2x (H2A+H2B+H3+H4), hence a histone octamer forms the nucleosome. (DNA wraps around a histone octamer twice to make a nucleosome). Histone 1 binds the linker DNA. Nucleosomes referred to as ‘beads on a string’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the order of DNA packing? Smallest to largest.

A

Starts off with DNA double helix, which winds around histones to form nucleosomes. This is then wound further to form chromatin fibre. This is further wound to form extended section of chromosome. Which are from Loops of chromatin fibre, that of which forms part of the metaphase chromosome – the densest form of packaged DNA.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the three types of chromosome structures?

A
  • Metacentric. Short arm and long arm equal in length
  • Submetacentric. Contains a short, short arm and long, long arm (unequal in length)
  • Acrocentric. Does not contain short arm. Instead, has satellites.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the primary DNA sequence?

A

The primary DNA sequence encodes all the gene products necessary for a human. It also includes a large number of regulatory signals (non-coding)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the exome?

A

The exome is made up of gene sequences. Some definitions use all of the coding sequences only (~36Mbp – 1.2% of genome). Other definitions use all of the gene sequences (i.e., the whole gene) (~60Mbp – 2% of genome)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a gene?

A

All of the DNA that is transcribed into RNA plus all of the cis-linked (meaning these regions are physically close to the exons On the DNA strand compared to trans regulatory regions that can be on different chromosomes) local controlled regions that are required to ensure quantitatively appropriate tissue-specific expression of the final protein. The gene promoter is usually the source of most of these signals. It typically influences/controls where, when and how much the genes are expressed.
A gene is NOT just the coding sequence (the bits that encode the final protein). This is why regulation of the gene is very important. Gene needs to have all the regulatory signals that allow it to be properly expressed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are intergenic regions?

A

Between genes, there are big gaps which are known as the intergenic reason (~98% of the genome). Intergenic regions contain sequences of no known function, such as repetitive DNA, endogenous retroviruses, pseudogenes, as well as many regulatory elements.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the structure of a gene?

A

Genes are often very different in size. They often cluster in families -e.g. globin clusters. This allows for co-ordinated gene regulation. A single gene is broken up into two parts:
Promoter and transcription unit (both containing exons).
The transcription unit is transcribed into RNA. The exons contain coding sequences. Between exons, are Introns. They don’t code, they are initially transcribed but aren’t in the final mRNA. For every n exons you have, you have n-1 Introns e.g 4 exons, 3 introns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the structure of a gene (from beginning to end)?

A

5’ end, Promoter (Basal promoter sequences -CAAT Box , TATA Box – which are transcription factor binding sites), 5’UTR within the first exon (transcription initiation, followed by translation initiation at ATG) , introns and exons, translation termination within last exon, followed by transcription termination, 3’UTR (within last exon)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is within the promoter region?

A

Regulatory element 5’..GGGAAATTCC..3’ ( 3’..CCCTTTAAGG..5’) – Needed to regulate recruitment of RNA polymerase

TATA box 5’..ATATAAATA..3’ (3’..TATATTTAT..5’) – Needed to recruit general transcription factors and RNA polymerase.

Everything after this (Starting with 5’UTR and ending at 3’UTR) is the transcription unit.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the promoter?

A

Promoters recruit RNA polymerase to a DNA template. RNA polymerase binds asymmetrically and can only move 5’ to 3’. Regulation occurs via transcription factors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the other regulatory regions associated with gene expression?

A

Enhancers: Upregulate gene expression. They are short sequences that can be in the gene or many kb distant. They are targets for transcription factors. They are ACTIVATORS

Silencers: Downregulate gene expression. They are also position independent and are also targets for transcription factors. They are REPRESSORS

Insulators: Short sequences that act to prevent enhancers/silencers influencing other genes.

17
Q

What are transcription factors?

A

Proteins that enhance or repress the interaction of RNA polymerase II with a particular promoter, as they recognise different parts of the promoter (different signals)

18
Q

Give a brief description of transcription

A

RNA Polymerase is recruited to gene /DNA (closed complex)
DNA helix locally unwound to form open complex so that
RNA synthesis begins in 5’ to 3’ direction
Elongation – polymerase moves along creating a longer complementary RNA strand
Termination
RNA pol dissociates from DNA

19
Q

What are the post-transcriptional modifications of mRNA?

A

Capping at 5’ end

Spliced – introns removed

Polyadenylated at 3’ end.

20
Q

What is the 5’ cap?

A

After 25-30 nucleotides are synthesised, a methylated cap is added to the 5’ end (to ensure that it looks like the 5’ end of the message and to protect it from nuclease activity) by three different enzyme activities:

-RNA 5’-Triphophatase
-Guanylyltransferase
-N7G-methyltransferase.
Guanosine is added by guanylyltransferase and is methylated at the N7 position by N7G-methyltransferase and hence the molecule is called 7-methyguanosine. The first two activities are carried out by a bifunctional capping enzyme (CE). RNA pol II is also required as its involved in positioning enzymes and proteins in right place.

21
Q

How does splicing of introns occur?

A

150 Proteins form the mature spliceosome. The spliceosome catalyses the connection of the A in intron to P at end of exon. Frees up OH at end of blue exon. Once spliced, proteins that are still associated with exon junctions help recruit TREX complex (TRanscription-EXport). This allows for targeting of the mRNA molecule for nuclear export to ER.

22
Q

How does polyadenylation occur?

A

CPSF (Cleavage and Polyadenylation Stimulating Factor) recognises the PAS (polyadenylation signal-AAUAAA) and acts on cleavage site – primary transcript.
CSTF (Cleavage Stimulating Factor) recognises GU-rich downstream Elements (DSE - UGUGU) and helps recruit PAP
PAP (Poly-A-polymerase) is recruited and adds multiple A bases after cleavage site.
PAB is Poly-A Binding Protein. Other proteins appear to be required for this process – CFlm (cleavage factor Im), CFllm and simplekin. These proteins either regulate the process or stabilise the poly-A tail in case of PAB

23
Q

What are 5’ and 3’ based on?

A

5’ and 3’ are numbered based on the carbon atoms of the sugar – Numbering starts at the carbon closest to the base. The base is attached to the first carbon, the phosphate links between the 3’ of one sugar and the 5’ of the adjacent sugar. By convention, single stranded DNA is listed 5’ to 3’

24
Q

What compartments is the genome separated into?

A

The genome can be separated into two compartments:
Compartment A – transcriptionally active with active histone modifications
Compartment B- Transcriptionally repressed with repressive histone modifications.

25
Q

What are TADs?

A

Topically associated domains. They are several non-interacting sub-compartments that make up individual compartments. They are usually separated by the Transcriptional repressor (CTCF protein)