Lecture 1 revision Flashcards

1
Q

How many base pairs in human genome

A

3 billion over 23 chromosomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How many base pairs in mitochondrial genome

A

16,569bp, circular DNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

C-value paradox

A

Explains that the size of the genome of an organism does not correlate with its actual size and complexity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Explain DNA Melt-Reassociation

A

Denatured ssDNA ->

  • Highly repeated reannealed dsDNA after rapid melt association
  • Moderately repeated reannealed dsDNA after intermediate melt-reassociation
  • Unique reannealed dsDNA after slow melt-association
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What types of molecules are responsible for eukaryotic organisation

A

Single copy

Gene families

Tandem gene arrays

Intermediate repeats (transposable elements)

Simple sequence repetitive DNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Overview of human genome groups (pie chart)

A

26% introns

20% LINES

13% SINES

12% Misc, unique sequences

8% misc. heterochromatin

8% LTR retrotransposons

5% segmental duplications

3% DNA transposons

3% simple sequence repeats

2% protein coding genes (20,441)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What percentage of protein coding genes form genome

A

Around 25%, but only 1% of those are exons

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Smallest vs largest protein coding gene

A

Smallest: SRY (0.9kb with one 850bp exon)

Largest: DMD (2400kb with 79 180bp exons, and a 30,770bp intron size)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Explain the DMD gene in ENSEMBL

A

> 2 million base pairs on the X chromosome

Main mRNA is 14kb containing 79 exons

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

SRY gene in ENSEMBL

A

~0.7kb on Y chromosome

727bp single exon mRNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What percentage is non-protein coding ?single copy DNA

A

26% of genome is introns

15% is single copy but not part of a protein-coding gene

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Functions of single-copy non-coding DNA

A
  • Most is ‘functional’ - Over 80% has >1 biochemical activity
  • Majority can be transcribed
  • 22,219 non-coding genes
  • rRNAs, tRNAs, snRNAs encoded for
  • miRNAs involved in gene regulation (2588 identified)
  • Long non-coding RNAs (14727) - some known to be functional that target regulatory proteins, disease markers, causative agents in disease
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Human genes families

A

a-globins- 4

beta-globins - 5

Actin - 15

Keratin type 1 - 19

Beta-tubulins - 19

alpha-tubulins - 10

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Beta cluster

A

-> epsilon

-> Gy

-> Ay

-> (pesudoB)

-> delta

-> Beta

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

alpha cluster

A

-> zeta

-> Pseduozeta

->Pseudoalpha

->Pseudoalpha

-> a2

-> a1

-> theta

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Explain tandemly arrayed genes

A
  • Gene clusters created by tandem duplications
  • Gene duplication = copies next to original
  • Encode large numbers of genes at a time (2-100s)

-14-17% of rat, mouse and human coding genome to allow faster transcription

17
Q

Tandem clusters of rRNA encoding genes

A

human embryo has 5-10 million ribosomes

Cell number doubles in 24 hours

single RNA gene not enough to provide RNA, but tandem repeats allow RNA production

RNA polymerases for transcription required

18
Q

Retrotransposons

A

Transpose via RNA intermediate:

Viral: retrovirus-like e.g. endogenous retroviruses

Non-viral - SINEs and processed pseudogenes

19
Q

DNA-DNA transposable elements

A

Transpose directly from DNA to DNA

Similar to bacterial transposons - none active in human genome

20
Q

Why are eukaryotic transposable elements important

A

Important in genome evolution

source of regulatory elements

Recombination sites

Insertions can cause disease

21
Q

Retrovirus/retrotransposon life cycle

A

Retrovirus enters cell

-> RNA

-> Provirus

-> RNA

-> Retrovirus

22
Q

Viral retrotransposons

A

LTR-gag-pol-int-env-LTR

gag - Group antigens
pol - Reverse transcriptase
Env - Envelope protein

23
Q

LINE-1 element

A

> 500,000 copies in genome

  • 1-6kb in length
  • 40-50 are active
  • ORF1 - 1137bp - homology to gag
  • ORF2 - 3900bp - homology to pol
  • NO LTRs

Target-site direct repeat -> Multiple stop codons (1kb) -> Coding region (1kb for ORF1, 4kb for ORF2) -> A/T rich region

24
Q

Timing and tissue specificity of L1 transposition

A
  • Mostly repressed (methylation)
  • Demethylation and increased transposition in tumours
  • Germ cells (many unique new insertions)
  • Early embryos (somatic cells)
  • Neural progenitor cells during childhood - each human is a unique mosaic
25
Q

Transposable element composition in human genome is…

A

30%

Of the 30 percent:

Non-ME sequence - 33%
ME and repeat remnants - 21%
17.6% - LINE1
Alu - 10.7%
ERV - 8.9%
LINE2 - 3.5%
DNA - 3.4%
Non-ME repeats - 2%
Others MEs - 0.7%
SVA - 0.1%

26
Q

Non-viral elements

A

SINEs (13% of genome):
- Genomic copies of small RNAs
- Most belong to Alu family (7SL RNA)
- Also copies of snRNAs and tRNAs

Processed pseudogenes (genomics copes of mRNAs)

27
Q

Alu sequences

A

150-300bp

1 million copies (10% human genome)

Found in other vertebrates

occur every ~6kb

transcribed to RNA

transpose using LINE reverse transcriptase

Sites of recombination

28
Q

Explain SINE-VNTR-Alu

A

SVA

non-autonomous hominid specific retrotransposons

Don’t exist in old world monkeys

Several subtypes

Can be transcribed

Mobilize by LINE L1 retrotransponase

Associated with human disease

29
Q

SVA associated diseases (deletions)

A

Leukaemia (2kb insertion in HLA-A, SVA/+, deletion)

Neurofibromatosis 2 (1.7kb insertion into NF2, SVA/+, deletion)

X-linked agammaglobulinemia (XLA) (0.25kb insertion into BTK, SVA/Y, exon skipping)

30
Q

Which transposon types are capable of jumping

A

LINEs, Viral retrotransposon (except LTR-gag-LTR), DNA transposon repeats that contain transponase

31
Q

Tandem repeat DNA (simple sequence DNA)

A

~8% of genome

Repeat unit 2-200bp

  • Array length up to 5,000,000bp (alphoid DNA)
  • Short tandem repeats
  • Mini/microsatellite DNAs
32
Q

Short tandem repeats

A

5% genome

repeat length 1-6bp

Total array length - 100bp

Length variation can affect gene expression ins hereditary conditions like schizophrenia by maybe directly binding transcription factors

STRs enriched in TFs

33
Q

Mini and microsatellite DNA

A

Mini:
15-100bp repeats
Total array length is 0.5-30kbp

Micro
2-5bp repeats
60-200bp array length

Array length is variable - VNTRs or STRs

Used in gene mapping and paternity tests