L1, Genome Organisation Flashcards
Human genome vs Mitochondrial Genome: Details
Human…
- 3 millon bps
- 23 pairs of linear chromosomes
Mit…
- 16,569 bps
- Circular DNA
Prokaryotes: Number of protein coding genes examples
- Mycoplasma genitalium (not free-living): 480
- E.coli (free-living): 4000
- S. cerveisiae (Brewer’s yeast): 6000
Eukaroytes: Number of protein coding genes, examples
- Often a lot of redundancy in mammals
- Arabidopsis Thaliana: 15000
- Fruit flies: 13000
- Mice: 23000
- Human: 20000 (approx. 1% of human genome)
C-value paradox
- ‘Lack of correlation between biological complexity and the intuitively expected protein-coding genomic information or DNA content’
- DNA-complement
- Proportion of junk DNA found to be higher in salamander than human
- In salamander, total DNA is around 5x greater than humans
DNA Melt-Reassociation aka Reassociation Kinetics
- Techinque for establishing broad types of DNA
- Able to separate into highly repeated, moderately repeated and unique fragments
- Measuring how much ssDNA remains and how much dsDNA has formed at given times
- More repetition = more rapid reassociation, easier to find a match
Demonstrate the cot curve for DNA melt-reassociation, comment
- See slide 10
- Fraction reassociated against cot (initial concentration x time for reassociation)
Current Understanding: Broad classes of DNA sequences
- Single copy
- Gene families
- Tandem Gene Arrays
- Intermediate repeats (mostly transposable elements)
- Simple sequence repeat DNA
Single copy DNA: % of genome and exon content
- Makes up about 25% of genome
- Only 1% contained in exons
- Average gene 27kb with 9 exons
Functions of non-coding DNA
- Majority can be transcribed
- 22,219 non-coding genes
- Structural RNAs -rRNAs, tRNAs, snRNAs
- miRNAs - involved in gene regulation
- lncRNA: Target regulatory proteins, disease markers, possible causative agents in disease (e.g. BACE1)
Human Gene Families: What are they? Give 6 examples with no. members
Similar sequences:
- alpha-globins (4)
- beta-globins (5)
- actin (15)
- keratin type I (19)
- beta-tubulin (19)
- alpha-tubulin (10)
What is a pseudogene?
Inactive copy within a cluster
TAGs: What are they, proportion of genomes
- Gene clusters created by tandem duplications
- One gene is duplicated, the copy is next to the original
- Can encode large numbers of genes at a time
- 14-17% of the human, mouse and rat coding genomes
-> faster transcription
TAGs in the human embryo: Why are they particularly useful?
- Human embryo has 5-10 million ribosomes
- Embryonic cell number doubles within 24 hrs; single RNA gene may not be sufficient for RNA demands but tandem repeats of rRNA encoding genes allow a higher output (needs multiple RNA pols transcribing simultaneously)
Transposable elements in the human genome: Proportion, MEs, LINEs
Class length, acronyms
- IM class length (see: Melt Curve Study)
- Make up around 30% of the human genome
- ME: Mobile element
- LINE: Long interspersed nuclear element
Outline the two key types of transposable element in eukaryotes (with examples)
By transposition route
Retrotransposons
Transpose via and RNA intermediate;
- Viral (retrovirus like e.g. Endogenous retroviruses or LINE-like e.g. LINE1, LINE2)
- Non-viral (e.g. SINEs, processed pseudogenes)
DNA-DNA transposable elements
Transpose directly from DNA to DNA. Similar to bacterial transposons
- Non active in human genome