Eukaryotic Genomes (4-7) Flashcards
How much human DNA is non-coding?
More than 50%
→ 1.5% is exons - coding part
→ as genomes get larger, an increasing proportion of the DNA is non-coding
What makes human genomes so large?
- Gene duplication → families of genes and pseudogenes with often coordinated regulation
- Large introns → often containing retrotransposons
- Transposons → ability to change position in genome - LINES, SINES, retroviruses, retrotransposons
- Repetitive DNA → simple sequence repeats, segmental duplications
- Non-repetitive DNA
What does gene duplication lead to?
Protein-coding genes have relative with which they share common ancestry
→ some exist in families and super-families
→ within a genome, families can be dispersed or clustered
→ maintenance of clusters implies functional co-ordination/regulation
→ expands the amount of DNA
What is the globin gene family?
New genes formed by duplication during evolution
→ transcriptionally regulated gene cluster
→ duplicates can mutate individually - start getting divergent proteins
One ancestral globin → duplication → divergence → haemoglobin & myoglobin (different functions)
How does haemoglobin gene expression change from foetal to adult?
Foetal → express alpha and gamma
Adult → changes to alpha and beta
Demonstrates one of the functions of expanding genome
→ gives the opportunity for developmental regulation - temporal transcription
What are retrotransposons?
Type of genetic element found in genomes - capable of copying themselves into RNA then back to DNA which can be inserted at different locations within the genome
LINEs → long interspersed nuclear elements
SINEs → short interspersed nuclear elements
→ regulate gene expression by affecting chromatin structure, gene transcription and pre-mRNA processing
→ make up >40% of human genome
What is the ‘beads on a string’ observation?
Chromatin first observed in 1974
→ visualised nucleosomes separated by linker DNA
→ with the removal of H1 by low salt conditions
What is a nucleosome?
145-147 base pairs of DNA wrapped tightly ~1 2/3 turns around a globular protein complex
→ histone octamer: 2 copies of H2A, H2B, H3, H4
What are the 5 major types of histone proteins
H2A, H2B, H3 and H4
H1 → linker histone
→ rich in +ve basic aa which interact with the -ve phosphate in DNA
What did in vitro reconstruction of the nucleosome reveal?
H3 and H4 dimerise then form a tetramer
→ the stable tetramer wraps DNA in a soluble form leaving holes
→ enough space for H2A and H2B to dimerise and join nucleosome
→ N terminal tails protrude from the nucleosome
How is DNA packed?
dsDNA
→ beads on a string
→ packed nucleosomes
→ extended nucleosomes
→ extended chromatin
→ compacted chromatin
→ metaphase chromosome
Each DNA molecule is packaged into a chromosomes 10,000-fold shorter than its extended length
How can you transcribe DNA with the high levels of organisation and nucleosomes present?
Nucleosomes must be removed or remodelled to allow for transcription and replication
What is DNA acetylation?
Reversible modification of histone tail lysine
→ +ve charge of lysine neutralised - proteins involved in transcription can access more easily
→ creates more open form of chromatin
How does the cell choose which histone to be modified by acetylation to open the DNA for transcription?
Gene activator proteins bind chromatin remodelling complex
→ leads to remodelled nucleosomes, histone removal or histone replacement
Can also bind histone-modifying enzymes
→ read predominant chemical modification and copy it to other histones - spread of modification down the length of DNA
What is DNA methylation?
1/2/3 methyl group added to aa
Specific lysine and arginine residues can be modified by methylation
→ can relax or compact chromatin depending on: which residue is methylated and the degree of methylation (context dependant)
What are some other histone code modifications?
Phosphorylation → some serine and threonine resides - promotes transcription
Ubiquitylation → some lysine residues on H2A are recruitment sites for DNA repair, some are activating and on H2B are repressive
Citrullination → some arginine residues on H3 and H4
→ complex histone code determines gene expression - not fully understood
How is the histone code maintained during transcription?
RNA Pol II stalls at nucleosome A
→ unwinds half of the DNA from nucleosome A
→ transcribed DNA is looped starts to wind around nucleosome A, while unwinding continues
Same nucleosome is transferred from in-front to behind the polymerase (from downstream to upstream as transcription proceeds)
How is the pre-replicative complex formed?
The origins of replication are remarked with an ORC (origin recognition complex)
→ Cdc6 binds DNA at each side of ORC and recruits Cdt1
→ they recruit Mcm helices - opens strands of DNA
In G1 the pre-RC is formed but not active
How is the pre-replicative complex activated in S phase?
Cdk2 phosphorylates Cdc6 → falls off and doesn’t recruit Cdt1
→ opens the origin for one round of replication
→ allows recruitment of primes clamp loader and DNA polymerases forming the active replication complex
How is the histone code maintained during replication?
The replication machinery displaces H2A-H2B dimers
→ H3-H4 tetramers displaces and reused?
→ not clear but both new strands inherit them
Another set of histones is made to fill in the gaps
→ immediately acetylated
→ NAP-1 and GAF-1 chaperones guide histones into the available spaces
→ produces rebuilt histones - acetylated keeping structure ope
Acetylation is removed then histone-modifying enzyme/reader-writer complex copies epigenetic information
→ maintenance of histone code
→ transfer of epigenetic history
What is epigenetics?
Heritable changes in gene expression that are not mediated at the DNA sequence level
Chromatin modifications
→ DNA methylation
→ histone modification
How can SINEs regulated gene expression?
Non-epigenetic → act as enhancers or alternative promoters
→ recruit transcription factors and promote expression of nearby genes
Epigenetic → GC-rich so are hot spots for DNA methylation
→ can silence nearly genes stimulating chromatin condensation
Why is transcriptional regulation needed?
Allows development of different tissues
Transition from childhood to adult
Deregulation can result in uncontrolled growth (cancers)
Allows reaction to environmental cues
What 3 ways is transcription controlled?
- Chromatin structure
- RNA polymerase (and general TF) binding specificity
- Additional binding and activation factors
When is chromatin expressed/silenced?
Condensed chromatin → genes with highly packed heterochromatin are usually not expressed - silencing
De-condensed chromatin → expressed genes found in this ‘open’ formation
→ we can open/close chromatin shifting bias between expression and silencing
What is methylation of lysine at position 9 in the histone code associated with?
The formation of heterochromatin - gene silencing
→ H3K9me
→ type of heterochromatin depends on whether its tri- or uni- methylated
What is H327me associated with?
H3 → histone H3
K27me → lysine position 27 methylated
Associated with inactivation of HOX genes
→ developmental genes arrayed in the same order as the segments int he body - key role in organising body plan of embryo
What are some histone code modifications associated with gene expression?
H3S10p H3K14ac → gene expression
H3K4me H3K9ac → gene expression
→ relax DNA, open chromatin structure allowing for gene expression
What is the centromeric histone code modification?
H3K4me2 → double methylation of lysine at position 4
→ keeps structure open
→ important for centromere as must be open at all times
What is theorised about the scaffolding holding interphase chromosome in loops?
The scaffold is poorly understood but looped DNA assumed to be held by many scaffolding proteins
→ contains lots of topoisomerases for the winding and unwinding of DNA
Modification extends loops for high level of gene expression
Where is chromatin within the nucleus?
Heterochromatin → genes tuned off, close to the nuclear membrane associated iwht the lamina where its formed
Active regions → for genes that need to be turned on, the loop is opened up and relaxed
→ then moved to an area where all the requirements for transcription are concentrated
→ there is dynamic movements of genes within the nucleus
What are the two major types of heterochromatin?
Facultative heterochromatin → cell-type specific, can switch to euchromatin following developmental cues, characterised by a specific histone code mark
→ H3K27me3 that binds ‘polycomb’ proteins
Constitutive heterochromatin → regions that are consistently silenced in all cell type of an organism - centromeres, telomeres
→ H3K9me3 - carried out by histone methyltransferases (HMT)
How is the human centromere organised?
Centric chromatin flanked by peri centric chromatin
Centric chromatin → long highly repetitive chromatin structures
Double methylation of lysine 4 (H3K4me2) → gives centromere open structure allows for kinetochore attachment
→ important for function
What are telomeres?
Repetitive DNA sequence found at the end of each of our chromosomes
→ vary in length and repeated DNA sequence
→ vertebrate sequences at over several kb
→ yea several hundred bp
What is the end replication problem?
DNA is synthesised 5’ to 3’
→ leading strand - continuous synthesis
→ lagging strand - Okazaki fragments
Gaps usually repaired by DNA ligase
→ but end gap on lagging strand can’t be repaired because DNA synthesis required 3’-OH - there is no primer
→ means one of the strands made in shorter than the template
→ so telomeres get shorter over time and fray
What is the Hayflick limit?
The number of times a normal somatic, differentiated human cell population will divide before cell divisions stops
→ between 40 and 60 divisions
→ after the cell becomes senescent
→ in-built ageing mechanism
What is the compensatory mechanism for telomere shortening?
Telomerase binds the sing-stranded G-overhang
→ extends the 3’ end of the parental strand using its own RNA subunit as a template (RNA directed DNA synthesis)
→ this provides more room for DNA primate to lay down a primer - which can be extended
(extends our telomeres, gives partial not full compensation for reduction in length of our telomeres)