Genes, Genomes and Gene Expression Flashcards
The evolving concept of the gene
Different representations of a gene which are all inter-related
•The gene as a unit of inheritance that determines a trait/phenotype
•The gene as physical entity carried on a chromosome
•The gene as a stretch of DNA sequence that is expressed as a gene
product
•The gene as one component of an individual’s genetic make-up
(genome)
genome definition
the genetic material that provides a set of hereditary instructions to construct and maintain an organism
Primary requirements of a genome
A genome is arranged into chromosomes, which encode sequences for:
§synthesis of RNA and cellular proteins
§DNA replication
§segregation of chromosomes during cell division
§compaction of chromosomes
Variation in the size of genomes
C-value: amount of DNA (pg)/haploid cell or number of kilobases (kb)/haploid cell
genome size of an organism does scale with the complexity of an organism
But….correlaNon is not precise especially in eukaryotes –C-value paradox
Prokaryotes <10-Mb
Simple eukaryotes <50-Mb
Complex eukaryotes >50-Mb
Parasitic genomes
Parasites generally have small genomes
Viruses are obligate parasites and use the host’s molecular machinery to reproduce
Viral genomes typically do not encode genes for:
energy production
genome replication
protein synthesis
cell division
Viral genomes typical encode genes for:
coat polypeptides
different life history traits – lysogeny vs lytic pathways
Bacteriophage genomes as an example
Enormous range in genome structure: Nucleic acids: RNA or DNA Strandedness: single or double Shape: linear or circular molecules, sometimes segmented (multiple pieces) Size: 2-kb to 2.5-Mb
Organization of genes within the viral genome
Many viruses have genomes in which genes overlap
-First DNA-based genome to be sequenced -Fred Sanger (1977) fX174 viru
-Two different transcripts from the same genomic region
Transcripts are translated from different start positions
Location of DNA in bacteria
Genome is composed of chromosomal DNA located in a membrane-less region of the cytoplasm –nucleoid Extrachromosomal DNA (plasmids) located in the cytoplasm
Features of bacterial chromosomes
Chromosomes are usually circular, but some are linear
Number of chromosomes may vary between species –unipartite vs multipartite
Genetically haploid –1 copy of each chromosome
Chromosome number does not correlated with gene number
Structure (shape) of bacterial chromosomes
Chromosome needs to be condensed approx. 1,000-fold to fit into the cell
looping (10-fold compaction), supercoiling (100-fold compaction)
looping:
-Base of loops anchored to a protein core (NAPs)
-60 bp wound around a protein tetramer
Supercoiling:
•Reduced/additional turns introduced into DNA helix
•Torsional stress reduced by DNA winding around itself (supercoils)
Gene organization in bacterial genome
§Highdensity of genes
§Genes are frequently arranged into operons
§Non-coding sequences are typically associated with gene regulation
§Low level of repetitive DNA (<1% of the genome)
Origin of replication
Chromosomal DNA needs to be replicated prior to cell division
Replication is initiated at the origin of replication (oriC)
oriCis also required for chromosomal segregation at cell division
DNA replication is bidirectionaland occurs at two replication forks that form a bubble
Eukaryotic chromosomes must have sequences for:
1. DNA replication –origin (ori)
2. Replication for the ends of linear chromosomes oriC occur every 100-kb in eukaryotic chromosomes
Plasmids
Autonomously replicating extrachromosomal DNA molecules found in a wide range of bacteria
Features:
Non-essential
Encode a range of functions
Generally circular double-stranded DNA molecules 2-200kb
Low (1-2) to high (>500) copies per cell
Transferred from cell to cell – cross species boundaries
Traits encoded by plasmids
Bacteriocins
Plasmid encodes toxins that kill other bacteria (bacteriocin) but provide resistance in the host bacterium
Pathogenicity
Plasmid encodes functions required for infection – cholera toxin, food-poisoning, crown gall disease
Catabolism
Plasmid encodes enzymes for degradation of organic molecules – camphor, toluene
F – Fertility
Plasmid encodes genes required for conjugation, formation of Hfr by integration into chromosome
R – Resistance
Plasmid encodes genes for conjugation and resistance to antibiotics, heavy metals, antiseptics
the troublesome R plasmid
- Antibiotic resistant genes are located within transposons on R plasmids
- R plasmids are transmitted between bacteria via conjugation (horizontal gene transfer)
Location of DNA in a eukaryotic cell
Eukaryote genomes are composed of nuclear and organelle DNA
Phenomenon of uniparental/cytoplasmic inheritance suggests that some genetic material resides outside the nucleus
Organelle genomes
Mitochondrial (Mt) and chloroplast (Cp) genomes are found in nucleoids
Multiple copies of chromosomes within each nucleoid
Multiple nucleoids per organelle
All Cp genomes and most Mt genomes are circular, double-stranded DNA and supercoiled
Chromosomes do not encode all the proteins found in the organelle
Cp genomes similar in size and gene content (120 genes, land plants)
Mt genomes display greater variation in size but similar gene content (37 genes, metazoans)
Nuclear genomes
Chromosomes contained within a nuclear membrane
Typically multipartite –multiple linear chromosomes
Chromosomes may be present in one, two or more copies (haploid, diploid, polyploid)
Chromosome number is NOT related to organism complexity or genome size
Chromatin
Nuclear DNA extensively bound to proteins - chromatin
Nucleosomes(DNA wound around a histone) associate with each other to form a more compact structure – 30-nm fibre
The 30-nm fibre is anchored into radial loops through attachment to proteins – nuclear matrix
Higher-order compaction of radial loops to form the metaphase chromosome in a dividing cell
Chromatin structure is dynamic
-Euchromatin –regions where chromatin is less condensed Gene-rich, transcriptional active, unique DNA sequences, histone acetylation
-Modification of histone tails to make them less charged
Heterochromatin –regions where chromatin is more condensed Constitutive - common to all cells – gene poor, transcriptional inactive, repetitive DNA sequences
Facultative – variable between cells – mechanism of gene regulation
Histone tail unmodified, highly charged
cHROMATIN STRUCTURE ARE NOT UNIFORM
Gene organisation in eukaryote vs prokaryote
§Significant increase in gene size (presence of introns)
§Significant increase in the amount of DNA between between genes (intergenic region)
§Amount of intergenic DNA in the genome varies between eukaryotes –explanation for the C-value paradox
(see later lecture)
Telomeres
Specialised structures at the ends of chromosomes
Contain multiple G-rich repeats of short DNA sequence e.g. TTAGGG
Prevent loss of genomic information after each round of DNA replication (see lecture on DNA replication)
Hide the chromosome ends from DNA-repair machinery
Centromeres
Condensed chromosomes have to be separated during cell division which involves spindle microtubule attachment to a specific region of the chromosome called the centromere via a kinetochore
Specialised nucleosome/s at the centromere composed of centromere
protein A (CENP-A)
Centromere sequences –typically located at a single site on a chromosome
DNA replication – a recap of first year knowledge unwinding
DNA replication requires localized unwinding and separation of the DNA strands to produce the single-stranded DNA template – replication fork (a region where the DNA molecule is being unwound)
Key steps in DNA replication – Strand separation
- Helicase separates the two strands of DNA at the replication fork
- Binds to lagging strand and breaks hydrogen bonds between the bases
- Single-stranded (SSb)binding proteins prevent the parental strands from annealing
- Helicase travels in a 5’ to 3’ direction
Priming for DNA synthesis
A 10-12bp single-stranded RNA primer is required to initiate DNA synthesis
Primase synthesizes primers for both the leading and lagging strand
Primase recruited by the helicase – ‘primosome’
DNA synthesis with DNA pol
DNA replication requires the action of two DNA polymerases III enzymes
DNA pol III has 3’ to 5’ proof-reading activity – removal of mismatched nucleotide
RNA primers are required continuously for DNA synthesis of the lagging strand
DNA polymerase extends from multiple RNA primers to generate the Okazaki fragments
Replication in the lagging strand (fill up the gap)
DNA polymerase I removes the RNA primer through 5’-3’ exonuclease activity
DNA pol I fills gap with DNA using the upstream Okazaki fragment as primer
Nicks in the newly synthesized lagging strand are repaired through the action of DNA ligase
The replisome
Proteins associated with the replication fork form a molecular machine – called a replisome
DNA pol III adds 10nt before falling off the template
bclamp + DNA pol III adds 50,000nt before falling off template
Initiation of DNA replication in bacteria
Formation of a replication fork at the oriC begins with strand separation
•DnaA protein complex recruited to oriC– induces localize melting and strand separation within the AT-rich region
•Helicases recruited to the unwound region and rapidly denatures the DNA using ATP
•Replisome recruited
Single-stranded binding (SSBs) proteins protect the ssDNA from nucleases
Replication bubbles
Two replisomes are recruited to oriC
DNA replication proceeds in two directions
Accessory protein – DNA gyrase
DNA stand separation induces torsional stress ahead of the replication fork - supercoiling
DNA gyrase (topoisomerase II) removes supercoiling
-It sever the DNA strand, removing the torsinal stress, unwind the dna and reseal it
Segregation of the bacterial chromosome during cell division
ParB proteins bind to parental oriC and newly replicated oriC mParB associates with ParA
ParA is associated with the pole
ParB dragged to the poles along to a ParA concentration mgradient
Chromosomes are separated prior to cell division
DNA replication in eukaryotes
Principle exactly the same as prokaryotic DNA replication with some added complications:
- Multiple oriper chromosome – need to coordinate their activity within a chromosome and across multiple chromosomes
- End replication problem associated with linear chromosomes
Primer missing at the end of the lagging strand
Terminal gap cannot be filled
Internal gap
Chromosome will get progressively shorter over time
The eukaryotic cell cycle
Mitosis -> G1 (Cell growth and preparation for cell cycle entry) -> S ( DNA REPLICATION)-> G2(Cell prepares for
the M phase) ->M
Initiation of DNA replication in eukaryote
Late M:
Origin Recognition Complex (ORC) binds to origin
Start of G1:
Cdc6 recruited
Helicase and Cdt1 recruited – formation of a prereplicative complex
Start of S phase:
Cdc6, Cdt1, ORC released
Cdc6/Cdt1 destroyed
Localized unwinding of DNA at the origin
DNA pol recruited (blue), replisome formed
Linking DNA replication with cell cycle control
DNA synthesis is regulated by the availability of Cdc6 and Cdt1
Cdc6 phosphorylated by Cyclin E-Cdk2 (only found in G1)
Cdc6/Cdt1 destroyed