4. Genomes, diversity and habitats Flashcards
Types of bacterial shapes
Coccus Rod Spirillum Spirochete Budding and appendaged bacteria Filamentous bacteria
Bacterial genomes
Small, circular, and information-packed.
General size 150kbp (mycoplasma) to >10Mbp (Actinobacteria).
Supplemented by plasmids and other ‘mobile elements’
E.coli Bacterial genome
4.6Mbp 4300 genes Average gene ~1000bp (1kbp) Single circular molecule Haploid No introns Many genes organised in operons.
Plasmids
Extrachromosomal but can be integrated.
Huge size range (<5kbp to >1Mbp)
Often ‘mobile’ (conjugation)
Confer ‘accessory’ functions - resistance, metabolic, virulence and many others.
Common mobile genetic elements
Plasmids
Bacteriophage
Transposons and insertion sequences
Integrons
Bacteriophages
Bacterial virus
Lytic
Temperature (lysogenic) - Stable insertion in host chromosome.
Can transfer genes - transduction
Transposons and insertion sequences
‘Jumping genes’
Hop in and out of chromosomes and plasmids
Often carry resistance genes
Integrons
Can pick up and accumulate ‘useful’ genes.
EG Antibiotic resistances
‘Mosaic’ bacterial chromosomes
Core genome: ‘Housekeeping’ genes possessed by all strains of a species.
Pan genome: Much larger, totality of genes found across different isolates of a species
‘Pathogenicity’ islands
Cluster of genes of 'foreign' appearance. EG altered G&C content. Present only in certain strains. Correllated with virulence. Various kinds of mobile elements.
Genome sequencing
Each spot contains a short DNA fragment.
Coloured tag incorporated at each step.
Massive number of short sequence reads.
Genome sequencing - method
- Prepare genomic DNA
- Fragment
- Large scale, random ‘shotgun’ sequencing.
- 8-10 billion reads of ~150bp
- Completer genome sequences (96 human genomes in only 48 hours)
Sequence assembly of bacterial genome
Overlapping sequences aligned.
Repetitive sequences interfere with assembly.
Complete genome assembly:
Complete with known sequence.
Use long-read sequencing methods (Oxford Nanopore)
Bioinformatics
Complex process of interpreting DNA sequence data in terms of gene function.
Computerised algorithms identify sequence features.
Amino acid coding regions (Open reading frames)
ORFs converted to amino acid sequences according to the genetic triplet code.
Amino acid sequence comparisons identify likely protein functions by homology to known sequences in other organisms.
Specialised websites collate information about specific genomes.
Impact of genome sequencing
Revolutionise bacterial genetics.
Use comparative genomics to understand bacterial phylogenetics and evolution.
Infectious disease and epidemiology.