Module 4 Flashcards
Genomics
Analysis of all the genetic material in organism including genes, regulatory regions, interactions, functions, expression
Genome
Includes all Chromosomes and plasmids that constitutes and organisms DNA
Short protein coding regions
Microbial genomes
Smaller genomes
Restricted ecological niches
Small genomes
Carsonellaruddii, 160kba
Large genome
Sorongiumcellulosome, 13Mb
First genome and genome sequenced
Bacteriophage MS2 (1972 and 1976) - RNA genome
First sequenced DNA genome
Bacteriophage X174 (1977)
First sequenced bacterial genome
H. influenzae (1995)
Genome Steps
Strain –> Strategy –> Chemistry –> Assembly –> Closure and finishing –> data release –> annotation –> publish
Genome Annotation
Find ORFs and ORF function (BLASTX) and other features
Domain Comparisons
Pfam/Prosite
Functional Genomics
Ascribing gene function across a genome
Size involved in
Adaptation
Adaptive Capabilities
Biosynthetic, Stress resistance, Structure, Regulation (sensing and responding)
Mycoplamsa genitalium
Smallest self replicating genome
Borrelia burgdorferi
Small genome with few cellular biosynthetic pathways
Bigger Genomes
LABs, H. influenzae, H. pylori, C. jejuni
Limited Environment
Bigger Genomes
Biggest Genomes
Commensals
Bigger Genomes
E. coli, B. subtilis, P, aeruginosa, Y. pestis, Clostridium, MbT, S. coelicolor
Secondary metabolites
Big Genomes –> antibiotics
Secondary Metabolites for antibiotics
Streptomyces coelicolor
Linear Chromosomes
Bb , S. coelicolor
Multiple chromosomes
V. cholerae
Lots of plasmids
Bb (17 total and 50% genome)
Plasmids show
Replication, decaying genes/mutations, antigenic variation
Has no transposons, IS elements, phages
Campylobacter
Important for organization
Repeats
Replication
OriC (dnaA) and terC
dnaA
Replication initiation protein
Why Mtb vs Bs growth
Strand bias is 55 and 75%
Gene Content
Annotation, Paralogues, Orthologoues
Annotation
Sequence similarity + domain matches
Sequence similarity
Gene families, regulators, transporters, biosynthesis
Paralogue
Same family (homologs) in same genome with different function likely
Orthologue
Homologoues (same familty) in different genomes that might have identical funtions
Most genome sizes
1kb
Conserved hypothetical
ORFans in many genomes
Hypothetical genes
Unique to one genome
Core regions
Shared by close relatives
From mobile genetic elements, plasmid integration, and phages
Flexible regions (genomic islands)
Gain of GI
Gene loss (obligate IC pathogens)
Correlates with microbial lifestyle
Genome organization and content
First genome LAB sequenced
1999 L. lactis –> 2003 L. plantarum
First sequenced megaplasmid
L. salivarious
General LAB features
Protein coding gene varies (1700 to 2000), Genome reduction, all have transposons, Many essential growth plasmids
Clusters of Orthologous Genes of Lactobacillales (LaCOGs)
Genes that cluster as orthologs
Orthologs
Genes from same ancestor
LaCOGs numbers
86% genome, 11% LAB specific, 18% conserved among 12 genomes
Conserved Core regions
The 18% LaCOGs with biochemical activity, uncharacterised genes, markers
LysM and LaCOG012
Markers of Lactobacillales
Bacilli to Lactose ancestor
Changes related to transition to nutritionally rich environments
Gene loss reflects
Similar environmental Pressures
Oenoe, Leume, Pedpe, Strth
Genome Losers
Presence/absence patterns of key enzymes in lactate fermentation
Poor correlation to phenotype observed
Types of transporters
Channel (VIC), 2 carrier (MF), 1 carrier (ABC), Group translocation (PTS)
PTS
Membrane proteins transfer sugar to phosphatase
Dominant transport types in LAB
2 carriers (9) and 1 carriers (2)
Bidirectional Transporters
Nonspecfic or low (5%)
Uptake transporters
aa»_space; sugars > cations/anions > peptides (55%)
Efflux Transporters
Drugs»_space;> peptides > macromolecules (40%)
More of these genes in LAB
Carbons sources and macromolecules
Anions > Drugs
Bli
0 Sugar transporters
Sth
Most transporters (MFS)
Efflux of drugs
Efflux
Drugs then lipids then peptides but excepetions
Most Uptake
amino acids then sugars then inorganic cations but exceptions
Most LABs use
PTS
All genes that can potentially be in a species
Pan or supragenome
Conserved set of genes in every member of that species
Core genome
Core genes
Identity genes essential for life
Dispensable genes
Not essential for survival
Unknowns of genomes
20% or core and 45% of Pan