Microbial Genomics Flashcards

Question

Similar methods for uncovering core genome were used to uncover E. coli pangenome. What did we find?

Answer 1

Trend line does not plateau, instead it approaches a straight line sloped upwards This is because E. coli (and S. agalactiae) have "open pangenomes" and are effectively infinite in size

Answer 2

Closed pangenome (e.g. Yersinia pestis) which are finite in size as they don’t pick up additional DNA as easily Open pangenomes are infinite in size as they pick up DNA easily With such organisms it is possible to comprehensively characterise all of the genes in the pangenome by sequencing just a few strains

Answer 3

Estimate how many new genes are discovered with each genome sequenced For E. coli this plateaus to a non-zero value of around 300 genes, meaning that you can continue to sequence even large numbers of E. coli genomes, and you will keep on identifying new genes indefinitely - No matter how many independent isolates you sequence, on average you will find ~300 new genes in each - E. coli has ability to pick up genes from anywhere and everywhere; Will always find new genes

Answer 4

Illumina also is an example of sequencing by synthesis Illumina can sequence millions of molecules simultaneously - Massively parallel sequencing Reads aren’t very long; ~100bp for each sequence vs ~800bp in Sanger sequencing - Makes it harder to piece together genome

Answer 5

Adaptor ends of single template molecule hybridises to a primer sequence attached to the surface, and the opposite end can fold over to hybridise to adjacent primers Addition of DNA polymerase allows the production of a second copy of the template Both of these copies can repeat the process, and this continues through multiple cycles until there is a cluster of identical molecules Across the surface there will be millions of clusters, each representing a different fragment of the original genome

Answer 6

Synthesising complementary strand using fluorescently-labelled nucleotides Reversible terminator nucleotides – Cleave off fluorescent label after imaging to allow for chain extension

Answer 7

Chunks (contigs), but still require finishing Contigs can be placed into order by comparison with a closely-related complete genome

Answer 8

Finishing is more expensive than generating a draft

Answer 9

The advent of third generation sequencing (Oxford Nanopore/PacBio)

Answer 10

Bacterial genome-wide association study to identify genes associated with particular phenotypes Looking for genes which were over-represented in strains from a particular host, they were able to identify vitamin B5 biosynthesis as a host-specificity factor Strains from chicken will only grow in the presence of vitamin B5, since they lack the genes necessary to synthesise it; It was suggested that this was an adaptation to the diet of the host

Answer 11

Basis of motility, metabolic profile and clinical manifestation

Answer 12

Serotyping involves raising antibodies against particular features on cell surface, and looking if they cross react between different trends If they cross react (recognise both strains) then the strains must be closely related – Same serotype O (lipopolysaccharide) antigen, H (flagellar) antigen and the K (capsular) antigen are useful for distinguishing between strains

Answer 13

Particular serotypes associated with outbreaks Enteropathogenic E. coli (EPEC)

Answer 14

Interactions with host cells - Characteristic A/E lesions in the ileum

Answer 15

Enterohaemorrhagic E. coli (EHEC) Enterotoxigenic E. coli (ETEC) 2 other pathovars defined as distinct from EPEC based on their conformation when adhering to Hep-2 cells: - Enteroaggregative E. coli (EAEC) - Diffuse Adherent E. coli (DAEC)

Answer 16

EPEC - Forms tight clusters EAEC - Forms a "stacked-brick" pattern with cells adhering to eachother DAEC - Defined based on diffuse adherence pattern; Not in association with eachother or host cells

Answer 17

DNA-DNA hybridisation DNA strands from 2 strains are hybridised together, and by measuring the temperature required to disassociate (melt) the hybrid DNA into separate strands, it is possible to estimate the degree of relatedness - More similar means more base pairing, so higher melting point

Answer 18

If enzyme is related between organisms, then it will show similar motility; Variations in sequence may affect enzyme motility – Concept of MLEE

Answer 19

Involves assessing the electrophoretic mobility of a series of purified enzymes; Compare mobility of different bacterial enzymes to distinguish strains Produces quantitative molecular data which can be used to understand evolutionary relationships between strains Early studies showed that serotyping doesn’t correlate well with genetic diversity as measured using MLEE; Genetically similar strains can have different serotypes, and distantly related strains can share the same serotype

Answer 20

Standard reference collection of 72 E. coli strains developed via MLEE Represent the full diversity of the species Electrophoretic diversity Geographical distribution Host range; Many of the selected strains originating from animals

Answer 21

A, B1, B2, D and E

Answer 22

Some strains showed different evolutionary relationships when different genes were analysed Suggested possibility of recombination (horizontal gene transfer) between different lineages of E. coli

Answer 23

Convergent evolution of their defining characteristics

Answer 24

Amplification and sequencing of ~400bp sections of 7-8 housekeeping genes distributed around different chromosomal regions Genes involved in ‘day-to-day’ functions like metabolism and energy production; Less likely to undergo recombination

Answer 25

Both showed evidence of multiple origins 2 separate clades of EHECs and EPECs - Had parallel acquisition of virulence determinants e.g. virulence plasmids and toxin genes Suggests that genetic requirements for each type of pathogenesis (e.g. genes for type II secretion system) can be acquired independently on multiple occasions

Answer 26

Genomes all showed large numbers of gene deletions Some of the gene deletions characteristic of Shigella had occurred by different mechanisms in the different species Suggested convergent evolution towards a Shigella phenotype; Obligate pathogens of humans

Answer 27

Core genome phylogenetics

Answer 28

5 "cryptic clades" of E. coli; C-I to C-V

Answer 29

Clade C-I was closely related to, but outside divergence of, existing E. coli strains C-II, C-V and the sister clades C-III/C-IV were more divergent; Showing similar evolutionary distance from E. coli as other species - Later studies determined that clades C-II, C-III/IV and C-V were sufficiently diverse to be defined as new Escherichia species

Answer 30

Most of 16S is very highly conserved, meaning it is possible to reliably amplify by PCR; Primers binding conserved regions Yellow “V-loops” can change sequence without disrupting function of RNA; Lower levels of conservation - Primers in conserved regions used to amplify and sequence the variable regions, which are phylogenetically informative and useful for species identification

Answer 31

Woese applied 16S rRNA sequencing to define a new kingdom; “Archaebacteria” (archaea)

Answer 32

99% of bacteria, which cannot be cultured in the laboratory Only uncovered because of 16S sequencing Likely to include strains which produce novel antimicrobial compounds or enzymes of potential biotechnological interest Studying these organisms can also give us insight into the biodiversity and ecology of different environments

Answer 33

Single cell genomics Individual cells are isolated by laser capture microdissection (cut out individual cell) Separated via microfluids or cell sorting, into different containers (FACS) where genomic DNA is extracted Individual cells are isolated by e.g. laser capture microdissection (cut out individual cell) Amplified DNA is sequenced and assembled - However, amplification is usually uneven, and the assembled genomes will often have patchy coverage; Some regions underrepresented

Answer 34

It is a way of culturing organisms within their natural environment Separates individual cells into separate wells Wells are filled with molten agar and covered with a semi-permeable membrane (prevents contamination), and the device is placed back into the environment the sample was obtained from This provides essential nutrients and allows a colony to grow from a single cell, to provide enough material for DNA sequencing

Answer 35

Requires deep sequencing; Need to sequence lots and lots of PCR products PCR primers may not be truly universal PCR bias may result in inaccurate quantification; Depending on sequence, some PCR reactions may amplify better or worse Contamination can be a problem as PCR is sensitive Sequencing errors may result in over-estimation of the diversity of organisms present; Mistakenly think we’ve discovered new species Some organisms have multiple distinct copies of the 16S rRNA gene, again leading to over-estimation of the number of species present Only looking at 16S gene which only tells us about species diversity; Don’t know if its pathogenic or commensal etc.

Answer 36

Sequence genomic DNA obtained from an environment, rather than just targeting the rRNA genes like in 16S rRNA sequencing DNA is extracted from an environmental sample, fragmented and sequenced on e.g. an Illumina

Answer 37

Attempt to identify the species from individual reads using software such as Kraken; Allows us to quickly compare individual sequence reads to a database and identify the species De novo metagenome assembly; Try to assemble our reads into larger contigs

Answer 38

There are fewer fragments that need assembling

Answer 39

Genome sequences assembled from microbiome samples

Answer 40

Polymerase is immobilised at the bottom of a small aluminium well Fluorescently-labelled nucleotides are incorporated into chain - However, the well is so small that light can only penetrate a small zone at the bottom (known as a zero-mode waveguide, ZMW) Fluorescently-labelled nucleotide is incorporated and held within illuminated zone for a prolonged period to produce a stronger fluorescent signal than free nucleotides in solution which diffuse into and out of the ZMW This allows the incorporated base to be identified by the colour of the label; The action of the polymerase cleaves off the fluorescent label, and allows the chain to be extended

Answer 41

Motor protein unwinds two strands of a DNA molecule and feeds one through a pore protein embedded in an artificial electrically insulating membrane Electrical potential across the membrane changes (for every base) as the DNA strand passes through, and the signal is characteristic of the bases that are going through the pore; The resultant “squiggle” can be converted into a DNA sequence Can sequence as long as the DNA molecule is; If we keep DNA intact then we can generate very long sequence reads

Answer 42

Large collaborative effort to characterise the composition of the human microbiome Different micro-environments show considerable variation in the composition of the bacterial populations present Shown to have influence on non-infectious conditions e.g. obesity, asthma

Answer 43

Strong link between obesity and the gut microbiome in both mice and humans Trait is transmissible - Germ-free mice transplanted with “obese microbiota” show a significantly increased level of total body fat

Answer 44

Higher rates of allergies and other immune conditions Early colonisation of infants during the first few months of life is important for future health

Answer 45

Because these studies involve PCR amplification, they are highly sensitive and prone to contamination

Answer 46

Sequencing blank control of water discovered that bacterial DNA is commonly found as a contaminant in the DNA extraction reagents If there is not much DNA in the actual sample, then the contaminant DNA can be amplified and sequenced, and could be misinterpreted as evidence for the presence of microbial life in a sterile sample

Answer 47

Perform appropriate controls to ensure that the conclusions of a study are not influenced by contamination It is sensible to put blank controls through the same DNA extraction and sequencing processes as the experimental samples It would also be sensible to use kits from different suppliers to confirm any conclusions about the microbial content of the samples

Answer 48

Closed - Small accessory genome Open - Large accessory genome

Answer 49

Provide functions that are context specific – Beneficial in specific environments - Resistance - Pathogenicity

Answer 50

Gene gain e.g. through Mobile Genetic Elements and horizontal gene transfer Gene maintenance Gene loss

Answer 51

Uptake of DNA from the environment into bacteria DNA is incorporated via homologous recombination and used to ‘overwrite’ the cells copies Not a major contributor to novelty; Mainly pickups genes similar to one’s bacteria already has

Answer 52

Bacterial DNA packaged into phage particles which then infect a new host, transferring bacterial DNA into recipient

Answer 53

DNA encoded on conjugative elements is copied and transferred between bacteria via pilus Plasmids or ‘integrative and conjugative elements’ (ICEs)

Answer 54

More likely to be functional if the new bacteria is related to a previous host bacteria Maintaining novel genes can be costly - Regulatory disruption Negative epistatic interactions

Answer 55

Over time gene adapts to new environment/host via compensatory mutations – Gene becomes regulated and integrated into new host, reducing cost There are methods of silencing genes, so they cost less; Proteins that bind and repress certain genes

Answer 56

Very beneficial/essential in specific environments e.g. resistance In other environments they are costly e.g. absence of antibiotic/toxin

Answer 57

Fitness increases as the resistance kicks in

Answer 58

Genes may be very beneficial in one place, in other places it is has no benefit, or is even costly Therefore may not be lost immediately upon becoming costly

Answer 59

Neutral theory – Genes are gained and lost at random; Genetic drift Adaptive theory – Genome is shaped by selection for environmentally specific genes

Answer 60

Drift is a process by which variation is lost due to random chance events It is more likely to lose a gene to drift where populations are bottlenecked to small sizes The more diverse the organism (correlates to pop. size), the more accessory genes you have (genome fluidity)

Answer 61

Variability correlates with metabolic capability; Open pangenome organism can easily pick up genes; More adaptable As you acquire more accessory genes, you have additional biosynthetic potential and are more likely to survive in a wide range of environments

Answer 62

Change what cells can gain but also what is dispensable in that environment Causes great variation

Answer 63

Diet can have both short and long term influences on microbiome Prebiotic (promote bacterial growth) and probiotic (include bacteria) drinks can promote changes Antibiotics can have impacts with broad spectrum killing many bacteria in body C. difficile can then cause opportunistic infection Faecal transplant can help reduce these infections by replacing microbiome

Microbial Genomics Flashcards

(87 cards)