The Metagenome Flashcards

1
Q

As a recap, define:

  • genomics
  • transcriptomics
  • proteomics
  • metabolomics.
A
  • Genomics: the whole cell gene content
  • Transcriptomics: the whole cell gene expression
  • Proteomics: the whole cell protein content
  • Metabolomics: the whole cell metabolite content
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define metagenomics.

A

Metagenomicsis the study ofgeneticmaterial recovered directly fromenvironmentalor biological systems/compartments.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define microbiota.

A
  • the ecological community of commensal and pathogenic microorganisms
  • this includes bacteria archaea, protists, fungi and viruses
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define microbiome.

A

The collective genomes of the micro-organisms in microbiota.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Describe the human microbiome.

A

The microbiome is unique to each individual, even between twins.

Changes in the microbiome have been associated with multiple human illnesses (e.g. Irritable Bowel Syndrome, depression, cancer).

The gut microbiome can classify individuals as lean or obese with >90% accuracy. Early-life gut microbiomes are linked to the development of allergic conditions (e.g. asthma).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Specifically, describe the human stool microbiome.

A

The stool microbiome during Clostridium Difficile Infection (CDI), quite different from healthy stool microbiome. CDI has a greater effect on the stool microbiome than host genetic factors.

Faecal microbiota transplant is able to cure CDI. Restoration of the stool microbiome to that of healthy state is rapid following transplantation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

List the technological approaches to metagenomics.

A
  • Targeted PCR amplification
    (16S rRNA, bacteria - 18S rRNA, eukaryotes): this includes using a single marker that we know has some variation within a population between species. We use this as a ‘proxy’ to find out about the species and organisms in a sample.
  • whole genome shotgun sequencing
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Describe the 16S targeted PCR amplification workflow.

A
  • sample collection
  • DA extraction
  • 16S PCR amplification
  • sequencing
  • analysis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Describe the variable regions.

A

The variable regions are conserved within a phylum (genetic group), but they converge within a species.

These regions are used to separate species based on these sequences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Which variable region do we choose (V1-V9)?

A

You would decide it based on the:

  • phylogenetic signal: does the variable region contain enough information to separate the species/genera that you’re interested in separating from you sample?
  • amplicon length: how big can the PCR product be?

Thus, it depends on the sample you’re analysing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What could go wrong with 16S targeted PCR amplification?

A

The method is very sensitive to contamination from the:

  • environment
  • operator
  • reagents

This is especially important for low biomass samples.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How would you mitigate the potential contamination of 16S targeted PCR amplification?

A
  • randomise the samples: make sure all the conditions and the controls are random on the plate, and that it’s all done at the same time
  • note the batch numbers of the reagents
  • sequence to negative controls: this is to make sure that it’s from your kit
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How does 16S targeted PCR amplification cope with hard reads?

A

The choice of variable region determines the resolution. It is less reliable below genus level.
Thus, 16S is hard with long reads.

There is new long read technology that enables full length 16S sequencing:

  • PacBio
  • Nanopore

However, the higher error rates of long read technologies introduce noise. The development for this technology is ongoing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Describe whole genome shotgun sequencing.

A

Instead of doing a PCR, you’re using the whole DNA - so there is no bias.

By finding out which genes are present, we infer which metabolic pathway is present in the sample. As you compare with other samples, you’re able to look for enrichment, decreasing metabolic pathways between diff environments, niches, patients, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

List some characteristics of whole genome shotgun sequencing.

A

The host cells are often in excess in the sample. There is no amplification step to enrich for bacterial DNA.

It is sample dependent, and the typical yields of contaminating human reads are:

  • faecal: <10% human reads
  • saliva, nasal, skin samples: >90% human reads
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do we enrich the sample without amplification in WGS?

A

PRE-EXTRACTION:

  • differential lysis of mammalian cells
  • enrichs for intact microbial cells
  • potential bias towards gram-positive bacteria

POST-EXTRACTION:

  • enzymatic degradation of methylated nucleotides targets mammalian DNA
  • bias against AT rich bacterial genomes