The Metagenome Flashcards

1
Q

What is metagenomics?

A

The study of genetic material recovered directly from environmental or biological systems/compartments.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define the term “Microbiota”

A

Ecological community of commensal and pathogenic microorganisms - Includes bacteria archaea, protists, fungi and viruses

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define the term “Microbiome”

A

Collective genomes of the microorganisms in these communities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does the variation in the human microbiome between individuals mean for microbiome studies?

A
  • Varuation makes it very hard to conduct microbiome studies of patients with a particular condition
  • Means you will need 100’s of people involved in any particualr study just to observe differences in the microbiome that might be associated with the particualr disease you’re studying
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Give some examples of diseases/conditions associated with changes in the microbiome?

A
  • Irritable Bowel Syndrome
  • Depression
  • Cancer
  • Asthma
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the theory behind chnages in the microbiome leading to emotional changes in a person?

A
  • Idea of the gut-brain axis in which bacteria in the gut produce neurochemicals via neurotransmitters which travel up the gastrointestinal tract and can have an effect on the brain
  • These effects can include changes in emotion/mood
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Is the relative abundance of bacteria or the genetic content of those bacteria thought of as more important on the effect of the human microbiome on health?

A
  • Genetic content of the bacteria within the microbiome more important
  • Certain bacteria will have certian metabolic pathways which will contribute to the health of that person
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the 2 technological methods used to study metagenomics?

A
  • Targeted PCR amplification
  • Whole genome shotgun sequencing
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are some of the characteristics of the 16S rRNA gene?

A
  • Is around 1500 bases long
  • It’s split into about 9 variable regions interspaced with conserved regions
  • You can determine the different species present within a bacterial sample by looking at what variable regions are present within the sample
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Explain the process of 16S PCR amplification?

A
  1. You first collect your bacterial sample
  2. Then you extract the DNA from the 16S rRNA gene of all the bacteria present within the sample
  3. Then you conduct a 16S PCR amplification on the DNA extracted - Each amplified gene sequence will be represented by a different colour
  4. You then place the products of the PCR reaction into a sequencing machine which will generate sequences that are complementary to the amplified products of the PCR reaction
    * ​​These generated sequences should also match the bacterial content of the original sample.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How can data from 16S PCR amplification be used to identify the quantity of each species of bacteria within a particular sample?

A
  • The generated short-read sequences are compared to a 16S database - this is a database containing 16S genes from all types of bacterial strains
  • By matching the generated short-read sequences with the sequences from the database you can determine how many of any particular bacteria was present within the sample
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What do the read lengths of the sequencing technologies used for 16S PCR amplification mean for process of sequencing the 16S rRNA gene.

A

The read lengths mean that these technologies aren’t able to sequence the entire length of the 16S rRNA gene which means only certain parts of the gene can be amplified and sequenced

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are some of the common regions of the 16S rRNA gene that are sequenced during PCR amplification?

A

Commonly sequenced regions include:

  • V1-V2
  • V1-V3
  • V3-V4
  • V3-V5
  • V4
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are some of the factors that decide which variable regions of the 16S rRNA gene are sequenced during 16S PCR amplification?

A
  • Phylogenetic signal
  • Amplicon length
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How can phylogenetic signal be used to decide which variable regions are sequenced during 16S PCR amplification?

A
  • Phylogenetic trees can be created based on different variable regions of the 16S rRNA gene within different species of bacteria
  • They can be used to decide which variable regions you sequence based on how distinct the different variable regions are for a set of bacteria.
  • If you’re looking for a particular type of bacteria within a microbiome smaple you would sequence the variable regions in which that type of bacteria has a distinct difference in phylogenetic tree compared to other bacteria so it’ll be easier to identify
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How can amplicon length (read length) be used to decide which variable regions are sequenced during 16S PCR amplification?

A
  • Different sequencing technologies produce different read lengths which may lead to errors being present within the final PCR product depending on the length of the variable regions sequenced.
  • E.g. Illumina MiSeq technology produces 2 x 300bp reads from each end of the sequence
  • If you sequenced V1-V3 regions of rRNA gene because of its length you would end up with a small section of that V1-V3 sequence will be amplified twice during PCR
  • During PCR amplification DNA polymerase will produce errors in the amplified DNA.
  • Only in the part of V1-V3 sequence that’s amplified twice will those errors be corrected because the polymerase has a template (section that’s been amplified twice) that can be used to correct the errors so they don’t end up in final PCR product
  • If you amplified V4 using Illumina MiSeq because it’s a shorter sequence than V1-V3 the whole sequence is amplified twice during PCR
  • This means any errors throughout the whole sequence can be corrected by DNA polymerase so no errors will be present within PCR product.
17
Q

What are the advantages and disadvantages of long read technologies compared to short read ones in reference to 16S PCR amplification?

A
  • Advantage - Unlike short read technologies these technologies are now being used to sequence the entire 16S rRNA gene (V1-V9)
  • Disadvantage - They have have higher error rates so more sequencing errors likely to be present in final PCR product
18
Q

What does the fact that all bacteria contain the 16S rRNA gene mean for potential contamination of a smaple used for 16S PCR amplification?

A
  • It means that the method used for sequencing the 16S rRNA gene is very sensitive to contamination.
  • This is because any bacteria that contaminate the microbiome sample extracted for 16S PCR amplification will have DNA from the 16S rRNA gene that wasn’t supposed to be in the sample
19
Q

Why is contamination particularly a concern for low biomass samples?

A
  • Because the contaminated part of the sample will make up a much larger proportion of the sample as a whole.
20
Q

What are some ways of mitigating the risk of contamination?

A
  • Randomise samples - prepare test samples and control samples randomly over a no. of different days
  • Note batch numbers/contents of reagents
  • Sequence negative controls
21
Q

Explain the process of whole genome shotgun sequencing?

A
  • Process is very similar to that of 16S rRNA sequencing with the differences being that instead of extracting DNA from the 16S rRNA gene you extract DNA from the entire genome
  • Also once you’ve extracted the DNA instead of doing 16S PCR amplification you just sequence the entire genome
22
Q

How can data from whole genome shotgun sequencing be analysed?

A
  • You can assemble the short sequence reads produced to figure out how the genomes of each of the bacteria sequenced look like
  • You can then use the sequence reads to create phylogenetic trees which look at taxonomic diversity between species (how genetically different 2 species of bacteria are)
  • You can also look at which genes are present within the sample as you are sequencing the entire genome - this can then be used to look at metabolic pathways
23
Q

What problem is caused by the fact that during whole genome shitgun sequencing you don’t amplify the bacterial DNA that’s extracted?

A
  • During DNA extraction from a sample you extract the DNA of the bacteria within that sample as well as the DNA of the host cells of the person that the sample came from
  • No amplification step means that the bacterial DNA isn’t enriched so the host cell DNA is often in excess in the sample
  • This means that you would just sequence the DNA of the host cells present rather than sequencing the DNA of the bacteria present within the sample
24
Q

What are some of the ways that bacterial DNA can be enriched without amplifying it?

A

Pre-extraction

  • Differential lysis of mammalian cells - lyse human cells and don’t lyse bacterial cells as they have different cell wall compositions.

Post-extraction

  • Enzymatic degradation of methylated nucleotides (these are present in mammalian DNA but not bacterial DNA)
25
Q

What are the issues with the methods used to enrich bacterial DNA in whole genome shotgun sequencing?

A
  • Differential lysis - Bias towards gram-positive bacteria compared to gram-negative bacteria which have different cell wall compositions which mean the gram-positive bacteria are better able to survive the differential lysis process
  • Enzymatic degradation of methylated nucleotides - Bias against AT rich bacterial genomes