Lecure 20 Flashcards

1
Q

Proire to genetic sequencing technologies, _____ discovery took a long time

A

Pathogen

E.g HIv was first discovered in 1981 but we know it existed before that

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How did we know HIV existed before it was discovered

A
  • look at the root of the tree root
    Can calibrate the time of divergence by looking at the time the samples were taken and estimating the mutation rate - the branch of the phylogenetic trees representing divergence over time
  • draw linear regression from clusters and where it crosses the X axis is when HIV had emerged into the population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Pathogen discovery was revolutionaised when gene sequencing was introduced

A
  • used to have to grow the virus in the lab to see what was causing the disease (but most viruses are not culturable - dont know the right conditions)
  • can use consensusPCR to test for presence of a particular virus, consesus PCR is when you can take a conserved region of that viruses genome and test for its presence in an assay (but can’t discover new pathogens, only when u know what ur looking for)
  • metagenomics allows you to look at the whole tree

This is kinda yap

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why use genomic sequencing ?

A
  • microbes can be difficult to culture
  • a (sort of) unbiased way to study ALL microbes in a sample, not just those u can culture (our ability to recognise what is in there is hindered)
  • allows us to understand the structure and function of microbial communities
  • it is realivelt cheap
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the infectome

A

What we call the way of studying an entire microbial community

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the contents of the infectome

A

Bacteria + archaea + fungi + viruses

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What kind of sequencing can u use to target sequencing the bacteria+ archea part of the infectome

A

16s rRNA sequencing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What kind of sequencing can u use to target sequencing the fungi part of the infectome

A

18S rRNA sequencing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What kind of sequencing can u use to target sequencing the whole infectome

A

Metagonomic / metatrasncriptomic sequencing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the difference between metagenomics and metatranscriptiomics?

A

Metagenomcis refers to DNA sequencing where as metatransctiptiomics refers to RNA sequencing

Metagenoics will tell u anything that had a DNA genome that is there
But a lot of DNA don’t transcribe DNA- only RNA so if we use metatransciptomics it will tell us anything there that is transcribing - including RNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does infectome analysis using metagenomics/ metatranscriptomics tell us?

A

Who’s there
- toxonomic classification
- population analysis

What can they do?
- functional analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What sequencing gives us an understanding of community structure - if its bacteria

A

16S rRNA sequencing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What gives us a better understanding of the metabolic potential of a community?

A

Metagenomics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is metatranscriptomics?

A
  • total RNA sequencing (RNASeq)
  • it gives us the gene expression of microbes within natural environments
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does metatranscriptomics allow us to do?

A

Allows us to obtain whole gene expression profiling of complex microbial communities

17
Q

What is the difference between metagenomics and metatranscriptomics?

A

Metagenomics = genetic content, identifying microbes present within a community

Metatranscriptomics = diversity of the active genes within such community, quantify their expression levels and monitor how these levels change in different conditions

18
Q

What is the advantage of metatranscriptomics over metagenomics

A

It can provide information about the differences in the active functions of microbial communities which appear to be the same in terms of microbe composition

(This is important as u may have a diseases and heathy population and they may have the same community composition however if you do metatransciptomics diseased may have a much higher gene expression of a particular thing)

19
Q

Metatransciptomics can be used to study the…

A

Virosphere (but also all the parasites, bacteria, fungi = infectome) as well as the host gene expression

20
Q

Things to keep in mind - caveats

A

There is no standard way to produce metatranscripomical date
- thus if we want to compare studies of different samples we have to understand the caveats:

E.g
- sewuwncing platforms may vary
- coverage/depth may vary
- sample collection and preservation (RNA is very degradable)
- RNA/DNA extraction methods
- enrichment or depletion steps
- one sample is representive of a single time point

21
Q

Sequence analysis : assembling the metagenomics - De novo assembly…

A

putting short sequence reads together to reconstruct longer sequences (contigs)

22
Q

What are contigs?

A

Continuous sequences from shorter, overlapping reads

23
Q

Once you’ve done de novo assembly, what do you do next?

A

Annotating: once assembled, ideniftying where the contigs came form using sequence homology

OR

Map sequence reads to a reference sequence (if you know what your looking for)

24
Q

De novo assembly vs mapping

A
25
Q

Pros and cons of sequence assembly methods

A
26
Q

Challenges for assembly of microbial communities

A
  • communities are more biologically complex then individual microbes
  • the presence of different strains of the same species can result in fragmented reconstructions
  • some sequence segments are repeated within the same organism or shared between distinct organisms e.g endogenous viral elements
  • when assembling genomes, we typically assume that sequence coverage is uniform
  • but the coverage of each community member depends on the abundance of its genome in partial genomes
  • low abundance community members may result in partial genomes
  • coverage/ depth will determine how well you can characterise low abundance community members (ie. need enough overlapping reads)
27
Q

The benifet of using metagenomics / metatranscriptomics approach is the at we dont have to know what is in the sample….

A

But this means we dont know from what organism each sequence is derived

28
Q

The benifet of using metagenomics / metatranscriptomics approach is the at we dont have to know what is in the sample….

A

But this means we dont know from what organism each sequence is derived

29
Q

Limitations of metagenomic approaches

A
  • many genetic sequences are left not annotated because we don’t know what protein they encode

(The info in databases are biased towards viruses that infect humans rather than wildlife)

(This is the biased part - you can generate them unbiasdly but we we don’t know what most of them are - thus making it biased)

  • available microbial genomes are biased towards model organisms, pathogens and cultivable microorganisms meaning that our discovery of novel things will likely also be biased
30
Q

The dark matter of metagenomic data

A

(Contigs we have produced and dont know what they are)

  • annotating metagenomes relies on a reference database but we can only annotate sequences that share sequence homology above a certain threshold
  • there are still many transcripts that do not share any sequence homology to anything known

(Grey don’t match host or anything we knows genome - probs virus as they mutate so fast)

31
Q

The dark matter of metagenomic data

A

(Contigs we have produced and dont know what they are)

  • annotating metagenomes relies on a reference database but we can only annotate sequences that share sequence homology above a certain threshold
  • there are still many transcripts that do not share any sequence homology to anything known

(Grey don’t match host or anything we knows genome - probs virus as they mutate so fast)

32
Q

The metagenomic identification of viruses is currently limited to those with sequence similarity to known virus the rest is identified as dark matter

A

Highly divergent viruses that comprise the “dark matter” of the virus-here remain challenging to detect
- AI related approaches - deep learning algorhytims have had a huge impact on protein structure prediction

Can we use AI to deft dark matter?

33
Q
A

Virus closely related to to human rubella virus

34
Q

The more microbes we discover, the more we understand about their potential to cause disease

A
35
Q
A
36
Q
A
37
Q
A
38
Q
A