Metagenomics and binning Flashcards

1
Q

Functional metagenomics

A

Look for one gene in an environment.

Fragment DNA, ligate into vectors, transform into host, screen, amplify and sequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Sequence based metagenomics

A

MGWGS. All organisms and viruses can be found, sensitive to strand variation. Fragment, libprep and seq.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

16s rRNA amplicon seq

A

Only look at rRNA to see what you have. Amplify rRNA and seq.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Describe the MG workflow

A

Preprocess
Align or assemble
After assembly, one can predict genes, bin and align to ref

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the problems of MG seq?

A
Extreme depth is needed
Low coverage
Kmer correction not possible since many different things are present. 
Optimal kmer varies between species 
Read errors cannot be corrected
Large data sets
No N50
Gene prediction is difficult since different organisms have different fingerprints
What even is bacterial species?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is binning?

A

Sorting of reads that “belong” together

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How to bin?

A
Alignment based (does it look like something we know from a reference? Remove low-quality hits)
Composition based (small fingerprints in DNA, fast and easy but not so accurate, needs long DNA to do statistics, no-one knows how it works)
Co-abundance based ("copy number", needs cn variation)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly