Module 7 - Microbial Genomics Flashcards
What is genomics?
Methods to study the entire genome of a microbe
Why is genomics important?
It provides insight into evolutionary relationships and detection of unknown organisms
What does introduction of DNA into microbes often lead to?
Gene disruption, allowing for studying loss-of-function on phenotype
What is a genome?
An organism’s complete set of DNA (including all genes)
What is genomics?
The collective characterization and quantification of genes (entire genome)
What is genomics?
The collective characterization and quantification of genes (entire genome)
What does genomics focus on?
The structure, function, evolutionary mapping, and editing of genomes
What needs (3) were created based on genomics?
- Improved DNA sequencing techniques
- Formats for storage of very large data sets
- Tools for analysis of large data sets
What needs (3) were created based on genomics?
- Improved DNA sequencing techniques
- Formats for storage of very large data sets
- Tools for analysis of large data sets
When was DNA sequencing first developed?
1970
What is sequencing?
The process of determining nucleic acid sequence
What did Walter Gilbert do?
Developed a chemical degradation method of DNA sequencing
When was Sanger sequencing developed?
Around 1970 (same time as Gilbert sequencing)
When was Sanger sequencing developed?
Around 1970 (same time as Gilbert sequencing)
What did Fredrick Sanger do?
Developed an enzymatic method using DNA polymerase
What is another name for Sanger sequencing?
Dideoxy sequencing, or chain termination sequencing
What are the three steps of Sanger sequencing?
- Clone a gene of fragment of DNA of interest
- Synthesize DNA with DNA polymerase
- Use electrophoresis to separate fragments of DNA at different lengths
What are the three steps of Sanger sequencing?
- Clone a gene of fragment of DNA of interest
- Synthesize DNA with DNA polymerase
- Use electrophoresis to separate fragments of DNA at different lengths
What is Sanger sequencing based on?
The fact that DNA polymerase requires a free 3’ OH group to continue DNA synthesis
What holds the two strands of DNA together?
Hydrogen bonds
What holds two consecutive nucleotides together?
Phosphodiester bonds (between 5’ phosphate group and 3’ hydroxyl group)
What is the significance of the 3’ OH group?
It is essential for chain elongation
What is the significance of the 3’ OH group?
It is essential for chain elongation
What is a nucleotide with a 3’ H called?
A dideoxyribonucleoside triphosphate (ddNTP)
What is the principle of dideoxy sequencing?
ddNTPs will stop chain elongation
What is the principle of dideoxy sequencing?
ddNTPs will stop chain elongation
How is the DNA sequence determined in Sanger sequencing?
By detecting labeled nucleotide at the end of each fragment separated by gel electrophoresis
How is the DNA sequence determined in Sanger sequencing?
By detecting labeled nucleotide at the end of each fragment separated by gel electrophoresis
What are the components (4) of the reaction mixture for Sanger sequencing?
- A template DNA to be sequenced
- A short oligonucleotide primer
- DNA polymerase enzyme
- All 4 dNTPs
How is the primer designed in Sanger sequencing?
To be complementary to the vector sequence
Why is the primer labeled in Sanger sequencing?
To allow for ease of detection of DNA pieces
Why is the primer labeled in Sanger sequencing?
To allow for ease of detection of DNA pieces
How is the Sanger sequencing reaction carried out?
In 4 tubes
What is the different with each tube in Sanger sequencing?
The specific ddNTP used (ddATP, ddTTP, ddCTP, and ddGTP)
What happens when a ddNTP is incorporated in a reaction in Sanger sequencing?
DNA synthesis is terminated
What happens when a ddNTP is incorporated in a reaction?
DNA synthesis is terminated
What happens when a dNTP is incorporated in a reaction in Sanger sequencing?
The DNA chain elongation will continue
What determines how many products will be in each reaction tube in Sanger sequencing?
Based on the number of times a specific nucleotide appears in the template DNA sequence (number of incorporation points)
How does a gel separate DNA fragments?
Larger fragments are at the top, while smaller fragments are at the bottom
How does a gel separate DNA fragments?
Larger fragments are at the top, while smaller fragments are at the bottom
What types of labels can be used in Sanger sequencing?
Radioactive labels, or fluorescent labels
How come fluorescent labels are used more frequently than radioactive labels?
Fluorescent labels are safer, cheaper, and easier than radioactive labels
How come fluorescent labels are used more frequently than radioactive labels?
Fluorescent labels are safer, cheaper, and easier than radioactive labels
How many bases can be sequenced by Sanger sequencing in a day?
700-1000 bases
How many bases can be sequenced by Sanger sequencing in a day?
700-1000 bases
How is a gel read in Sanger sequencing?
From bottom to top (Guitar Hero)
If a gel from Sanger sequencing reads 5’-AGTCT-3’, what is the DNA sequence from 5’ to 3’?
5’-AGACT-3’
If a gel from Sanger sequencing reads 5’-AGTCT-3’, what is the DNA sequence?
5’-AGACT-3’
What is primer walking?
Designing primers such that the 5’ end complements the end of the last DNA segment sequenced
What is primer walking?
Designing primers such that the 5’ end complements the end of the last DNA segment sequenced
What is pyrophosphate?
Two phosphates bonded together
What is pyrophosphate?
Two phosphates bonded together
How is pyrosequencing similar to Sanger sequencing?
It uses DNA polymerase
What happens if a dNTP is incorporated in pyrosequencing?
A pyrophosphate is released, which can be detected
What happens if a dNTP is incorporated in pyrosequencing?
A pyrophosphate is released, which can be detected
What reaction does pyrophosphate undergo in pyrosequencing?
APS + PP –> ATP
What does APS stand for?
Adenosine phosphosulfate
What enzyme catalyzes the reaction of pyrophosphate in pyrosequencing?
ATP-sulfurylase
What does ATP-sulfurylase do?
Converts APS and pyrophosphate into ATP
What does ATP do in pyrosequencing?
It can be used in a luciferase reaction to produce detectable light
What does CCD stand for?
Charged-coupled device
What does CCD stand for?
Charged-coupled device
What happens in pyrosequencing after a wash?
The reaction is repeated with a different dNTP base
When is light detected in pyrosequencing?
Only when the dNTP is complementary to the template base
How many bases can be sequenced by pyrosequencing in a day?
300-500
What is pyrosequencing used for?
Resequencing or sequencing genomes for which a close relative is already available
What is pyrosequencing used for?
Resequencing or sequencing genomes for which a close relative is already available
What are the steps of whole-genome shotgun sequencing?
- Shear the DNA into short pieces
- Sequence the fragments
- Use a computer algorithm to reconstruct
What are the steps of whole-genome shotgun sequencing?
- Shear the DNA into short pieces
- Sequence the fragments (Sanger sequencing)
- Use a computer algorithm to reconstruct
How can the fragments of DNA be sequenced in whole-genome shotgun sequencing?
- Cloning fragments, then Sanger sequencing
2. Directly on fragments using high-throughput sequencing
What do computer programs do in whole-genome shotgun sequencing?
Identify regions of sequence overlap from the fragments
What do computer programs do in whole-genome shotgun sequencing?
Identify regions of sequence overlap from the fragments
How come ~10x genome size is needed for whole-genome shotgun sequencing?
There is a random distribution of fragments generated
How come ~10x genome size is needed for whole-genome shotgun sequencing?
There is a random distribution of fragments generated
How many microorganisms have currently been sequenced?
3,000
How many microorganisms have currently been sequenced?
3,000
What is another name for high-throughput sequencing?
Next-generation sequencing
How many base pairs can high-throughput sequencing sequence at a time?
25-500
How many reads can high-throughput sequencing generated?
Hundreds, thousands, or millions of reads
What is the consequence of high-throughput sequencing having many reads?
High coverage, but a more computationally intensive assembly process
What are some examples of high-throughput sequencing?
Illumina (Solexa), Nanopore DNA sequencing, single molecule real time (SMRT), DNA nanoball, and SOLiD sequencing
What are some examples of high-throughput sequencing?
Illumina (Solexa), Nanopore DNA sequencing, single molecule real time (SMRT), DNA nanoball, and SOLiD sequencing
What DNA sequencing is currently under active development?
Third-generation sequencing
How does third-generation sequencing work?
By reading nucleotide sequences at the single molecule level
What is bioinformatics?
An interdisciplinary field that develops software and methods to understand large and complex biological data
What is bioinformatics?
An interdisciplinary field that develops software and methods to understand large and complex biological data
What is the purpose of annotation?
It allows for researchers to predict ORFs
What does ORF stand for?
Open reading frames
What is the importance of ORFs?
They allow researchers to better determine the start and stop points for a given gene
How can functions for newly discovered proteins be suggested?
Based on observed similarities
What can researchers speculate about a gene by looking at its sequence?
Whether it is a transcriptional factor, transport protein, or some enzyme
True or false: many genes have known proteins associated with them
False: many genes predicted by sequencing data encode gene products whose functions remain unknown
What is functional genomics?
The study of finding out the biological role of unknown genes
What is functional genomics?
The study of finding out the biological role of unknown genes
What types of experiments are part of functional genomics?
Using metagenesis to study phenotypes
How is gene expression generally regulated?
At the transcriptional level
What is a genomic library?
A collection of all the genes in a genome (cloned DNA fragments)
What can a genomic library be used for?
Whole genome sequencing
What does the method for obtaining a genomic library depend on?
The desired outcome
How can an mRNA library be formed?
By using cDNA generated from reverse transcriptase
What does reverse transcriptase do?
Convert RNA into DNA
What does cDNA stand for?
Complementary DNA
What does cDNA do?
It is the complement to the expressed mRNA
How can a true genomic library be formed?
By shearing the genome and cloning the fragments
How are DNA fragments prepared for a genomic library?
Through restriction analysis
How are clones generated for a genomic library?
By ligating the DNA fragments into plasmids, and transforming the cells
What determines how many clones are needed to represent the whole genome in a genomic library?
The size of the genome, and the average size of each cloned fragment
What equation can be used to calculate how many clones are needed for a genomic library?
N = ln(1-p)/ln(1-f)
In the equation for a genomic library, what is N?
The number of cloned fragments
In the equation for a genomic library, what is p?
The probability of generating a complete library
In the equation for a genomic library, what is f?
The average size of a fragment divided by the total genome size
What is the difference between a cDNA library and a true genomic library?
A true genomic library has all the genes, while the cDNA library only has genes that encode proteins
What is the difference between a cDNA library and a true genomic library?
A true genomic library has all the genes, while the cDNA library only has genes that encode proteins
What understanding do expression patterns allow for?
How organisms function under different conditions
What is a transcriptome?
A set of transcribed mRNA molecules in a cell
How can a transcriptome be measured?
Through northern blots
What is the procedure for a northern blot?
- Total RNA is isolated from cells of interest, and separated with gel electrophoresis
- RNA is transferred to a membrane
- RNA is probed with labeled DNA fragments
What is the procedure for a northern blot?
- Total RNA is isolated from cells of interest, and separated with gel electrophoresis
- RNA is transferred to a membrane
- RNA is probed with labeled DNA fragments
What happens after gel electrophoresis in a northern blot?
RNA is transferred to a membrane made of nitrocellulose or nylon
How is RNA linked to the nitrocellulose membrane?
Through exposure to UV light
What probe is used in a northern blot?
A labelled DNA probe which is complementary to the mRNA being examined
How can the location of the signal on the membrane be measured in a northern blot?
Through autoradiography or photography
What is the disadvantage of a northern blot?
It is cumbersome to track the expression of multiple genes at once
What is the disadvantage of a northern blot?
It is cumbersome to track the expression of multiple genes at once
What is a DNA microarray?
A collection of microscopic DNA spots attached to a solid surface
How does the procedure of a microarray compare to northern blotting?
A microarray is essentially the reverse procedure
How does the procedure of a microarray compare to northern blotting?
A microarray is essentially the reverse procedure
What are the steps of the microarray procedure?
- A glass slide is spotted with synthesized oligonucleotides
- Total mRNA is extracted from the cell
- mRNA is converted into cDNA, incorporating a fluorescent label
- The cDNA is passed over the chip
In the microarray procedure, what happens if more mRNA copies are present in a sample?
More fluorescently labeled cDNA corresponding to that mRNA is produced
How are the fluorescent labels used in a microarray?
They are captured by a scanner and analyzed by a software to quantify the amount of binding to each individual spot
How are the fluorescent labels used in a microarray?
They are captured by a scanner and analyzed by a software to quantify the amount of binding to each individual spot
How can a microarray be used with two different samples?
The cDNA from two different samples can be labeled differently, and hybridize to a single microarray
How is the relative amount of cDNA in a microarray seen?
Through color output
How is the relative amount of cDNA in a microarray seen?
Through color output
What does Yersinia pestis do?
Infects a variety of rodents, and can replicate within fleas
What is the primary transmission of Yersinia pestis between rodents?
Flea bites
How can Yersinia pestis be transmitted to humans?
Via fleas or an infected animal
What is special about the hosts for Yersinia pestis?
They differ greatly in internal body temperature
What is the internal body temperature of a human
37 C
What is the internal body temperature of a flea?
26 C
What conditions were used in the microarrays studying Yersinia pestis?
Two different temperatures, corresponding to the flea and human
What can be done to the genes identified from the Yersinia pestis microarray?
They can be studied further to identify function, role in pathogenesis, or potential as targets for therapeutic applications
What can be done to the genes identified from the Yersinia pestis microarray?
They can be studied further to identify function, role in pathogenesis, or potential as targets for therapeutic applications
What is the most recent technology in studying transcriptomes?
RNA-sequencing
What is another name for RNA-sequencing?
RNA-seq
What are the steps for RNA-seq?
- RNA is isolated from cells, and the rRNA is removed
- Remaining RNA is converted to cDNA by reverse transcriptase
- The cDNA can be sequenced through high throughput sequencing
In RNA-seq, before cDNA is sequenced, what is done?
Sequencing linkers are attached to cDNA fragments
What does the cDNA represent in RNA-seq?
The transcripts within a cell
What happens after the cDNA is sequenced in RNA-seq?
The resulting sequences are compared to known RNA sequences (bioinformatics)
How can the abundance of RNA transcripts be confirmed in RNA-seq?
Through qPCR
What was RNA-seq first used for?
Used to examine the transcriptome of Saccharomyces cerevisiae
Besides microorganisms, what can RNA-seq be used on?
The genome of RNA viruses
Which is used more frequently: RNA-seq or microarrays?
RNA-seq
What will cause RNA-seq to evolve?
Advancements in DNA sequencing and analysis methods
What is a proteome?
A collection of all the proteins present in a cell under specific conditions
What do differences in protein types and abundance reflect?
Changes in gene expression and/or protein stability
Why is the study of proteomes important?
Proteins can vary due to stability or post-translational modifications, which cannot be detected by analyzing gene expression or mRNA
What are some methods to study proteomics?
2D-PAGE, mass spectrometry, X-ray crystallography, and NMR
What does NMR stand for?
Nuclear magnetic resonance
What does 2D-PAGE stand for?
2D-polyacrylamide gel electrophoresis
What is 2D-PAGE?
A 2D separation method to separate proteins based on two properties
What properties are proteins separated by in a 2D-PAGE?
Isoelectric point and mass
What does pI stand for?
Isoelectric point
What is an isoelectric point?
The pH where a protein has no net charge
What determines the isoelectric point of a protein?
The amino acid sequence
What will a protein do in a pH gradient?
Migrate to the pH that matches pI
What do proteins in a 2D-PAGE look like?
Spots on a 2D matrix
What does the pattern of spots represent on a 2D-PAGE?
The protein composition of a cell
What does the different pattern of spots in a 2D-PAGE suggest?
A measure of gene expression in different conditions
How are many protein in a cell expressed?
Under conditions where they function or are needed by the cell
How can the identities of proteins in an individual spot in a 2D-PAGE be determined?
By using mass spectrometry
What makes mass spectrometry possible for identifying proteins?
Being able to compare amino acid sequences with corresponding genome sequence databases
What are the steps to identify a protein from a 2D-PAGE?
- Spots are extracted from the gel
- The protein is digested into smaller fragments
- The fragments are analyzed by mass spectrometry to determine amino acid sequence
- These sequences are compared to a sequence database
How is a protein digested into smaller fragments?
Through proteases
What do proteases do?
Break down proteins into smaller fragments
How is the amino acid sequence determined from mass spectrometry?
By considering the individual mass/charge ratio
What is comparative genomics?
The study of evolutionary processes using the tools of genomics
What is the goal of comparative genomics?
To determine relationships between species
What can comparative genomics identify (in terms of genes)?
Genes associated with virulence and pathogenicity
Where does genetic variability come from?
Mutations in the DNA sequence
What are homologs?
Genes in a given genome that belong to related gene families that share a common ancestral DNA sequence
What are the two types of homologs?
Paralogs and orthologs
How can homologs arise?
From a gene duplication event
What happens (in terms of evolution) what a gene is duplicated?
One copy of the gene can evolve novel functions, while the other copy performs the original function
What are paralogs?
Homologous genes that arise from a duplication event
What is an example of a paralog?
Malate dehydrogenase and lactate dehydrogenase
What is an example of a family of paralogs?
ABC transporters
True or false: paralog families are tiny
False: they can be quite large
What is important in the evolution of genomes?
The duplications of genes
What are orthologs?
Homologs that evolved from the same ancestor with the same function in different organisms
What is an example of an ortholog?
Malate dehydrogenase in two different genomes
What does HGT stand for?
Horizontal gene transfer
What is horizontal gene transfer?
The movement of DNA between organisms other than transmission from parent to offspring
What is another name for HGT?
Lateral gene transfer
True or false: the genome of every organism contains foreign genes
True: HGT is important in all organisms
True or false: plasmids are the only way HGT can occur
False: the genome of every organism contains foreign genes
What is an indication that a gene is foreign?
If the base pair composition differs significantly from the rest of the chromosome
What is a characteristic of a specific genome (specific to a particular organism)?
The GC content
What is the GC content of E. coli?
50%
What is the GC content of Streptomyces coelicolor?
72%
What is the GC content of S. cerevisiae?
38%
What is some evidence for HGT?
Gene phylogeny and differences in nucleotide pair patterns
What was found when comparing genome sequences of related microbes?
Large segments of DNA exist in one genome, but not in a closely related genome
What do large segments of DNA that exist in one genome, but not a closely related one, suggest?
Introduction or removal of large stretches of DNA occur through one or more gene transfer mechanisms with the assistance of transposable elements
What are genomic islands?
Introduced DNA segments greater than 10kb up to 200 kb
How are genomic islands detected?
By comparison of genomic sequences and analysis of nucleotide composition
What is a characteristic of genomic islands?
They often contain GC content different from the majority of the genome
What are genomic islands usually associated with?
tRNA genes, transposable elements, plasmids, or bacteriophages
What is metagenomics?
A process where DNA is extracted directly from microbial communities and analyzed as a composite mixture
What happens to extracted DNA in metagenomics?
They can be sequenced directly or cloned into vectors to make libraries for further analysis
What is another name for metagenomics?
Environmental genomics
What is the consequence of having many organisms in microbial communities?
It is difficult to complete DNA sequences of each individual genome
How come only a subset of microbial communities are targeted for metagenomics?
There are many organisms in microbial communities
In what environments is metagenomics used?
Freshwater, wastewater treatment systems, acid mine drainages, and deep-sea thermal vents
What can metagenomics lead to?
The discovery of new genes encoding novel enzymes and antibiotics, or evidence of new organisms
What are the steps for metagenomics?
- Obtain DNA from a sample
- Sequence using rapid next-gen sequencing
- Analyze sequences using a computer
What must be done to analyze sequences in metagenomics?
Sequences from already known organisms must be eliminated
In Sanger sequencing, which dNTP is radiolabeled?
dCTP (32P-dCTP)
What is the importance of radiolabeling dCTP?
It is important for collecting data
True or false: radiolabeled dCTP can be added to the DNA strand
True: it still has the 3’ OH group
What helped increase the number of nucleotides that can be read from Sanger sequencing?
Improvements in gel electrophoresis technology
What were some improvements of Sanger sequencing over the years?
Thermostable polymerases, fluorescent labels for each ddNTP, and base-calling softwares
How are thermostable polymerases advantageous for Sanger sequencing?
It allows multiple rounds of synthesis from a single template strand
How are fluorescent labels for each ddNTP advantageous for Sanger sequencing?
The reaction can be done in one tube instead of four
How are base-calling softwares advantageous for Sanger sequencing?
They can interpret the raw data automatically and provide a direct sequence output
What is automated sequencing?
Using longer electrophoresis in the Sanger reaction to lead to more sequencing data
What is another name for automated sequencing?
Cycle-sequencing
When was high-throughput sequencing developed?
2004
What does apyrase do?
Removes unincorporated dNTPs and ATP from the system
Where is apyrase used?
In pyrosequencing
What is 454-pyrosequencing?
An adaptation of pyrosequencing that dramatically increased throughput
Why is it called 454-pyrosequencing?
“454” is the code name of the sequencing technology project
What are the steps of 454-pyrosequencing?
- DNA is fragmented into short pieces, then ligated onto beads
- Each bead can be separated into its own PCR reaction
- The pyrosequencing reagents are added to each well
What is the advantage of 454-pyrosequencing?
It increases sequencing throughput
What is emulsion PCR?
PCR carried out on the surface of a bead in an oil emulsion
Where is emulsion PCR used?
In 454-pyrosequencing
What is Ion Torrent?
A high-throughput sequencing method
What is Ion Torrent related to?
454-pyrosequencing
How does Ion Torrent work?
Like 454-pyrosequencing, but it measures protons instead of pyrophosphate each time a nucleotide is incorporated
What is another name for Ion Torrent?
Ion-semiconductor sequencing
How does Illumina sequencing work?
- DNA polymerase adds fluorescently labeled nucleotides
- An image is taken, and the fluorescent label is cleaved
- Paired-end reads can be analyzed separately
What is PacBio?
A system for genome sequencing
What is the advantage of more coverage in shotgun sequencing?
It helps reduce gaps in the final assembly that would need to be filled in with time consuming methods like primer walking
How can gaps be reduced in shotgun sequencing?
By combining a long, error-prone sequencing with short sequences
In shotgun sequencing, which method produces long, error-prone sequences?
PacBio
In shotgun sequencing, which method produces short sequences?
Ion Torrent or Illumina
What does the long error-prone sequence do in shotgun sequencing?
Gives a backbone (scaffold) for the overall genome
What do the short sequences do in shotgun sequencing?
Ensure that the final genome does not contain errors
What is a pan-genome?
The full complement of all genes within a species of bacteria (in all different strains)
What fundamental technique is used to study the transcriptome?
Nucleic acid hybridization
What does a Southern blot do?
Detects specific DNA sequences
What are the steps for a Southern blot?
- Generate DNA fragments through restriction enzymes
- Separate the fragments through gel electrophoresis
- Transfer the DNA fragments to a membrane for further analysis
What advances led to DNA microarrays?
Advances in photolithography and the availability of complete genome sequences
What does photolithography refer to?
The technique of creating computer chips
In the Y. pestis experiment, which genes could be important in pathogenicity?
The green and red genes (not the yellow genes)
How come the green and red genes may be important in Y. pestis pathogenicity?
They respond differently according to the different host temperatures
What does SDS-PAGE stand for?
Sodium dodecyl sulfate polyacrylamide gel electrophoresis
What does an SDS-PAGE do?
Separates denatured proteins based on mass
How does an SDS-PAGE work?
It mitigates charge, so proteins can be separated based on mass alone
How come most of the proteome would not resolve on an SDS-PAGE?
Many polypeptides have similar mass
What are the steps of a 2D gel?
- Apply a protein sample to a pH gradient on a polyacrylamide strip
- Use a current to separate by pI
- Use this strip in electrophoresis to separate by mass
What does LC-MS stand for?
Liquid chromatography-mass spectrometry
What is the purpose of LC-MS?
To separate proteins by LC before analyzing them with MS
What methods can be used to study the structures of proteins?
X-ray crystallography and NMR
How does X-ray crystallography work?
Crystallized proteins are subjected to X-rays. The scattering patterns can be detected and analyzed
How easy is protein crystallization?
It is a delicate and often difficult process
What are the disadvantages of protein crystallization?
Not all proteins crystalize well
Which proteins are especially difficult to crystallize?
Hydrophobic proteins that normally associate with cytoplasmic membranes
What is the advantage of NMR?
It is able to determine the structure of proteins while in solution
How does NMR work?
It measures the distances between atomic nuclei
What is the disadvantage of NMR?
It has a maximal size limit of about 30 kDa (the size of an average bacterial protein)
How are the different proteins in a paralog family related?
They probably carry out similar functions, but on different substrates
How are the different proteins in an ortholog family realted?
They probably carry out the same function (and same substrate) in different organisms
What does the evolutionary relationship of orthologs mirror?
The evolutionary history of their respective genomes
What do dehydrogenases do?
They carry out NAD or NADP dependent reactions
What does malate dehydrogenase do?
Converts malate into oxaloacetate
What does lactate dehydrogenase do?
Converts lactate into pyruvate
Why can genomes be considered as mosaics?
They have arisen from evolutionary changes and horizontal gene transfer
How come scientists first believed that HGT had no evolutionary advantage to microbes?
- There were different gene regulatory elements involved
- Expression was needed for the new gene
- The new gene could be detrimental
Besides GC content, what are some clues that HGT has taken place?
Differences in patterns of nucleotide base pairs, codon usage patterns, and presence of repetitive sequences
How can gene phylogeny imply HGT?
If evolutionary relationships from the gene do not match the core genome, this is evidence that HGT has taken place
When is HGT most successful?
When two organisms have similar DNA and GC content
What are some examples of advancements due to metagenomics?
- The discovery of proteorhodopsins
- The discovery of ammonia-oxidizing archaea
- The discovery of vitamin B12 producers in the ocean
What do proteorhodopsins do?
Harness light energy for metabolism in marine environments
Who pioneered functional metagenomics?
Jo Handelsman
How can sequences from metagenomics be linked to specific microbes?
Through databases or SSU rRNA genes
What are some limitations of metagenomics?
Researchers cannot make firm predictions based solely on available sequence information
What does FACS stand for?
Fluorescently associated cell sorting
How does FACS work?
A DNA probe for a SSU rRNA is used in a permeabilized cell to recover individualized uncultivated cells
What is single-cell genomics?
Understanding the genome of one cell through FACS
What are some challenges of metagenomics?
Co-extracted contaminants and community complexity
What is metatranscriptomics?
The study of RNA transcripts directly extracted from a commuinty
What is metaproteomics?
The study of environmental proteins
How do metatranscriptomics and metaproteomics work?
Through bioinformatics (similar to individual organisms)