Microbial Genomics Flashcards
Genome
entire complement of genetic info including genes, regulatory sequences and noncoding DNA
Genomics
discipline of mapping, sequencing, analyzing and comparing genes
Sanger Dideoxy Sequencing Method
- uses radioactively labeled bases in PCR
- uses gel electrophoresis to sequence the DNA
Illumina Sequencing Method
- 200 million reads/run; 300 bp length
- Steps:
- put on double stranded adapters
- have complimentary strands on flowell (600 million clusters)
- creates build up/bridges on flowell
- bridge amplification, stationary phase PCR
- clusters are identified w/ fluorescence
PacBio Sequencing
capable of 30,000 reads/run; 2000 bp length
454 Sequencing
- 1 million reads/run; 700 bp length
- generates data 100x faster than Sanger method
- relies on 2 major advances
- use of robotics
- pyrosequencing
- Steps:
- chop up DNA, attach things w/ ligase, undergo emulsion PCR which attaches DNA to bead
- everything placed in picotiter plate (1.7 million wells)
- nucleotides are run over DNA and will attach
- releases PPi which get transferred to ATP which then admits light
- light identifiable for each base
How do you assemble the information from sequencing into a genome?
- align all reads w/ software to generate longer sequencing (contigs)
- gaps remain btwn contigs when don’t match up = “draft” genomes
- orientation can be assumed if know related organisms
- can amplify gaps by sequencing PCR amplicons
- most laborous and expensive part of sequencing
Annotation
- coverting raw sequence data into a list of genes present in genome
- majority of genes encode mRNAs (open reading frames)
- some encode functional RNAs (rRNAs, tRNAs, ncRNAs)
- Functional ORF = Open Reading Frame (encodes proteins)
- computers search for start/stop codons and shine-dalgarno sequences
- identify ORF
- computers search for start/stop codons and shine-dalgarno sequences
What is the correlation of gene size to genome size?
- gene = 1,000 bp
- 1,000 genes = 1 genome
- 1 Mbp = 1,000 genes = 1,000,000 bp
What is the fractino of noncoding DNA in microbes?
- Eukaryotic microbes = >80%
- Bacteria = <15%
How many kbp does Nanoarcheaum equitans have?
139 kbp
How many kbp does *Mucobacterium tuberculosis *have?
4,400 kbp
What is the estimated minimum # of genes needed for cell viability?
250 - 400 genes
Which prokaryote has the larges genome to date at 13.0 Mbp?
Sorangium cellulosm
Do archaeal genomes tend to be smaller or larger than bacterial ones?
smaller at only an estimated ~5 Mbp