Chapter 9 Flashcards
genome
entire complement of genetic info including genes, regulatory sequences, and noncoding DNA
genomics
discipline of mapping, sequencing, analyzing, and comparing genomes
First genome sequenced in 1976
RNA virus MS2; 5386 bp
First cellular genome sequenced in 1995
Haemophilus influenzae
The human genome contains…
3 billion bp and 25000 protein coding regions
sequencing
determining the precise order of nucleotides in a DNA or RNA molecule
generation
successive major changes in sequencing technology that confer
-increase in speed, drop in cost of sequencing
Sanger method
first generation sequencing
Presently, most labs access ______ generation sequencing
second
shotgun sequencing
entire genome is cloned, and resultant clones are sequenced
-sequencing is redundant
genome assembly
connecting the DNA fragments in the correct order and eliminating overlaps
annotation
converting raw sequence data into a list of genes present in the genome
bioinformatics
science that applies powerful computational tools to DNA and protein sequences for the purpose of analyzing, storing, and accessing the sequences for comparative purposes
Majority of genes encode…
proteins
functional open reading frame (ORF)
encodes a protein
How do computer algorithms search for ORFs?
look for start/stop codons and Shine-Dalgarno sequences
hypothetical proteins
proteins that likely exists, but whose function is currently unknown and encode nonessential genes
noncoding RNA
RNA that does not code for protein; lack start codons and have multiple stop codons (tRNA, rRNA)
Unlike prokaryotes, eukaryotic genomes contain…
a large fraction of noncoding DNA
On average, a prokaryotic gene is ______ bp long
1000
As genome size increases, gene content…
proportionally increases
smallest cellular genomes belong to..
parasitic or endosymbiotic prokaryotes
estimates suggest minimum # of genes for a viable cell is…
250-300 genes
Many genes can be identified by..
comparative analysis
comparative analysis
identifying sequence similarities to genes found in other organisms
most abundant class of genes
metabolic genes and genes coding for protein sequences