Chapter 18 Flashcards
Genomics, Bioinformatics, and Proteomics
Genomics allows sequencing of entire genomes
Genome: - Complete set of DNA in a single cell of an organism Genomics - the study of genomes - Structural genomics - Functional genomics - Comparative genomics - Metagenomics
Structural genomics
- sequencing genomes
- analyzing nucleotide sequences to identify genes and sequences such as gene regulatory elements
Shotgun cloning
whole genome sequencing (shotgun cloning)
- the most widely used strategy for sequencing and assembling an entire genome
1. Genomic DNA is cut into fragments called contigs
2. entire chromosome is assembled by computer program
3. fragments are aligned based on identical DNA sequences
Contigs
contigs - continuous fragments
- overlapping fragments adjoining segments that collectively form one continuous DNA molecule within chromosome
Bioinformatics
uses computer based approaches to organize, share, and analyze data related to:
- gene structure
- gene sequence and expression
- protein structure and function
Applications for bioinformatics
- comparing DNA sequences
- finding gene regulatory regions (promoters and enhancers)
- predicting amino acid sequences
- deducing evolutionary relationships between genes
Hallmark Characteristics of a Gene Sequence Can Be Recognized by Bioinformatics Tools
- several hallmark characteristics of genes (prokaryotes and eukaryotes) can be searched for using bioinformatics software
- Gene-regulatory sequences found upstream are marked by identifiable sequences such as promoters, enhancers, and silencers.
Open reading frames (ORF)
- sequences of triplet nucleotides translated into amino acid sequences of a protein
- suggestive of protein- encoding gene
- typically begin with initiation sequence ATG
- end with termination sequence (TAA, TAG, TGA)
Functional genomics
- Study of gene functions based on resulting RNAs or possible proteins they encode as well as regulatory elements
- Attempts to Identify Potential Functions of Genes and Other Elements in a Genome
Predicting Gene and Protein Functions by Sequence Analysis
Similarity searches
–Screen databases and compare sequence to known sequence
–Genome sequence statistically similar to gene with known function likely encodes for protein with similar function.
- Compares portions of human leptin gene (LEP) with homolog in mice (ob/Lep)
Homologous genes
- Genes that are evolutionarily related
- Similarity searches are able to identify homologous genes.
Orthologs - Genes from different species thought to have descended from common ancestor
HumanGenomeProject (HGP)
- Coordinated effort to sequence and identify all genes of human genome
- Illustrates that humans and all other species share common set of genes essential for cellular function and reproduction
Major features of human genome project
Two biggest surprises discovered from HGP
–There are less than 2% genome codes for proteins.
–There are probably only about 20,000 protein-coding genes.
The number of genes is lower than the number of predicted proteins.
Alternative splicing
Many genes code for multiple proteins through alternative splicing.
- Alternative splicing produces incredible diversity of proteins beyond number of human genes
- Alternative splicing patterns can generate multiple mRNA molecules = multiple proteins.
Human genome
Human genomic sequence is 99.9% the same, with most genetic differences resulting from
- Single-nucleotide polymorphisms (SNPs)
- Single-base changes in genome
- Variations associated with disease conditions - Copy number variations (CNVs)
- Segments of DNA duplicated or deleted
Comparative genomics
- Compares genomes of different organisms to answer questions about genetics and other aspects of biology
–Incorporates study of gene and genomic evolution–Explores relationship between organisms and environment–Studies differences and similarities between organisms and how differences contribute to phenotype, life cycles, and so on.