Lecture 21 Flashcards
the complete set of DNA within a single cell of an organism
genomes
is a discipline in genetics that applies recombinant DNA, DNA sequencing methods, and bioinformatics to sequence, assemble, and analyze the function and structure of genomes
genomics
Human genome project sequenced human genome, model system genomes, and –
EST (expressed sequence tags) = cDNAs
map the fragments, sequence the fragments, and piece it together = slow and careful (public)
clone-by-clone sequencing approach
Take a lot of copies of the genome
Break it up into little fragments
Sequenced all the little fragments
Computer puts it all together
shotgun sequencing
next generation sequencing: after each nucleotide is added, its identity is determined by –
fluorescence
Once a sequence is obtained, it must be –
annotated
annotating a sequence includes – the genes, promoters, exons/introns, and function of genes
locating
process of attaching biological functions to DNA sequences
- describes both structural and functional features of a gene
- goal = identify known genes, regulatory sequences, etc as well as to identify sequences that are likely to be genes through their function
annotation
– cDNAs allow designation of all introns and exons
complete
– cDNAs (referred to as expressed sequence tags (EST)) allow expressed genes to be identified
incomplete
same info from all cell types from the same individual
includes: cis-elements (promoter, enhancer, etc.)
introns, repetitive sequences, etc.
genomic DNA
– is expressed genes that is derived from mRNA contains coding regions (no introns or regulatory sequences)
cDNA
What type of library do you need to piece the genome together?
genomic
What type of library do you need to determine the exon/intron complete sequence and structure of a gene?
genomic and cDNA
expression levels are based on frequency of –
cDNA clones
exon/intron structure are based on
cDNA clones
alternative splicing are based on –
cDNA clones
databases contain info on function which is based on –
homology to known genes
Where experimental data is not available, – is used to predict gene structure from ORFs, splice sites, promoter sequences
bioinformatics
Bioinformatics: the use of – approaches to decipher DNA-sequence information
computational
sequences without a stop codon, appear to possibly code for polypeptides
open reading frames
DNA – analysis can analyze mRNA concentrations – a cheaper alternative to sequencing.
microarray
Sequence analysis of all living organisms in an environment
metagenomics
in metagenomics – are over represented
most common species