reverse Flashcards
identification of functionally important sections of a sequenced genome. Coding genes, Noncoding parts (e.g. regulatory elements)
Annotation
(higher % of certain tRNAs in the cell)? : Relative Synonymous codon Usage (RSCU): Note the excess of A ending synonymous codons
Translation efficiency
alignment searches, Retrieve sequences in a database similar to a query. Sacrifices in alignment quality (scores) for speed and ability to retrieve as many matches as possible. not good tool to do alignments to compare sequences
BLAST
For each cloned cDNA, sequence a short segment from the 5’ or 3’ end.
ESTs (Expressed Sequenced Tags):
All DNA content/information in a haploid cell.
Genome
The information content of genomes
Bionformatics
An assembly of contiguous stretches of (chromosomal) DNA. (getting the DNA and sequencing, and put together into something that can be read)
Structural genomics
Characterize the role (level of expression, biological function) played by transcripts and proteins. (what does this tell me? ABCD? what’s the launguge, like reading a book)
Functional genomics
Comparing genomes from different organisms (sequence conservation, nucleotide composition bias). ( learn lessons from comparing genomes, conservation? mutations not allowed here over millions of years. What are things that are rapidly changing?)
Comparative genomics
Use random markers, Organize (map) segments of DNA, Choose minimum number of overlapping clones (tiling path), Sequence (use random markers to make map, then sequence minimum number to make whole thing.)
Hierarchical genome shotgun (HGS)
Fragment the entire genome into pieces, Sequence all pieces, Assemble all pieces into contigs that span each chromosome
Whole genome shotgun (WGS)
It is the average number of times a base is sequenced in a genome project
Coverage
is a universal file format used to report sequences. A FASTA file has a description header starting with >. It is followed by the actual sequence
FASTA
is measured by a score that reflects the number of times a base was identified by the automated sequencer and the quality of the chromatogram for each base (i.e. peak height and even spacing)
Quality
is a four coloured graph produced by the sequencer (chromatogram)
Trace