Lecture 18 - Sequence Similarity Flashcards
What is an open reading frame?
Part of the reading frame that has the potential to be translated.
How can the function of unknown proteins be inferred?
By similarity of sequence to known proteins.
What was the first protein database?
Protein Information Resource (PIR)
Give some protein sequence databases.
- Swiss-Prot
- TrEMBL
- Uniprot
Which protein database is manually annotated?
Swiss-Prot
Give three DNA sequence databases in order from early to late.
- Genbank
- European Molecular Biology Laboratory
- DNA Databank of Japan
What is sequence alignment?
A way of arranging the primary sequences of DNA, RNA or protein to identify regions of similarity.
What is pairwise alignment?
Comparing two sequences.
A query sequence is compared to every sequence in a database to find the best match.
What is global alignment?
An attempt to match every residue in two sequences.
When is global alignment most useful?
When sequences are of equal lengths.
When is local alignment more useful?
For dissimilar sequences that are suspected to contain regions of similarity or similar sequence motifs within their larger sequence context.
What is local alignment?
Trying to match regions of two sequences.
What alignment method does BLAST use?
Local alignment
What has BLAST been designed for?
Speed
What are the following BLAST programmes for?
a) blastn
b) blastp
c) blastx
d) tblastn
e) tblastx
a) blastn = nucleotide query vs. nucleotide database
b) blastp = protein query vs. protein database
c) blastx = nulecotide query vs. protein database
d) tblastn = protein query vs. nucleotide database
e) tblastx = nucleotide query vs. nucleotide database