Protein Function Flashcards
What is protein function?
The specific role or task that a protein performs within a cell or organism.
How is the functional aspect of a protein investigated?
Gene Ontology (GO) - a standardized machine-readable vocabulary
Allows researchers to know which aspect of function is being investigated and to annotate their findings in a computationally processed format.
What are the three ontology trees described by GO that describe the different aspects of protein function?
- Molecular function
- Biological process
- Cellular location
How does the standard vocabulary provided by GO help with functional predictions of proteins?
- Allows annotations to be computationally processed
- Provides a standard approach for programs to output their functional predictions.
Makes it easier to compare and analyze the functions
What is homology-based transfer?
- Computatuional funciton-prediction method
- Assigning unannotated proteins with the function of their annotated homologs
Why is homology-based transfer commonly used for protein function prediction?
Based on the assumption that two sequences with a high degree of similarity most likely evolved from a common ancestor and therefore must have similar functions.
This approach is based on the relationship between proteins that are homologous.
What is BlastP and how does it work?
- A program used for comparing an unknown protein sequence against a sequence database.
- It is the fastest and most widely used heuristic tool for pairwise protein sequence comparison.
Basic Local Allignment Search Tool
What is the difference between FASTA and BLAST and when would you use each one?
Two commonly used programs for comparing an unknown sequence against a sequence database.
FASTA
* Better at DNA searches
* Sometimes misses weak protein matches
BLAST
* Better at protein searches.
* The fastest and most widely used heuristic tool for pairwise protein sequence comparison.
What is a graphic display in BLAST and what does the horizontal axis represent?
Provides an overview of the alignments.
The horizontal axis corresponds to the query sequence.
What do the color codes in a BLAST graphic display indicate?
Indicates the quality of the match.
* Red = good
* Green = acceptable
* Black = bad
What information is included in a Hit List?
Provides:
* The sequence accession number and name,
* Description of potential function based on annotation
* Bit score
* E-value
What does the bit score measure in BLAST search results?
Measures the statistical significance of the alignment.
High bit score = good match
match between the query sequence and database sequence.
What is the E-value in BLAST and what does a low E value indicate?
- The expected number of chance alignments.
(Likelihood that the similarity occured by chance) - The lower the more significant the hit
- E-value < 10-4 = good match.
What information is provided in the Alignments section of BLAST search results?
- % identity
- Positives
- Gaps
- Length of the alignment.
Top line is the query sequence
Bottom line is the subject sequence
The numbers to the right of the sequences indicate the coordinates of the match on the query and subject sequence.
What does the % identity in BLAST represent?
The number of identical residues divided by the number of matched residues, ignoring gaps.
What does the positives field measure in BLAST search results?
Gives a measure of the fraction of residues that are either identical or similar, represented by a +.
What does the gaps field show in BLAST search results?
Shows residues that were not aligned.
What are low-complexity segments in BLAST search results?
Regions containing many identical residues
masked by BLAST in the query sequence with the letter “X”.
What is Identity in sequence alignment?
Identity is a measure made on an alignment
E.g. Sequence A can be “32% identical to” Sequence B.
What is similarity in sequence alignment?
A measure of how close two amino acids are to being identical.
E.g. isoleucine and leucine are similar.
What does homology imply in sequence alignment?
Implies an evolutionary relationship between sequences
Either exists or does not exist.
For example, Sequence A IS or IS NOT homologous to Sequence B,
What is the difference between Homology and Similarity in sequence alignment?
- Homologous implies they share an evolutionary relationship and have a similar 3D structure and function.
- Similar merely implies that their sequences are similar.