Shane - Lecture 1 Flashcards
What can you infer if two sequences are similar?
They probably have the same ancestor, share the same structure and have a similar biological function
What qualifies as a homologue?
(2)
An amino acid sequence that is more than 100 amino acids long/nucleotides long
Where there is 25% identical amino acids or 70% identical nucleotides
What is the twilight zone?
(2)
Protein sequences with between 0 and 20% identical amino acids
It is not significant -> this could have arisen by chance
What is E-value?
Expectation value
What does BLAST do?
Searches a database of your choice for sequences that have homology or a shared ancestry with the sequence you have entered
What does an E-value quantify?
The chance of the match happening by chance
What does an E-value close to zero mean?
It is very unlikely that the similarity arose due to chance
What does an E-value close to 1 mean?
It is very likely that the similarity arose due to chance
Write a note on BLAST
(4)
Quick heuristic alignment algorithm
Found on the national centre for biotechnology information (NCBI)
Matches DNA sequences to other DNA sequences
Uses either a gene sequence or a protein sequence
How does BLAST work?
(4)
It tries to find a small match first then expands on this match
It divides the sequence up into shorter parts e.g. 11 nucleotides and tries to match them (heuristic)
There might be some mismatches when the match extends
BLAST will keep extending the match until mismatches become too significant
What does BLAST stand for?
Basic
Local
Alignment
Search
Tool
List some uses of BLAST
To identify an unknown sequence by trying to match it to something known
Get clues about the function/structure of a protein by finding similar proteins
Map a sequence in a genome
What are the two types of blast searches?
Nucleotide BLAST
Protein BLAST
Which type of BLAST is better to use?
Protein BLASTS are more sensitive and biologically significant
How do you decide what type of BLAST to use?
(2)
Do you have a nucleotide sequence or a peptide sequence
Do you want a close match or something identical
What tool can you use to translate a nucleotide sequence into a peptide sequence?
ExPASy Translate tool
Why is it recommended to use a protein search instead of a gene search?
Gene sequences might be identical but have different functions