Sequence alignment Flashcards
Explain global alignment
Global alignment aims to maximize the alignment quality over the whole sequences. Gaps are typically penalized.
Explain local alignment
Local alignment tries to match sub-regions of the sequences. Gaps typically not penalized.
List the four possible components of a scoring function
- Match
- Mismatch
- Gap
- Gap extension
What can you do with a substitution matrix?
Substitution matrix can be used to score sequence alignments. It holds all the possible values for a scoring function: points given for each possible match, mismatch, gap and gap extension.
Describe the traceback procedure in Smith-Waterman algorithm
- Start with the highest value in the matrix.*
- Trace back until a zero is found
*To break ties, pick the one that yields the longest local alignment.
Explain the affine gap penalty scheme
In the affine gap penalty scheme, there is a different penalty for opening a gap and extending a gap.
How would you check that a high alignment score you just calculated for two sequences did not happen by chance?
- Create randomized versions of sequence 1.
- Align sequence two against the randomized versions.
- The fraction of randomized alignment scores that are bigger than the original gives the p-value.