Part 2 - Lecture 2 - Local Pairwise Sequence Alignment Flashcards
Why local alignment?
we often do not know which parts of two sequences we should align
Why might parts of interesting sequences be more important to certain functions?
- genic vs intergenic parts of a traditional gene locus
- exons vs introns within a gene - exons are what remains after splicing since some exons are noncoding
- proteins have disordered parts vs domains with secondary structures
Is there such a thing as junk DNA?
tough to claim any part of genome has no function so NO
If we have two sequences which are i.i.d., what amount can we expect for the two sequences to match?
25%
What is the local alignment?
the best glocal alignment taken from all substrings
What is the smith-waterman recurrence?
instead of putting a negative penalty value put a zero
-this makes all base cases start at zero
In organizing mismatch scores why do we penalize transitions less than transversions?
cause purine to purine and pyrimidine to pyrimidine is more common(transition) than purine to pyrimidine or pyrimidine to purine (transversion)
How does the Smith-waterman algorithm depend on sensible scoring?
- the zero point is critical
-positive scoring alignments must be good
-negative scoring alignments assumed not useful
In local alignment, what is added?
- a zero to appear for a rest which changes the base cases
- start the traceback at the highest value
- stop the traceback once you hit zero