Lecture 3 DA Flashcards
What is the purpose of an alignment (4)?
- Find homologues
- See if a homologue has an associated protein structure
- Determine function
- Determine evolutionary relationships
What is the purpose of multiple sequence alignments (2)?
- Elucidate functional informarion within protein sequences
- Perform evolutionary analysis
In a pairwise alignment, what does positioning of two amino acids at the same point imply? What is the best way to determine the implication, and can this always be done?
They perform the same role in homologous proteins.
Can be determined by performing structural alignment, where the amino acids are aligned in 3D. May not be possible if no 3D structure is available.
What can be done to increase the accuracy of pairwise alignments? What does this reveal? What alignment to perform is usually worthwhile?
Adding more sequences to an alignment. Can reveal patterns that aren’t obvious in a pairwise alignment. Worthwhile to perform MSA.
Describe how an MSA is done (6).
- Find sequence of interest (ie. BLAST).
- Prune if necessary.
- Run multiple alignment algorithm.
- Inspect output.
- Remove disruptive sequences, and repeat.
- Identify key conserved amino acids.
How is scoring done in MSA (3)?
- Alignment is arranged so a maximum number of characters in each sequence is matched.
- Scores are accorded to the sum of pairs.
- Each column is scored by summing all possible matches, mismatches, and gaps.
What are some disadvantages of MSA (2)?
- MSA is computationally expensive.
- Difficult for 4 sequences, more than 20 is impossible.
How does the CLUSTAL algorithm work (4)?
- Begin by pairwise alignment.
- Build a phylogenetic guide tree.
- Take most closely related sequences and align them, forming a consensus.
- Repeat with the next most closely related sequence.
What is an advantage and disadvantage of the CLUSTAL algorithm?
Advantage
- Results in near optimal alignment.
Disadvantage
- If an early error is made, it is preserved.
What is a major problem with the CLUSTAL algorithm?
Selection of an appropriate matrix for alignments consists of divergent and closely related sequences.
From an MSA, what do highly conserved residues suggest?
Correspondence to an active site.
From an MSA, where are insertions and deletions often found?
In surface loops.
From an MSA, what do conserved patterns of hydrophobicity with a spacing of 2 indicate? What about 4?
B-sheet.
4 indicates a-helix.
What is the terminal node?
End point.
What is an internal node?
Hypothetical ancestor.