Introduction to sequence analysis Flashcards
What is a global sequence alignment?
- sequence comparison along the entire length of the two sequences being aligned
- best for high-similar sequences of similar length
- as the degree of sequence similarity declines, global alignment methods tend to miss important biological relationships
Character based methods for building a phyologenic tree
- ML (maximum likelyhood)
- MP (maximum parsimony)
Define the terms homology and homologs
Homlogoly - The presence of a similar feature because of descent from common ancestor (defines evolutionary relationships)
Homologs - Genes either are or are not homologous (not measured in degrees)
3 widely used MSA programs
- Claustal-W
- T-COFFEE
- MAFFT
Why do we perform sequence analysis?
- discover function
- study evolution
- find crucial features
- identify cause of disease
What is a speciation event?
Speciation is a lineage-splitting event that produces two or more separate species
What does the p-value of an alignment mean?
It tells us about the probability that we get an alignment with this score by chance. Should be close to zero
What is a taxon?
A set or group of organisms, most often species, at the end of a branch
What is taken into consideration when scoring two aligned sequences?
- The kind of AA
- the chemical properties of the AAs
Why do we do multiple alignments?
- to identify conserved regions, patterns, and domains
- to identify new members of protein families
- to predict structure and function of new protein sequences
- as a preliminary step in molecular evolution analysis using phylogenetic methods for constructing phylogenetic trees
What is a clade?
A group of organisms that includes an ancestor and all descendants of that ancestor, irrespective of how closely they may or may not resemble one another
What kind of alignment does BLAST perform?
A local sequence alignment
Distance based methods for building a phylogenic tree
- UPGMA (unweighted pair group method with arithmetic mean)
- NJ (neighbor joining)
I want to compare sequences of different lenghts, which alignment should I use?
Local sequence alignment
My two sequences are really similar and also have about the same length. Which alignment should I use?
Global sequence alignment