evolution of protein function Flashcards
1
Q
sequence similarity networks
A
- use of phylogenetic relationships to interpret function
- summarise relationships between proteins
- blast info creates dot for each protein
- link dots if score greater than threshold
- dot colour indicates function
2
Q
protein families
A
- family expansion and subfamily identification possible in large phylogenies
- some residues strongly associated with 1 binding site in a family and not another
- M13 peptidases
- expose different family members to different ligands
- classify subfamilies by ligand interactions and specificity-determining residues
3
Q
M13 peptidases
A
- 2 binding sites
- S1’ and S2’
- require different residues
- analyse residue characteristics
- S1’ hydrophobic
- S2’ polar/charged
- use to classify subfamily
4
Q
specificity profiles
A
- show specificity at each position of a binding domain
- e.g. PDZ domain
- can study evolution of function
- higher column = increased binding
- small column = little binding
- no letter = no binding at that position
- single letter indicates only that letter present and binding
5
Q
profile clusters
A
- cluster based on similarity
- create objective classification system
- not the same as phylogeny
- corresponds well to binding classes
- e.g. PDZ domains - highly conserved specificity profiles
6
Q
ortholog vs paralog
A
- ortholog = related via speciation
- similar function, especially if an essential function
- strong selection pressure to retain function
- paralog = related via gene duplication
7
Q
paralogs
A
- indicate a gene duplication becomin fixed in a populaiton
- indicates selection pressure to retain both paralogs
- assuming a cost to maintain gene in the genome
- cna mutate and adopt novel functions
- multiple copies of essential gene
8
Q
glucose transporter genes
A
- s. cerevisiae
- artifical selection pressure from humans
- duplication of glucose transporter genes
- c. albicans
- exploits host glucose
- many paralogs
- some transporters similar to humans
- immune evasion
9
Q
fate of paralogs
A
- maintain original function
- increase dosage
- subfunctionalisation
- more specialised function
- division of ancestral function
- neofunctionalisation
- evolve a new function
10
Q
tree reconciliation
A
- labelling internal nodes of trees as duplication or speciation
- need to understand context
- can’t assign by looking at node on its own
- important for developing correct phylogeny
11
Q
tree reconciliation
parsimony
A
- maximally parsimonious
- finds a solution that minimises the number of duplications and losses by node labelling
12
Q
bootstrap support values
A
- at each node
- indication of degree of confidence in separation of parts of a tree
- some branches more reliable than others
- investigate weaker branches
- collapse and rearrange to find a better solution with smaller cost
13
Q
cost of a tree
A
- involves duplication and losses
- genes easily lsot from genomes
- smaller cost
- assuming cost of keeping gene in genome:
- duplication is costly
14
Q
ancestral states
A
- can label gene tree nodes with names of corresponding ancestral species using reconciled phylogenies
- can infer attributes of extinct ancestors
- infer most likely ancestral protein sequence using maximum likelihood
- synthesise protein e.g. rhodopsin and observe properties