lecture 14 - bioinformatics: homologous evolution Flashcards
how do we use bioinformatics info? (4)
(1) for determining 3D structures
(2) can’t predict 3D structure from sequence - need additional info, but can predict similar structures
(3) can predict tertiary structure quite well
(4) to analyze evolutionary relationships
define homologous
- 2 sequences that are derived from a common ancestor
- commonly >25% sequence identity in aa sequence
what do homologous sequences imply?
an evolutionary relationship
sequences can be either
homologous or not
how do we describe sequence similarity for homologous sequences?
as a % identity
define paralogous
homologous within one species
define orthologous
homologous within different species
mechanism for paralogous genes
- gene encoding a specific enzyme undergoes a mutation
- now have enzyme 1 and enzyme 2 that have evolved to have different, often related, functions
example of paralogous proteins
trypsin and chymotrypsin
mechanism for orthologous genes
- dna encoding a specific protein in a species undergoes speciation
- now have enzyme w identical or similar functions in two species (e.g. mouse and rat)
how do homologues arise? what is the result?
- divergent evolution
- resulting in similar structure and function
define convergent evolution
common or similar function but unrelated sequences
example question:
calpain-1 and calpain-2 = 65% identity in human genome
calpain-2 in rat, mouse, cow, sheep and humans = 93%-98% = identical
what does this imply?
this implies calpain-1/calpain-2 diverged before those species appeared
comparing homologues with similar function and discvoering conserved residues tells us what?
implies those residues are important for function
what are 2 possible reasons conserved residues are important for function?
(1) they play a direct role in function (e.g. enzyme catalysis or binding ligand)
(2) they play an indirect role needed to maintain structure of the protein (function depends on structure)
can some residues be substituted conservatively? if so which ones?
yes
- Leu –> Ile
- Asp –> Glu-
- Arg –> lys+
where are less conserved regions found? why are they found here?
- loops
- link the “important” parts together