Bioinformatics 3 Flashcards
How do Hidden Markov models expand on scoring matrices (BLOSUM) and Profiles (PSSM)?
The include Markov chains - amino acid at one position can influence what will come next. (this allows the probability of the next AA to be calculated based on the previous one).
It also introduces gaps.
How do you create a HMM?
Build a MSA of homologous sequences (using e.g. ClustalO).
There is a probability at each position for a match, mismatch, deletion or insertion.
The HMM is then built by traversing the alignment and calculating the probability for each possible transition between alignment positions.
Each transition possibility has a probability score.
the overall score is calculated by multiplying transition scores together.
Overall score is then converted to E-value.
What program can be used to create and/or search HMM databases?
HMMER.
Which methods can be used for experimental structure prediction?
X-ray crystallography
NMR
Cryo-EM.
What is the aim of secondary structure prediction?
to identify local structure - alpha helix, beta sheet and random coil.
What is the name given to 3 state prediction (alpha, beta, coil)?
Q3
Why is it possible to predict structure from sequence?
Because the theory is that to a large extent the local sequence determines the local structure.
Name a secondary structure prediction program?
Jpred
How is the accuracy (Q3) of secondary structure prediction calculated?
Accuracy = (no. of residues correctly predicted)/(total no. of residues)
What are the parameters of Q3?
Q3 is given a value between 0 and 1.
What does Q3=1 indicate?
A perfect prediction.
What does the Q3 result of a random prediction depend on?
The percentage of the different states.
e.g. Equal amounts in each state Q3 =33%.
Why is Q3=1 unrealistic?
Because secondary structure assignment in protein structure is uncertain up to about 10% - so perfect Q3 =0.9
What is the similarity between Jpred software and SNP prediction software?
Both of their algorithms are trained.
Jpred - Trained on sequences with known structures.
SNP - trained on sequences with known SNPs.
What does the Jpred algorithm use to predict secondary structure?
Uses PSI-BLAST, MSA and HMM