W5 Structure-based methods (CADD 2) Flashcards
What are the 4 main structure-based methods?
- Docking
- Grid-based methods
- Molecular Dynamics (MD)
- De novo
What is docking?
Given a target site, the software takes each given virtual compound →explores its possible conformations (docking poses) within the target site →identifies those ones with the best predicted binding→ scored and ranked according to a mathematical evaluation (scoring function) of the predicted free energy change upon binding (docking score)
What are Molecular Dynamics (MD)?
Mainly for hit optimisation; allows to “relax” the
system over time→ consider the system flexibility and including solvent (water) molecules
What is De Novo?
Either user or computer specifies a starting atom or chemical group (seed) in the active/target site: new atoms or fragments are then added randomly onto the seed, to “fill” the site
What are the different methods to determine the 3D structures of proteins (targets)? (4)
o X-ray crystallography
o NMR spectroscopy
o Cryo-EM
o Homology modelling
–> Enable techniques such as: site analysis, molecular docking (NB virtual screening!), molecular dynamics, de novo drug design
What are the steps in X-Ray crystallography?
Different steps are required for target 3D-structure determination:
1-obtain the protein (extract from cells or let bacteria to produce for you)
2-crystallise the protein (obtain the crystal of your protein in the lab)
3-diffraction by X-rays (use a X-ray machine)
4-mathematical solution of X-ray diffraction pattern using a computer and final 3D- structure
What are the limitations of X-Ray crystallography?
- requires milligram quantities of proteins (big amount!)
- gives the structure in the “solid state” (whilst most proteins are in solution!)
- requires few months to solve structure
- generally, less successful for receptors
Resolutions of X-Ray data: (for info)
> 3.5Å à cannot determine position of main chain (peptide backbone of the target) 2.5Å to 3.5Å à α-helices and β-sheets resolved, but the positions of loops is uncertain and solvent molecules are not seen
2.0Å to 2.5Å à orientations of amino acids side chains are resolved, and solvent molecules can be seen
<2.0Å à conformations of side chains and solvent molecules are resolved
What is NMR Spectroscopy?
- Uses chemical shifts, coupling constants and interatomic interactions to determine the structure of the protein
- Uses nuclear Overhauser effects (NOE) to calculate how close protons are in space
- Solves structures in solution: closer to a physiological situation!
What are the limitations of NMR Spectroscopy?
- requires milligram quantities of proteins (big amount!)
- only applicable to relatively small proteins
- takes a long time to solve the structure, as many NMR experiments have to be interpreted
What is Cryo-EM: cryogenic electron microscopy?
Freezes biomolecule samples into a glassy state and probes them with beams of electrons→ generate a 3D-representation.
Stringing thousands of these snapshots together into stop-action movies and virtual reality flythroughs, we can watch biology in action.
The Protein Data Bank (Brookhaven Protein Database)
What does it contain?
- Contains X-ray, NMR and Cryo-EM structures of proteins and other biomacromolecules (>150000 structures):
- In this archive, each catalogued structure is assigned to an identifying label (“PDB ID”),
What is Homology modelling?
When is it used?
- Determining the structure of a protein by X-ray crystallography or NMR spectroscopy : time-consuming→ relatively few structures have been solved
- Gene sequencing is very fast: many protein sequences have been established
- Homology modelling: in silico methodology that allows to predict the structure of a protein based on its gene sequence.
Used when: primary sequence of a protein (its amino acids) of unknown structure shows good similarity to the sequence of a homologous protein (proteins that
have evolved from a common ancestor) of known structure
=A 3D-model of the target protein is created based on existing structural data: possible to get reliable models as structure is more conserved than amino acid sequence
Generating a homology model is a multi-step process:
What are the 5 steps?
1-template selection
2-sequence alignment
3-building the model
4-optimisation
5-validation
Homology modelling: template selection
Homology modelling works best for similar classes of proteins. Using a template not homologous is possible but the resulting model might not be that accurate:
the closest homologue available should be chosen
Homology: evaluated in terms of percentage primary sequence identity between target and template:
=At least 25% sequence identity needed 50% identity recommended for a good model