Docking and virtual screening Flashcards
Pharmacophore complementary to HIV protease active site
Active site has a backbone carbonyl oxygen (Gly27), carboxyl oxygen (Asp25), backbone amide NH and alkyl R side chain (Ile50)
Therefore the complementary pharmacophore would have a H-bond donor to interact with each of the oxygens, a H-bond acceptor to interact with the NH and a hydrophobic group to interact with R
Then all appropriately positioned to fill the available space
Methods for identifying suitable drug candidates
- Database screening
2. De novo design
Database screening
When known chemicals from a computational database are docked into the receptor target site in order to find a best fit
A scoring function for docked structures is used to rank the compounds and identify potential hit candidates
De novo design
Involves generating novel compounds computationally and docking them into the receptor target site in order to find a best fit
Molecular docking
Refers to a prediction of the binding mode of a compound to the active site
Also predicts the strength of association
What must be done before virtual screening/docking can be used?
Need to assign the charge, tautomeric state and conformation of the ligand (will also need to be aware that these may be different in the protein environment)
Need to assign protonation state of protein
Also need to consider the flexibility of both the ligand and protein - although simple methods often neglect protein flexibility because it is time consuming
Steps involved in the first stage of a typical docking process
- Because ligands often change their conformation when binding to proteins, many programs take the torsional flexibility of the ligand into account by allowing specific bonds to be able to rotate. Alternatively, the program may perform rigid docking of a predetermined set of ligand conformations
- Many conformations of the ligand-protein complex are generated for each molecule being considered - each generated ‘snapshot’ of the ligand-protein complex is referred to as a pose
- Each pose is assessed using a scoring function
- The orientations that score well are kept and the remaining low-scoring poses are disregarded
Why can some of the large number of potential poses be immediately rejected?
Due to clashes with the protein
Ideal scoring function
Would take the binding mode produced from molecular docking and give an accurate estimate of the free energy of binding (DeltaGbind)
However, in practical applications of virtual screening, a crude scoring function is used (such as molecular mechanics functions or empirical scoring functions)
Empirical scoring functions
Developed to reproduce experimental binding affinity data
Based on relatively simple descriptions of intermolecular interactions
The scores can be computed quickly to rank large libraries of diverse compounds that have been docked into the active site of the target protein
Example of an empirical scoring function
AScore
AScore predicts DeltaGbind using terms that describe:
- Van der Waals interactions: non-covalent interactions described by similar equations to those used in molecular mechanics but with summations over ligand-protein and ligand-ligand atom pairs
- Hydrophobic effects: summation over hydrophobic ligand-protein atom pairs
- Hydrogen bonding: H-bonding is perhaps the most important factor for the specific binding of a ligand to its receptor. The interaction occurs when 2 atoms are in close enough proximity to form a donor-acceptor pair. The geometry of the H-bond (D-H—A) is typically described by the bond angle and the bond length (i.e. distance between H and acceptor A)
- Deformation: upon binding, both the ligand and protein are constrained in conformation compared to their free states in solution. This leads to adverse entropic changes, which is a negative effect that must be overcome during the binding process
The parameters can then be modified to improve the correlations between the predicted and experimental DeltaGbind values for a test set of related protein-ligand structures
Steps involved in the second stage of a typical docking process
- Different poses, belonging to either the same or different ligands, are ordered according to their computed score
- The energy of the top scoring poses are recalculated using more accurate, computationally more intensive scoring functions
SkelGen
A sophisticated program used for de novo drug design
Has a library of 1678 fragments generated by fragmentation of known synthetically accessible structures
A ligand is initially created by linking randomly chosen fragments from the fragment library and placing the ligand at a random position in the binding site
The structure evolves and is modified in a stochastic process - the impact of each modification is determined by feedback from docking and scoring
Run multiple times to produce a series of chemical scaffolds designed to fit into the active site
Advantage of SkelGen
Provides access to around 1 trillion low molecular weight, drug-like molecules, compared to only a few million structures in a typical chemical library