Molecular modelling Flashcards

1
Q

Molecular modelling - definition

Use of a ………………………….. to study ………….., …………., …………….. and ……………. of molecules.

A

Use of a theoretical model to study structure, energy, dynamics and reactivity of molecules.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Molecular modelling - types (examples)

  • …………………….. (………………..)
  • …………………… (…………, ………….., …………….)
  • …………………….. (……………………………………………).
A
  • Visual/physical (stick model)
  • Mathematical (physics, kinetics, reactivity)
  • Computational (as mathematical but larger scale)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Molecular modelling - uses

  • ……………………………………
  • …………………………………
  • ………………………………………………………….
  • ………………….
A
  • Structural determination
  • Structure visualisation
  • Calculating forces between molecules
  • Experimental
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Potential energy surface

A

U - energy associated with a configuration of a mechanical system which can be converted to work

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Potential energy and molecule shape

Most likely shape of a molecule has …………………………………………., ie. exists at …………………… …………… of plot of ………………………………………………

A

Most likely shape of a molecule has the lowest potential energy, ie. exists at a global energy minimum of plot of potential energy vs geometry.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Potential energy functions - most detailed ⇒ least detailed

A

Required to calculate PES.

Ab initio QM, QM; semi-empirical QM; molecular mechanics/force fields; scoring functions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Ab initio QM - basis set

A

Fitted functions correlated to orbitals for use in the Schrödinger equation.

Neglects ee interaction (assumes e moves under average influence of all other e)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Ab initio QM - Limitations

A
  • Takes a long time
  • Calculation complexity scales M4 for M AOs
  • Supercomputers can manage systems ∼1-2k atoms
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Ab initio QM - Advantages

A
  • Few approximations/parameters
  • Accuracy improves systematically
  • Can be applied to reactions as well as structure
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Ab initio QM - Methods

(most accurate ⇒ least)

A

With electron correlation: Configuration interaction; CASSCF; DFT

With no electron correlation: Unrestricted Hartree Fock; Hartree Fock

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Semi empirical QM - basis set

A

Similar to AB but uses modified Hamiltonian operator, neglecting some interactions and replacing others with parameters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Semi empirical QM - Limitations

A
  • Accuracy cannot systematically improve
  • Some cases may need correction (common for peptide bonds)
  • Reactions can be studied but may lack accuracy
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Semi empirical QM - Advantages

A
  • Includes some electron correlation
  • Much faster than AB
  • Systems can be 2-3k atoms
  • Using distributed processing systems (eg. fold@home), large structures >10k atoms like proteins can be modelled
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Semi empirical QM - Methods

(most accurate ⇒ least)

A
  • CNDO (1965)
  • MNDO (1977)
  • PM3 (1989)
  • SAM1 (1993)
  • PM6 (2007).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Molecular mechanics (“force field”) - basis set

A

Treats nuclei and electrons as atoms, connected by compressable springs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Molecular mechanics (“force field”) - Bonded terms

A
  • Applies up to 3 covalent bonds apart
  • bond length (d), angle stretching (θ)
  • Rotations about single bonds/dihedral angle (ϕ)
  • Ubond (d) + Uangle (θ) + Udihedral (ϕ)
17
Q

Molecular mechanics (“force field”) - Nonbonded interactions

A
  • Operates over a longer range, rab
  • Electrostatic energy (r-1) – uses Coulomb’s Law. Treats charges as fitted constants (qa/b)
  • Uelec (rab) = atomsa,b qaqb / rab
  • Van der Waals – close-range attraction (r-6) & repulsion (r12).
  • Uvdw (rab) = atomsa,b (A / rab12 − C / rab6)
18
Q

Molecular mechanics (“force field”) - Total potential energy

A

Utotal (d, θ, ϕ, rab) = Ubond (d) + Uangle (θ) + Udihedral (ϕ) + Uelec (rab) + Uvdw (rab)

19
Q

Molecular mechanics (“force field”) - Approximations/parameters

A
  • MM treats as atoms, not independent electrons/nuclei
  • Inputs: d, θ, ϕ, rab
  • Paramaters: A/C/q. Accuracy of calculation depends on accuracy of paramaters
20
Q

Molecular mechanics (“force field”) - Limitations

A
  • Needs new parameters for each molecule ID
  • Cannot treat chemical reactions
21
Q

Molecular mechanics (“force field”) - Advantages

A
  • Can treat v large systems (up to 1m)
  • Can be accurate if used correctly (ie. accurate paramaters)
22
Q

Scoring functions - types

A
  • First Principles
  • Empirical
  • Knowledge-based
23
Q

Scoring: first principles

A

Physics based approach, considering molecular mechanics force field (VdW, electrostatic effects).

Utotal (rab) = atomsa,b qaqb / rab + atomsa,b (A / rab12 − C / rab6)

24
Q

Scoring: first principles - Limitations

A
  • Can be slow to calculate
  • Depends on reliable paramaters
  • Force fields provide potential energy, not free energy
25
Q

Scoring: empirical

A

Computes binding free energy ΔG

C are constants, f are functions of distance ΔR & angle Δα between interacting groups on protein & ligand.

26
Q

Scoring: empirical - Parametrising functions

A
  • Training set: LR complexes with crystal structures and experimentally measured binding affinities ΔGbind
  • Structures from training set are used to predict ΔGbind using a scoring function and constants C are modified until this matches experimental values
  • Performance can be tested using other sets of known data (test sets)
27
Q

Scoring: empirical - Advantages

A
  • Fast
  • Calculates free energy
28
Q

Scoring: knowledge-based potentials

A
  • Generates a free energy function ΔGbind for each X-Y pair of protein-ligand atom IDs X/Y
  • Extracted from set of known protein-ligand X-ray structures from PDB
  • Assumes a Boltzmann distribution of protein atoms around ligands
  • Averaged over all structures in database, eg. N,O pairs can be expected to be distance r apart:
  • ΔN-OGbind (rN-O) = - RT ln g(rN-O) where g = distribution/variance in database
  • Total ΔGbind is sum of all pairs of ΔGbind energies
29
Q

Scoring: knowledge-based - Limitations

A
  • Boltzmann statistics for distribution of atom pairs not realistic
  • Difficult to define reference state
30
Q

Scoring: knowledge-based - Advantages

A
  • Fast
  • Calculates free energy
  • More general than empirical approaches
31
Q

Computational docking

A
  • Predicts how ligand L binds to a given 3D structure of receptor R.
  • Geometry of bound ligand = binding mode/docked pose.
32
Q

Computational docking - conformational search methods

A
  • Systematic/grid based
  • Genetic algorithm
  • Graph theory
  • Extant knowledge can provide a starting point/bias predictions.
33
Q

Computational docking - search process

A
  1. Characterise active site (where, shape etc)
  2. Conformational search of ligand in active site (= one generation)
  3. Score ligand poses using scoring functions
34
Q

Computational docking - genetic algorithm

A
  1. Conformation data stored on “chromosome” (eg. side-chain dihedral angles, coordinates in active site)
  2. Initial population ranked using scoring function. Lowest energy conformations retained, highest removed.
  3. Conformations not removed share information by cross-over (AA/BB → AB/BA swap) or mutation (random data point change).
  4. New conformations scored.

(eg. pop = 7. 1,2 retained. 6,7 removed. 3-5 share information with 1,2 producing new 3-7. Pop = old 1,2, new 3-7.)

Conformations will get progressively lower in energy and so more accurate to experimental data.

35
Q

Computational docking - testing docking

A
  1. RL complex with experimentally known 3D structure selected from database
  2. L removed from R and redocked
  3. Root-mean-square-distance determines if geometry reproduced:

rmsd = √ [(Natomsi=1 di2)/Natoms]

  • where di = distance between equivalent atoms i between superimposed structures
  • rmsd < 2Å acceptable for small ligands
36
Q

Computational docking - limitations

A
  • Proteins are flexible and often induce fit upon binding - cannot be accounted for
  • Water in/around binding site may change entropy/enthalpy values