Key notes Flashcards
What type of systems cannot be simulated usind MD? Give examples of biological systems we can simulate and how the free energy there is related to the processes.
- In MD, classical forcefields can’t be used for reactions involving bond breaking/making.
- Protein folding and ligand binding are ideal applications for this
- ΔE and ΔS define whether a protein fold is favourable enough to form and how strongly a ligand binds to a protein.
Discuss changes in free enrgy, binding and structure as well as solvent reorganisation in a ligand binding system
- Binding of the ligand will cause a change in enthalpy/internal energy as a result of intermolecular interactions (e.g. electrostatic interaction associated with vdW)
- Loss of conformational freedom in binding site causes a decease in entropy, that counterbalances an increase due to water around free ligand having more freedom.
- The total free energy is a net effect of all these different changes
Experimentally we can find the free energy of a system using ΔA = -RTlnKbind where Kbind is the ratio of the time a ligand is bound compared to unbound. Why does this approach not work for simulation and what is an alternative approach?
- To sufficiently sample a system in both states, and directly calculate the free energy would involve simulation times that are too unrealistic.
- Instead relate the free energy to a microscopic description of the system through statistical mechanics
- A = -kbTlnQNVT, (QNVT is the canonical partition function)
What are some problems with using a partition function approach to calculate free energy directly? What is an alternative approach
- Sampling entire phase space and integrating QNVT directly is impractical
- Alternatively, could take an ensemble average, and give A in terms of potential energy, U. (A ∝ e-U(r))
- However low energy samples would contribute very little average (?) and high energy samples take a long time to reach.
- This leads us to not being able to calculate A directly.
- QNVT is a function of the …, which depends on the sum of … (…) and … (…) energy as a function of … and …
- … can be solved analytically, leaving … as an excess which can be calculated in a simulation at each step giving overall …
- QNVT is a function of the Hamiltonian, which depends on the sum of kinetic (K) and potential (U) energy as a function of position and momenta.
- K can be solved analytically, leaving U as an excess which can be calculated in a simulation at each step giving overall ΔU.
How is our method of calculating ΔA as a function of ΔU implemented?
- Carry out simulation of state 0, calculating PE at each step (U0)
- At each step also, take configuration (snapshot of trajectory) and apply PE function corresponding to state 1 to calculate U1, resulting in ΔU.
- This is known as thermodynamic perturbation theory
How might we compute the difference in hydration free energy between two ions?
- Define forcefield parameters for Lithium in a box of water, as well as for rubidium in water.
- Water terms are identical, Li and Rb will have different LJ/coulombic terms
- Run MD simulation of system in state 0 (Li(aq)+), calculating U0 via PE eqn and terms defined in FF.
- Simultaneously, take same configuration and calculate PE of system in state 1 (Rb(aq)+), U1
- Same coordinates, only difference is parameters used to calculate U1 and therefore ΔU at that step.
- At end of simulation take average of ΔU and use to find ΔA.
What is the problem ignored so far in free energy calculation using thermodynamic perturbation theory with no windows? relate your answer to the free energy of the system
- If state 0 and 1 are very different (state 0 has a low probability of being in state 1) then ΔU is large
- In the case of Li/Rb this would occur due to unfavourable interactions of Rb overlapping the water molecules closer to Li, causing a high energy LJP term
- A large ΔU results in the large exponential term becoming negligibly small, giving low weight in ensemble average
- This causes ΔA to converge slowly, meaning our errors in our finite simulation will be large.
What is an example of the solution to energy perturbation theory?
- The free energy change in mutating AA glycine to Alanine might be an important system to study for active site manipulation.
- As λ increases from 0 to 1 we switch off interaction of glycine and turn on interaction of alanine.
- This shows the power of simulations as this is impossible experimentally
- A technical issue with this is that as we switch LJ interaction changes as charges are switched on/off gradually, causing atoms to shift to unfavourable locations.
What is the solution to our free energy perturbation theory problem?
- Break down calculation into windows where there is good overlap between states and ΔU is small.
- This is done through a coupling parameter, λ, which gradually increases from 0 to 1 through multiple simulations (equation), then sum the free energy changes outputed
- There is an increase in cost for these additional simulations
- In Li/Rb case first window would be change from Li to 10% Rb etc
What is biased sampling?
- Where a biasing potential is used that can be used to force the system to explore unfavourable configurations, leading to enhanced sampling of phase space
- This means we are more likely to overcome kinetic barriers that trap us in local minima of our PES for our entire simulation time
What is a key condition of biased sampling?
Need to know something about pathways as a starting point as this is what we are defining
What are reaction coordinates and briefly give two examples? (also known as collective variables and order parameters)
- Characterise a process in terms of a small set of properties of a system that are a function of atomic coordinates.
- Also known as collective variables and order parameters
- Distance/separation, r
- Dihedral angle
- Give an example of how distance can be used as a reaction coordinate
- Potential mean force (PMF), which is the free energy along a chosen reaction coordinate, can be simulated using the distance between an Na+-Cl- ion pair in electrolyte solution.
- Give an example of how radius of gyration (Rgyr) can be used as a reaction coordinate. What must be assumed?
- Rgyr gives an indication of the expansion/contraction of a globular structure through the average of the distance each atom is from the centre of mass. More expanded = higher Rgyr
- The transformation of a β hairpin peptide to unfolded random coil state’s free energy landscape can be investigated
- Choosing appropriate set of reaction coordinates is difficult so must guess generally
Give an advantages and disadvantages of using multiple reaction coordinates
Pros
- A combination of variables allows important structures across high energy barriers to be sampled, giving a larger indication of the greater free energy landscape.
- If our single reaction coordinate output poorly maps experimental results, a second coordinate can be introduced to form a 2D plot that may give a different minimum energy pathway to before.
Cons
- However, in combination, outputs of these reaction coordinates can lead to many different structures which must all be considered
- Certain structures may even be resritcted via specific choice of a given set of coordinates
- Large computational cost
- Give an example of how root mean squared distance (RMSD) can be used as a reaction coordinate
- WHat must one be careful of?
- RMSD is the difference between atomic positions at time t and the starting positions of the simulation, t0.
- Can be averaged over all atoms of interest, e.g. carbons in a protein backbone chain
- Similarly, with Rgyr, must be careful with choice of reaction coordinate to pair with as may not be unique function of rN.