Key notes final Flashcards

1
Q

What do computer simulations do and what do they allows?

A

Allow the study of properties of a many body system by creating a mathematical model (e.g a molecular mechanics forcefield) to describe the system behaviour which can be run to sample configurational space of that system (main families are MS and MC)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why bother using comupter simulations?

A
  • Gain better understanding of the system at a molecular level which allows better design of experiments and help interpret data.
  • Predict properties/behaviour from molecular structure.
  • Simulate unrealistic experiments that explore extreme conditions or toy models (e.g turn off charges to investigate hydrophobic driven reactions)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is an MD simulation and what does it allow us to do?

A
  • Enable us to simulate the time evolution of a system of particles.
  • Can extract thermodynamic and kinetic information through simulation of particles at a position at the beginning and end of many steps.
  • Trajectories are generated through classical dynamics (ignore electronic effects) by solving newtons equations of motion.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How are newtonian principles employed in MD simulations?

A
  • The force acting on an atom i due to all other particles in a system is linked to the potential experienced by atom I and its position
  • Using newtons second law (F=ma) we can also find the particles acceleration
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why can MD trajectories not be solved analytically? What must be done instead

A
  • Cannot be solved analytically for a many-body system as force acting on each particle depends on the position of all other particles at that moment in time.
  • Instead F=ma is solved numerically using small time steps Δt (~fs) where the force is calculated at the end of each of these time steps then following on to the next one.
  • Most commonly done with the Verlet algorithm
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What defines the transferability of a force-field?

A
  • The ability to re-use fore-field parameters across a range of different functional groups.
  • More parameters may be accurate for a specific system but transfer poorly to another.
  • Forcefields have been paramaterised to reproduce properties of proteins, lipids e.g CHARMM, AMBER. Same functional form but differing parameters (different torsion angle for amino acids)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Ergodicity is not always true in practice, why is this?

A
  • A barrier that is too high can cause system to be kinetically trapped in a certain region of phase space, leaving the global minimum unexplored.
  • If one is doing a grid-based lattice simulation getting stuck in local minima when starting on certain set of lattice coordinates can be a problem.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are periodic boundary conditions (PBC)?

A

Convenient way of representing bulk solution without having to simulate every molecule where molecules are placed in a simulation cell replicated in all three directions via minimum image convention

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the time scale problem? What is a chemcial example that suffers from this?

A
  • Larger than that of length scale, avoiding getting stuck in local minima can use up most simulation time without sampling desired configurational space.
  • Can occur if waiting for a rare event to occur, but instead getting stuck behind a kinetic barrier.
  • No way to know if this has occurred as the PES is not available prior to the simulation.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are edge effects and how are they overcome?

A
  • Many small boxes have many atoms, some of which are close to edge.
  • Results will be influenced by particles interacting with the edge of the system.
  • Edge effects are overcome by having molecules that pass through the cell walls appear on the other side.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Assuming MM suffices over QM, what is the length scale problem?

A
  • System must be a manageable number of particles; however systems are growing larger (millions of atoms).
  • One must choose the system carefully to get desired output.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Briefly discuss and compare three different force-fields, using the alcohol octanol as an example.

A
  • All-atom: An all atom/atomistic force-field treats each atom explicitly. At every step, the simulation must input all bonding/non-bonding interactions in a system. In long molecules like octanol, this can lead to many dihedral angles needing to be specified for all differing ~-OH chains. Very precise however, computationally costly.
  • United-atom: cheaper way to describe the system that saves computational time, but still expensive. It does this by grouping less chemically involved hydrogen atoms into CH­2 groups. Good for lipid descriptions
  • Coarse graining: 4-5 heavy atoms with their H’s are grouped into a single interaction site, saving large amount of computational effort. Octanol is split into a polar (hydrophilic) and non-polar (hydrophobic) half. Lose a degree of chemical detail as cannot capture hydrogen bonding explicitly. However partial charges assigned to these blocks to describe electrostatic and non-bonding interactions.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are three ways in which coarse graining allow simualtion on larger systems for longer timescales?

A
  • Decrease in total number of particles N
  • Larger time step
  • Smoother energy landscape
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How does larger time-step ‘speed up’ a coarse grain simulation?

A
  • A ​time step for integration needs to be smaller than the highest frequency motion in a system.
  • The verlet algorithm can be used to move an atom through calculation of the force, with respect to all other atoms, to apply in one timestep, which should be constant.
  • A time step may be suitable for bond stretch of a low frequency (left), resulting in it being closer to its equilibrium value. However, the same force acting on a higher frequency motion (right) results in a high energy final position, leading to assumption of force being constant over our chosen timestep to be invalid.
  • Timesteps should be 10x smaller than highest frequency motion, which are generally bond stretches of light hydrogen atoms.
  • By removing bonds involving them, a larger time step can be used, as particles of higher mass oscillate slower. Each time step now propagates us further forward in time.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

How does a decrease in total number of particles N ‘speed up’ a coarse grain simulation?

A
  • A decrease in total number of particles reduces the number of force calculations required at each step.
  • Resulting in less computational time needed to move the system forward one-time step, as less forces and energies must be calculated each step.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

How does a smoother energy landscape ‘speed up’ coarse graining?

A
  • Rugged energy landscape of an atomistic model has many local minima.
  • Averaging that occurs through coarse-graining results in a smoother energy landscape with less local trapping/friction and faster dynamics.
  • Beads can slide over each other much easier
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Why must we develop new interaction potentials for coarse grained systems?

A
  • Coarse-grained particles do not interact in the same way that atoms do.
  • New coarse-grained potentials must be developed that are unique to each system.
  • Parameterising is a large challenge as takes a lot of time to predict the correct output.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What are two methods of generating course-grained interaction potentials?

A
  • Use an analytical functional form with associated parameters; like atomistic interaction potentials but with CG beads
  • Numerically fitting data from atomistic simulations using tabulated potentials (list of r’s between atoms with corresponding F’s/E’s – useful for complex systems where don’t have analytical function)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is mapping, and why is it essential for developing a CGP? (PPQ)

A
  • The first stage in developing a CG interaction potential (CGP) is to define mapping of the atomistic system in to a CG one.
  • Essential for new molecules and there are many ways to assign atomistic structure depending on what interactions are expected to be the focus.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is an RDF?

A
  • A radial distribution function g(r) shows the probability of finding a particle at distance r from a reference particle.
  • At small distances there is a low probability of finding another atom due to coulombic repulsion (LJP?)
  • Shells form around the reference particle as other bands of particles appear, with depletion zones between them.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What are 2 general rules of mapping?

A
  • Each bead will inherit the mass/charge of its constituents
  • Must maintain the overall shape of the molecule (otherwise may not replicate the correct self-assembled structure)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Outline how you would produce a CG trajectory from an atomistic simulation using a bottom up approach

A
  • First, run a reference atomistic simulation to use as our benchmark (however does not necessarily equal experimental/ab initio PES)
  • Extract effective CG potentials using a trajectory of a system (a series of snapshots over time).
  • Each snapshot in the atomistic simulation will have positions for all atoms at each timestep which will be used to find their corresponding Force/Energy.
  • A CG pseudo trajectory can be attained by grouping atoms in to beads using the beads centre of mass based on an atomistic average.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q
  • How does our improved guess for the intermolecular CG potential [VCGi+1(r)] change when gi(r) = gref(r)? – 1 on graph
A

kBTln…= 0, no change to interaction potential needed as CG simulation produces same pairwise structure as atomistic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q
  • How does our improved guess for the intermolecular CG potential [VCGi+1(r)] change when gi(r) ref(r)? – 2 on graph
A
  • kBT… is -ve
  • probability of finding particle (correlation between CG beads) at that r underestimated.
  • Equation will lower pairwise CG energy at that r, making interaction more favourable
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q
  • How does our improved guess for the intermolecular CG potential [VCGi+1(r)] change when gi(r) ref(r)? – 3 on graph
A
  • kBTln… is +ve
  • probability of finding particle (correlation between CG beads) at that r overestimated.
  • Equation raises pairwise CG energy at that r, making interaction more appropriately favourable (less than before).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

How does an RDF used as a target reference functions help form non-bonded potentials from the trajectory of a system?

A
  • Goal is to form a CGP that reproduces the RDF obtained from atomistic simulations.
  • The CG RDF, g(i) is linked to the trajectory of our system CG trajectory,
  • Which in turn is linked to an atomistic RDF, g(ref), derived from an atomistic trajectory.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

How is VPMF used in Iterative Boltzmann Inversion iteratively?

A
  • VPMF =kBTln[gref(r)] instead is used as a starting guess for VCG (intermolecular CG potential)
  • Vi+1CG(r) = ViCG(r) + kBTln[gi(r) / gref(r)] is our improved guess after each iteration
  • gref(r) is our target probability distribution from atomistic simulation
  • gi(r) is the probability distribution from a CG simulation using the ith potential
  • Difference between the two moves us closer of further to result
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q
  • Why does it take so long to produce a CG potential through IBI that is like an atomistic trajectory?
A
  • Must iteratively calculate each pairwise interaction in your system using the IBI method until a full CG tabulated potential that is like the original atomistic trajectory is gained.
  • However once this is done, CG allows us to reach much higher simulation times.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q
  • What is a problem with the IBI discussed in lectures as well as an example of this being solved?
A
  • It takes only a singular atomistic simulation into account as a reference that will be under certain set conditions. This is a problem if one wanted to study phase transitions, which are dependent temperature changes.
  • Moore et al (2016) developed a CGP constructed using many different atomistic simulations at varying temperatures to account for this, known as multi-state IBI.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q
  • What is force matching and how does it differ to Iterative Boltzmann inversion?
A
  • IBI tried to match the structure of an atomistic simulation where the RDF is used to reach our target data.
  • Force matching aims to match forces on the CG interaction sites with the forces at the atomistic level.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q
  • What are some advantages of both bottom-up and top-down approaches?
A
  • Bottom-up: capable of capturing fine detail of interaction as CG interactions are extracted from reference atomistic simulations
  • Top-down: provide potentials that are more easily transferable and applicable to experimental data.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q
  • What is an example of a top-down approach?
A
  • The MARTINI force field, a CG potential fitted to reproduce experimental target data, using few parameters and standard interaction potentials to maximise transferability.
  • A choice of building blocks calibrated against experimental oil/water partition coefficients are available to build a system
41
Q
  • What is a disadvantage of the MARTINI force-field?
A
  • No systematic way for developing potentials and introducing new molecules unavailable in building blocks.
42
Q
  • What are some disadvantages of CG models?
A
  • A model can be too biased, and very untransferable to another system
  • A model may only be parameterised for a specific class of molecules, showing a lack of compatibility
  • Model may be too coarse to capture desired behaviour
43
Q

What type of systems cannot be simulated usind MD? Give examples of biological systems we can simulate and how the free energy there is related to the processes.

A
  • In MD, classical forcefields can’t be used for reactions involving bond breaking/making.
  • Protein folding and ligand binding are ideal applications for this
  • ΔE and ΔS define whether a protein fold is favourable enough to form and how strongly a ligand binds to a protein.
44
Q

Discuss changes in free enrgy, binding and structure as well as solvent reorganisation in a ligand binding system

A
  • Binding of the ligand will cause a change in enthalpy/internal energy as a result of intermolecular interactions (e.g. electrostatic interaction associated with vdW)
  • Loss of conformational freedom in binding site causes a decease in entropy, that counterbalances an increase due to water around free ligand having more freedom.
  • The total free energy is a net effect of all these different changes
45
Q

Experimentally we can find the free energy of a system using ΔA = -RTlnKbind where Kbind is the ratio of the time a ligand is bound compared to unbound. Why does this approach not work for simulation and what is an alternative approach?

A
  • To sufficiently sample a system in both states, and directly calculate the free energy would involve simulation times that are too unrealistic.
  • Instead relate the free energy to a microscopic description of the system through statistical mechanics
  • A = -kbTlnQNVT­, (QNVT is the canonical partition function)
46
Q

What are some problems with using a partition function approach to calculate free energy directly? What is an alternative approach

A
  • Sampling entire phase space and integrating QNVT directly is impractical
  • Alternatively, could take an ensemble average, and give A in terms of potential energy, U. (A ∝ e-U(r))
  • However low energy samples would contribute very little average (?) and high energy samples take a long time to reach.
  • This leads us to not being able to calculate A directly.
47
Q

How is our method of calculating ΔA as a function of ΔU implemented?

A
  • Carry out simulation of state 0, calculating PE at each step (U0)
  • At each step also, take configuration (snapshot of trajectory) and apply PE function corresponding to state 1 to calculate U1, resulting in ΔU.
  • This is known as thermodynamic perturbation theory
48
Q

How might we compute the difference in hydration free energy between two ions?

A
  • Define forcefield parameters for Lithium in a box of water, as well as for rubidium in water.
  • Water terms are identical, Li and Rb will have different LJ/coulombic terms
  • Run MD simulation of system in state 0 (Li(aq)+), calculating U0 via PE eqn and terms defined in FF.
  • Simultaneously, take same configuration and calculate PE of system in state 1 (Rb(aq)+), U1
  • Same coordinates, only difference is parameters used to calculate U1 and therefore ΔU at that step.
  • At end of simulation take average of ΔU and use to find ΔA.
49
Q

What is the problem ignored so far in free energy calculation using thermodynamic perturbation theory with no windows? relate your answer to the free energy of the system

A
  • If state 0 and 1 are very different (state 0 has a low probability of being in state 1) then ΔU is large
  • In the case of Li/Rb this would occur due to unfavourable interactions of Rb overlapping the water molecules closer to Li, causing a high energy LJP term
  • A large ΔU results in the large exponential term becoming negligibly small, giving low weight in ensemble average
  • This causes ΔA to converge slowly, meaning our errors in our finite simulation will be large.
50
Q

What is the solution to our free energy perturbation theory problem?

A
  • Break down calculation into windows where there is good overlap between states and ΔU is small.
  • This is done through a coupling parameter, λ, which gradually increases from 0 to 1 through multiple simulations (equation), then sum the free energy changes outputed
  • There is an increase in cost for these additional simulations
  • In Li/Rb case first window would be change from Li to 10% Rb etc
51
Q
  • QNVT is a function of the , which depends on the sum of … (…) and … (…) energy as a function of and
  • can be solved analytically, leaving as an excess which can be calculated in a simulation at each step giving overall
A
  • QNVT is a function of the Hamiltonian, which depends on the sum of kinetic (K) and potential (U) energy as a function of position and momenta.
  • K can be solved analytically, leaving U as an excess which can be calculated in a simulation at each step giving overall ΔU.
52
Q

What is an example of the solution to energy perturbation theory?

A
  • The free energy change in mutating AA glycine to Alanine might be an important system to study for active site manipulation.
  • As λ increases from 0 to 1 we switch off interaction of glycine and turn on interaction of alanine.
  • This shows the power of simulations as this is impossible experimentally
  • A technical issue with this is that as we switch LJ interaction changes as charges are switched on/off gradually, causing atoms to shift to unfavourable locations.
55
Q

What is biased sampling?

A
  • Where a biasing potential is used that can be used to force the system to explore unfavourable configurations, leading to enhanced sampling of phase space
  • This means we are more likely to overcome kinetic barriers that trap us in local minima of our PES for our entire simulation time
56
Q

What is a key condition of biased sampling?

A

Need to know something about pathways as a starting point as this is what we are defining

57
Q

What are reaction coordinates and briefly give two examples? (also known as collective variables and order parameters)

A
  • Characterise a process in terms of a small set of properties of a system that are a function of atomic coordinates.
  • Also known as collective variables and order parameters
  • Distance/separation, r
  • Dihedral angle
58
Q
  • Give an example of how distance can be used as a reaction coordinate
A
  • Potential mean force (PMF), which is the free energy along a chosen reaction coordinate, can be simulated using the distance between an Na+-Cl- ion pair in electrolyte solution.
59
Q
  • Give an example of how radius of gyration (Rgyr) can be used as a reaction coordinate. What must be assumed?
A
  • Rgyr gives an indication of the expansion/contraction of a globular structure through the average of the distance each atom is from the centre of mass. More expanded = higher Rgyr
  • The transformation of a β hairpin peptide to unfolded random coil state’s free energy landscape can be investigated
  • Choosing appropriate set of reaction coordinates is difficult so must guess generally
60
Q

Give an advantages and disadvantages of using multiple reaction coordinates

A

Pros

  • A combination of variables allows important structures across high energy barriers to be sampled, giving a larger indication of the greater free energy landscape.
  • If our single reaction coordinate output poorly maps experimental results, a second coordinate can be introduced to form a 2D plot that may give a different minimum energy pathway to before.

Cons

  • However, in combination, outputs of these reaction coordinates can lead to many different structures which must all be considered
  • Certain structures may even be resritcted via specific choice of a given set of coordinates
  • Large computational cost
61
Q
  • Give an example of how root mean squared distance (RMSD) can be used as a reaction coordinate
  • WHat must one be careful of?
A
  • RMSD is the difference between atomic positions at time t and the starting positions of the simulation, t0.
  • Can be averaged over all atoms of interest, e.g. carbons in a protein backbone chain
  • Similarly, with Rgyr, must be careful with choice of reaction coordinate to pair with as may not be unique function of rN.
62
Q
  • Free energy is a function so the free energy change is independent of the . This means we can create unrealistic if …/… are not of importance to us.
A
  • Free energy is a state function, so the free energy change is independent of the path. This means we can create unrealistic pathways if mechanism/kinetics are not of importance to us.
63
Q

What factors control the overlap sampling between adjacent simulations and how fine grained our sampling of our free energy profile is in umbrella sampling?

A
  • Force constant
    • Too low: biasing insufficient to explore high energy regions (wide harmonic)
    • Too high: insufficient overlap between windows (narrow (harmonic)
  • Frequency of window spacing
  • Choosing these values is largely trial and error
64
Q

What is the WHAM algorithm?

A
  • The weighted histogram analysis method is used to stitch simulations together iteratively in umbrella sampling.
  • Unbiased distribution solved with arbitrary values of free energy associated with that potential.
  • Values fed back in to each other until FEP is converged and best estimate for unbiased distribution is obtained.
65
Q

What is metadynamics and how does it differ from umbrella sampling?

A
  • In umbrealla sampling we forced the system to explore unfavroubale regions of phase space with a biasing potential, which restrained us in places difficult to sample, but penalised when too high/unfavourable
  • Metadynamics instead adds biasing potentials to penalise the system from visiting already sampled regions (i.e low energy phase space), forcing it to move to less favourable positions
66
Q
  • Where umbrella sampling used as its potentials, instead, metadynamics uses … … , which are added to the potential as the simulation proceeds. Metdynamics is …, meaning we don’t need to estimate the underlying … … (and biasing potential) in advance with metadynamics as we did in … …. However, we do still need a reaction coordinate.
A
  • Where umbrella sampling used harmonics as its biasing potentials, instead, metadynamics uses Gaussian functions, which are added to the potential as the simulation proceeds. Metdynamics is adaptive, meaning we don’t need to estimate the underlying energy landscape (and biasing potential) in advance with metadynamics as we did in umbrella sampling. However, we do still need a reaction coordinate.
67
Q
  • Outline the basic principles of umbrella sampling
A
  • System is restrained (through tethering to a spring) to a small region along the reaction coordinate ξ using a biasing potential.
  • If the system deviates too far from this small region, an energy penalty restores the region.
  • This is repeated at different target values of ξ. The system is forced to explore small unfavourable regions along a certain channel until full reaction coordinate is explored.
  • All simulations are stitched together to produce an unweighted underlying free energy profile.
68
Q

How does metadynamics allow sampling of the full reaction coordinate?

A
  • Start at some configuration, depositing gaussians as we sample
  • Eventually will be pushed out into a new local minimum
  • We can tweak how often these depositions occur as well as the height and width of them.
69
Q
  • Metadyanimcs can be … … … but is useful for getting a quick scan of the … …
A
  • Metadyanimcs can be slow to converge but is useful for getting a quick scan of the energy landscape.
71
Q

Describe the Monte Carlo (MC) method and how does it contrast to MD method?

A
  • MC simulations are an alternative method to sampling accessible microstates of the ensemble, generally used for smaller systems. Uses orthogonal techniques
  • MD used Netwon EOM to predict positions of atoms at a future time, taking a time average to find property of interest
  • MC generates configurations through random numbers that are unconnected in a timescale, using an ensemble average to investigate a property
72
Q

How is MC implemented?

A
  • Move a random particle in a predefined way (e.g along z-axis), with an acceptable dr
  • Calculate U of new configuration. If lower than original, new configuration is accepted and replaces old.
  • A trajectory-esque profile forms as we move to lower energy configurations.
73
Q

How does the size of ΔU affect the acceptance ratio?

A
  • Unew­ > Uold : Boltzmann factor of energy difference closer to 1. More likely to be greater than random # from 0 -> 1. High probability uphill likely favourable) move accepted and added to growing ensemble of microstates
  • Unew >> Uold (e.g a steric clash): less random #’s likely to be lower than Boltzmann factor of ΔU. Low probability large uphill move accepted.
  • Low energy states are general preferred in this algorithm
74
Q

What are some advantages of MC? Give examples and comparisons to MD

A
  • MC does not need to follow realistic pathways, so can explore conformations more rapidly. E.g. a protein folding event in MD would have to physically fold realistically – slow process. MC could randomly rotate an important dihedral to quickly generate states on interest
  • MC doesn’t require calculation of force, whereas in MD, the differentiation of the potential (=F), using Verlet, can be very costly. In MC this isn’t required, allowing for more unphysical models to be used.
75
Q

What is a disadvantage of MC?

A
  • Sampling efficiency depends strongly on move set choice. Poor choice may limit transfer to other unknown configurations, preventing access to all regions of phase space (poor sampling).
  • For example, the choice of certain dihedral to sample be our change to sample around, but it may in fact prevent certain configurations from being explored due to unknown factors.
76
Q

What is an example of an MD-MC combined simulation? Why must they be used toghether? How? What is the advantage or combining both methods?

A
  • A mixed lipid-membrane of chains differing by 4C’s in their tail.
  • MD would be too slow to see lipid diffusion
  • MC would converge too slowly with a system this complex
  • Instead a trail MC move follows each MD step, removing/adding 4C atoms, evaluating E of exchange to see if favourable
  • Sampling more efficient as removed dependency on starting configuration, which can trap system in very low energy configurations surrounded by high.
77
Q

Why is it difficult to enhance sampling with a biasing potential?

A
  • The timescale problem in MD makes in difficult to overcome kinetic barriers in our simulation time.
  • To enhance the sampling of this phase space we could use a biasing potential to explore unfavourable configuration but requires knowledge of important factor in pathways.
78
Q

What is Replica Exchange Molecular dynamics (REMD)?

A
  • REMD is an alternative method of overcoming (or removing) kinetic barriers, allowing more rapid sampling of phase space.
  • Can either change the potential via alteration of the forcefield to change the curve we are sampling (H-REMD)
  • Or change the temperature we sample at (T-REMD).
79
Q

What is the problem with simply changing the temperature we sample at?

A
  • The conditions in which we have chosen to investigate are now different, which can complicate the system of interest, if an event is temperature dependent (e.g a phase change)
80
Q
  • How can we change the temperature more suitably in T-REMD?
A
  • Use Temperature Replica Exchange Molecular Dynamics
  • Run an MD simulation of different replicas of the system at different temperatures in parallel (parallel tempering = replicas are generated through MC instead)
  • Our temperature of interest is the lowest.
  • Exchange configurations between replicas using MC (in both methods)
  • Continue simulation
  • Repeat step 2 until converged.
81
Q

What is the process of the T-REMD method?

A
  • As in MC, interested in the free energy difference between states.
  • If E lower make swap
  • If not random number decided if accepted or not
  • If accepted take high temperature coordinate simulation and exchange with low temperature configuration
  • Overtime, periodic exchanges between replicas allow inaccessible regions of phase space, at high T initially, are now accessible (takes a long time for this to occur though)

N.B. These are not unfavourable regions, they are merely blocked by high E barriers from being sampled -may be lower energy minima

82
Q

What is a disadvantage of T-REMD?

A
  • States are not connected in time
  • Can’t use to calculated time dependent properties (e.g diffusion coefficients) as timescale used is unrealistic
83
Q

Discuss the practical considerations that need to be taken in a T-REMD experiment in terms of computational power.

A
  • Many more processors are required to run many replicas in parallel. This along with the size of the system will be tied to the computational resource available.
84
Q

What is the general principle in Metropolis MC?

A
  • The probability of a transition from configuration 1 to 2 is a function of the change in ΔU between those states.
  • The probability of a configuration can be written in terms of the Boltzmann distribution where the ideal partition function is a normalizing factor
85
Q

What if the energy of the new configuration is greater than or equal to (≥) original configuration?

A
  • Boltzmann probability is compared to a random number between 0 and 1
  • If #rand < p(new), confuguration accepted, allowing an increase in energy
  • If #rand > p(new), new configuration rejected and have another copy of original microstate set
86
Q

What is the process of H-REMD? Give a biological example of where this may be useful

A
  • Temperature across simulations is constant
  • Instead a soft-core potential is used to across replicas, where Lennard Jones potential gradually softened meaning that atoms can sit on top of one another while still remaining in certain conformations
  • Useful in ligand binding, where certain bound conformations can be locked in deep minima.
  • H-REMD allows ligand to rotate in pocket and explore different orientations while still being bound.
  • These can then be swapped in to a correct LJP, giving an indicator to the transitions in between them
87
Q

Discuss the practical considerations that need to be taken in a T-REMD experiment in terms of temperatures used.

A
  • Set of temperatures used such that largest temperature will enable rapid exploration of phase space.
  • Finding this temperature can be difficult when don’t know the PES
  • Spacing between temperatures must also be optimised to minimise convergence time
88
Q

Discuss the practical considerations that need to be taken in a T-REMD experiment in terms of acceptance probability.

A
  • Must run a test to see if exchange if often/probable enough to reach convergence.
  • Otherwise, may just switch back and forth between two states of similar energy.
89
Q

Discuss the practical considerations that need to be taken in a T-REMD experiment in terms of the chemistry of the system.

A
  • If one is simulating a system involving a phase change due to increased temperature, this will result in a high energy change (e.g a protein unfolding.
  • Must have a lot of replicas around phase transition of configurations that suddenly differ largely.