Key notes final Flashcards

Question

* How does our improved guess for the intermolecular CG potential [VCGi+1(r)] ![]() change when g_i(r) ref(r)? – 3 on graph

Answer 1

* k_BTln... is +ve * probability of finding particle (correlation between CG beads) at that r overestimated. * Equation raises pairwise CG energy at that r, making interaction more appropriately favourable (less than before).

Answer 2

* Goal is to form a CGP that reproduces the RDF obtained from atomistic simulations. * The CG RDF, g(i) is linked to the trajectory of our system CG trajectory, * Which in turn is linked to an atomistic RDF, g(ref), derived from an atomistic trajectory.

Answer 3

* V_PMF =k_BTln[g_ref(r)] instead is used as a starting guess for V_CG (intermolecular CG potential) * V_i+1^CG(r) = V_i^CG(r) + k_BT**ln[g_i(r) / g_ref(r)]** is our improved guess after each iteration * **g_ref(r)** is our target probability distribution from atomistic simulation * **g_i(r)** is the probability distribution from a CG simulation using the i^thpotential * Difference between the two moves us closer of further to result

Answer 4

* Must iteratively calculate **each pairwise interaction** in your system using the IBI method until a full CG tabulated potential that is like the **original atomistic trajectory** is gained. * However once this is done, CG allows us to reach much higher simulation times.

Answer 5

* It takes only a singular atomistic simulation into account as a reference that will be under certain set conditions. This is a problem if one wanted to study **phase transitions, which are dependent temperature changes.** * Moore et al (2016) developed a CGP constructed using many different atomistic simulations at varying temperatures to account for this, known as multi-state IBI.

Answer 6

* IBI tried to match the **structure** of an atomistic simulation where the RDF is used to reach our target data. * Force matching aims to match **forces** on the CG interaction sites with the forces at the atomistic level.

Answer 7

* Bottom-up: capable of capturing fine detail of interaction as CG interactions are extracted from reference atomistic simulations * Top-down: provide potentials that are more easily transferable and applicable to experimental data.

Answer 8

* The MARTINI force field, a CG potential fitted to **reproduce experimental target data,** using few parameters and standard interaction potentials to maximise transferability. * A choice of building blocks calibrated against experimental oil/water partition coefficients are available to build a system

Answer 9

* No systematic way for developing potentials and introducing new molecules unavailable in building blocks.

Answer 10

* A model can be too biased, and very untransferable to another system * A model may only be parameterised for a specific class of molecules, showing a lack of compatibility * Model may be too coarse to capture desired behaviour

Answer 11

* In MD, classical forcefields **can’t** be used for reactions **involving bond breaking/making.** * **Protein folding** and **ligand binding** are ideal applications for this * ΔE and ΔS define whether a protein fold is **favourable** enough to form and how strongly a ligand binds to a protein.

Answer 12

* Binding of the ligand will cause a **change in enthalpy/internal energy** as a result of intermolecular interactions (e.g. electrostatic interaction associated with vdW) * **Loss of conformational freedom** in binding site causes a decease in entropy, that counterbalances an increase due to water around free ligand having more freedom. * The total free energy is a **net effect** of all these different changes

Answer 13

* To sufficiently sample a system in both states, and directly calculate the free energy would involve **simulation times that are too unrealistic.** * Instead relate the free energy to a microscopic description of the system through statistical mechanics * **A = -k_bTlnQ_NVT,** (Q_NVTis the canonical partition function)

Answer 14

* Sampling entire phase space and integrating **Q_NVT directly is impractica**l * Alternatively, could take an ensemble average, and give A in terms of potential energy, U. **(A ∝ e^-U(r))** * However low energy samples would contribute very little average (?) and high energy samples take a long time to reach. * This leads us to not being able to calculate A directly.

Answer 15

* Carry out simulation of state 0, calculating PE at each step (U₀) * At each step **also**, take configuration (snapshot of trajectory) and apply PE function corresponding to state 1 to calculate U₁, resulting in ΔU. * This is known as **thermodynamic perturbation theory**

Answer 16

* Define forcefield parameters for Lithium in a box of water, as well as for rubidium in water. * **Water terms are identical**, Li and Rb will have **different LJ/coulombic terms** * Run MD simulation of system in **state 0 (Li_(aq)⁺),** calculating U₀ via PE eqn and terms defined in FF. * Simultaneously, take same configuration and calculate PE of system in **state 1 (Rb_(aq)⁺), U₁** * Same coordinates, only difference is parameters used to calculate U₁ and therefore ΔU at that step. * At end of simulation take **average of ΔU** and use to find ΔA.

Answer 17

* If state 0 and 1 are very **different** (state 0 has a low probability of being in state 1) then **ΔU is large** * In the case of Li/Rb this would occur due to unfavourable interactions of Rb overlapping the water molecules closer to Li, causing a **high energy LJP** term * A large ΔU results in the large exponential term becoming negligibly small, giving low weight in ensemble average * This causes ΔA to converge slowly, meaning our errors in our finite simulation will be large.

Answer 18

* **Break down calculation into windows** where there is good overlap between states and **ΔU is small.** * This is done through a coupling parameter, **λ,** which gradually increases from 0 to 1 through **multiple simulations** (equation), then sum the free energy changes outputed * There is an increase in cost for these additional simulations * In Li/Rb case first window would be change from Li to 10% Rb etc

Answer 19

* Q_NVT is a function of the **Hamiltonian**, which depends on the sum of **kinetic** **(K)** and **potential (U)** energy as a function of **position** and **momenta**. * **K** can be solved analytically, leaving **U** as an excess which can be calculated in a simulation at each step giving overall **ΔU**.

Answer 20

* The free energy change in mutating AA glycine to Alanine might be an important system to study for active site manipulation. * As λ increases from 0 to 1 we switch off interaction of glycine and turn on interaction of alanine. * This shows the power of simulations as this is impossible experimentally * A technical issue with this is that as we switch LJ interaction changes as charges are switched on/off gradually, causing atoms to shift to unfavourable locations.

Answer 21

* Where a **biasing potential** is used that can be used to force the system to explore unfavourable configurations, leading to enhanced sampling of phase space * This means we are more likely to **overcome kinetic barriers** that trap us in local minima of our PES for our entire simulation time

Answer 22

Need to know something about pathways as a starting point as this is what we are defining

Answer 23

* Characterise a process in terms of a small set of properties of a system that are a function of atomic coordinates. * Also known as collective variables and order parameters * Distance/separation, r * Dihedral angle

Answer 24

* **Potential mean force (PMF),** which is the free energy along a chosen reaction coordinate, can be simulated using the distance between an Na⁺-Cl^- ion pair in electrolyte solution.

Answer 25

* R_gyr gives an indication of the **expansion/contraction** of a globular structure through the average of the distance each atom is from the centre of mass. More expanded = higher R_gyr * The transformation of a β hairpin peptide to unfolded random coil state’s free energy landscape can be investigated * Choosing appropriate set of reaction coordinates is difficult so must **guess** generally

Answer 26

Pros * A combination of variables allows **important structures** across **high energy barriers** to be sampled, giving a larger indication of the greater free energy landscape. * If our single reaction coordinate output poorly maps experimental results, a second coordinate can be introduced to form a 2D plot that may give a **different minimum energy pathway** to before. Cons * However, in combination, outputs of these reaction coordinates can lead to **many different structures** which must all be considered * Certain structures may even be **resritcted via specific choice of a given set of coordinates** * **Large computational cost**

Answer 27

* RMSD is the difference between atomic positions at time t and the starting positions of the simulation, t₀. * Can be averaged over all atoms of interest, e.g. carbons in a protein backbone chain * Similarly, with R_gyr, must be careful with choice of reaction coordinate to pair with as may not be unique function of r^N.

Answer 28

* Free energy is a **state** function**,** so the free energy change is independent of the **path**. This means we can create unrealistic **pathways** if **mechanism/kinetics** are not of importance to us.

Answer 29

* Force constant * Too low: biasing insufficient to explore high energy regions (wide harmonic) * Too high: insufficient overlap between windows (narrow (harmonic) * Frequency of window spacing * Choosing these values is largely trial and error

Answer 30

* The weighted histogram analysis method is used to stitch simulations together **iteratively** in umbrella sampling. * Unbiased distribution solved with arbitrary values of free energy associated with that potential. * Values fed back in to each other until **FEP is converged** and best estimate for unbiased distribution is obtained.

Answer 31

* In umbrealla sampling we forced the system to explore unfavroubale regions of phase space with a biasing potential, which restrained us in places difficult to sample, but **penalised when too high/unfavourable** * Metadynamics instead adds **biasing potentials to penalise the system** from visiting **already sampled regions** (i.e low energy phase space), forcing it to move to less favourable positions

Answer 32

* Where umbrella sampling used **harmonics** as its **biasing** potentials, instead, metadynamics uses **Gaussian functions,** which are added to the potential as the simulation proceeds. Metdynamics is **adaptive**, meaning we don’t need to estimate the **underlying energy landscape** (and biasing potential) in advance with metadynamics as we did in **umbrella sampling**. However, we do still need a reaction coordinate.

Answer 33

* System is restrained (through tethering to a spring) to a small region along the reaction coordinate ξ using a **biasing potential.** * If the system deviates too far from this small region, an **energy penalty** restores the region. * This is repeated at different target values of ξ. The system is forced to **explore small unfavourable regions** along a certain channel until full reaction coordinate is explored. * All simulations are stitched together to produce an **unweighted underlying free energy profile.**

Answer 34

* Start at some configuration, depositing **gaussians** as we sample * Eventually will be pushed out into **a new local minimum** * We can tweak how often these depositions occur as well as the height and width of them.

Answer 35

* Metadyanimcs can be **slow to converge** but is useful for getting a quick scan of the **energy landscape**.

Answer 36

* MC simulations are an alternative method to **sampling accessible microstates of the ensemble**, generally used for smaller systems. Uses orthogonal techniques * MD used Netwon **EOM** to predict positions of atoms at a future time, taking a time average to find property of interest * MC generates configurations through **random numbers** that are **unconnected** in a timescale, using an ensemble average to investigate a property

Answer 37

* Move a random particle in a predefined way (e.g along z-axis), with an acceptable dr * **Calculate U** of new configuration. If lower than original, **new configuration is accepted and replaces old.** * A trajectory-esque profile forms as we move to lower energy configurations.

Answer 38

* U_new\> U_old : Boltzmann factor of energy difference closer to 1. More likely to be greater than random # from 0 -\> 1. High probability uphill likely favourable) move accepted and added to growing ensemble of microstates * U_new\>\> U_old (e.g a steric clash): less random #’s likely to be lower than Boltzmann factor of ΔU. Low probability large uphill move accepted. * **Low energy states are general preferred** in this algorithm

Answer 39

* **MC does not need to follow realistic pathways, so can explore conformations more rapidly.** E.g. a protein folding event in MD would have to physically fold realistically – slow process. MC could randomly rotate an important dihedral to quickly generate states on interest * **MC doesn’t require calculation of force**, whereas in MD, the differentiation of the potential (=F), using Verlet, can be very costly. In MC this isn’t required, allowing for more unphysical models to be used.

Answer 40

* Sampling **efficiency depends** strongly on **move set choice**. Poor choice may limit transfer to other unknown configurations, **preventing access to all regions of phase space** (poor sampling). * For example, the choice of certain dihedral to sample be our change to sample around, but it may in fact prevent certain configurations from being explored due to unknown factors.

Answer 41

* A mixed lipid-membrane of chains differing by 4C’s in their tail. * MD would be too slow to see lipid diffusion * MC would converge too slowly with a system this complex * Instead a trail MC move follows each MD step, removing/adding 4C atoms, evaluating E of exchange to see if favourable * Sampling more efficient as **removed dependency on starting configuration**, which can trap system in very low energy configurations surrounded by high.

Answer 42

* The **timescale problem** in MD makes in difficult to overcome kinetic barriers in our simulation time. * To enhance the sampling of this phase space we could use a biasing potential to explore unfavourable configuration but **requires knowledge of important factor i**n pathways.

Answer 43

* REMD is an alternative method of **overcoming (or removing) kinetic barriers**, allowing more rapid sampling of phase space. * Can either **change the potential** via alteration of the forcefield to change the curve we are sampling (H-REMD) * Or **change the temperature** we sample at (T-REMD).

Answer 44

* The conditions in which we have chosen to investigate are now different, which can complicate the system of interest, if an event is **temperature dependent** (e.g a phase change)

Answer 45

* Use Temperature Replica Exchange Molecular Dynamics * Run an MD simulation of different replicas of the system at **different temperatures _in parallel_** (parallel tempering = replicas are generated through MC instead) * Our **temperature of interest is the lowest.** * Exchange configurations between replicas using MC (in both methods) * Continue simulation * Repeat step 2 until converged.

Answer 46

* As in MC, interested in the free energy difference between states. * **If E lower make swap** * If not random number decided if accepted or not * If accepted take high temperature coordinate simulation and exchange with low temperature configuration * Overtime, periodic exchanges between replicas **allow inaccessible regions of phase space, at high T initially, are now accessible** (takes a long time for this to occur though) N.B. These are not unfavourable regions, they are merely blocked by high E barriers from being sampled -may be lower energy minima

Answer 47

* States are not connected in time * Can’t use to calculated time dependent properties (e.g diffusion coefficients) as timescale used is unrealistic

Answer 48

* Many more processors are required to run many replicas in parallel. This along with the size of the system will be tied to the computational resource available.

Answer 49

* The probability of a transition from configuration 1 to 2 is a function of the change in ΔU between those states. * The probability of a configuration can be written in terms of the **Boltzmann distribution** where the ideal partition function is a normalizing factor

Answer 50

* Boltzmann probability is compared to a random number between 0 and 1 * If #_rand \< p(new), confuguration accepted, allowing an increase in energy * If #rand \> p(new), new configuration rejected and have another copy of original microstate set

Answer 51

* **Temperature** across simulations is **constant** * Instead a **soft-core potential** is used to across replicas, where **Lennard Jones potential gradually softened** meaning that atoms can sit on top of one another while still remaining in certain conformations * Useful in ligand binding, where certain bound conformations can be locked in deep minima. * H-REMD allows ligand to rotate in pocket and explore different orientations while still being bound. * These can then be swapped in to a correct LJP, giving an indicator to the transitions in between them

Answer 52

* Set of temperatures used such that largest temperature will enable rapid exploration of phase space. * Finding this temperature can be difficult when don’t know the PES * Spacing between temperatures must also be optimised to minimise convergence time

Answer 53

* Must run a test to see if exchange if often/probable enough to reach convergence. * Otherwise, may just switch back and forth between two states of similar energy.

Answer 54

* If one is simulating a system involving a phase change due to increased temperature, this will result in a high energy change (e.g a protein unfolding. * Must have a lot of replicas around phase transition of configurations that suddenly differ largely.

Key notes final Flashcards

(78 cards)