Key notes Flashcards
1
Q
- There exist many computational methods that are available depending on the length/time scale we are interested in.
- Classes in order of increasing … scale and decreasing … scale they investigate
- Quantum/… … : includes atoms, electrons in an explicit solvent using … … (Schrodinger). Accurate but computationally unfeasible for >… atoms.
- … -…: all/most atoms included in an explicit solvent but instead uses…as a method of reference, which describe atoms via….
- Coarse-grained: group 4-5 heavy atoms into beads, in an explicit or … solvent using MD. Far less interactions to compute but at the cost of accuracy.
- … -…-… : interaction sites grouped, comprising of many atoms, generally proteins/peptides, using an implicit solvent with … dynamics.
- … : Materials represented as a continuous mass in an implicit solvent, using … … (instead of QM/EOM).
A
- There exist many computational methods that are available depending on the length/time scale we are interested in.
- Classes in order of increasing length scale and decreasing time scale they investigate
- Quantum/Ab initio: includes atoms, electrons in an explicit solvent using quantum mechanics (Schrodinger). Accurate but computationally unfeasible for >3 atoms.
- All-atom: all/most atoms included in an explicit solvent but instead uses MD as a method of reference, which describe atoms via EOMs.
- Coarse-grained: group 4-5 heavy atoms into beads, in an explicit or implicit solvent using MD. Far less interactions to compute but at the cost of accuracy.
- Supra-coarse-grained: interaction sites grouped, comprising of many atoms, generally proteins/peptides, using an implicit solvent with stochastic dynamics.
- Continuum: Materials represented as a continuous mass in an implicit solvent, using continuous dynamics (instead of QM/EOM).
2
Q
- Define rare events in the context of molecular simulations.
A
- Processes that involve time scales much longer than what we can computationally afford.
- Definition relative to our method of choice.
3
Q
What are enhanced sampling techniques
A
- Computational methods that allow one to overcome the timescale problem by sampling the phase space of our system to a greater extent. We can use MD or MC to do this.
4
Q
- (IMP) How is system free energy related to the timescale problem.
A
- Our phase space if a collection of all configuration’s position and momenta.
- A rare event is likely to be separated from our starting configuration by a high free energy barrier that cannot be overcome using MD/MC alone in a feasible timescale. Instead we are likely to spend all/most of our simulation time trapped in that local minima.
5
Q
- (IMP) Briefly describe the two main methods of enhanced sampling.
A
- Free energy-based methods: characterize PES of system
- E.g. umbrella sampling, replica exchange MD, metadynamics
- Choice of reaction coordinate/dof important and will affect free E of system.
- No direct info about kinetics of the process
- Path sampling-based methods: explore all possible pathways
- E.g. transition path/interface sampling, forward flux sampling
- More suited for processes with many possible pathways between
- No direct info about the thermodynamics of the system.
6
Q
- What is an order parameter?
A
- An order parameter or collective variable, allows the different configurations/states of a system to be distinguished
- Allows system to be driven from A to B (or v.v) through application of enhanced sampling method of choice.
7
Q
- What are the requirements of an OP? what is the difficulty in this requirement?
A
- Must be differentiable with respect to atomic position
- This is simple in the cases illustrated but can be very challenging for complex systems that is simple enough to compute but accurately represents dynamics.
- Can also be difficult when no prior knowledge of configurational space is known.
8
Q
- What is affected by our OP choice in free energy-based methods.
A
- Our choice of OP determines the resulting free energy surface (FES)
- This multidimensional hyper surface is re-mapped on to a simple coarse-grained FES (1D/2D)
9
Q
- What is problem with this new FES?
A
- Can often be an oversimplification as only representative of the chosen order parameter.
- The true FES, which represents all configurational space does not equal the FES we are sampling, constrained to our OP of choice
10
Q
- (IMP) How does statistical mechanics relate to our FES assumption?
A
- Stat mech confirms true FES ≠ our FES according to this OP, as uses configurational partition function, Z which does not contain info about kinetics/ dynamics of the system and depends only on particles position in the system only.
11
Q
(PPQ) How can we use commitor analysis to probe the accuracy of our OP?
A
- Select a value, OP* of OP corresponding to a putative transition state (maximum)
- Sample an ensemble of configurations characterized by OP*
- Run several MD simulations for each, varying velocity (making them statistically independent)
- Find probability of OP* configuration making to it to B
- Plot histogram of these probabilities.
12
Q
- (PPQ) Which of these histograms indicates a more accurate OP?
A
- (a) indicates no matter where you start you always end up with same probability of ending up in A or B, NO indication this is a TS
- (b) indicates probability of ending up in B increasing as you move further along coordinate, 0.5 at middle. Representative of the putative TS
- Therefore, b is better.
13
Q
(IMP)
- … … … is a path based enhanced sampling method where instead of computing an overall (v.low) … , path is divided into a series of …
- Each … has an increasing value of our …, … each corresponding to a possible … with said value of …
- As λ increases likelihood of going from … to … , rather than back to A, …
A
(IMP)
- Forward flux sampling is a path based enhanced sampling method where instead of computing an overall (v.low) probability, path is divided into a series of interfaces
- Each interface has an increasing value of our OP, λ, each corresponding to a possible configuration with said value of λ
- As λ increases likelihood of going from A to B, rather than back to A, increases
14
Q
(IMP) How are trajectories generating in FFS?
A
- Begin by looking at fluctuations of a long unbiased MD run
- At each interface λi large # of trial molecular dynamics runs done
- The few that reach the next interface λi+1 are used as a starting point to reach the following interface.
15
Q
(IMP)
- ML methods are good at … data, but poor are … predictions.
- In other words, ML can give accurate prediction … … … … (filling the gaps), but poor at predicting data … of it.
- The … /… of data accessible and how we … it is much more important than the algorithm we feed it in to.
A
(IMP)
- ML methods are good at interpolating data, but poor are extrapolating predictions.
- In other words, ML can give accurate prediction within a data set (filling the gaps), but poor at predicting data outside of it.
- The amount/quality of data accessible and how we describe it is much more important than the algorithm we feed it in to.
16
Q
Why is there a need for ML-based interatomic potentials in simulations?
A
- Classical forcefields lack the detail to accurately reproduce complex systems, as functional forms can be very limiting.
- The timescales these systems exist in are also far too large for quantum calculations.
17
Q
- What are the pros and cons of ML-based interatomic potentials?
A
- Pros: fast, long large scale simulations with quasi quantum chemistry accuracy (if data set is good
- Cons: Not easy to craft dataset, can take years to improve iteratively.