Structure bioinformatics - computational approaches Flashcards

Question

To simulate molecular trajectories, what do we need to define?

Answer 1

The forces: Related to how the atoms are interacting with each other and how they are influencing each others movement. The velocities: Temperature related, related to how fast the atoms are moving.

Answer 2

- Non-bonding interactions - Bonded-terms - Angle terms - Torsion terms (dihedral angles). - Improper torsional terms We want all of these parameters to be close to energy equilibrium to make the force and energy around the atoms favorable. We penalize distance and angles outside of equilibrium.

Answer 3

Van der Waals interactions: When atoms come close enough so that the outer electron layer of the atoms can touch, Van derivative Waals interactions can be formed. These interactions are distance dependent, if they get too close they repulse each other and if they are too far apart the interaction is 0. These interactions are crucial in modeling the forces that work on a system, we want these as close to equilibrium as possible. When modeling the forces we use Lennard-Jones potential to model the forces as a function of the distance between the atoms.

Answer 4

Electrostatic interaction refers to the attractive or repulsive forces between electrically charged particles defined by Coulomb potential and calculated using partial charges. These are modeled together with Van der Waals interactions with the purpose of understanding the forces on a system that are due to non-bonding interactions. To reproduce the electrostatic interactions we need to select partial charges to all individual atoms.

Answer 5

- Angle terms - Torsion terms (dihedral angles) - Improper torsional terms

Answer 6

The energy equilibrium for distance between atoms is the length of the bond. We penalize all distances smaller or greater than this since it will increase the forces which will make the energy less favorable.

Answer 7

there is an equilibrium angle and you penalize if you are not at equilibrium. These are less rigid than bonds meaning that they can move more without the energy becoming to unfavorable. We need 3 atoms for an angle.

Answer 8

The bonds can move around the axis and this will create different angles between planes. You need 4 atoms in a row to define these angles.

Answer 9

Usually if you have bonds and angles then you do not have non-bonded interactions between these atoms. In a bigger molecule, atoms further from each other than 4 connections can have non-bonded interactions and you should compute the energies for these by looking at the distance between them. There is however a distance cutoff where you assume that the non-bonded interactions are 0.

Answer 10

It represents the forces that come from the bond term (the bond vibrations influenced by how much the bond is stretched)

Answer 11

It's the Lennerd-Jones potential that describes the forces that come from the Van der Waals interactions between atoms.

Answer 12

It represents Coulombs law that describes the forces on a system that come from electrostatic interactions.

Answer 13

Minimization refers to the process of finding the configuration of a molecular system where the potential energy is minimized. The potential energy landscape represents the energy of a system as a function of the coordinates of its atoms, and the goal of minimization is to identify the lowest-energy state or a local minimum on this landscape. Energy minimization is a crucial step in molecular dynamics simulations and structure optimization. It is used to relax the initial atomic coordinates, remove steric clashes, and find stable conformations. Ligand binding also happens at low energies.

Answer 14

A local minimum on a potential energy landscape is a configuration of atomic coordinates where the potential energy is lower than in the immediate surrounding region. It's a point where the system is stable with respect to small variations in atomic positions. In a simulation of a bigger molecule we would probably find different local minima if we run the simulation several times because the atoms move around and the energy landscape changes which will give different local minima.

Answer 15

The global minimum is the lowest point on the entire potential energy landscape. It represents the most stable configuration of the system among all possible atomic arrangements. Finding the global minimum is essential for understanding the most energetically favorable state of the system. We would like to find the global minima but usually it is only possible to find the local minima. We can only find global minima if we look at the whole systems with molecular dynamics.

Answer 16

Input data: - The force field of the system. - Coordinates of the atoms The coordinates are used by the force field to get the energy landscapes and from the force field we can also compute the forces on each atom giving us their accelerations. Velocities are used to control the temperature of the system according to the thermostat algorithm. Minimize the system energy to find the local minimum and avoid potential clashes between atoms. Run simulation and compute trajectories. Compute the free energies.

Answer 17

Free energy drives physical processes like protein folding and ligand binding and describes spontaneous processes. High free energy would mean that the model is not very good because ligand binding and such happens at low energies.

Answer 18

Molecular dynamics are performed in finite boxes and need to represent infinity. Periodic boundary conditions (PBC) are a set of conditions used to simulate a system as if it were part of an infinity.

Answer 19

PBC involves creating periodic replicas of the simulation box in all three spatial dimensions (x, y, and z). These replicas are essentially copies of the original box. If a particle crosses the boundary in the x-direction, for example, it reappears on the other side of the box in the same x-position but with the same velocity.

Answer 20

The free energy is a sum of enthalpy and entropy. The enthalpic contribution depends on the interactions in your system and the entropy of the order of your system. Disorder = high entropy. Potential energy is the energy stored in bonds ect.

Answer 21

When you have minimized your system you run multiple short MD simulations as you are increasing the temperature to reach lab conditions.

Answer 22

When you lastly run the simulation and compute the trajectories.

Answer 23

Because they look at the whole/general structure and therefore become sensitive to general orientation/position in space so we need to superimpose prediction and template before using the metrics.

Answer 24

Both are metrics to see how similar a predicted fold is to a template fold. Both looks at the distances in the placement of alfa carbons in the backbone between template and structure. However RMSD looks at the whole alignment wether the atoms are superimposed or not and gives equal weight to all distances by taking the average distance. TM only looks at the overlapping parts of the superimposed structures and takes no average. It is designed to be size-independent

Answer 25

Homology modeling: Only works with high sequence identity and if the sequences with high identity has an experimental structure. Threading: could be time-consuming to test all available folds so you have to choose a selection of them to test. Alpha fold: world better for helices than sheets.

Answer 26

Molecular docking is a fast way to filter through many compounds and find the ones that are worth working with further. It is good to use as an initial screening method. However, the approximations that the algorithm makes to make it faster gives us lower accuracy. The docking algorithms are also not good at choosing the ligands with the highest affinity out of multiple ligands with known affinity. We need MD simulations for that. Therefore, when you have found the initial compounds, the molecular simulations will give you a more realistic and accurate result of how the protein and ligand are going to work together since we can find the most energetically favorable state of the system and we don't take as many short-cuts ect. The molecular dynamics is to computationally heavy to use as a screening method. Therefore in drug discovery, use molecular docking as a screening method and then look further at the results and make them better by using molecular dynamics simulations.

Answer 27

Because enzymatic reactions include the breaking and forming of bonds which cannot be modeled by MD simulations because in the mathematical representations of the forces bonds cannot break.

Answer 28

Screening databases and chemical libraries for compounds. Docking screening - which compounds fit into the binding pocket? Docking algorithm: - sampling of ligand conformations - ligand scoring.

Answer 29

The drug like chemical space is too big. High throughput screening libraries. Commercial chemical space.

Answer 30

Get the structure Define the force field - using the forcefield parameters. Minimize the system to find local/global minima Equilibrium phase where we slowly increase the temperature to reach lab conditions Production phase where we run the simulation and get the trajectories. Compute the free energies.

Answer 31

Within the molecule only the vibrations of the bond modeled by Hooke's law but between the separate O2 molecules we also have the non-bonded terms Van her waals and electrostatic interactions modeled by Lennard-jones potential and Colombs law.

Answer 32

In reality bonds would break if we pulled the atoms too far apart but in the mathematic representations of the forces the come from bonds the energy increases the more we pull the bond.

Answer 33

Alphafold uses machine learning to predict the protein structure. You input an amino acid sequence and alphafold does multiple sequence alignments to find a sequence representation of the input. It also looks for related structures in PDB to create a pair representation of the input. The pair and sequence representation is put through a transformer called the evoformer that extracts information out of the representations and a structural hypothesis is formed. The structural hypothesis is used to improve the MSA which then gives a new hypothesis. This is done in 48 blocks. The refined hypothesis is then put through a second neural network that gives an initial prediction of the structure and applies physical and chemical constraints. This prediction goes back though the evoformer and the second network 3 times before an output is given.

Structure bioinformatics - computational approaches Flashcards

(57 cards)