protein modeling Flashcards

Question

what to consider ? Homology modelling, Carefully align similar regions

Answer 1

2. Align the sequence of target and template(s) → Which parts of the sequence are similar? → Are there parts you don’t want to align?

Answer 2

Extracting spatial restraints → How is the spatial environment of a residue in the template? → Transfer the information to the target

Answer 3

→ Transfer the spatial orientation from the template to the structure → Keep as many restraints as possible

Answer 4

Refine the model → Side chain orientations → Removing clashes → Energy minimisation → ….

Answer 5

Don’t just use, what you get Evaluation → Checking for clashes → Weird side chain orientations → Ramachandran plot → Kinked backbones → ….

Answer 6

* Does the backbone adopt angles and conformations that are theoretically allowed?

Answer 7

1. Choose the right template 2. Align the sequence 3. Extracting spatial restraints 4. Modelling 5. Refine the model 6. Evaluation if you dont like what you see you start again with different input

Answer 8

* Template and Target have a high sequence similarity/identity * Known structural motifs → e.g. α-helices

Answer 9

* Loops (structurally more complex) * No templates with similar sequence * Reproducing different protein conformations ( Choice of template improtant ) * Details e.g. binding site conformation & side chain orientations ( you can model with a ligand in the binding site)

Answer 10

* Choose templates carefully → Resolution → Sequence similarity → Conformation → Bound ligand? * Pay attention to how the sequences are aligned * Evaluate a model before using it further * Is an additional optimization required?

Answer 11

* “Predicting the 3D structure only based on the amino acid sequence” * Not a new concept but incredibly challenging * Massive advances in the recent years by using AI/ML approaches → AlphaFold2 → RosettaFold → OmegaFold → OpenFold → ESMFold Code available online

Answer 12

* AlphaFolddatabase: pre-predicted structures * ColabFold: → Colabnotebook for custom protein structure prediction * Local installation: → Make your own predictions (probably doesn’t work on the average PC) * Available code: → Can be downloaded and adapted * Adaptations of the original code: → AlphaFoldMultimer: Protein complex predictions → Prediction of different conformations

Answer 13

* Confidence metric: pLDDT(based on Local Distance Difference Test metric) * pLDDT>90: expected to be modelled with high accuracy * 90>pLDDT>70: well modelled, generally good backbone prediction * 70>pLDDT>50: low confidence modelling, be cautious * 50>pLDDT: should not be interpreted, likely disordered (unstructured or only structured in complex with other protein

Answer 14

* Higher confidence for structured domains with many inter-residue contacts * Low confidence for loops, linkers & unstructured regions * Ignorant of different conformations (in parts fixed by adapted, separate codes) * No predictions for non-protein components * Lack of details (e.g.binding sites) & no way to directly influence this * Ignorant of environment, e.g.membra

Answer 15

* Reasons to use a predicted structure: → No experimental structure available → No experimental structure in the desired conformation available → No full-length experimental structure available * Homology modelling? → Modelling of specific features in a specific way (e.g. the ligand binding site) → More influence on specific features and how they should be modelled * AI-generated models? → No idea about the structure at all → No templates available → Quick impression of structural arrangement → No details required (e.g. as input for MD)

Answer 16

* Root-mean-square deviation of atomic positions (RMSD) * Quantitative measure for the similarity of two protein structures of the same protein * Usually calculated for the protein backbone (C, O, N and Cα) or Cα only * Usually also includes a rigid superimposition to minimize the resulting RMSD

Answer 17

in vitro Requires more time to cover a small portion of space More definite yes or no in silico Missing a few good choices Requires less time to cover a large portion of space

Answer 18

STRUCTURE BASED * Docking calculations * Virtual screening * MD simulations * … Keywords: * Homology Modelling * Force Fields (Molecular Mechanics) Ligand-based * Pharmacophore modelling * MedChemapproach ...

Answer 19

Prediction of the interactions between two molecules * Protein – Small molecules * Protein – Protein tses 2 have same basics but different methodology * (Small molecule – Small molecule)

Answer 20

Protein structure, Molecule library Docking calculation Ranking Post-processing (clustering, filtering, …) Visual inspection & molecule selection

Answer 21

Are there artifacts from exp. structure (stabilization)? * Correct mutations * Remove additional proteins Molecules from crystallization? (e.g.PEG) * Remove Additional molecules, proteins or protein parts (e.g. fusion proteins) can slow down the calculation! Are side chains missing or with two orientations? * Add missing side chains * (Rotamer libraries to predict orientation) Hydrogen atoms * Add if they are missing * Consider protonation states! →Can depend on the environment! tricky... where to protonate histidine for example Water molecules in the binding site? * Mediators of protein-ligand interactions?

Answer 22

Novel ligands * Diverse chemotypes * Molecules with diverse characteristics(MW, logP, charges, H-bond donors/acceptors..) * (Ultra-)Large libraries * virtual molecule libraries grew drastically in recent years! Specific ligands * Based on prior knowledge * In-house libraries * Natural products

Answer 23

* Add hydrogen atoms * Correct protonation states? → pH? * Usually 3D-conformations are necessary → Depending on docking program → Conformer generation can be tricky

Answer 24

In principle docking consists of two repeating steps Search for poses (Orienting the molecule in the binding site) Scoring (Judging the pose)

Answer 25

1. What is kept rigid? → Protein & ligand (rarely used nowadays) → Protein → Neither protein nor ligand (more complex) 2. Search algorithm → Stochastic (e.g.genetic algorithms) → Deterministic (e.g.Energy minimization) → Systematic (e.g.using an ensemble of pre-created conformers)

Answer 26

Bond Angle Torsion but mainly these cause the other ones dont change easily : Electrostatic interactions Van-der-Waals interactions

Answer 27

Scoring functions: Force field based Molecular mechanics Empirical Reproduce empirical data of a specific system (statistical analyses of experimental data) Knowledge based Some parameters stem from empirical data Another important factor: Entropic contributions (e.g.desolvation, restraining of flexibility,…) Partially included in some scoring functions; very complex to model! In general: The lower the energy value, the better!

Answer 28

The more exact the Scoring function, the better the prediction! →More time intensive →What is your aim? →Few molecules & exact poses: more accurate, but slower scoring function →Many molecules, quick poses over accuracy: less accurate, but faster scoring function Good rank ≠ True ligand Scoring functions are not perfect…

Answer 29

UsuallyTop 500-5000 molecules from the ranking Intramolecular angles? Clashes? Are polar groups interacting? Desolvation; stronger interactions Apolarinteractions? e.g.π-πinteractions

Answer 30

* Predicting ligand binding poses * Differentiate between ligands and non-binders * Discovery of chemically novel ligands * Predict absolute affinities * Differentiate between ligands and non-binders * Positioning of very flexible ligands (e.g. peptides) * Considering protein flexibility

Answer 31

Screen large molecular libraries Virtual Screening Rational modification of a hit molecule Rational combination of fragments Which amino acids are relevant for Protein-Ligand interaction? Don‘t forget about the experimental validation!

Answer 32

YES EXAMPLE Inactive conformation favours antagonists Active conformation favours agonists

Answer 33

ZINC12 library ~3.6 mio. mols 27 molecules tested 1 sub-micromolar hit=Novel scaffold! Create a library of similar molecules Docking & evaluation All 11 additionally tested molecules are ligands!

Answer 34

What was the aim? * Screening for any ligands to an “easy” target? → higher hit rate (30-40%?) * Searching for novel scaffolds? → lower hit rate might be fine * Screening to a new binding site, a target without ligands or a model? → 1 hit might be ok (Virtual) screens are not supposed to yield the perfect new drug, but only a starting point!

Answer 35

To study details that are not easily measured in real-life experiments. To study small and fast phenomena. Biomolecular simulations are most commonly “classical MD” We simulate proteins to validate and quantify the function, motions and interactions that determine molecular function. Computation electrophysiology Lead optimization (Generate the hit analogues by linking the hit with fragment molecules.) and testing them

Answer 36

Classical” MD = Newtons laws of motion a small change in one atom can make a huge difference in many after a few steps , 1) Position + Velocity of all atoms 2) Interactions with all other atoms (forcefield) 3) Calculate the forces acting on all atoms 4) Update atoms to new positions and velocities after a small time step. 5) Go to step 1 and repeat simulations can involve structure up to the whole cell with lees resolution as we go up in size

Answer 37

2 fs (0.000 000 000 000 002s)

Answer 38

12 000 (144M interactions)

Answer 39

10 ns (5M steps) KTH supercomputer: 256’000 cores to be able to proccess more

Answer 40

non-protein molecules (crystallographic waters, ligands, modified amino acids, etc.) ● alternate conformations ● missing side-chain atoms ● missing fragments ● clashes between atoms ● multiple copies of the same protein chains ● di-sulfide bonds ● wrong assignment of the N and O atoms in the amide groups of ASN and GLN, and the N and C atoms in the imidazole ring of HIS [

Answer 41

1. Clean 2. Box 3. Solvate ( add water ) 4. Neutralize (add ions) 5. Minimize energy 6. Equilibrate 7. Production MD and analysis

Answer 42

No chemistry No breaking or forming of bonds No pH Low complexity systems Single lipids in membranes No natural competing interactions Small systems / simple interactions Sampling Local minima Butterfly effect

Answer 43

Trajectory the simulation “movie” Topology a description of which atoms are in the simulation and how they are connected Force field a list of forces for all kinds of atoms. RMSD ”Root mean square distance”, a measure of how similar a configuration of points are to a reference configuration

Answer 44

Bigger motions and collective changes ( like comformational changes and protein tumbling ) takes longer to observe “Stuck in local minima” A high barrier = More unlikely to happen = Takes longer to happen spontaneously Transitions between protein states also has barriers X-ray structure of receptor in Apo state + A drug Expect: A simulation that shows the transition to the activated receptor Pulling simulations Attach the molecule being pulled to a spring, which is connected to a “fake” atom that moves along some interesting motion motion. Umbrella sampling Attach the molecule being pulled to a spring, which is connected to a “fake” atom that moves along some interesting motion motion forces the protein to sample a particular part of the transition

Answer 45

“Pairwise atom distance” RMSD NEEDS alignment to be meaningful Markov State Modeling represents the conformational space of a molecular system as a set of discrete states, where each state corresponds to a distinct region of the system's configurational space. Transitions between these states are governed by transition probabilities, which describe the likelihood of transitioning from one state to another in a given time interval. Principle component analysis Identification of Dominant Motions: PCA identifies the principal components (PCs) that describe the most significant collective motions in the system. Each PC represents a linear combination of atomic displacements, capturing correlated motions across the molecule.

Answer 46

Signaling Pathway: The WNT signaling pathway is a complex network of intracellular signaling cascades initiated by binding of WNT ligands to cell surface receptors. This pathway regulates diverse cellular processes such as cell proliferation, differentiation, migration, polarity, and stem cell maintenance. WNT Ligands: The WNT family comprises 19 members in humans, each encoded by a distinct gene. These proteins are characterized by their conserved cysteine-rich domain and are classified into several subgroups based on their sequence homology and functional properties. Receptors: WNT ligands bind to cell surface receptors belonging to the Frizzled (FZD) family of seven-pass transmembrane proteins. In addition to FZD receptors, WNT signaling can also be modulated by co-receptors such as LRP5/6 (Low-Density Lipoprotein Receptor-related Protein 5/6) and ROR1/2 (Receptor Tyrosine Kinase-like Orphan Receptor 1/2). WNT/-catenin pathway - Multiprotein complex - Signal initiation independent of conformational dynamics

Answer 47

The ternary complex model is a conceptual framework used to describe the interaction between a ligand (L), a receptor (R), and an effector (E) in pharmacology and signal transduction. This model provides insights into the functional consequences of ligand-receptor interactions and how they may lead to downstream biological effects. The ternary complex model is particularly relevant in the context of G protein-coupled receptors (GPCRs), which are a large family of cell surface receptors involved in various signaling pathways.

Answer 48

Dual color FRAP * Assesses proteinprotein interaction * Crosslinking‐based FRAP stands for Fluorescence Recovery After Photobleaching. It is a microscopy technique used to study the dynamics of fluorescently labeled molecules within living cells or biological tissues. FRAP provides insights into processes such as protein diffusion, membrane dynamics, and molecular interactions in real-time.

Answer 49

Understand WNT-binding Define the role of the linker domain Understand parameters of signal specification Define mechanisms of effector coupling Explore the drugability of FZDs

Answer 50

* Y2502.39 is not phosphorylated but crucial for stabilizing the receptor structure * Mutation of Y2502.39 affects DVL interaction but not G protein coupling * First evidence for conformational selection of pathway specifcity

Answer 51

Signaling * On the hunt for protein phosphorylation using MS and protein chemistry Bioinformatics: * Protein modeling * Molecular dynamics simulations (distance measurements)

Answer 52

Bioinformatics: * Protein modeling * Molecular dynamics simulations * Protein-protein contact

Answer 53

* WNTs induce a dynamic and reversible dissociation of the FZD6 dimer * The methodology allowed for the first time to assess dimer dynamics in living cells in the time course of minutes. * The dimer of FZD6 appears to stabilize an inactive conformation whereas the monomeric form is the conformation that mediates G protein signaling. * It remains unclear whether this concept is generally applicable for all FZDs

Answer 54

Bioinformatics: * Sequence alignment * Cancer database mining * Protein modeling * Molecular dynamics simulations (in silico mutation; distance measurements)

Answer 55

* Class F has a conserved molecular switch mechanism (R6.32- W7.55) essential for G protein activation * Mutation of the molecular switch (unlocking the interaction) impairs DVL coupling

Answer 56

Bioinformatics: * Sequence alignment * Protein modeling * Molecular dynamics simulations * Volumetric analysis * Ligand docking

Answer 57

* Targting FZDs with small molecules is possible * SAG1.3 is the first but still a lousy partial agonist at FZD6 * The FZD core contains a ligand binding pocket arguinig again for intrinsic existence of activation receptor core dynamics

Answer 58

Bioinformatics: * Sequence alignment * Protein modeling and mutagenesis * Molecular dynamics simulations * Volumetric analysis

Answer 59

* Mutagenesis approach allows definintion of sites defining effector coupling specificity * FZDs prefer coupling to DVL over G proteins * Receptor dynamics are essential for pathway selection

Answer 60

Bioinformatics: * CryoEM analysis * Protein modeling * Evolutionary analysis * Molecular dynamics simulations * Volumetric analysis

Answer 61

* Molecular switch acts very differently and explains why FZDs couple to DVL and less well to heterotrimeric G proteins * Water networks can be resolved and they turn out to be important for receptor activation * Conserved cholesterol binding site is of functional relevance

Answer 62

Disheveled (DVL) proteins are a family of cytoplasmic proteins that play critical roles in transducing signals from Wnt receptors, particularly Frizzled (FZD) receptors, to downstream signaling pathways. DVL proteins are key mediators of the Wnt signaling pathway, which is essential for various developmental processes, tissue homeostasis, and disease mechanisms.

Answer 63

* Mutagenesis * Sensor design and sites for bioorthogonal labeling * Validate dynamics * Relate receptor function to cancer * Identify FZD‐targeting compounds * Allow class F‐wide conclusions * Integrated use of methodology * Understanding activation and signal specification * Buidling new sensors * Finding FZD‐targeting molecules

protein modeling Flashcards

(87 cards)