Antibody Modelling Flashcards

Question 1

Q

What’s the situation with sequence and structural data?

Answer

A

We have loads of sequence data but not that much structural data. The theory is that sequence determines structure, so we should be able to use sequences to predict structures.

Question 2

Q

Why are predictions useful?

Answer

A

They are useful because:

Help us understand the sequence-structure gap
The structural information can tell us about the function
It can guide rational drug design
It can guide mutagenesis studies
It helps to solve experimental structures
It focuses on fundamental chemistry of protein structures

Question 3

Q

How can protein similarity be measured?

Answer

A

Sequence identity - compares the sequences
Root mean squared (RMS) - compares structures as they are superimposed and the average distance between 2 equivalent points is calculated. 2.5Angstrom is good, lower is better.

Question 4

Q

What is CASP?

Answer

A

Critical Assessment of Protein Structure
This allows the blind trials of protein prediction software so they can be truly assessed with no bias and compared for other users

Question 5

Q

Describe the 2 ab-initio energy calculation methods for structure prediction.

Answer

A

Ab-initio methods use calculations and simulations to determine structure. They were not the most successful methods at protein prediction.
Energy Minimisation - The atoms are described in terms of bond length, bond angle, bond dihedral rotations and interactions. The conformation with the lowest energy is then searched for. Hydrophobicity terms need to be added and structures can get stuck in false energy minimums.
Molecular Dynamics - The atoms are described in terms of bond length, bond angle, bond dihedral rotations and interactions. Newtons laws of motion are solved over time which allows jumps over energy barriers to find the lowest energy conformation.

Question 6

Q

How does secondary structure prediction work?

Answer

A

Based on the idea that local sequences determine local structure. Programs aim to predict secondary structure elements: alpha-helices, beta-strands and coils.

Question 7

Q

How can you measure the accuracy of structure prediction?

Answer

A

Accuracy (Q3 - because its a 3-state model) = no. of residues correctly predicted/total no. of residues considered.
For a typical protein with an average mix of alpha-helices, beta-strands and coils Q3 = 40%

Question 8

Q

How does the Chou-Fasman principle work?

Answer

A

This is an early method that gives residues scores and then adds additional rules to determine secondary structure elements. It’s based on the idea that amino acids have a preference for certain secondary structure elements.
Propensity of (e.g.alanine in a helix) = (no.of alanines in helices/no.of alanines in the database)/(no. of amino acids in helices/ no. of amino acids in database)
Propensity = 1 - average amino acid
Propensity > 1 - indicates a preference
Propensity < 1 - indicates a dislike

Question 9

Q

Describe stereochemical methods for secondary structure prediction.

Answer

A

This recognises the hydrophobicity of residues and the way they favour particular secondary structures
e.g. hydrophobic residues are in the core, polar or charged residues are exposed.
This method produces 60% accuracy.

Question 10

Q

Describe the use of artificial neural networks for structure prediction.

Answer

A

Artificial neural networks and machine learning is used as you can give input sequences and output structures, the machine will learn the weights for signals and will try different architectures so that it can predict outputs from inputs.

Question 11

Q

Why has secondary structure improved?

Answer

A

Better algorithms
more structural data
more sequence data

Question 12

Q

What is template-based modelling and what are some of the other terms used to describe the process?

Answer

A

Template-based modelling uses the basis that homologous proteins have similar structures so using these as templates is the best way to predict 3D structures. 
This process is also called:
-homology modelling
-threading
-fold recognition
-comparative modelling

Question 13

Q

Describe the 6 steps of template-based modelling.

Answer

A

Find the template sequence - compare query sequences and predicted secondary structures to the database looking for at least 15% identity. MSAs are used if possible.
Align the sequences - Multiple structures allow gaps and loops to be predicted.
Substitute the sequence for structure- if residues match replace them, if there is a clear substitution use the backbone as a template, if there is an indel put it in a suitable space, depending on the size of it.
Identify and model loop regions - If there’s a conserved backbone this is fairly easily however if it is not conserved search the database for similar sequences and orientations then use these as a template. There are various algorithms for this, short loops are predicted better.
Add side chains - Use rotamer libraries and probabilities to ensure allowed chi angles and no steric clashes. Consder the packing and formation of H-bonds, disulphides etc
Model refinement - Use energy calculations to minimise the structure, but don’t minimise too much.

Question 14

Q

Describe Phyre2.

Answer

A

A webserver that carries out template-based homology.
-input the query sequence
-create an MSA
-predict the secondary structure
-make a HMM using secondary structure and MSA
-produce an alignment using the HMM
-build the backbone based on this information
-build loops
-add side chains
-give an accuracy/confidence score
PhyreAlarm --> checks for updates
BackPhyre --> structure to sequence

Question 15

Q

How do monoclonal antibodies work and what are some of their modes of action?

Answer

A

They recognise and alter the activity of specific proteins. They can be used as a therapeutic method e.g. cancer treatment.

Direct cytotoxicity - the binding of the antibody leads to the destruction of the cell
Immune Modulation - It targets cytokines to cells which leads to cell death
Pretargetting - It directs drugs or radioactive molecules to cells via binding

Question 16

Q

Describe the structure of antibodies

Answer

A

Antibodies have 2 light chains and 2 heavy chains combined with hinge regions (disulphide bridges). Each domain forms 2 stacked beta sheets with a disulphide bridge.
Light chains - Vl and Cl
Heavy chains - Vh, Ch1, Ch2, Ch3 (Ch4 for IgM &IgE)
Fv - variable fragment
Fab - antigen binding fragment
Fc - crystallisable fragment
CDRs - Complementary determining regions bind to antigens, are made of 6 loops and have high variability.

Question 17

Q

Describe Clothia’s analysis/ CDR variability analysis

Answer

A

CDRs are variable and made of 6 loops. L1, L2, L3, H1, H2, H3. H3 is hypervariable the rest are all variable.
The variability depends on key residues involved in the loops as well as the length of the CDRs.

Question 18

Q

Describe Clothia’s analysis/ CDR variability analysis

Answer

A

CDRs are variable and made of 6 loops. L1, L2, L3, H1, H2, H3. H3 is hypervariable the rest are all variable.
The variability depends on key residues involved in the loops as well as the length of the CDRs.

Question 19

Q

Describe humanisation

Answer

A

Human monoclonal antibodies can’t be produced by conventional methods. Mouse antibodies are good alternatives but they can lead to an immune response.
The best solution is to make chimeric antibodies:
-Use human Fc region and mouse Fab/Fv region
-Use mouse CDRs and human everything else
-Some additional residues may need to be mutated to avoid steric clashes and increase binding.

Question 20

Q

Describe the Rosetta Antibody program

Answer

A

This is a webserver used to predict antibodies. Blind trials show it was effective and specialised software is better than standard protein prediction software.

Input the antibody sequence
Select Vl and Vh frameworks
Assemble the beta-barrel
Graft canonical loops using BLAST and superimposition. These loops inlclude L1, L2, L3, H1 and H2.
Model H3 loop which has low-resolution.This long is longer and hypervariable so is less accurate and more difficult to model.
Refine and optimise the structure
Often multiple versions are produce for the user to choose.

Question 21

Q

Describe the biopharmaceuticals market including a specific example

Answer

A

This market grows by 5-10% per annum. There is currently a worldwide total market of $100 billion.
Herceptin is a breast cancer antibody that has sales of $5 billion per annum.