Antibody Modelling Flashcards

1
Q

What’s the situation with sequence and structural data?

A

We have loads of sequence data but not that much structural data. The theory is that sequence determines structure, so we should be able to use sequences to predict structures.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why are predictions useful?

A

They are useful because:

  • Help us understand the sequence-structure gap
  • The structural information can tell us about the function
  • It can guide rational drug design
  • It can guide mutagenesis studies
  • It helps to solve experimental structures
  • It focuses on fundamental chemistry of protein structures
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How can protein similarity be measured?

A
  • Sequence identity - compares the sequences
  • Root mean squared (RMS) - compares structures as they are superimposed and the average distance between 2 equivalent points is calculated. 2.5Angstrom is good, lower is better.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is CASP?

A

Critical Assessment of Protein Structure
This allows the blind trials of protein prediction software so they can be truly assessed with no bias and compared for other users

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Describe the 2 ab-initio energy calculation methods for structure prediction.

A

Ab-initio methods use calculations and simulations to determine structure. They were not the most successful methods at protein prediction.
Energy Minimisation - The atoms are described in terms of bond length, bond angle, bond dihedral rotations and interactions. The conformation with the lowest energy is then searched for. Hydrophobicity terms need to be added and structures can get stuck in false energy minimums.
Molecular Dynamics - The atoms are described in terms of bond length, bond angle, bond dihedral rotations and interactions. Newtons laws of motion are solved over time which allows jumps over energy barriers to find the lowest energy conformation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How does secondary structure prediction work?

A

Based on the idea that local sequences determine local structure. Programs aim to predict secondary structure elements: alpha-helices, beta-strands and coils.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How can you measure the accuracy of structure prediction?

A

Accuracy (Q3 - because its a 3-state model) = no. of residues correctly predicted/total no. of residues considered.
For a typical protein with an average mix of alpha-helices, beta-strands and coils Q3 = 40%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How does the Chou-Fasman principle work?

A

This is an early method that gives residues scores and then adds additional rules to determine secondary structure elements. It’s based on the idea that amino acids have a preference for certain secondary structure elements.
Propensity of (e.g.alanine in a helix) = (no.of alanines in helices/no.of alanines in the database)/(no. of amino acids in helices/ no. of amino acids in database)
Propensity = 1 - average amino acid
Propensity > 1 - indicates a preference
Propensity < 1 - indicates a dislike

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Describe stereochemical methods for secondary structure prediction.

A

This recognises the hydrophobicity of residues and the way they favour particular secondary structures
e.g. hydrophobic residues are in the core, polar or charged residues are exposed.
This method produces 60% accuracy.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Describe the use of artificial neural networks for structure prediction.

A

Artificial neural networks and machine learning is used as you can give input sequences and output structures, the machine will learn the weights for signals and will try different architectures so that it can predict outputs from inputs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why has secondary structure improved?

A

Better algorithms
more structural data
more sequence data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is template-based modelling and what are some of the other terms used to describe the process?

A
Template-based modelling uses the basis that homologous proteins have similar structures so using these as templates is the best way to predict 3D structures. 
This process is also called:
-homology modelling
-threading
-fold recognition
-comparative modelling
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Describe the 6 steps of template-based modelling.

A
  • Find the template sequence - compare query sequences and predicted secondary structures to the database looking for at least 15% identity. MSAs are used if possible.
  • Align the sequences - Multiple structures allow gaps and loops to be predicted.
  • Substitute the sequence for structure- if residues match replace them, if there is a clear substitution use the backbone as a template, if there is an indel put it in a suitable space, depending on the size of it.
  • Identify and model loop regions - If there’s a conserved backbone this is fairly easily however if it is not conserved search the database for similar sequences and orientations then use these as a template. There are various algorithms for this, short loops are predicted better.
  • Add side chains - Use rotamer libraries and probabilities to ensure allowed chi angles and no steric clashes. Consder the packing and formation of H-bonds, disulphides etc
  • Model refinement - Use energy calculations to minimise the structure, but don’t minimise too much.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Describe Phyre2.

A
A webserver that carries out template-based homology.
-input the query sequence
-create an MSA
-predict the secondary structure
-make a HMM using secondary structure and MSA
-produce an alignment using the HMM
-build the backbone based on this information
-build loops
-add side chains
-give an accuracy/confidence score
PhyreAlarm --> checks for updates
BackPhyre --> structure to sequence
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do monoclonal antibodies work and what are some of their modes of action?

A

They recognise and alter the activity of specific proteins. They can be used as a therapeutic method e.g. cancer treatment.

  • Direct cytotoxicity - the binding of the antibody leads to the destruction of the cell
  • Immune Modulation - It targets cytokines to cells which leads to cell death
  • Pretargetting - It directs drugs or radioactive molecules to cells via binding
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Describe the structure of antibodies

A

Antibodies have 2 light chains and 2 heavy chains combined with hinge regions (disulphide bridges). Each domain forms 2 stacked beta sheets with a disulphide bridge.
Light chains - Vl and Cl
Heavy chains - Vh, Ch1, Ch2, Ch3 (Ch4 for IgM &IgE)
Fv - variable fragment
Fab - antigen binding fragment
Fc - crystallisable fragment
CDRs - Complementary determining regions bind to antigens, are made of 6 loops and have high variability.

17
Q

Describe Clothia’s analysis/ CDR variability analysis

A

CDRs are variable and made of 6 loops. L1, L2, L3, H1, H2, H3. H3 is hypervariable the rest are all variable.
The variability depends on key residues involved in the loops as well as the length of the CDRs.

18
Q

Describe Clothia’s analysis/ CDR variability analysis

A

CDRs are variable and made of 6 loops. L1, L2, L3, H1, H2, H3. H3 is hypervariable the rest are all variable.
The variability depends on key residues involved in the loops as well as the length of the CDRs.

19
Q

Describe humanisation

A

Human monoclonal antibodies can’t be produced by conventional methods. Mouse antibodies are good alternatives but they can lead to an immune response.
The best solution is to make chimeric antibodies:
-Use human Fc region and mouse Fab/Fv region
-Use mouse CDRs and human everything else
-Some additional residues may need to be mutated to avoid steric clashes and increase binding.

20
Q

Describe the Rosetta Antibody program

A

This is a webserver used to predict antibodies. Blind trials show it was effective and specialised software is better than standard protein prediction software.

  • Input the antibody sequence
  • Select Vl and Vh frameworks
  • Assemble the beta-barrel
  • Graft canonical loops using BLAST and superimposition. These loops inlclude L1, L2, L3, H1 and H2.
  • Model H3 loop which has low-resolution.This long is longer and hypervariable so is less accurate and more difficult to model.
  • Refine and optimise the structure
  • Often multiple versions are produce for the user to choose.
21
Q

Describe the biopharmaceuticals market including a specific example

A

This market grows by 5-10% per annum. There is currently a worldwide total market of $100 billion.
Herceptin is a breast cancer antibody that has sales of $5 billion per annum.