9. Protein structure prediction Flashcards

1
Q

The structure

A

is determined by the aa seq.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

The folding will

A

correspond to the energy minima

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why protein modelling?

A
  • Structure is important for function

- Gap between known sequences and structures is huge

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Common structures

A

a-helix
b-sheet
b-turn
random coil

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Program available for secondary structure prediction

A

DSSP

STRIDE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Different approaches

A
  • Statistical methods
  • Knowledge-based methods
  • Machine learning
  • Consensus method
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Evaluation 2nd structure

A

Q3

Sov

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Q3

A

fraction correctly predicted residues - Accuracy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Sov

A

Fractional overlap of segments - ability to pick up correct structure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How is Q3 and Sov used

A

They can only evaluate the method itself not your prediction as it looks at already known structures
- one should use both Q3 and Sov for good evaluation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Q3 equation

A

Correctly predicted residues/total residues =Q3%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Example of machine learning methods

A

PSIPRED
PHD
Jnet

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Membrane topology

A

look if protein is bound to a membrane or not

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Common characteristics of TM region (3)

A
  • W or Y at the edges of the membrane
  • ca 20 hydrophobic residues inside the membrane
  • Positive inside
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Database to identify TM regions

A

TOPCON

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

TOPCON

A

accounts different methods and make a consensus

- gives exact positions of TM region in the bottom

17
Q

3D structure prediction

A
  • More complex than secondary

- Two main approaches

18
Q

Two main approaches of 3D modelling

A
  • Homology modelling

- Ab initio modelling

19
Q

Homology modelling

A
  • use known structure as template
  • higher seq similarity -> better prediction
  • alignment and template selection is very important
20
Q

Methodology (entire 3D)homology modelling (5steps)

A
  • Identify related structure
  • Align target seq to template structure
  • Generate “known” backbone and side chains
  • Generate loops
  • Refine
21
Q

Template selection

A
  • select optimal crystal structure
22
Q

Common approaches for template selection

A

sequence similarity
homology (both of these: BLAST, Prosite, Pfam)

Fold recognition

23
Q

Fold recognition (threading)

A
  • structure more conserved than sequence
  • compare to a known library of folds (CATH, SCOP
  • align sequence to a fold
  • ENERGY CALCULATIONS
  • does not require a similar sequence
24
Q

Loops

A
  • exposed regions are more variable than the protein core
  • often important for protein function
  • loops longer than 5 residues is hard to model
25
Q

Approaches short, med, long

A

short - analytical approach
medium - database approach
long - fragment based approach

26
Q

Optimisation of model

A
  • use ENERGY MINIMISATION to fix bad parts
  • side chain clashes
  • bad peptide bond angles
27
Q

Common errors (of 3D template) 5st

A
  • side chain packing
  • distortions/shifts
  • bad loops
  • misalignment
  • bad template
28
Q

Ab initio modelling

A
  • predict structure without any prior knowledge but the sequence
  • used when template-modelling fails
  • works best for small proteins
  • great computational cost
29
Q

CASP

A
  • evaluation of 3D structure methods

- similarity of model to the native structure

30
Q

Function prediction

A
predict the function
- use results from eg
 * signalP/TargetP
 * 2nd structure pred
 * topology predictions
 * 3D predictions
- use machine learning methods to connect
results usually given as gene ontology terms