9. Protein structure prediction Flashcards

Question 1

Q

The structure

Answer

A

is determined by the aa seq.

Question 2

Q

The folding will

Answer

A

correspond to the energy minima

Question 3

Q

Why protein modelling?

Answer

A

Structure is important for function

- Gap between known sequences and structures is huge

Question 4

Q

Common structures

Answer

A

a-helix
b-sheet
b-turn
random coil

Question 5

Q

Program available for secondary structure prediction

Answer

A

DSSP

STRIDE

Question 6

Q

Different approaches

Answer

A

Statistical methods
Knowledge-based methods
Machine learning
Consensus method

Question 7

Q

Evaluation 2nd structure

Question 8

Q

Q3

Answer

A

fraction correctly predicted residues - Accuracy

Question 9

Q

Sov

Answer

A

Fractional overlap of segments - ability to pick up correct structure

Question 10

Q

How is Q3 and Sov used

Answer

A

They can only evaluate the method itself not your prediction as it looks at already known structures
- one should use both Q3 and Sov for good evaluation

Question 11

Q

Q3 equation

Answer

A

Correctly predicted residues/total residues =Q3%

Question 12

Q

Example of machine learning methods

Answer

A

PSIPRED
PHD
Jnet

Question 13

Q

Membrane topology

Answer

A

look if protein is bound to a membrane or not

Question 14

Q

Common characteristics of TM region (3)

Answer

A

W or Y at the edges of the membrane
ca 20 hydrophobic residues inside the membrane
Positive inside

Question 15

Q

Database to identify TM regions

Question 16

Q

TOPCON

Answer

A

accounts different methods and make a consensus

- gives exact positions of TM region in the bottom

Question 17

Q

3D structure prediction

Answer

A

More complex than secondary

- Two main approaches

Question 18

Q

Two main approaches of 3D modelling

Answer

A

Homology modelling

- Ab initio modelling

Question 19

Q

Homology modelling

Answer

A

use known structure as template
higher seq similarity -> better prediction
alignment and template selection is very important

Question 20

Q

Methodology (entire 3D)homology modelling (5steps)

Answer

A

Identify related structure
Align target seq to template structure
Generate “known” backbone and side chains
Generate loops
Refine

Question 21

Q

Template selection

Answer

A

select optimal crystal structure

Question 22

Q

Common approaches for template selection

Answer

A

sequence similarity
homology (both of these: BLAST, Prosite, Pfam)

Fold recognition

Question 23

Q

Fold recognition (threading)

Answer

A

structure more conserved than sequence
compare to a known library of folds (CATH, SCOP
align sequence to a fold
ENERGY CALCULATIONS
does not require a similar sequence

Question 24

Q

Loops

Answer

A

exposed regions are more variable than the protein core
often important for protein function
loops longer than 5 residues is hard to model

Question 25

Q

Approaches short, med, long

Answer

A

short - analytical approach
medium - database approach
long - fragment based approach

Question 26

Q

Optimisation of model

Answer

A

use ENERGY MINIMISATION to fix bad parts
side chain clashes
bad peptide bond angles

Question 27

Q

Common errors (of 3D template) 5st

Answer

A

side chain packing
distortions/shifts
bad loops
misalignment
bad template

Question 28

Q

Ab initio modelling

Answer

A

predict structure without any prior knowledge but the sequence
used when template-modelling fails
works best for small proteins
great computational cost

Question 29

Q

CASP

Answer

A

evaluation of 3D structure methods

- similarity of model to the native structure

Question 30

Q

Function prediction

Answer

A

predict the function
- use results from eg
 * signalP/TargetP
 * 2nd structure pred
 * topology predictions
 * 3D predictions
- use machine learning methods to connect
results usually given as gene ontology terms