Protein Structural Bioinformatics Flashcards

1
Q

SCOP

A

scop
structural classification of protein
first classification software

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How can proteins be classified?

A

By secondary structure:
a-helical - Secondary structure exclusively or almost exclusively a-helical

Beta-sheet - Secondary structure exclusively or almost exclusively beta-sheet

a+B - a-helices and beta-sheets separated in different parts of the molecule, absence of beta-alpha-beta super secondary structure

a/B - Helices and sheets assembled from beta-alpha-beta units

a/B-linear - Line through centers of strands of sheets roughly linear

a/B-Barrels - Line through centres of strands of sheet roughly circular

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The SCOP database

A

Structural classes:
- Folds
- Superfamilies
-Families

Small proteins:
- Cystine-knot cytokines
- Cystine-knot cytokines
- Transforming growth factor beta

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is used to define domains in proteins?

A

The Gō plot

  1. Calculate radius of spherical volume of protein
  2. Calculate disease from each alpha carbon of each amino acid to all the others
  3. If the distance is greater than the spherical radius, score “+”
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Disadvantages of the Gō method

A
  • Requires solved structure
  • Domain boundaries not always clear
  • Gō method now superseded by sequence-based algorithms
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How Pfam builds domains

A
  • Start with a high quality protein structure (X-ray crystallography, good resolution, low Å)
  • BLAST PDB to find related protein structures
  • Align these – maximise structural homology (meaning adjust alignment so that boundaries of secondary structural elements match)
  • Build a statistical profile (Hidden Markov Model – HMM) of the “seed” alignment
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How Pfam builds domains

A
  • Use the HMM to query GenPept – hmmsearch
  • Align the new hits to the HMM – hmmalign
  • Rebuild the HMM to include the new hits – hmmbuild
  • Repeat as desired, or until there are no new hits
  • “Structure, structure, structure” (Alex Bateman, founder of Pfam)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Disadvantages of Pfam

A

Each domain is defined by a HMM, and that HMM is only as good as the “seed” alignment used to construct it

Because the HMM building process is iterative, errors can be magnified

There are now so many domains in Pfam, that curation is uneven

Pfam was designed to support the Human Genome Project, and viruses were under-represented

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Homstrad meaning

A

HOMologous STRucture Alignment Database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Not enough structures

A

Structure determination is far harder than sequencing

Illumina and Minion sequencing have made sequencing ultra high throughput

There is no equivalent technological leap forward for structural biology

The Structural Genomics Consortium

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly