Lecture 4 Flashcards

1
Q

What is the aim of blast?

A

To find in a database of sequences, sequences or subsequences most similar to query.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the blast components?

A

Algorithm to FIND similar sequences

Probabilistic model to RANK them

(2 sequences needed and their similarity is quantified, then substitution matrix gives alignment score)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a profile?

A

A table listing the frequencies of AAs in each position of a protein sequence.

They are calculated from an MSA with a domain of interest.

Allow consensus sequence to be identified.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the use of profiles?

A

Highly conserved sets of residues likely to be part of active site –> clues to function.

Little conservation likely to be in surface loops.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How can we use profile patterns to identify homologues?

A

Match query sequence against sequences in alignment table

Give higher weight to positions that are conserved.

Absolutely conserved regions should be insisted upon being found.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a good e value?

A

Lower than 0.0001

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Give an example of a multi domain protein.

A

GARs-AIRs-GARt in vertebrates

(In bacteria each enzyme domain is encoded separately)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why is domain identification needed?

A

For high resolution structure

Sequence analysis

Multiple alignments

Fold recognition.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the derivation of substitution matrices?

A

Count number of S –> T changes in homologous ALIGNED sequences

Use relative frequencies of changes to form
Scoring matrix for substitutions.

Likely changes score higher than 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What must you ensure when making matrices?

A

To restrict sequences to be sufficiently similar so that any position is not changed more than once!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is required for domain fusion?

A

Both domain scold correctly and bury fraction of solvent exposed surface area in inter domain surface.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is an HMM?

A

Machine learning algorithm used for profile searches of sequence databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly