BLAST searching Flashcards

1
Q

3 requirements for biological sequence database searching

A

Sensitivity (How correct it is)
Selectivity (Exclusion of incorrect hits)
Speed (How long it takes)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Heuristics

A

Taking shortcuts for solving problems in a quick way that delivers a result that is sufficient enough to be useful given time constraints.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Local vs global alignment

A

Local alignment identifies short regions of similarity detecting conserved domains within proteins. Global alignment compares sequences over their entire length, suitable for homologous genes with full-length conservation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Steps in BLAST algorithm

A

Read input sentence, generate word seeds, search database for matches, score matches based on substitution matrix, extend matches bidirectionally until score reaches lowest point.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Scoring matrices in BLAST

A

BLOSUM-Blocks Substitution Matrix - based on conserved protein blocks
PAM-Point Accepted Mutation - based on evolutionary mutations over time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Blast programmes

A

BLASTN - nucleotide query and answer
BLASTP - protein query and answer
BLASTX - nucleotide query, protein answer
TBLASTN - protein query, nucleotide answer
TBLASTX - nucleotide (translated) query and answer.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is E value in BLAST results?

A

Score for how similar the answer is to the sequence of the query, the closer to 0 the better.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why are protein sequences preferred in BLAST?

A

Proteins are conserved, less mutations (bacteria).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

High scoring segment pairs (HSPs)

A

Ungapped local alignments between a query and a database sequence with scores above a predefined threshold.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly