BINF Flashcards

1
Q

Protein Data Bank

A

Curated by humans
Contains multiple protein structures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

UniProtKB

A

Curated by humans
Contains gene-specific information
No nucleotide sequences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

RefSeq

A

Large but non-redundant genes and genomes
Great for BLAST searches (more targetted hits(

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

NCBI

A

Nucelotide and protein

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

BLAST

A

Basic local alignment search tool
most commonly used sequence alignment database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is sequence alignment?

A

To determine sequence similarity
To find common motifs
To find point mutations
To find insertions/deletions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the two steps involved in sequence alignment

A

Construction of the best alignment between seq
Assessment of similarity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Global seq alignment

A

Determines the best alignemnt over the entire length of two sequences
Best applied when the seq are similar

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Local seq alignment

A

Determines the best alignment in shorted stretches than the entire sequence of two sequences
Best applied when the sequences are substantially different but have regions of similairy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Multiple seq alignemnt

A

simultaneous alignment of more than 2 sequences
Best applied when looking for conserved seq
Seq or patterns in a protein family

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

When aligning two sequences, how fo you determine which is best?

A

use the concept of alignment score

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

General approach to blast

A

It is the most commonly used sequence database for alignment
Uses a match word to start the alignemnt
High scoring words are extended in either diretion until alignment score starts to drop

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

S=__

A

slignemnt score

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

W=

A

word length. 8-9 nt

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

P value

A

porbability that an alignment with a score greater than or queal to S occured by chance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

E

A

Expectation cut off

17
Q

hat does a very small E means

A

highgly significant match (2.1e-21)

18
Q

Rule of thumb E must be at most

A

1E-3

19
Q

What makes blast so fast?

A

It does not try to extend or link discontinuous segment pairs
It does not generate a global alignment
It requies words to match exactly but uses a look up table to score similar words
The longer the word the faster the analysis