Biodatabases Flashcards

1
Q

things to censider when compiling a database

A
  • How can other people access the data
  • How can the information be used
  • What to do with all the data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

where is there a list of biological databases

A

Wikipedia

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

example of a genome browser

A

Ensembl

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is BLAST

A

Basic Local Alignment Search Tool

  • most used program in computational biology
  • access to many different databases. major databases are nucleotide, nr (non redundant protein database), genome databases.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

in BLAST, what are bit score and expected match?

A

Bit score - Independent of database size, how good the match is between 2 sequences
Expected match - liklihood of finding this match, by chance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

in BLAST report, what does the alignment line show

A

shows AA sequence for both DNA sequences.

Highlights differences between sequences, and puts + if they are different AA but same protein.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

in BLAST, what is the positives score

A

% of functionally similar positions in the sequence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

5 BLAST algorithms

A

BLASTP - protein -> protein database
BLASTN - DNA -> DNA database
BLASTX - DNA, translated -> Protein database
TBLASTN -> protein -> Translated DNA database
TBLASTX - DNA, translated -> translated DNA database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

which BLAST algorithm has the highest query search?

A

tblastx

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is UniProt

A

Universal Protein Resource
Listed on NCBI but also has its own website
Checked computationally

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is UniProt/TrEMBL

A

contains protein sequences associated with computationally generated annotation.
Unreviewed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

UniProt/Swiss-Prot

A

high quality manually annotated, non redundant protein database. Reveiwed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Databases for species

A
many species have their own database
Flybase - drosophila
Zfin - zebrafish genome
Wormbase - C elegans
Reef genomics
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

why are 16S sequences used

A

16S gene - encodes 16S rRNA.

used to make phylogenies because of slow rates of evolution in this gene

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is MG-RAST

A

metagenomics anaysis server

How well did you know this?
1
Not at all
2
3
4
5
Perfectly