Bioformatics 2 Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

What is Fasta ?

A

Format used to introduce a protein sequence in a BLAST search

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is System Biology ?

A

Systems biology is a systematic approach to understanding all the genes and their expression under different conditions. You can only do this when you have all the proteins in a particular family.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why use Blast ?

A

Finding Model Organisms for Study of Disease
Example Cystic Fibrosis

BLAST helps you to find homologous genes and proteins

Homologous Proteins (or genes)

Have a common ancestor (theyre related)
Have similar structures
Have similar functions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the criterias for considering two sequences to be homologous ?

A

Proteins are homologous if
Their amino acid sequences are at least 25% identical

DNA sequences are homologous if
they are at least 70% identical

Note that sequences must be over 100 a.a. (or bp) in length

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does BLAST DO ?

A

BLAST takes a query sequence

Compares it with millions of sequences in the Genbank databases

By constructing local alignments

Lists those that appear to be similar to the query sequence
The “hit list”

Tells you why it thinks they are homologous
BLAST makes suggestions

YOU make the conclusions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do I input a query into BLAST?

A

Choose which “flavor” of BLAST to use

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How do I interpret the results of a BLAST search?

A

BLAST creates local alignments

What is a local alignment?
BLAST looks for similarities between regions of two sequences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

The BLAST Output( GRAPHIC DISPLAY)

A
How good is the match ?
Red = excellent!
Pink = pretty good
Green = OK, but look at other factors
Blue = bad
Black = really bad!

How long are the matched segments?
Longer =Better

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

The hit list

A
BLAST lists the best matches (hits) 
For each hit, BLAST provides:
Accession number – links to Genbank flatfile
Description
“G” = genome link
E-value
An indicator of how good a match to the query sequence
Score
Link to an alignment
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is an E-value?

A

E-value
The chance that the match could be random

The lower the E-value, the more significant the match
E = 10-4 is considered the cutoff point
E = 0 means that the two sequences are statistically identical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

The Alignment

A
Look for:
Long regions of alignment
With few gaps
% identity should be >25% for proteins
(>70% for DNA)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Conclusion in Blast ?

A

Look at E-value
Look at graphic display
If necessary, look at alignment

Make your best guess!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly