Bioinformatics Flashcards

1
Q

What are some of the uses of bioinformatics?

A
  • next generation sequencing- gene expression analysis- microarrays
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is sequence identity?

A

A perfect match between an unknown sequence and a known sequence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is sequence homology?

A

A partial match between an unknown sequence and a known sequence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Give 3 protein sequence databases. Do they have cross collaboration?

A

UniProtKB, UniProtKB/TrEMBL and RefSeqP - no cross collarboration.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Give 3 DNA sequence databases. Do they have cross collaboration?

A

ENA/EMBL, Genbank and DDBJ- cross collaboration.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does Ensembl do?

A

Provides annotations of numerous genomes, including their protein products.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a pairwise sequence alignment?

A

All possible alignments between two sequences are checked for sequence homology.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a global alignment used for?

A

Looking for homology between the same protein from different species.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a local alignment used for?

A

To match cDNA to genomic DNA.To align two different protein sequences that share a common domain.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a global alignment?

A

Aligns the length of two sequences. Any homologous sequences can be aligned globally as long as they are similar enough.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a local alignment?

A

Alignment of two sequences such that homologous subsequences are aligned in between regions of non-related and unaligned sequences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why would gaps be introduced?

A

In order to produce the best possible global alignment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the fasta format?

A

Most commonly used format for sequences. > followed by a description of the sequence and its accession number

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is queried by blastn and in which database?

A

Nucleotide query, nucleotide database.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is queried by blastp and in which database?

A

Amino acid query, amino acid database.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is queried by tblastn and in which database?

A

Amino acid query, translated nucleotide database.

17
Q

What is queried by blastx and in which database?

A

Translated nucleotide query, amino acid database.

18
Q

What is queried by tblastx and in which database?

A

Translated nucleotide query, translated nucleotide database.

19
Q

Which BLAST searches are protein searches?

A

Blastp and blastx.

20
Q

What is the score?

A

Calculated by increasing the score for matches/similarities and decreasing for mismatches/gaps.

21
Q

What are identities?

A

The number of residues that are identical in the alignment.

22
Q

What are positives?

A

The number of similar residues in the alignment.

23
Q

What does gaps mean in a BLAST output?

A

The number of gaps in the alignment.

24
Q

What is the E (expect) value?

A

A measure of how reliable the alignment is.

25
How can sequences be considered homologous?
> 25% identity in amino acids sequence or >75% identity in nucleotide sequence, for sequences larger than 100 amino acids.
26
What is Clustal?
A multiple sequence alignment program.
27
What is considered a good alignment?
- at least 10-30 residues long- have at least 1-3 stars- have 5-7 colons - have a few periods
28
What does * represent in a Clustal output?
An entirely conserved column.
29
What does : represent in a Clustal output?
A column where all of the residues have roughly the same size and hydrophobicity.
30
What does . represent in a Clustal output?
A column where the size and hydrophobicity has been conserved over the course of evolution.
31
Why are multiple sequence alignments more informative than pairwise alignments?
They have a lower % identity.
32
What does a phylogenetic tree show?
The evolutionary relationship between species or sequences.