Bioinformatics Flashcards
What are some of the uses of bioinformatics?
- next generation sequencing
- gene expression analysis
- microarrays
What is sequence identity?
A perfect match between an unknown sequence and a known sequence.
What is sequence homology?
A partial match between an unknown sequence and a known sequence.
Give 3 protein sequence databases. Do they have cross collaboration?
UniProtKB, UniProtKB/TrEMBL and RefSeqP - no cross collarboration.
Give 3 DNA sequence databases. Do they have cross collaboration?
ENA/EMBL, Genbank and DDBJ- cross collaboration.
What does Ensembl do?
Provides annotations of numerous genomes, including their protein products.
What is a pairwise sequence alignment?
All possible alignments between two sequences are checked for sequence homology.
What is a global alignment used for?
Looking for homology between the same protein from different species.
What is a local alignment used for?
To match cDNA to genomic DNA.
To align two different protein sequences that share a common domain.
What is a global alignment?
Aligns the length of two sequences. Any homologous sequences can be aligned globally as long as they are similar enough.
What is a local alignment?
Alignment of two sequences such that homologous subsequences are aligned in between regions of non-related and unaligned sequences.
Why would gaps be introduced?
In order to produce the best possible global alignment.
What is the fasta format?
Most commonly used format for sequences.
> followed by a description of the sequence and its accession number
What is queried by blastn and in which database?
Nucleotide query, nucleotide database.
What is queried by blastp and in which database?
Amino acid query, amino acid database.