FINALS - Bioinformatics Flashcards
two sequences are the SAME
identity
LINING UP two or more sequences to search for the maximal regions
alignment
RELATEDNESS of sequences
similarity
FIXED SET of commands
algorithm
a SPACE introduced in alignment to compensate for insertions or deletion in one of the sequences being compared
GAP
SIMILARITY attributed to descent from a common ancestor
HOMOLOGY
the SEQUENCE PRESENTED for comparison
query
the genetic sequence DATABASE
genbank
number of MATCHES to the query by chance
e-value
study on EVOLUTIONARY RELATEDNESS among species by comparing homologies and differences
phylogenetics
uses computers to STORE and ANALYZE molecular biological information
bioinformatics
about RNA MOLECULES in a living organism
transcriptomics
genomes of MICROBES are described within a specific environment
MICROBIOMICS
description of the chemical processes involving METABOLITES
metabolomics
description of the sequences of the WHOLE GENOME of an organism
genomics
description of all the entire complement of PROTEINS
proteomics
DNA data bank of Japan
DDBJ
european bioinformatics institute
EMBL
database from NCBI
GenBank USA
databases for proteins in Japan
PDBj
databases for proteins in Europe
PDBe
databases for proteins in USA
RCSB PDB
determines the boundary of an exon and intron of eukaryotic gene
Ensembl
contain ORIGINAL DATA in the form or primary sequence data or structural data as submitted by the scientific community
primary databases
contain information that has been processed and DERIVED from teh raw data available in primary database
secondary databases
COLLECTS and PRESENT data after comparing and filtering them from different primary databases and exhibit only the non-redundant sequences
composite databases
a way of rearranging sequences of DNA, RNA, or protein to identify REGIONS OF SIMILARITY
sequence alignment
matching the residues of two sequences across their ENTIRE LENGTH
global alignment
matching of two sequences from regions which have MORE SIMILARITY WITH EACH OTHER
local alignment
multiple sequence alignmengt tool that ARRANGES the sequences of dna, rna, or protein to identify REGIONS OF SIMILARITY
MUSCLE
finds regions of LOCAL SIMILARITY between sequences
BLAST