L2 Flashcards
What is the major challenge of the genomics era?
To store and handle terabytes (TB) of sequence data through the establishment and use of computer databases
What is a database
A computerized archive used to store and organize data in such a way that information can be retrieved easily via a variety of search criteria
What are databases made of?
Computer hardware and software for data management
What should each record(entry) in a database contain?
A number of fields that hold the actual data items.
What is the process of making a query
Process by which a user expects the computer to retrieve a whole data record by specifying a particular piece of info to be found in a particular field.
What is knowledge discovery
A function of biological databases which refers to the identification of connections between pieces of information that were not known when the information was first entered
Types of databases
- flat file format
- relational database management
- object-oriented database management systems
What is a flat file format
A long text file that contains many entries separated by a delimeter (|)
What are database management systems
Sophisticated computer software programs for organizing, searching and accessing data
What are relational databases
They make us of a set of tables to organize data. They are created using a programming language known as structured query language (SQL)
Each table in a relational database is also called
Relation which is made up of columns and rows. Columns represent individual fields. Rows represent values in the fields
How is a query executed in a relational database
The system selects linked data items from different tables and combines the information into one report
What are primary databases?
They are archives of raw proteins or DNA sequence data submitted by the scientific community
Examples of primary databases
GenBank, Protein Data Bank (PDB)
What databases does the International Nucleotide Sequence Database Collaboration made of?
-GenBank
-European Molecular Biology Laboratory (EMBL)
-DNA Data Bank of Japan (DDBJ)