Managing Data Flashcards
What is the difference between a Online Database and a Local Database?
An online database is only accessible from the Internet, whereas a local database is stored locally on the attached device.
What is the difference between a Distributed Database and a Centralized Database?
A distributed database is physically situated in two or more devices, connected by a communications network. A centralized database is stored at a central location, accessible from multiple points.
What is the difference between a Data Warehouse and a Data Mart?
A data warehouse is a huge database that contains data from the whole business. A data mart contains less data from a section of the business. Data marts are easier and faster to access, and cheaper.
What is the role of Data Mining?
Businesses can use data mining to find patterns and relationships amongst data, using mathematical methods. Can be used to describe patterns/trends and predict future events.
What are some ethical implications of data marts, warehouses and mining?
- Privacy - Might collect large amounts of sensitive information.
- Discrimination - May reveal patterns that lead to discrimination.
- Transparency - Companies must be clear on what data they collect and how it will be used.
- Security - Could be appealing targets for hackers.
What is a Data Dictionary?
A Data Dictionary is a table that describes the data that is in and/or is going to be in a Database table. Is referred to by staff to fill out dictionary.
What are the elements of a Data Dictionary?
- Element Name
- Data Type
- Size
- Default Value
- Description
- Constraints (e.g. Required)
What is Data?
A collection of values.
What is Data Duplication?
The repetition of data.
What is Data Redundancy?
Where the same piece of data is held in two separate places in a database.
What is Data Manipulation?
The modification of information to make it easier to read or make more structured. E.g. Ordering by alphabetical order, using SQL commands.
What is Data Security?
Ensuring that data is stored and transmitted securely so that unauthorised people or organisations cannot access it.
What is Data Integrity?
Data integrity reflects the quality of data. Ensures data accuracy and consistency over the database’s entire lifecycle by making sure data is recorded as intended.
What are the 3 types of Data Integrity?
- Referential Integrity - Ensures primary and foreign keys are kept the same and must not be NULL.
- Domain Integrity - Validity of entries in a column by restricting data types, format etc.
- Entity Integrity - Each record has a unique primary key, each row must define a unique entity.
What are three types of Data Anomalies?
- Update - When duplicated data is updated, requiring all instances to be updated.
- Insert - When data cannot be inserted without the presence of other attributes.
- Deletion - When deleting a record also deletes other information that only occurs in that record.