Week 2 - Data Management Flashcards
What is Data management?
It includes the collection, storage, retrieval, quality assurance, and security
explain data-information-knowledge-wisdom (DKIW)
Data: raw observations of the world
Information: data that has been processed to provide meaning
Knowledge: what makes possible the transformation of information into instructions or knowing how to do something
Wisdom: insight is integrated and actionable
What is meta-data
Data that describes the properties or characteristics of end-user data and the context of those data
What does meta-data do?
It enhances the searchability, categorisation and data management efficiency
Structured data
Strictly organised such that it is easily searchable - database with a rigid schema
Unstructured data
Requires special handling - email body, social media post
Semi-structured data
Mix of both structured and unstructured data
What is a database
An organised collection of logically related data
Data management system
Data integrity: ensuring accuracy and consistency
Data security: protecting sensitive information
Scalability: adapting to growing amounts of data
Collaboration: enabling cross-functional access and analysis
What is the fundamental database operations?
Create, read (retrieve), update, delete
It forms the basis of data manipulation and access
What does ACID stand for?
Atomicity: all or nothing approach, smallest unit of transaction [buying concert ticket]
Consistency: ensuring that transactions bring the database from one valid state to another [library checkout]
Isolation: making sure transactions are processed independently [airplane seat tickets]
Durability: guarantees that once a transaction is committed, it will remain even in the case of a system failure [saving a paper]
ACID are the principles that …
Ensure reliable transactions in a database
Tabular data
[+] ideal for small amounts of data
[+] easy to create and use
[-] not suitable for complex relationships, only 2 dimensional
CSV files
Text file often used for data exchange between different system
What is a relational data-base
It is a collection of tables (relation) that interact with each other