Week 2 - Mastering the Data I Flashcards
1
Q
What sources can you find data in?
A
- Internal (e.g. under the ERP system)
- External (e.g. open source (Kaggle.com), government websites, commercial providers).
2
Q
In most cases, where is data stored?
A
In relational databases.
3
Q
What do relational databases ensure when it comes to data?
A
- Ensures that data is complete.
- Ensures that data isn’t redundant, so they don’t take up too much space.
- Follows business rules and internal controls.
- Aid communication and integration of business processes.
4
Q
What are the four types of attributes in a relational database?
A
- Primary keys are unique identifiers.
- Foreign keys are attributes that point to a primary key in another table.
- Composite keys are a combination of two foreign keys used for line items.
- Descriptive attributes include everything else.
5
Q
What are the 5 steps for the extraction, transformation and loading (ETL) techniques.
A
- Determine the purpose and scope of the data request.
- Obtain the data.
- Validate the data for completeness and integrity.
- Clean the data.
- Load the data for data analysis.
6
Q
What are the 6 ethical considerations of data collection and data use?
A
- Informed consent
- Privacy
- Transparency
- Ownership
- Bias
- Misuse