M6 U3 - Data Management - Q2 Flashcards
What phases are involved in data understanding? (2)
- data acquisition (aka data gathering)
- data preparation
What is data acquisition?
Also known as data gathering, it involves gathering data from different sources and transforming the data into formats that are suitable for analytic solution development.
What happens when the requirements phase is completed?
the data science team will embark on data acquisition or data gathering
What’s data wrangling? (3 actions, 3 results)
The process of gathering, selecting, and transforming data to ensure that it is usable, free of noise and has as little bias as possible to meet defined analytic objectives.
What steps are involved in data wrangling? (3)
- Checking for missing values
- Identifying outliers
- Formatting the data.
What is data management?
- It’s an organization’s way of ________________ (4) data.
Makes sure that the data housed within an organization is ______________ (2)
- It’s an organization’s way of acquiring, storing, securing and processing data.
- Makes sure that the data housed within an organization is accessible and accurate
What group(s) manage data management?
- Managed by the IT team in an organization.
- Business users will participate too
List the organization responsible from the 11 knowledge areas for data management. List the areas.
- Data management body of knowledge (DAMA-DMBOK)
Who is involved in the data management process?
- Multiple departments
Who’s responsible for designing an organization’s data management framework?
data architects
Data Governance
- Defines how data is accessed and managed within an organization.
- planning, oversight, and control over management of data and the use of data and data-related resources.
Data Architecture
the overall structure of data and data-related resources as an integral part of the enterprise architecture
Data Modeling & Design
analysis, design, building, testing, and maintenance (was Data Development in the DAMA-DMBOK 1st edition)
Data Storage & Operations
structured physical data assets storage deployment and management (was Data Operations in the DAMA-DMBOK 1st edition)
Data Security
ensuring privacy, confidentiality and appropriate access