Module 2 - Follow the data lifecycle Flashcards
What is the second phase of data analysis?
Prepare: Collect and store data for analysis
What is the first phase of data analysis?
Ask: Define the problem and confirm stakeholder expectations
What is the third phase of data analysis?
Process: Clean and transform data to ensure integrity
What is the fourth phase of data analysis?
Analyze: Use data analysis tools to draw conclusions
What is the fifth phase of data analysis?
Share: Interpret and communicate results to others to make data-driven decisions
What is the sixth phase of data analysis?
Act: Put your insights to work in order to solve the original problem
What are all stages of the data lifecycle?
- Plan
- Capture
- Manage
- Analyze
- Archive
- Destroy
Database
A collection of data stored in a computer system
In the data lifecycle, which phase involves using data to solve problems, make good decisions, and support business goals?
Analyze: The analyze phase involves using data to solve problems, make great decisions, and support business goals
Data lifecycle: Plan
Decide what kind of data is needed, how it will be managed, and who will be responsible for it.
Data lifecycle: Capture
Collect or bring in data from a variety of different sources.
Data lifecycle: Manage
Care for and maintain the data. This includes determining how and where it is stored and the tools used to do so.
Data lifecycle: Analyze
Use the data to solve problems, make decisions, and support business goals
Data lifecycle: Archive
Keep relevant data stored for long-term and future reference.
Data lifecycle: Destroy
Remove data from storage and delete any shared copies of the data.
What is a universal data management principle?
Govern how data is handled so that it is accurate, secure, and available to meet your organization’s needs.
What are the two parts of the Ask Data Analysis phase?
- Define the problem to be solved
- Make sure we fully understand stakeholder expectations
Stakeholder
People who have invested time and resources into a project and are interested in the outcome
Defining a problem
Look at the current state and identify how it’s different from the ideal state
Prepare phase
The emphasis is on identifying and locating data you can use to answer your questions.
Process phase
Refine your data! Find and eliminate any inaccuracies that can get in the way of results.
1. Clean the data
2. Transform data into a more useful format
3. Combine two or more datasets to make information more complete
4. Remove outliers (data points that could skew the information)
Analyze phase
Turn the data you’ve gathered, prepared, and processed into actionable information
Share phase
Share what you’ve learned with your stakeholders
Act phase
Insights from data are put to work!
Common data analyst tools
- Spreadsheets
- Query languages
- Data visualization tools
Spreadsheet
Digital worksheet - stores, organizes, and sorts data
Formula
A set of instructions that performs a specific calculation using the data in a spreadsheet
Function
A preset command that automatically performs a specific process or task using the data in a spreadsheet
Query language
A computer programming language that allows you to retrieve and manipulate data from a database
Query
A request for data or information from a database
SQL
Structured query language