Course-5 Analyze data to Answer questions Flashcards
Analysis
The process used to make sense of the data collected
Goal of analysis
The goal of analysis is to identify trends and relationships within data so you can accurately answer the question you are asking.
The 4 phases of analysis
-Organize data
- Format and adjust data
-Get input from others
-Transform data
Sorting
when you arrange data in a meaningful order to make it easier to understand, analyze and visualize.
Filtering
is used when you are only interested in seeing data that meets a specific criteria, and hiding the rest.
Sort Sheet
All of the data in a spreadsheet is sorted by the ranking of a specific sorted column- data across rows is kept together.
Sort range
Nothing else on the spreadsheet is rearranged besides the specified cells in a column.
Customized sort order
When you sort data in a spreadsheet using multiple conditions.
Sort function syntax
=SORT(Range,sort_column,true or false) True ascending and false is descending
Incorrectly formatted data can:
1) Lead to mistakes
2) Take time to fix
3) Affect stakeholder’s decision-making
How does data validation work?
It allows you to control what can and can’t be entered in your worksheet.
Data validation
1) Allows dropdown lists with predetermined options
2) Create custom checkboxes
3) Protect structured data and formulas
Conditional formatting
A spreadsheet tool that changes how cells appear when values meet specific conditions.
Concatenate
A function that joins together two or more text strings
Text string
A group of characters within a cell, most often composed of letters
Openness ( Or open data)
Free access, usage, and sharing of data
if(end>start,end-start,1-start+end)
Best Practices for searching online
- Thinking skills
- Data analytics terms
- Basic knowledge of tools
Mental model
Your thought process and the way you approach a problem.
R
A programming language frequently used for statistical analysis, visualization, and other data analysis.
Aggregation
Collecting or gathering many separate pieces into a whole.
Data aggregation
The process of gathering data from multiple sources in order to combine it into a single summarized collection.
Data Aggregation example
-Puzzle pieces= Data
-Organization = Aggregation
-Pile of pieces= Summary
-Putting the pieces together= gaining insights.
Data Aggregation Benefits
- Identify trends
- Make comparisons
- Gain Insights
Data can also be aggregated over a given time period to provide statistics such as:
- Average
- Minimums
- Maximums
- Sums
Subquery
A query within another query
VLOOKUP( Vertical Lookup)
A function that searches for a certain value in a corresponding piece of information.
Value
A function that converts a text string that represents a number to a numerical value.