Week 3: Data Exploration & Processing in Excel Flashcards
- Understand the steps before data analysis. - Understand how to use Excel for data summarization and exploration. - Learn how to use PivotTables to describe data.
Structured data (definition)
Data that resides in a fixed field within a file or record
Unstructured data (definition)
Data that is usually not in a certain
format
Essential steps before data analysis
Data collection and data cleaning (wrangling)
Steps before formal data analysis
(1) Get familiar with the dataset, know the data volume, what each row and column
means, and the data format of each row and column.
(2) Data cleaning/data wrangling
Data cleaning/data wrangling steps
- Check for missing data, Extreme outliers, Erroneous value formats, Impossible values, Inconsistent values, Duplicate records
- Check if data re-organization is needed.
- Check if data transformation is needed.
- Check if textual information needs to be transformed into numbers
Extreme outliers (definition)
data points that are significantly different from the other observations
What PivotTables do
Let you quickly summarize your data in almost any way
imaginable to gain important insights