4. Data Visualization & Summarizing Data Flashcards
A visual dimension of a visualization that represents data
Aesthetic
Common types of data visualization
- Scatter plot
- Line graph
- Histogram
- Density chart
- Bar chart
- Stacked bar chart
- Pie chart (usually bad)
The variation in a single variable
Univariate statistics
The variation between two variables
Bivariate statistics
(type of data visualization)
relationship between two numeric variables
Scatter plot
(type of data visualization)
change in a numeric variable or proportion over time
Line graph
(type of data visualization)
univariate view of a numeric variable
Histogram
(type of data visualization)
differences in proportion or mean between categories
Bar chart
(type of data visualization)
proportions of various categories
Pie chart (usually bad)
What are the two types of Statistics?
- Descriptive: Describing a given dataset
Assuming that those data are the population - Inferential: Making inferences from a sample to a population. Quantifying the amount of uncertainty around the values you calculate
What are the three measures of central tendency?
- Mean
- Median
- Mode
What are the measures of spread in numerically describing data? (4)
Range
Quartiles; Inter-Quartile Range (IQR)
Variance
Standard Deviation
Mean
Add up all the values and divide by the total number of values
An observation that is extreme compared to the rest of the observations.
Outlier
Median
Line up the variable in order from lowest to highest and take the middle number; if there are an even number of observations then take the average between the two middle numbers