Data visualization Flashcards
What is a distribution?
The distribution of a variable refers to how FREQUENTLY certain values of that variable show up in our data.
1 Categorical Variables
Barchart
- distribution is explained by talking about how likely each category in the variable is most and least likely
1 quantitative variable
Histogram and box plots
-Shape ( Skewed, symmetric)
-Center (mean or medians)
-Spread ( SD, range or IQR)
-Outliers
Bivariate graphs
show relationship between two variables
2 categorical
1.Stacked bar chart
2. dodge bar
3. Filled bar
2 quant scatterplot description
-form (Linear..)
-Strenght
-Direction ( Pos or neg)
-outliers
Association
when describing an association we have an explanatory variable ( Suspected cause) and a response variable (Suspected effect).