Data Analytics Flashcards
Simple Barchart Function
Distribution of 1 categorical variable (eg sex % of total population)
Simple Histogram Function
Distribution of 1 continuous variable
Dot Plot/ Composite boxplot
Association between 1 categorical and 1 continuous variable
Simple Scatter plot (both regression and smooth line)
2 continuous variables
Composite and Stacked Barchart Function
2 categorical variables
Realistations Def.
Measured observed points from population
Frequency Distributions Def.
Ordered display of each value in data set together with how often it appears in data set
Relative Frequencies Def.
% of sample points that have a particular value
Why do bar graphs fail
There’s a zone of irrelevance and data can be obscured
Left Skewed Def.
Most data sits on left. Tail falls to right
Right skewed de.
Most data sits on rate. Tail falls left
Which value is greater in left skewed distribution (when not equal)
mean < median < mode
Which value is greater in right skewed distribution (when not equal)
Mean > median > mode
Variance Def.
Averaged squared difference from mean
Standard Deviation Def.
Average distance of scores to mean (square root of variance)