Lecture 4: Introductions To Histograms & Distributions Flashcards
What is a histogram?
A histogram is a graph illustrating the distribution values in the data set (i.e the frequencies)
What’s a bin
Value ranges. Together they cover the full range of data
What’s a probability curve/probability density function (pdf lol)
A histogram with a smooth curve when we have an infinite number of values —> the width of the bins can be shrunk to 0 = smooth!
Distribution shape: ___——^^——___
Symmetric and unimodal
Distribution shape: __—^^—__—^^—__
Bimodal
Distribution shape: ____——^^^-_
Negatively skewed (LEFT)
Distribution shape: _-^^——___
Positively skewed (RIGHT)
THE SKEW IS __________?
WHERE THERE’S FEW!
Variables with a floor (hard lower bound) but no ceiling (hard upper bound) tend to have what kind of skew?
RIGHT SKEW!! Few on the upper side :D
If a distribution is symmetric and unimodal then the mean, median, and mode are _______?
THE SAME!!!
Describe the distribution shape of a skew! (Order of mean, median, and mode)
Mode is the most frequent value so it’s the highest point of the skew!
Then the median is a little lower
Then the skew pulls the mean away from the mode! (Most affected)