Class 3 Notes Questions Flashcards
What are random variables?
Most variables are random in the sense that we canβt predict their exact values from one value to the next.
Upper-case variables like π refer to the whole random variable,
whereas lower-case variables like π₯ refer to values of π.
What are π? π? xΜ? s? πxΜ?
π: population mean
π: population standard deviation
xΜ: sample mean
s: sample standard deviation
πxΜ: sample standard error
example of theoretical probability distribution to frequency distribution
sum of two dice with the βFreq. Dist. 10,000 Dice Rollsβ graph matching the βProb. Dist. (Theory)β graph
What is the Bernoulli Distribution? And an example.
Data with a Bernoulli distribution take on the discrete, binary values of 0 or 1.
For example, a flip of a fair coin provides one example of an random event that follows a Bernoulli distribution. That is the case because a flip of a fair coin has only two outcomes:
Heads (1) or Tails (0).
Assuming that the coin is truly fair, the probability of obtaining either Heads or Tails is 0.5. People usually assign the probability of obtaining Heads as π and Tails as 1 β π
What is a histogram? And an example.
Histograms capture the number of observations within specific intervals/ ranges that are called bins. Narrower bins provide more detail than wider bins, but the choice of bin size depends on the problem at hand.
F D C B A
Grade
F Dβ D D+Cβ C C+Bβ B B+Aβ A A+
Grade
Histogram vs. Bar graph
Histogram
- Distribution of non-categorical data
- Each bin shows a group of observations
- Bin order is important
Bar graph
- Count of categorical or binary data
- Each bar represents one category of data
- Bin order is arbitrary
What is a Q-Q plot? Specifically of Simulated Normal Data
The Q-Q plot assesses how much sample quantities look like theoretical quantities from a normal distribution. The Q-Q plot achieves that by dividing the data into smaller, equally-sized intervals/subsets called quantiles
Below is an example Q-Q plot. Most of the data hang close to the 45-degree line,
indicating that the sample data resemble a normal distribution.
What is skewness?
Skewness measures the asymmetry of a distribution. It indicates the extent to which a distribution leans to the left (negative skew) or to the right (positive skew).
What is kurtosis?
Kurtosis measures the βtailednessβ or the peakedness of a distribution. It indicates how heavy or light the tails of the distribution are compared to a normal distribution. The high kurtosis distribution is more pointy, whereas
the normal distribution is more bell-shaped.