Chapter 4, 5, 6, 7, 8 Flashcards
scatter plots
measure two quantitative variables on the same individual. Each point represents an individual.
Explanatory variable is plotted on the horizontal axis; response variable is plotted on the vertical axis. Not always clear which variable is which.
Two types of linearly-related variables
Positively associated when one increases, the other increases. (trends uphill)
Negatively associated when one increases, the other decreases. (trends downhill)
Testing for a linear relation
- find the absolute value of the correlation coefficient, |r|
- Find the critical value in Table II Appendix A for the given samples size, CV.
- If |r| > CV, we say a linear relation exists between the two variables. Otherwise, no linear relation exists.
Residual
The difference between the observed and predicted values of y is the error, or residual.
The criterion to determine the line that best describes the relations between two variables is based on the residuals. The most popular technique for making the residuals as small as possible is the method of lease squares.
Least-Squares Regression Criterion
The least-squares regression line is the line that minimizes the sum of the squared errors (residuals).
This line minimizes the sum of the squared vertical distance between the observed values of y and those predicted by the line “y-hat”.
linear correlation coefficient
measures the strength and direction of the linear relationship between two quantitative variables.
If r = +1, then a perfect positive relationship exists between the two variables.
If r = -1, then a perfect negative linear relation exists between the two variables.
r is specifically for a sample. Rho for population.
Unitless.
sample space of a probability experiment
S
the collection of all possible outcomes.
Unusual event in a probability experiment
an event that has a low chance of occurring. < 5%, by convention.
probability of an event
P(E)
An event is any collection of outcomes from a probability experiment.
The probability of drawing an ace from a standard deck of cards is 1/13 is an example of…
theoretical (classical), empirical (experimental), or subjective probability?
The probability of drawing an ace from a standard deck of cards is 1/13 is an example of… theoretical (classical) probability.
Based on the Department of Public Safety, 75% of all car crashes is due to driver error is an example of…
theoretical (classical), empirical (experimental), or subjective probability?
Based on the Department of Public Safety, 75% of all car crashes is due to driver error is an example of empirical (experimental) probability.
I’m 100% confident that you will win the match is an example of…
theoretical (classical), empirical (experimental), or subjective probability?
I’m 100% confident that you will win the match is an example of subjective probability.
Independent vs. dependent probability events
In an independent probability event, the outcome of one event does not affect the probability of the next. Example: rolling a dice.
In a dependent probability event, the outcome of one event affects the probability of the next. Example: drawing a card from a deck, without replacement.
Think: Does the given variable change the probability?
Does P(A) = P(A/B)?
P(A/B) notation means…
…the probability of A occurring, given that B has already occurred.
(A and B could be independent or dependent events).
Ex. What is the probability of someone having leprosy, given that they are from a low income country? P(having leprosy/from L.I. country). LI country is a restriction.
P (A and B) =
P(AandB) = P(A/B)P(B) = P(B/A)P(A)
If A and B are independent events, then P(A and B) =
= P(A) P(B)
i.e.
P(A) = P(A/B)