Stats Final Flashcards
Need an A
Correlation
Correlations test the extent to which variables are
related to one another.
Simple (“Bivariate”) Correlation
assesses the
relationship between two variables.
To use bivariate correlation:
● The same sample must be measured on two
variables
● Both variables must be measured with
continuous data
Pearson r Product-Moment Correlation Coefficient
Direction (type of relationship)
○ Positive (direct): as X increases, Y increases
○ Negative (inverse): as X increases, Y decreases
Magnitude
○ Range of values for r: -1 ← 0 → +1
○ The farther away from 0 (in a pos or neg direction), the stronger the
relationship “Closeness to the line”
Correlation Coefficient Magnitudes (3)
r=+1, r=-1,r=0
r = +1
A perfect linear positive relationship. All observations follow a
linear regression line with positive slope (SSE = 0)
r = -1
A perfect linear negative relationship. All observations follow a
linear regression line with negative slope (SSE = 0)
r = 0
There is no linear relationship between two variables (although
there can be non-linear relationship)
± 0.3
Usually don’t exceed
Testing for Statistical Significance
Testing for statistical significance involves testing the Pearson r to see
likelihood that observed relationship is not due to chance sampling error. (Ask: ● Hours of crime dramas watched (“How many hours of crime shows did you watch
this week?”)
● Fear of crime (“How likely are you to be a victim of crime?”)
Conduct a test with 𝛂 = .01.)
Correlation
a necessary condition for a causal relationship, but not a
sufficient condition
Regression, covariance, and correlation
based on the same conceptual
background - All three provide information about whether or not two variables stand in
a linear relationship
Correlation analysis
provides one simple coefficient that information us
about the strength of the relationship, whereas regression enables us to
predict values of the DV when knowing values of the IV.
What are the criteria that allow you to determine causality?
Spuriousness, correlation, correlation techniques to falsify, sophisticated statistical tests
casual relationship
Significant r
tells us that the relationship is not likely due to sampling
error, but it does not tell us meaning of the relationship
NOT SAMPLE ERROR, NOT MEANING
Plain Language
The more the more, the more the less, there is no r(df) = r, p <.05
Magnitude
○ Range of values for r: -1 ← 0 → +1
○ The farther away from 0 (in a pos or neg direction), the stronger the
relationship “Closeness to the line”
Coefficient of Determination
Tells us about shared variance.
Technically: the proportion of
variance in one variable that can
be accounted for (or explained
by) variance in another variable.
Number between 0-1 which tells us the variance
Why correlation is not causation
Spurious
Sample size
Sampling Error
X -> Y? Y-> X? Z?
Bivariate Linear Regression
allows you to
predict values of one variable from values
of another variable.
(Ad spend -> Increased revenue)
Regression Line
A regression line or “line of best fit” is drawn to
minimize the vertical distance between the line and
all data points.
We usually find it with y=mx+b