STATS Module 10 Summary Flashcards
Correlation Study purpose
Evaluates the association between two numerical variables
Key characteristics of association are..
- Must have variation
- Not used for predictions
- Strength of the correlation is used using Pearson correlation coefficient.
r= sample correlation coefficient
p= population correlation coefficient
+1 values indicate a strong positive relationship: values -1 indicate a strong negative relationship.
Null and Alternative hypothesis for correlation test?
Ho: p=0
Ha: pcannot=o
T-distribution test for correlation is given as
to=r-p/SE
SE = sqrt(1-r^2/df)
df = n-2
Statistical test for correlation
Compare To to Tc.
Scientific conclusion
- Non-directional hypothesis:
Reject the Ho: Evidence of an association
Fail to reject the Ho: No evidence of an assoication - With directionality:
Reject the Ho: Evidence of a positive/negative association
Rejetc the Ha: No evidence of a positive/negative association
Reporting: include r, df, To, and P-value.
Linear regression purpose?
Predict relationships between two variables
- Predictor variable (x): Independent variable
- Response variable (y): dependent variable
Linear regression equation?
slope (b): Change in y for one unit in x
Intercept (a): value of y when x=0
EQ: y= a+ bx
Linear regression statistical model?
Systamatic component: Linear equation
Random component: Assumes normal distribution for smapling error
Link function: Connects the systamtic and random components
Hypothesis testing in linear regression?
Intercept (a) tests if the response variable equals a referenc evalue when x=0
slope (b): Tests how much y changes for a unit change in x
Linear regression null distribution?
Null distribution as a t-distribution
Intercept: To= a-Ba/SE, df=n-2
Slope: To= b-Bb/SE, df=n-2
Conduct the statistical test for linear regression
Compare the To vs. the Tc score or Type 1 error rate vs the P-value.
Draw scientific conclusions/reporting
Reject the Ho: Evidence that intercept or slope is different from the reference value
Fail to reject the Ho: Evidenc that intercept or slope is not different from the reference value.
Reporting: test paramater, t-score, df, and p-value
Linear regression Assumptions:
1) Linearity: Relationships is Linear
2) Independence: Residuals are independent, check random sampling
3) Normality: Residuals are normally distributed
4) Homoscedasticity: Residual variance is consistent across predictor values.