Biostats_5_Correlation, P-value and Statistical Significance Flashcards
What is linear association?
Linear association describes a straight-line relationship between variables. A correlation coefficient quantifies this relationship numerically.
What is the correlation coefficient?
The correlation coefficient (r) ranges from -1 to +1 and describes the strength and direction of a linear relationship between two variables.
For positive associations, the closer to +1, the stronger the association.
For negative association, the closter to -1, the stronger the association.
What does positive correlation indicate?
Positive correlation indicates that as the value of one variable increases, the value of the other variable also increases.
r = +1.
In terms of correlation, as the value of one variable increases, the value of the other decreases represents what type of correlation coefficient?
When there is a negative correlation, there are inverse relationships between the varibles. For instance, the value of the dependent variable decreases as the independent variable increases. This is reflected by a downward-sloping best-fit line, with the correlation coefficient (R) being less than 0 but greater than -1. The closer R is to -1, the stronger the negative linear relationship between the two variables.
What does no correlation mean?
No correlation means no linear relationship exists between the variables.
where r is near 0.
What a correlation such as this indicate?
When R = 0, the best-fit line is flat, indicating no clear relationship between the independent and dependent variables. The scattered points suggest a lack of correlation, meaning there is no association between the two variables.
r = -1.
What is the coefficient of determination?
This is the proportion of variability in the dependent variable.
The coefficient of determination (r^2) represents the proportion of variance in the dependent variable explained by the independent variable.
For example, if r = -0.8; this means r^2 = 0.64 or 64%
A study is conducted to assess the relationship between body mass index (BMI) and daily physical activity (measured in hours). The investigators find that BMI is inversely related to daily physical activity, with a correlation coefficient of -0.6 (p < 0.01). According to this information, how much of the variability in BMI can be explained by daily physical activity?
A. 36%
B. 60%
C. 40%
D. 6%
Correct Answer: 36%
Explanation: The proportion of variability in the dependent variable (BMI) that can be explained by the independent variable (daily physical activity) is determined by the square of the correlation coefficient (r^2). The correlation coefficient (r) is given as -0.6. Squaring this value gives 0.36. This means 36% of the variability in BMI can be explained by daily physical activity. The negative sign of the correlation coefficient indicates the direction of the relationship (inverse), but it does not affect the proportion of variability explained.
What is the difference between causation and correlation?
A correlation does not imply causation; other factors may influence the relationship between variables.
The assumption that there is no association between two measured variables (e.g., the exposure and the outcome) or no significant difference between two studied populations other than what would be expected from sampling or experimental error, is called … ?
the null hypothesis
What are the two types of hypotheses and what do they mean?
Null Hypothesis (H₀): No association exists between the exposure and the outcome.
vvvvvvvvvvvvv
Alternative Hypothesis (H₁): Association exists between the exposure and the outcome.
What does an RR of 1.08 with p-value = 0.01 mean?
There is a statistically significant association between the exposure and outcome.
A p-value of 0.01 means there is a 1% probability the null hypothesis is true.
When the p-value is above the alpha threshold, a researcher should … ?
” Fail to reject the H₀ “
vvvvvvvvvvvvvvvvvv
This means accept the null hypothesis where the null hypothesis (H₀) states that there is no association between the exposure and the outcome.
What is the usual alpha threshold when conducting studies?
A threshold of 0.05 is standard in many fields, indicating a 5% chance of falsely rejecting the null hypothesis. This means that there is a 5% probability of concluding there is an effect when there is none (false positive).
Falsely rejecting the null hypothesis means that a researcher committed a ____ error
Type I error
vvvvvvvvvvvvvvvv
This is the same as creating a false positive.