STATISTICS QUALI (PART 2) Flashcards
CHAPTER 5
___ occurs when the null hypothesis is incorrectly rejected when it is actually true. (False positive). It means concluding that there is an effect or difference when in fact there isn’t.
• The probability of making this error is denoted by the significance level.
• The area under the curve beyond the critical value represents the probability of a type 1 error
EX : Imagine a pharmaceutical company testing a new drug to determine if it is more effective than a placebo. The result shows a p-value of 0.04 and the significance level is set at 0.05.
H0 : The new drug is no more effective than the placebo
H1: The new drug is more effective than the placebo
Decision: The null hypothesis is rejected
• if the null hypothesis is actually true (the drug is not more effective) this conclusion is a type 1 error
Type 1 error
CHAPTER 5
___ Occurs when a researcher reject the null hypothesis that is actually false. (False negative)
• The probability of making this error is denoted by beta (b)
• The power of a test is the probability of correctly rejecting a false null hypothesis. It is calculated as (1-\beta)
• “ Higher power reduces the likelihood of a type 2 error”
• The area under the curve that represents the probability of a type 2 error is typically on the opposite side of the critical region for a type 1 error
Type 2 error
CHAPTER 5 (MAKING SENSE OF STATISTICAL SIGNIFICANCE)
____ a quantitative measure of the magnitude of the difference between groups or the strength of the relationship between variables. Unlike p-values, which only tell you wether an effect exists, “ It tells how large that effect is, providing a sense of its practical significance”
• Indicates the size of the effect not just its existence
• Unlike p-values, effect sizes are not influenced by the sample size
• In Cohen’s D, it is used to measure the difference between two means in terms of standard deviation
• In Pearson’s r, it is used to measure the strength and direction of the relationship between two variables
• Used in ANOVA to measure the proportion of total variance that is attributed to an effect
Effect size
CHAPTER 5 MAKING SENSE OF STATISTICAL SIGNIFICANCE
___ Help to interpret the magnitude of an effect in a study. Also known as Cohen’s convention
1. COHEN’S D
• small effect - .20
• Medium effect - .50
• Large effect - .80
2. PEARSON’S R
• Small effect - 0.1
• Medium effect - 0.3
• large effect - 0. 5
3. ETA SQUARE (Measuring the proportion of total variance attributed to an effect in ANOVA)
• Small effect - 0.01
• Medium effect - 0.06
• Large effect - 0.14
4. COHEN’S F (Measuring effect size in ANOVA )
• Small effect - 0.10
• Medium effect - 0.25
• Large effect - 0.40
5 COHEN’S W ( measuring effect size in chi-square test)
• Small effect - 0.10
• medium effect - 0.30
• Large effect - 0.50
Effect size conventions
CHAPTER 5
___ used to combine the results of multiple studies that address a similar research question. Aims to provide a more precise estimates of the effect size and resolve inconsistencies among individual studies.
• Involves systematically reviewing and statistically combining results from different studies to draw general conclusions about a specific research question
• Uses the effect size to calculate a weighted average effect size, providing a more precise estimate of the overall effect.
• Synthesizes data from MULTI studies to calculate a combines effect size
• Increase the statistical power and precision of the estimates by pooling data
Meta analysis
CHAPTER 5 MAKING SENSE OF STATISTICAL SIGNIFICANCE
___ probability that a test will correctly reject a null hypothesis. It helps determine the likelihood of avoiding a TYPE 2 ERROR (failing to detect a true effect)
• Larger sample size increases power
• Larger effect size increases power
• Higher significance levels increase power but also increase the risk of Type 1 error
• Low variability Increases power
• Power analysis is used to determine the necessary sample size to achieve a desired power level, typically 80%
Power
CHAPTER 5 MAKING SENSE OF STATISTICAL SIGNIFICANCE
____ tools used in statistical power analysis to determine the sample size needed to achieve a desired level of power for a given effect size and significance level.
• Helps to determine the number of participants needed to achieve a desired power level for a given effect size and significance level
• Helps balance the risk of Type 1 and type 2 error by adjusting sample size, effect size and significance level.
Power table
WHAT DETERMINES THE POWER OF A STUDY
1. ____ Increases power because it provides more information about the population, reducing the standard error making it easier to detect a true effect.
2. ___ Easier to detect, increasing the power of the study
3. ___ Increases power but also increases the risk of Type 1 error (false negative - set 0.05)
4. ____ using matched pairs, repeated measures or other designs that control for extraneous variables can increase power
5. ____ more powerful if the direction of the effect is correctly Specifed because the critical region is concentrated in one tail
6. ____ less powerful because the critical region is split between two tails, but it is more conservative and less prone to type 1 error
Larger sample size
Larger effect size
High significance level
Study design
One tailed
Two tailed test
CHAPTER 5 MAKING SENSE OF STATISTICAL SIGNIFICANCE
____ refers to the real world importance or relevance of a research, beyond just its statistical significance.
• It considers the size of the effect. Even if the result is statistically significant, it might not be practically significant if the effect size is too small to matter in real-world applications
• Studies with larger effect sizes require smaller sample sizes to achieve a high power.
Practical intelligence
CHAPTER 5 MAKING SENSE OF STATISTICAL SIGNIFICANCE
___ hypothesis test used to compare the means of two groups. It helps determine whether there is a significant difference between the group’s means, which can indicate an effect or relationship.
• If > the critical value,fail to reject the null hypothesis, if less than < or equal to the critical value, reject the null hypothesis
• Use Non-parametric test like the MANN-WHITNEY U TEST if the data does not meet the normality
• Use Welch’s T-test, if the variance is unequal
T-test
CHAPTER 5 MAKING SENSE OF STATISTICAL SIGNIFICANCE
___ used to determine whether the mean of a single sample is significantly different from a KNOWN or POPULATION MEAN.
• Particularly useful when you want to compare the sample mean to a specific value
—— ASSUMPTIONS —
1. Data normally distributed (especially for sample sizes)
2. Observations should be independent
3. Measured on interval or ratio scale
• If non-normal distribution, use Non-parametric test like Wilcoxon signed rank test
EX : Testing if the average height of a sample of students is different from the national average height.
One sample t-test
CHAPTER 5 MAKING SENSE OF STATISTICAL SIGNIFICANCE
___ represent the number of independent pieces of information available to estimate another piece of information.
• Often calculated as the sample size minus the number of parameters instead.
• They affect the shape of the sampling distribution and the precision of parameter estimates
• T-TEST - Used to compare means between groups. It affect the critical t- value
• CHI-SQUARE- calculated based on the number of categories
• ANOVA - used to determine the F- distribution
• SIMPLE REGRESSION - (n-2), n is the number of observation
• MULTIPLE REGRESSION- (n-k-1), k is the number of predictors
• GOODNESS OF FIT - (K-1) k, is the number of categories
• TEST OF INDEPENDENCE - (r-1) (c-1), where r is the number of rows and c is the number of columns
EX : suppose you have a sample of 10 students and you want to calculate the mean daily calcium intake.
df= n-1 = 10-1 = 9
Degrees of freedom
CHAPTER 5 MAKING SENSE OF STATISTICAL SIGNIFICANCE
___ a type of probability distribution that is symmetric and bell shaped
• Particularly useful when dealing with small sample sizes or when the population standard deviation is unknown
• The shape of the distribution depends on the degrees of freedom, which are related to sample sizes. As the degrees of freedom increases, the t-distribution approaches the normal distribution
T-distribution
CHAPTER 5 MAKING SENSE OF STATISTICAL SIGNIFICANCE
___ a standardized test statistic used in hypothesis testing particularly when dealing with small sample sizes or when the population standard deviation is unknown.
• A large ___ indicates a greater differences between the sample mean and the population mean.
• A smaller indicates a smaller difference
T-score
CHAPTER 5 MAKING SENSE OF STATISTICAL SIGNIFICANCE
___ used to compare the means of two related groups.
• Particularly useful when the same subjects are measured under two different conditions or at two different times.
• Use, when you have paired data, such as measurement taken from the same subjects before and after a treatment
• When data is normally distributed
• Use Non-parametric test like Wilcoxon signed rank test if the difference do not meet the normality assumption
EX : suppose you want to test wether a new teaching method improves student performance. You measure the test scores of 10 students before and after using the new method.
T-test for dependent samples
CHAPTER 5 MAKING SENSE OF STATISTICAL SIGNIFICANCE
___ The ability of a statistical method to remain effective even when certain assumptions are violated or when there are small deviations from ideal conditions.
• Robust methods are less effective by outliers or extreme values. ( Ex: The median is robust measure of central tendency because it is not influenced by extreme values unlike mean)
• Robust measures perfrom well even when assumptions (normality, homoscedasticity) are not fully met.
- Non-parametric test like Mann Whitney U test are robust alternative when the assumptions of normality is violated
• Robust regression analysis such as (LAD) are used when data contain outliers
Robustness
CHAPTER 5 MAKING SENSE OF STATISTICAL SIGNIFICANCE
—– T-TEST FOR INDEPENDENT SAMPLES —
____ a type of experimental design where different participants are assigned to each condition or group.
• Participants are divided into separate groups with each group experiencing a different condition or level of independent variable
EX : In a study testing a new drug, one group receives the drug, while another group receives a placebo.
• Variability between participants can affect the results, though random assignment helps mitigate this.
Between-subjects design
CHAPTER 5 MAKING SENSE OF STATISTICAL SIGNIFICANCE
____ compares the means of two independent groups to see if they are significantly different from each other.
• Population variance is not known
• When dependent variable is continous (test scores, weight)
• When you have two independent, unrelated groups (diff participants in each group)
• When the data is approximately normally distributed
• When the variances of the two groups are equal (homogeneity of variance) - can be tested using levene’s test
T-test for independent samples
CHAPTER 5 MAKING SENSE OF STATISTICAL SIGNIFICANCE
___ a non parametric test used to compare difference between two independent groups when the sample distributions are not normally distributed. It is an alternative to the independent samples t,-test and is particularly useful for ordinal data or when the assumptions of the t-test are not met.
• Data are not normally distributed
• When the data are ordinal
• Sample sized are small (less 30)
• If the u is less than the critical value, reject the null hypothesis
Mann Whitney U test