L6 to L10 - Data Analysis Flashcards
The general procedure for hypothesis testing
- Define Problem
- Number of groups, independence, type of data, tail test - State Hyp
- Compute Test Stats
- Find p-value
- Compare p-value with a
- State Conclusion
State the assumptions of Parametric Tests
- Samples are drawn from NORMALLY distributed ppn
2. All samples have equal variances
A crossover study compared the LDL cholesterol levels after a 2-week diet with oat bran cereal and corn flakes respectively, in 14 individuals with hypercholesterolemia. At a significance level of 0.05, is there a difference in LDL levels between the 2 diets?
Assuming data is normal, state the type of study to be carried out, the Hypothesis and the assumptions
- Study: Paired-samples t-test
- Hypothesis:
- H0: µd = 0, MEAN difference btwn LDL after oat and corn = 0
- H1: µd ≠ 0 - Assumptions
- Samples are drawn from NORMALLY distributed ppn
- All samples have equal variances
- *µd is normally distributed
For any tests involving all continuous data, what must be done before selecting the appropriate test?
Normality Test
Define the problem and state the test(s) to carry out, assuming data is continuous
Two different medications, tablets A and B were administered to two groups of patients to compare their ability to increase INR after one day. At a significance level of 0.05, is there a difference in the increase of INR in one day between the two tabs?
Define the problem:
- How many and which samples being compared: Tablets A and B
- INDEPENDENT samples
- Outcome of interest: Increase in INR
- Type of data to be analysed: Continuous data
- Tail: Two-tailed
Tests:
- Independent Samples t-test
- Normality test
- Variance test
Advantage of Levene’s test over F-test for equality of variance
- F-test: Assumption that ppn from which samples are obtained must be NORMAL must be met
- F-test: Compare variance of only 2 groups
Levene’s test: can use regardless of those (more than two groups, ppn no need normal)
Hypotheses for F-test of equality of variance
H0: Variance are EQUAL
H1: Variance are UNEQUAL
A clinical trial was conducted in 2 groups of study participants to compare tx A and B onset to relieving headache. Assume that data normally distributed. At α = 0.05, p = 0.02.
Tx A: Mean onset 34 min, SD 3.75, sample size 15
Tx B: Mean 25.8, SD 7.43, Sample 10
Variance test: p < 0.01
State the likely stats test used in this study, and formulate a conclusion for the study
Test: Independent-samples t-test with UNEQUAL VARIANCE
Conclusion: At a significance level of 0.05, there is a statsig difference in the MEAN onset to relieving headache between tx A (34.0 +/- 3.75 mins) and tx B (25.8 +/- 7.43 mins) (p = 0.02).
What are the other tests to be carried out for INdependent-samples t-test
- Equality of variance test
2. Normality test
One way ANOVA (OWA) is for what kind of comparison? State its Hypotheses
Comparing data between >2 independent groups and analyses variance
H0: Ppn means corresponding to random samples are equal
H1: Not equal (at least one is not equal)
For >2 independent groups, why should multiple independent t-test not be carried out?
Multiplicative rule of (1-a) causes increased probability of type I error and decreases probability of making correct conclusion
Assumptions of OWA
- Samples are random samples of ppn
- Ppns are independent
- Underlying ppn is normally distributed
- Ppn has equal variances*
- If non-equal variance, use Welch-Anova
For F test of ANOVA, what is the equation?
F = Sb/Sw
Also, Mean squares (variance) = Sum of squares/df
Sb = BETWEEN GROUP variance Sw = WITHIN GROUP variance
Briefly describe the principles of one-way anova
Analyses within group variation (individual eans vs ppn means) and Btwn group variations (underlying ppn means VS overall means)
- If Larger btwn grp variation, it can be implied that underlying population means are different
A study compared the pulmonary function (forced expiratory volume in one second, FEV1) for patients with coronary artery disease from 3 different hospitals. At a significance level of 0.05, is there a difference in FEV1 among these patients?
Define the Problem. State all statistical tests you will carry out to arrive at the final stats test to use to analyse the data
- How many samples: 3
- Outcome of Interest: FEV1
- Data to be analysed: Continuous data
Other tests:
- Normality Test
- Equality of variance using LEVENE’S test (F test not for >2 samples)
If Data is normal, and variance is equal, use OWA
A study compared the increase in breathing rate of three groups of 30 participants in three different types of exercises (Running, weightlifting, Skipping).
Normality test shows that p > 0.05 for all three samples.
Equality of variance via levene’s test shows that p < 0.05 for running and skipping, but p > 0.05 for weightlifting
State:
- The normality test that is used
- Statistical test to be used to analyse the data
- Normality test: Shapiro-Wilk for all three groups (n < 50)
- Stats test to be used:
– Breathing rate: Continuous Data
– All data are normally distributed
– Not all variance are equal
Hence use WELCH-ANOVA
A study compared the increase in breathing rate of three groups of 30 participants in three different types of exercises (Running, weightlifting, Skipping). At a significance level of 0.05, the p-value was found to be 0.0422.
Assuming OWA was used to analyse data, state the conclusion of this study
At a significance level of 0.05, not all MEAN increase in breathing rate for the three groups of participants are the same.
Purpose(s) of Post-hoc tests in OWA
Identify the groups with differences, while controlling the overall probability of making type I error on predetermined alpha
Main difference between post-hoc tests
How conservative they are
- More conservative = reduce stats power (greater chance for type II error) but controls type I error better
List and rank the post-hoc tests in terms of conservativeness
- Bonferroni Adjustment
- Least sig. difference (LSD) test
- Turkey’s test
- Scheefe’s procedure
- Dunnett’s test
Conservativeness
4 > 1 > 3 > 2
5: Special case