EBM4 Hypothesis testing Flashcards
How did we start this lecture
She started the lecture by showing us a poster in UK that said that Ocs have a 100% increase risk of VTEs in 3rd generation compared to the ones in older generations. This lead to many unwanted pregnancies in UK.Then she showed us the math that this 100% increase is an exaggeration as this is only a relative risk increase. In these cases sometimes it is worthwhile to investigate the absolute risk increase. She pulled numbers up from the studies and it was revealed that the absolute risk is quite small, only 0.014%, so about 7000 women would have to take the new generation Ocs to cause 1 case of VTE (this was the number needed to harm)
“Third generation Ocs are associated with twice the risk of VTEs (100% increase) compared to older products”. Identify what aspects of EBM are being discussed here
Risk can be associated with risk ratio and the increase can be termed as relative risk increase.
What are the two kinds of variables
dep and inde, exposure and outcome
Know how to state null hypothesis in terms of the context of a specific study.
We can have Null hypothesis also for continuous variable for which we look at the mean of a population (such as age, BMI etc).
Why do we make 2 sided alternate null hypothesis
to allow variation on either side of the hypothesis
What are parametric methods
The idea of parametric methods requires us to make assumptions regarding the underlying distribution of the data (such as normal bell shaped curve distribution and having a sufficiently large sample size). Two tools of stats that involves parametric methods are:
- T tests
- Anova
What is Chi squared test used for
Chi square test is used to compare proportions between two or more than two different groups. This is for categorical data only (examples blood type, states in they live, ethnicity).
What is t test and ANOVA used for
T test is used to compare the mean values between two (and only 2) different groups. This obviously is used for numerical data only (Probably can also be used for continuous data).
ANOVA is defined as the analysis of variance. This is used for analysis of data that includes multiple variables. This is usually employed to compare more than 2 variables between more than 2 groups. Also this can be used for mean values.
ANOVA can do what t tests can but t test cannot do what ANOVA can.
Factors that go into 0.05
0.05 is not a “magic” number, must take into account measure of association.
What is Z score
Z score which is a measure of the standard deviation.
All of these tests produce test statistics: this is a quantitative estimate of exposure outcome relationship between 2 groups
What are the graphs of Z scores
She also showed us a graph of Z scores and their bell shaped distribution. She then pointed out that if the tests statistics are more than or equal to or less than or equal to -1.96 and 1.96 (if they lie within this range) then the test is statistically significant.
Another way to say this is that if the data collected is more than or less than the critical value (-1.96 and 1.96) then p
What does the p value tells us
A p-value doesn’t tell us if the null hypothesis is correct or not. A low p-value means that the data collected are not very consistent with the null hypothesis. A high p-value indicates that the data are reasonably consistent with the null hypothesis. Remember that a p-value is calculated on the assumption that the null hypothesis is true.
Define P value
the probability that an association is at least as strong as observed (i.e as strong or stronger) might have arisen by chance alone (if the null hypothesis was true).
In other words this means that the p value is a measure of relative consistency between the null hypothesis and the data collected.
NOTE: P value is not the probability that the hypothesis is correct.
What happens when the p value is small
If p value is small, we can reject the null hypothesis and we can consider the findings of the experiment as statistically significant.
What are chi squared test and t tests used for
Categorical data and numerical data