Artikelen 2 Flashcards
Sir Ronald Fisher (1890-1962) introduced the terms ‘null hypothesis’ and ‘significance’, urged the systematic distinction between sample and operation, suggested a level of 0.05 as an arbitrary but convenient level to judge a result significant, and proposed many techniques, including the analysis of variance.
oke
Sometimes people say we only need to use probabilities in situations where we are ignorant; if we could know enough to make exact predictions, we would not need to talk in terms of probabilities. Popper argued that such subjective interpretations of probability did not apply in science. According to the objective interpretation of probability, probabilities exist in the world, independent of our states of knowledge. They are discovered by examining the world, not by reflecting on what we know or how much we believe.
oke
Von Mises’s long-run relative frequency interpretation of probability says that the probability of a head coming up is the proportion of times the coin produces heads in a hypothetical infinite number of tosses. Because the long-run relative frequency is a property of all the events in the collective, the probability of the next toss being a head applies to the collective, not to any single event.
oke
Sometimes significance is defined simply as ‘the probability of a Type I error’, but this is wrong. The probability of a Type I error when the null hypothesis is true is 0.05, but the probability of a Type I error when we have rejected the null hypothesis is 29 %.
oke
Power is defined as …
the probability of detecting an effect given that the effect really exists in the population.
in order to control B, you need to… 2 dingen!!
(1) Estimate the size of effect (e.g. mean difference) you think is interesting, given your theory is true.
(2) Estimate the amount of noise your data will have.
You can estimate the amount of noise in your data by looking at past similar studies, or by running a pilot study.
oke
Most studies do not systematicallv use power calculations to determine the number of participants, but they should. Ignoring the systematic control of Type II errors leads to inappropriate judgments about what results mean.
oke
wat is heel belangrijk als je een studie gaat repliceren
meestal heb je meer participanten nodig, omdat de power verlaagd is, omdat het een andere sample is!!! mean is anders, sd etc.
But assuming the population effect was estimated exactlv bv the American stud,
and the ,vithin-group nriance ,vas exactly as estimated by the American study of replicating with the same number of subjects as the original studv was about 0.67
Some reviewers will be tempted to think that the result is thrown in doubt by all the non-significant studies, but if the null hypothesis were true, one would expect an equal number of studies showing a significant effect in one direction as in the other.
oke
If your study has low power, a null result tells you nothing. If you set power at a high level in designing the experiment, you are entitled to accept the null hypothesis.
oke
You can deduce the probability of the experimental hypothesis being true.
DIT IS EEN BESCHRIJVING VAN POWER, NIET VAN DE P VALUE OF DE WAARHEID VAN DE H0!!!!!!!!!!!!!!!!!!!!!!!!11
Sensitivity can be determined in three ways:
power, confidence intervals and finding an effect significantly different from another reference one.
A stopping rule is a condition under which you will stop collecting data for a study. Running until you have a significant result is not a good stopping rule because you are guaranteed to obtain a significant result eventually.
OKE
A researcher might mainly want to look at one particular comparison, but threw in some other conditions out of curiosity. If she planned one particular comparison in advance, she can test at the 0.05 level, but the other tests must involve a correction, like Bonferroni.
OKE