Variance & Power Flashcards
Define the range of sample
The distance from the smallest data point from the largest
What is another name for the normal distribution
Bell curve or Gaussian distribution
Consider the plot of the results from the experiment to determine if either two genetic manipulations effected growth. Did a or b have a statistically significant effect. The mean effect of both A and B group are clearly larger then C can we make conclusions based on this
- A is different but not B because it overlaps
- no because the mean average falls within the confidence interval of 95% and if there is a lot of variation in the data the confidence might be wide, the mean value is an estimate only and does not reflect variation in the data
Two types of error
Type I: false pos (rejecting a null hypothesis when there is no real difference)
Type II: false negative (failing you reject a null hypothesis when the alternative hypothesis is the true state)
The 95% confidence interval bars around the means tells us what
This means that there is a 95% confidence that the real mean falls within within the interval bars
Therefore if the confidence interval overlaps we can not say that there is a difference because it is possible that the real mean in both groups is the same value
What is the relationship between statistical power and replication
The more replication (is coin flips) the greater the power
If the trial to test of a coin was fair involved 10 tests. The 95% confidence interval was 0.19 to 0.81. What does this mean
It means that we can not reject the Ho that the coin is fair as long as we get between 2(0.19) and 8(0.81)
What are the three main factors affecting statistical power
Sampling size
Effect size
Variation in the data
Draw the power curve and explain what it is
Illustrates the relationship between sampling effort and the inferential power of those samples for a given size. It shows the min number is samples necessary to provide sufficient power and when the addition of more samples will yield little improvement in power
A bad
B good
C bad
Define p-value
The prob that we would collect data with the observed level of difference if in reality the Ho is true. In other words if we see a very large difference the prob of this magnitude being observed given Ho is true is less than 5%
If confidence increases two the interval increase or decrease
Increases
Measures of variability
Describes the dispersion of the values
Range Variance Standard deviation Confidence interval Standard error of mean
Normal distribution
If a set of normally distributed about its mean, variation is distributed evenly around the mean
With SD we can estimate how dispersed the data are without actually sampling all individuals
If estimator varied widely under repeated sampling what happens to CI
Confidence in results will be lower as it is a less precise estimator or the pop param
Which of the following increase power? Increasing variation Increasing sample size Increasing acceptable error Increasing the mag of difference between means
No
Yes
No
Yes