Week 10: Survey Analysis Flashcards
What is the purpose of power calculations in research design?
Power calculations determine the minimum sample size needed to precisely detect a program’s impact and inform data collection planning, budget, timeline, accuracy, and precision. They are used to compute power and minimum detectable effect size.
How do you calculate power for comparing two means?
<power twomeans mean1 mean2, sd1(x) sd2(y) n1(x) n2(y) power (.8) alpha(x)>
Unless using an alpha level other than 0.05, it doesn’t need to be specified. By default power is 80%
If alpha is reduced from 0.05 to 0.01, how does it affect power?
Lowering alpha decreases power because the probability of a Type II error increases (assuming all else is equal), reducing the likelihood of detecting a true effect
How does increasing sample size affect power?
Increasing sample size generally increases power, reducing the chance of a Type II error
What is the command to perform a power test for two proportions?
<power twoproportions prop1 prop2, n1(x) n2(y) alpha(x)>
What is the impact of unequal group sizes on power?
Power decreases when sample sizes are unequal, even if the total sample size remains constant
How do you set survey design in Stata?
<svyset psuid [pweight=finalwgt], strata(stratid) singleunit(centered)>
The option <singleunit> allows for different ways of handling a single PSU in a stratum. There are usually at least two PSUs in each stratum. If there is only one PSU in a stratum (due to missing data or a subpopulation specification, for example) the variance cannot be estimated in that stratum. If the default option <missing> is used, there will be no SEs when Stata encounters a single PSU in a stratum. The <centered> option centres data with one sampling unit at the grand mean instead of the stratum mean</centered></missing></singleunit>
How do you obtain the mean and standard deviation of a variable considering survey design?
<svy: mean var_name>
<estat>
Once the survey design has been set, the <svy: > prefix can be used to calculate descriptive statistics (also includes <svy: tab cat_var> from which Stata will also give p values from chi-square tests)
</estat>
How does ignoring survey design affect statistical estimates?
Ignoring the survey design underestimates point estimates and their standard errors, increasing risk of Type I errors.
The sampling weight will affect the calculation of the point estimate, and the stratification/clustering will affect the calculation of standard errors.
How do you calculate the mean of a variable considering a categorical variable?
<svy, over(cat_var): mean var_name>
What happens to sample size requirements as the desired detectable difference decreases?
Smaller differences require larger sample sizes to achieve the same power and significance level
How does the SE change when using survey design in Stata?
SE increases when survey design features (e.g., weights, strata) are considered, providing more accurate confidence intervals
What is the built-in programme for calculating power?
<power>
The command is best used for simple randomisations with no clustering. It performs power and sample-size analysis for studies using hypothesis testing to infer about population parameters. You can compute sample size given power and effect size; or the minimum detectable effect size and the corresponding target parameter given power and sample size.
</power>
If the power of the analysis is 0.8611 at the 0.05 level, what does that mean for the difference between means?
The power is 84.1%, meaning there is 86% probability of finding a statistical difference at the 0.05 level in two means if a true difference exists
What happens to sample size requirements if we increase power?
Sample size increases