Biostatics 3 Flashcards
How is statistics divided?
Statistics is divided into descriptive and inferential statistics.
What is the purpose of descriptive statistics?
Presenting, organizing, and summarizing data.
What methods are used in descriptive statistics?
Methods include calculating measures of central tendency (mean, median, mode), measures of spread (range, variance, standard deviation), and using graphical representations (histograms, bar charts, pie charts).
What is the purpose of inferential statistics?
Drawing conclusions about a population based on data observed in a sample.
What methods are used in inferential statistics?
Methods include hypothesis testing, confidence intervals, and regression analysis.
How do descriptive and inferential statistics differ?
Descriptive statistics summarize and present data, while inferential statistics make predictions or inferences about a population based on a sample.
When would you use descriptive statistics?
Use descriptive statistics when you need to describe the basic features of the data in a study.
When would you use inferential statistics?
Use inferential statistics when you want to make predictions or generalizations about a larger population from a sample.
Give an example of descriptive statistics.
Calculating the average age of students in a class.
Give an example of inferential statistics.
Estimating the average age of all students in a school based on a sample of students.
What is the purpose of statistical inference?
To make conclusions about a population based on data observed in a sample.
What do statistical tests allow us to do?
They allow us to make statistical inferences.
What question does statistical inference help answer?
Is the result observed in our sample likely to be a true reflection of the result in the population?
What is a study population?
The defined patient population we are interested in.
What is a sample?
A subset of the study population from which data is collected.
What are sample statistics?
Measures computed from a sample, also known as point estimates.
What are true population parameters?
Fixed values that are unknown but represent the true characteristics of the entire population.
What is the goal of statistical inference?
To estimate unknown population parameters and determine the likelihood that the sample results reflect the true population characteristics.
What is the basis of statistical tests?
Hypothesis testing.
What do we start by assuming in hypothesis testing?
There is no relationship between the variables in the population.
What is the null hypothesis (H0)?
The assumption that there is no relationship between the variables.
What is the alternative hypothesis (H1)?
The assumption that there is a relationship between the variables.
Give an example of a null hypothesis in a study about pregnancy planning and age
There is no difference in age between women who plan their pregnancy and women who do not plan their pregnancy.
Give an example of an alternative hypothesis in a study about pregnancy planning and age.
There is a difference in age between women who plan their pregnancy and women who do not plan their pregnancy.
What is the goal of hypothesis testing?
To gather statistical evidence to support or refute the null hypothesis.
Why is hypothesis testing important in statistical analysis?
It helps determine whether observed data provide sufficient evidence to conclude that a relationship exists in the population
What is a Type 1 error?
A Type 1 error occurs when we incorrectly reject the null hypothesis.
What is another name for a Type 1 error?
False positive.
What does a Type 1 error imply?
It implies finding a relationship or effect when there is none.
What is a Type 2 error?
A Type 2 error occurs when we fail to reject the null hypothesis when it is false.
What is another name for a Type 2 error?
False negative.
What does a Type 2 error imply?
It implies not finding a relationship or effect when there is one.
Why is it important to understand Type 1 and Type 2 errors?
To accurately interpret the results of hypothesis testing and understand the potential risks of incorrect conclusions.
What is the p-value?
The p-value is the probability of finding the observed relationship, or one more extreme, assuming the null hypothesis is true.
What threshold is commonly used to determine statistical significance?
A threshold of 0.05.
What does it mean if p ≤ 0.05?
We reject the null hypothesis, and the p-value is said to be “significant.”
What does a significant p-value (p ≤ 0.05) imply?
It implies there is less than a 5% probability that the observed relationship is due to chance if the null hypothesis were true.
What does it mean if p > 0.05?
We do not reject the null hypothesis, and the p-value is said to be “not significant.”
What does a non-significant p-value (p > 0.05) imply?
It implies there is a high probability that the observed relationship could occur by chance if the null hypothesis were true.
Why is the p-value important in hypothesis testing?
It helps determine whether the observed results provide enough evidence to reject the null hypothesis.
What is the consequence of a low p-value (≤ 0.05)?
It indicates strong evidence against the null hypothesis, suggesting a true effect or relationship in the population.
What is the consequence of a high p-value (> 0.05)?
It indicates insufficient evidence to reject the null hypothesis, suggesting the observed relationship could be due to chance.
What is statistical power?
The probability of correctly finding a relationship when the null hypothesis is false.