Biostatics 3 Flashcards

1
Q

How is statistics divided?

A

Statistics is divided into descriptive and inferential statistics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the purpose of descriptive statistics?

A

Presenting, organizing, and summarizing data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What methods are used in descriptive statistics?

A

Methods include calculating measures of central tendency (mean, median, mode), measures of spread (range, variance, standard deviation), and using graphical representations (histograms, bar charts, pie charts).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the purpose of inferential statistics?

A

Drawing conclusions about a population based on data observed in a sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What methods are used in inferential statistics?

A

Methods include hypothesis testing, confidence intervals, and regression analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do descriptive and inferential statistics differ?

A

Descriptive statistics summarize and present data, while inferential statistics make predictions or inferences about a population based on a sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When would you use descriptive statistics?

A

Use descriptive statistics when you need to describe the basic features of the data in a study.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When would you use inferential statistics?

A

Use inferential statistics when you want to make predictions or generalizations about a larger population from a sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Give an example of descriptive statistics.

A

Calculating the average age of students in a class.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Give an example of inferential statistics.

A

Estimating the average age of all students in a school based on a sample of students.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the purpose of statistical inference?

A

To make conclusions about a population based on data observed in a sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What do statistical tests allow us to do?

A

They allow us to make statistical inferences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What question does statistical inference help answer?

A

Is the result observed in our sample likely to be a true reflection of the result in the population?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a study population?

A

The defined patient population we are interested in.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a sample?

A

A subset of the study population from which data is collected.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are sample statistics?

A

Measures computed from a sample, also known as point estimates.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are true population parameters?

A

Fixed values that are unknown but represent the true characteristics of the entire population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is the goal of statistical inference?

A

To estimate unknown population parameters and determine the likelihood that the sample results reflect the true population characteristics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the basis of statistical tests?

A

Hypothesis testing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What do we start by assuming in hypothesis testing?

A

There is no relationship between the variables in the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the null hypothesis (H0)?

A

The assumption that there is no relationship between the variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the alternative hypothesis (H1)?

A

The assumption that there is a relationship between the variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Give an example of a null hypothesis in a study about pregnancy planning and age

A

There is no difference in age between women who plan their pregnancy and women who do not plan their pregnancy.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Give an example of an alternative hypothesis in a study about pregnancy planning and age.

A

There is a difference in age between women who plan their pregnancy and women who do not plan their pregnancy.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is the goal of hypothesis testing?

A

To gather statistical evidence to support or refute the null hypothesis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Why is hypothesis testing important in statistical analysis?

A

It helps determine whether observed data provide sufficient evidence to conclude that a relationship exists in the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What is a Type 1 error?

A

A Type 1 error occurs when we incorrectly reject the null hypothesis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What is another name for a Type 1 error?

A

False positive.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What does a Type 1 error imply?

A

It implies finding a relationship or effect when there is none.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What is a Type 2 error?

A

A Type 2 error occurs when we fail to reject the null hypothesis when it is false.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What is another name for a Type 2 error?

A

False negative.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What does a Type 2 error imply?

A

It implies not finding a relationship or effect when there is one.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

Why is it important to understand Type 1 and Type 2 errors?

A

To accurately interpret the results of hypothesis testing and understand the potential risks of incorrect conclusions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What is the p-value?

A

The p-value is the probability of finding the observed relationship, or one more extreme, assuming the null hypothesis is true.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What threshold is commonly used to determine statistical significance?

A

A threshold of 0.05.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

What does it mean if p ≤ 0.05?

A

We reject the null hypothesis, and the p-value is said to be “significant.”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

What does a significant p-value (p ≤ 0.05) imply?

A

It implies there is less than a 5% probability that the observed relationship is due to chance if the null hypothesis were true.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What does it mean if p > 0.05?

A

We do not reject the null hypothesis, and the p-value is said to be “not significant.”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

What does a non-significant p-value (p > 0.05) imply?

A

It implies there is a high probability that the observed relationship could occur by chance if the null hypothesis were true.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

Why is the p-value important in hypothesis testing?

A

It helps determine whether the observed results provide enough evidence to reject the null hypothesis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

What is the consequence of a low p-value (≤ 0.05)?

A

It indicates strong evidence against the null hypothesis, suggesting a true effect or relationship in the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

What is the consequence of a high p-value (> 0.05)?

A

It indicates insufficient evidence to reject the null hypothesis, suggesting the observed relationship could be due to chance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

What is statistical power?

A

The probability of correctly finding a relationship when the null hypothesis is false.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

Why is statistical power important?

A

It indicates the likelihood of detecting an actual effect or relationship in the population.

45
Q

How does sample size affect statistical power?

A

Larger samples have more statistical power, while smaller samples have less.

46
Q

How does effect size influence statistical power?

A

Larger effect sizes are easier to detect and require less power, while smaller effect sizes are harder to detect and require more power.

47
Q

How does the level of significance affect statistical power?

A

A higher level of significance (e.g., 0.05) requires less power, while a lower level of significance (e.g., a smaller p-value) requires more power

48
Q

What is the effect of a large sample size on power?

A

Increases power, making it easier to detect true relationships.

49
Q

What is the effect of a small sample size on power?

A

Decreases power, making it harder to detect true relationships.

50
Q

What happens when the effect size is large?

A

It is easier to find the effect, requiring less statistical power.

51
Q

What happens when the effect size is small?

A

It is harder to find the effect, requiring more statistical power.

52
Q

What should be considered to ensure adequate statistical power?

A

Consider the sample size, effect size, and desired level of significance to ensure the study is well-powered to detect true relationships.

53
Q

What does a p-value less than 0.05 indicate?

A

The sample size and power were adequate to detect the difference.

54
Q

What does a p-value greater than 0.05 indicate?

A

We cannot say that the null hypothesis is true, but there is not enough statistical evidence to reject it.

55
Q

What are possible reasons for a non-significant p-value?

A
  1. The study may be underpowered to detect the real difference.
  2. The sample size may be too small to detect the real difference.
  3. There may truly be no difference.
56
Q

How should p-values be reported?

A

P-values should be reported numerically, not just as “significant” or “not significant.”

57
Q

Why is it important to provide confidence intervals along with p-values?

A

Confidence intervals give an indication of the precision of summary statistics.

58
Q

What should be considered along with statistical significance?

A

Consider both statistical significance and clinical significance.

59
Q

What is the best approach for reporting statistical results?

A

Report both the p-values and confidence intervals to provide a complete picture of the data’s statistical and clinical significance.

60
Q

What is a confidence interval?

A

Another method for statistical inference, measuring the precision of our result in the sample in relation to the expected truth in the population.

61
Q

What do confidence intervals account for?

A

They account for sampling/random error.

62
Q

How do you interpret a confidence interval for a mean age?

A

If the mean age of women is 28 years with a 95% CI of 25-31 years, we are 95% confident that the true mean age of the population lies between 25 and 31 years.

63
Q

What does the 95% confidence level mean?

A

It means that 5% of the time, the true population mean will not lie within the sample’s 95% confidence interval just by chance.

64
Q

What do wide confidence intervals indicate?

A

Lack of precision. Example: 4.2 (0.3-16.2).

65
Q

What do narrow confidence intervals indicate?

A

Good precision. Example: 0.4 (0.2-0.6).

66
Q

How does sample size affect confidence intervals?

A

Larger samples are more likely to be representative of the population, resulting in more precise, smaller intervals.

67
Q

Why are confidence intervals important?

A

They provide a range within which we expect the true population parameter to lie, offering insight into the precision and reliability of our estimates.

68
Q

What indicates a non-significant result in terms of confidence intervals?

A

If the 95% confidence interval around your measure of association includes the null value, the p-value will be >0.05, and the result will not be statistically significant.

69
Q

How is statistical significance determined when testing the difference in mean gestational age?

A

If the 95% CI includes 0, there is a non-significant difference in the mean gestational age.

70
Q

How is statistical significance determined when measuring prevalence, risk, or odds ratio?

A

If the 95% CI includes 1.0, there is a non-significant association between the exposure and the outcome.

71
Q

What is the relationship between confidence intervals and p-values?

A

A 95% confidence interval that includes the null value corresponds to a p-value >0.05, indicating non-significance.

72
Q

Why are both confidence intervals and p-values important?

A

Confidence intervals provide a range of plausible values for the population parameter and indicate the precision of the estimate, while p-values indicate the probability of observing the data if the null hypothesis is true

73
Q

What should you consider when choosing a test to test your null hypothesis?

A

Consider the type of the dependent variable, whether the grouping variable is independent or paired, and the distribution of the data if the dependent variable is numerical.

74
Q

What types of dependent variables might you have?

A

Numerical or categorical.

75
Q

How do you determine if the grouping variable is independent or paired?

A

Independent groups are different and unrelated (e.g., men vs. women). Paired groups are related (e.g., before and after measurements in the same subjects).

76
Q

What should you check if the dependent variable is numerical?

A

Check the distribution of the data (normal or skewed).

77
Q

What do parametric tests assume?

A

They make assumptions about the distribution of the data (e.g., normal distribution).

78
Q

What do non-parametric tests assume?

A

They do not make assumptions about the distribution of the data.

79
Q

When should you use parametric tests?

A

When the data distribution meets the assumptions of the test (e.g., normal distribution for numerical data).

80
Q

When should you use non-parametric tests?

A

When the data distribution does not meet the assumptions of parametric tests or is unknown.

81
Q

What test is used to determine if a numerical variable is normally distributed?

A

Shapiro-Wilk test.

82
Q

What does the Shapiro-Wilk test do?

A

It tests whether the distribution of the variable is different from a normal distribution.

83
Q

What test is used for comparing a numerical variable across two independent groups if the distribution is normal?

A

T-test or Student’s T-test (parametric test).

84
Q

What does the T-test compare?

A

It tests for the difference in means.

85
Q

Why does the T-test not consider the rest of the distribution?

A

Because the characteristics of a normal distribution are fixed.

86
Q

What test is used for comparing a numerical variable across two independent groups if the distribution is skewed?

A

Mann-Whitney test or Wilcoxon rank-sum test (non-parametric test).

87
Q

What does the Mann-Whitney or Wilcoxon rank-sum test compare?

A

It tests for differences in the overall distribution of the variable.

88
Q

What test is used for comparing a numerical variable across more than two independent groups if the distribution is normal?

A

One-way ANOVA (parametric test).

89
Q

What does the One-way ANOVA test for?

A

It tests for the difference in means.

90
Q

What limitation does the One-way ANOVA have?

A

It tells you whether there are any differences, not which specific groups are different.

91
Q

What is needed if the One-way ANOVA finds a difference?

A

Additional tests (post-hoc tests) to see which groups are different.

92
Q

What test is used for comparing a numerical variable across more than two independent groups if the distribution is skewed?

A

Kruskal-Wallis test (non-parametric test).

93
Q

What does the Kruskal-Wallis test for?

A

It tests for differences in the overall distribution of the variable.

94
Q

What limitation does the Kruskal-Wallis test have?

A

It tells you whether there are any differences, not which specific groups are different.

95
Q

What is needed if the Kruskal-Wallis test finds a difference?

A

Additional tests (post-hoc tests) to see which groups are different.

96
Q

What are paired groups?

A

Groups that are related to each other, e.g., the same people measured at two different points in time.

97
Q

What test is used for comparing a numerical variable across two paired groups if the distribution is normal?

A

Paired T-test (parametric test).

98
Q

What does the Paired T-test compare?

A

It tests for the difference in means.

99
Q

What test is used for comparing a numerical variable across two paired groups if the distribution is skewed?

A

Wilcoxon signed-rank test (non-parametric test).

100
Q

What does the Wilcoxon signed-rank test compare?

A

It tests for differences in the overall distribution of the variable.

101
Q

What does the Chi-squared test compare?

A

It tests whether the observed frequencies are different from the frequencies that would be expected if there were no differences between the groups.

102
Q

When should you use Fisher’s exact test instead of the Chi-squared test?

A

When the expected frequencies are less than 5.

103
Q

How can you visually assess the relationship between two numerical variables?

A

Use a two-way scatter plot.

104
Q

What test is used for linear relationships if the variables are normally distributed?

A

Pearson’s correlation (parametric)

105
Q

What is the range of Pearson’s correlation coefficient (r)?

A

It ranges from -1 through 0 to +1.

106
Q

What test is used if the variables are not normally distributed?

A

Spearman’s correlation coefficient (non-parametric).

107
Q

What is the null hypothesis for testing the relationship between two numerical variables?

A

There is no relationship between the two variables in the population

108
Q

What is the alternative hypothesis for testing the relationship between two numerical variables?

A

There is a relationship between the two variables in the population.