L5: Descriptive & Inferential Statistics Flashcards

1
Q

What are the two branches of statistics?

A

Descriptive statistics:
Methods for organizing and summarizing a set of data that help to describe the attributes of a group or population.

Inferential statistics:
Statistical methods used to draw conclusions from a sample and make inferences to the entire population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the three types of variables?

A
  • Nominal e.g. gender, race etc.
  • Ordinal e.g. cancer stages, likert scale data
  • Continuous e.g. height and weight
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How to describe nominal data numerically and graphically?

A

Numerically: n (%)
Graphically: pie chart, bar chart

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How to describe ordinal data numerically and graphically?

A

For most ordinal data eg cancer stages:

  • numerically as n (%)
  • graphically as pie chart, bar chart

For likert scale data:

  • numerically as n (%), graphically as pie chart, bar chart OR
  • numerically as median (IQR), graphically as box plot
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How to describe continuous data numerically and graphically?

A

Graphically: histogram, box plot
Numerically:
- if normally distributed, mean and SD
- if non-normally distributed, median (IQR)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the two approaches to statistical inference?

A
  • parameter estimation: seeks an approximate calculation of a population parameter
  • hypothesis testing: seeks to validate a supposition based on limited evidence, inferred using a sample from the population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a point estimate and interval estimate (confidence interval)?

A
  • point estimate involves using the sample data to calculate a single number to estimate the parameter of interest
  • interval estimate (confidence interval) provides a range of reasonable values intended to contain the parameter of interest with 95% confidence
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why does confidence interval provide us more information than p-value?

A
  • provides info on precision of point estimate (narrower CI-> more precise point estimate; wider CI -> less precise point estimate)
  • also tells us statistical significance (when difference does not contain 0 and when ratio does not contain 1)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the width of the confidence interval affected by?

A
  • higher the confidence level, wider the CI
  • larger the sample size, narrower the CI
  • higher the standard deviation, wider the CI
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is p-value?

A

Probability that the observed result or a more extreme result would occur by chance alone, assuming that H0 is true.
The smaller the p-value, the stronger the evidence against H0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a type I error?

A
False positive (reject H0 when the truth is no effect) 
Probability of type 1 error = significance level = alpha = 0.05
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a type 2 error?

A
False negative (fail to reject H0 when truth is there is an effect) 
Probability of type 2 error = beta = typically 0.2
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is statistical power?

A

Probability of correctly rejecting a false H0 when the truth is an effect exists.
Statistical power = 1- beta = 0.8

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the difference between statistical and clinical significance?

A
  • statistical significance is heavily dependent on sample size. with large sample size, even small effect can appear statistically significant. however, with small sample size, even large effect which are clinically consequential can appear non statistically significant
  • hence, DO NOT just simply look at whether statistically significant. look at point estimate and confidence interval to interpret if clinically significant or not.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What normality test are used for which sample size?

A
  • Shapiro wilk for n <50

- Kolmogorov-smirnov test for n more than equal to 50

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How to decide on appropriate statistical test for hypothesis testing?

A

When comparing data between/among groups:

  • number of groups
  • whether paired/independent
  • whether data is continuous/ordinal/nominal -> if continuous, whether normally distributed
  • assumptions underlying the specific stat test