L5 - Descriptive and Inferential Stats Flashcards

1
Q

List the type of variables and an example of each

A
  1. Nominal/ Categoriacal
    - Race, Gender
  2. Ordinal
    - Pain rating
  3. Continuous
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Describe how you would present numerical data for the following variables:

  1. Highest Education Qualification
  2. Pain Score
  3. Mean Plasma Concentration of Ciprofloxacin
A
  1. n (%)
  2. n (%), or % or Median (IQR) where appropriate
  3. Mean (SD) or Median (IQR)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Common graphical displays that describe the data for the different types of variables

A
  1. Nominal/ Categorical: N.A.
  2. Ordinal: pie char, bar chart
  3. Continuous: Box plot, Histogram
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define Inferential statistics. What is its main assumption?

A

Stats methods to draw conclusions from sample and imake inferences to an entire ppn

Assumption: Sample represents a RANDOM SAMPLE from underlying (unobserved) ppn

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Describe the two main approaches to inferential stats

A
  1. Param Estimation: estimate ppn params from sample stats (Point estimate and interval estimate)
  2. Hyp testing:to validate supposition based on limited evi, inferred using sample from ppn (Null and Alternative Hyp)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Sampling Distribution of sample means?

A

Under Param Estimation:

  • Repeated random samples taken
  • Means computed for each sample
  • Then means of all samples used as data to get point estimate and CI
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Describe the properties of sampling distribution of the means

A
  1. Mean ≈ ppn mean
  2. SD of sample means ≈ Population SD/rt(n), is also SEM (Std error of mean), which is used to get CI

(note: SD only look at one sample, while SEM look at SD of all the samples of the mean)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is SEM?

A

Estimate precision/reliability of sample as it relates to ppn from which sample was drawn.

i.e. Tells as where TRUE ppn mean may lie, via the CI

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is Central Limit Theorem?

A

Large sample sizes: Mean approx normally distributed, even if each sample amy not be noramlly distributed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Describe point estimate and interval estimate (CI)

A
  • Point estimate: single number to estimate param of interest
  • CI: Range of reasonable values intended to contain param of interest. Usually 95% confidence
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the width of CI influenced by?

A
  1. Confidence Level (Higher = increased width)
  2. Increased n = Decreased width
  3. Increased SD = Increased Width
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Describe the Null and Alternative Hyp

A

H0: No difference/ relationship/ effect
H1: opposite of H0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Define p-value

A

Probability that observed result occur by chance, assuming H0 true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Define the following:

  1. a (alpha)
  2. B (beta)
  3. Type I error
  4. Type II error
  5. Stats Power
A
  1. a: Significance level of stats test, or chance to commit type I error
  2. B: Probability of failing to reject when there is a difference
  3. Type I error = a
  4. Type II error = B
  5. Stats power: Probability of correctly rejecting false H0 when there is effect = 1-B
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Does statsig = clisig?

A

NO

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Considerations for deciding on appropriate stats test when comparing data between / among groups

A
  1. Number of groups being compared
  2. Whether groups are independent or pairedrelated
  3. Type of variable the data is
    - For continuous: Whether data is normally distributed
  4. Whether assumptions for specific stats test are met
17
Q

Describe the two statistical tests for normality and state when to use each of them.

State their Hypotheses as well

A
  1. Shapiro-Wilk test for n<50
  2. Kolmogorov-Smirnov test for n≥50

H0: Distribution is normal
H1: Distribution NOT normal

p < 0.05: Sample data is NOT normal

18
Q

Considerations for deciding on appropriate stats test when examining and quantifying degree of LINEAR relationship between two NUMERICAL variables

A
  1. Whether one or both are continuous or ordinal
    - For continuous: check for normality
  2. Whether assumptions are met
19
Q

Considerations for deciding on appropriate stats test when ESTIMATING effect of an INDEPENDENT VARIABLE (i.e. predictor variable) (x) on DEPENDENT VARIABLE (i.e. Outcome variable) (y)

A
  1. Whether DEPENDENT variable is continuous or ordinal or nominal
  2. Whether assumptions are met