Inference Flashcards

1
Q

Types of Categorical data

A
  1. nominal = categories with no order
  2. ordinal = categories with order
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

types of numerical data

A
  1. discrete = whole numbers, counts
  2. continuous = recorded measures
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Summary statistics for normal data

A

mean and s.d

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

summary statistics for non-normal data

A

median and interquartile range (Q3-Q1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Correlation definition

A

a measure of the degree of linear association between two numerical variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

when is correlation not appropriate?

A

for method comparison studies; testing equivalence of two methods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

p-value definition

A

the probability of getting the observed data if the null hypothesis were true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

comparing means of 2 normal samples

A

if numerical, use a t-test/2 sample t test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

coefficient of determination (R squared)

A

a measure of the amount of variability in the data which is explained by the regression line; the variability in y explained by x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Testing non-parametric data

A

either
a. transform the data so that it is normal
b. use a Mann-Whitney test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Testing qualitative data

A

> Testing proportions
With decent sample size, use Chi-squared
Give confidence interval for difference/ratio of 2 proportions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Testing Quantitative/dependent data

A

> if differences between 2 samples are normal then use a paired t-test and CI
if not normal use Wilcoxon signed rank test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Type 1 error

A

> false positive
reject the null hypothesis when it is true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Type 2 error

A

> False negative result
accept the null hypothesis when it is false

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Confidence interval definition

A

The range of values within which you can be 95% certain that the population mean value lies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what happens to CI’s as sample size increases?

A

Get smaller

16
Q

Difference between confidence and prediction intervals

A

> Confidence interval represents the mean of the possible values = a population estimate
Prediction interval = represents individual observations; encompasses the full range of possible values in the data
Prediction interval will be wider than confidence as individual observations have greater variability than the mean

17
Q

How to assess normality

A

> plot histogram to assess skew
Use Shapiro-Wilk test

18
Q

T-test assumptions

A

> Sample was randomly selected
Data are independent (if 2 sample)
Data are normally distributed
Similar variance between groups

19
Q

what do linear predictor coefficients represent?

A

how much y changes for a 1 unit increase in x

20
Q

Prediction interval definition

A

Range within which a single new predicted value of y will fall, with 95% confidence

21
Q

power definition

A

power = 1 - the probability of making a type 2 error (beta)
>the probability of correctly rejecting H0

22
Q

when to use a mann-whitney test

A

> 2 independent groups of a quantitative variable
one variable not normally distributed

23
Q

assumptions of a 2 sample t test

A

> normality
independence
homogeneity of variance