Descriptive Stats Flashcards

1
Q

Descriptive stats vs. inferential stats

A

D: describe, organize, or summarize data
I: generalize from sample of data to larger groups of subjects using inductive reasoning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Population vs. sample

A

Pop: largest collection of entities about which an investigator wishes to draw conclusions
Sample: subset of population actually being studied

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Probability sample

A

Investigator can specify chance of subject being selected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

4 types of probability samples

A

simple random
stratified random
cluster
systematic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

simple random sample

A

all members of pop have equal chance to be selected; “representative” if resembles source population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

stratified random sample

A

population divided into groups with shared characteristics, random samples from each group; may be more representative of population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

cluster samples

A

ex: randomly select 5 schools then randomly select equal # students from each
used when too expensive/ labor intensive to use other methods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

systematic samples

A

systematic selection of subjects, e.g. every 5th pt admitted to hospital
may be = simple random without randomization but may be prone to selection bias if systematic error involved

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

stratified vs. cluster sampling

A

strata are homogenous, then members of strata are randomly selected
clusters are heterogenous “natural groupings” and are selected at random

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

probability addition rule

A

prob (Ex or Ey) = prob(Ex) + prob(Ey)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

probability multiplication rule

A

prob (Ex and Ey) = prob(Ex)*prob(Ey)

*assuming events are independent of each other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

probability binomial distribution

A

probability that specific combos of 2 mutually exclusive independent events will occur

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

4 types of variables

A

nominal
ordinal
interval
ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

nominal variables

A

“categorical”
names or labels with no inherent order, e.g. race and gender
*includes dichotomous/ binomial data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

ordinal variables

A

“ranked”

natural order exists but not evenly spaced, e.g. cancer grade, pain score, scale from best to worst

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

interval variables

A

space between adjacent scale values are equal, but no absolute 0, e.g. temperature in C; ratios cannot be computed (100* C =/= (2 x 50* C)

17
Q

ratio variables

A

true zero is absence of variables, and ratios can be calculated; e.g. annual income, BP, Kelvin temperature

18
Q

discrete variable

A

takes only certain limited values, no in-between; e.g. race, gender, # of anything

19
Q

continuous variables

A

may take any value though may have limited range; e.g. weight, age, BP

20
Q

dependent variable

A

outcome of interest in study that is expected to change based on intervention

21
Q

independent variable

A

intervention, exposure, or factor that may influence dependent variable

22
Q

3 ways to summarize data

A

frequency distribution
measures of central tendency
measures of dispersion

23
Q

frequency distribution

A

data points within certain categories = frequency -> percent/ relative frequency vs. all data points

24
Q

measures of central tendency

A

mean, median, mode

25
Q

measures of dispersion/ spread/ variation

A

range (max and min), interquartile range, variance, SD, variation coefficient

26
Q

variance equation

A

s^2 = (sum[{xi-xbar}^2]) / (n-1)

xi - xbar = deviation score

27
Q

coefficient of variation equation

A

SD / mean

28
Q

percent of population within 1, 2, and 3 SDs of mean

A

1: ~68%
2: ~95%
3: ~99.7%

29
Q

left/negative skew and right/positive skew

A

normal curve: mode = mean = median

left: mean

30
Q

what is z-score

A

indicates how many SDs an element is from the mean in a normal distribution

31
Q

z-score equation

A
z = (X - u)/ sigma
X = sample statistic/ estimate
u = mean
sigma = SD
32
Q

z-score and directionality

A

in a two-sided z-score table, positive value is for values to left of mean
for right side, use 1 - z-score