Lecture 1 - Mean, Mode, etc. Flashcards

1
Q

What is continuous data?

A

Data that can be divided up an infinite number of times. e.g. age decades -> years -> months -> days, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is discrete data?

A

Data that cannot be divided up. aka integer data. e.g. # of fractures someone in a car accident has

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is nominal data?

A

Data given arbitrary numeric labels. Used merely for identification. e.g. race. white = 0, black = 1, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is ordinal data?

A

Ordered rankings with numeric labels. Numbers have meaning in relation to one another, but no intrinsic value. e.g. APGAR score.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is categorical data?

A

A type of ordinal data where higher numbers are in fact better, but the actual difference between is not known. e.g. pain scores.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How is mean denoted in equations?

A

” X bar” (an X with a line on top of it)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a major weakness of the mean?

A

It is strongly influenced by outlying values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are strengths of the median?

A

Impervious to outlying values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a disadvantage of the median?

A

It only uses a small portion of the data set and does not give you an idea of the full range of values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is variance?

A

A measure of variability around the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

When should you use the mean vs. mode?

A

Generally use the mean when variance is low and the mode when variance is high. If the values are well spread out, report both.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the standard deviation, numerically?

A

The square root of variance (converts variance back to a meaningful number for our actual data)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is large s?

A

Large spread. No hard and fast rule. Generally when SD is 50% (or more) as big as the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a small s?

A

Anything <25% = small

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the coefficient of variation (CV)?

A

Used to describe SD in relation the mean as a percentage. CV = (SD/mean) x100

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What direction is this data skewed?

A

Right, because the outlier is on the right.

17
Q

What are the significant values portrayed in this box plot?

A

Bottom of box = 25th percentile

Middle of box = 50th percentile

Top of box = 75th percentile

Whiskers typically 1.5 IQR or min/max

18
Q

What is sample probabilities?

A

Using sets of data and calculate probabilities on them that we hope are unbiased and can be extrapolated to a larger population of interest.

19
Q

What is the probability that any 2 events will occur?

A

P (event 1 OR event 2) =

P (event 1) + P (event 2) - P (both events occur)

If both events CANNOT occur (e.g. both being male and female), do not subtract P

20
Q

What does this phrase mean:
P (reduced FEV1| elevated IL-8) = 11/22 = 0.50?

A

What is the probability of reduced FEV1 GIVEN the people who have elevated IL-8 (usually limits the total sample size to a subset of the population) Also can be phrased “conditioned on”

21
Q

When are two probabilities independent?

A

When the probability of something happening alone is the same as under a certain condition. e.g. probability of seeing a blue car after having seen a red car is just the probablity of seeing a blue car.

22
Q

If two probabilities are independent, what is the probability of getting one twice in a row?

A

Multiply each probability.

P (event1 AND event2)=P (event1) x P (event2)

oFor a simple example, the probability of getting two heads in two tosses of a fair coin is just P (heads) x P (heads) = 0.50 x 0.50 = 0.25