Introduction to Biostats (Lecture 2) Flashcards

1
Q

What is nominal data?

A
  • data can be placed into specific categories when order does not matter
  • only a few values exist
  • mutually exclusive
  • “yes or no” - event either occurred or it did not occur
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the difference between nominal and ordinal data?

A

ordinal data - has narrowly defined categories with few possible values in which the ranking-order matters
nominal data - data placed in specific categories when order does not matter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What kind of data would “male vs. female” be?

A

nominal data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Name type of data for: patient died or did not die

A

nominal data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Type of data used for: RASS Scale in ICU Sedation

A

Ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the two types of discrete/non-parametric data?

A
  1. nominal
  2. ordinal
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Nominal and ordinal data are types of ________ data, which is a type of ____ data

A

discrete/non-parametric data; qualitative data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Characteristics of discrete/non-parametric data:

A
  1. “categorical” variables
  2. usually have a few possible values
  3. can be counted but limited mathematical manipulation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

scales, counts, and dichotomous outcomes (I.e., mortality) are examples of _______

A

discrete (non-parametric) data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the two types of continuous/parametric data?

A
  1. interval
  2. ratio
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Characteristics of continuous/parametric data:

A
  1. May have an infinite number of values
  2. exist on some defined scale
  3. More mathematical manipulation possible
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Weight, temperature, and age are examples of _____

A

continuous/parametric data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

which type of data lacks a defined and meaningful zero point?

A

interval data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the difference between interval and ratio data?

A

interval data lacks a defined and meaningful zero point, whereas ratio data has a meaningful minimum or zero on the scale
- also can have a negative value for interval data (I.e., temperature), but cannot have a negative value in ratio data (I.e. weight, drug concentration, heart rate, blood pressure)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is absolute zero?

A

non-existent measure at zero (can have in ratio data)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Type of data for: Celcius or Fahrenheit temperature (degrees)

A

interval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

data where measurements are on a defined scale and scale of ranked differences is meaningful

A

interval and ratio data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

type of data that can be compared

A

interval and ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

weight, drug concentration, heart rate, and blood pressure are examples of what type of data?

A

ratio data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

List and describe the 3 measures of central tendency:

A
  1. mean - the average of all values
  2. median - value in the middle of numerical order
  3. mode - value that appears most often
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Which measure of central tendency is best when describing interval and ratio data?

A

mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Which central tendency is best when outliers are present?

A

median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Which central tendency is best for non-numerical qualities?

A

mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What are the 4 measures of distribution/spread?

A
  1. range
  2. interquartile range
  3. variance
  4. standard deviation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is the purpose of measures of spread/distribution?

A

they are used to describe how data are spread to provide information about the variability in the distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

the difference between the lowest and highest value in the data is _________

A

range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What is interquartile range?

A

the difference between the 25th and 75th percentile (median is the 50th percentile)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What is variance?

A

How far the values of a variable lie from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What is preferred over variance?

A

standard of deviation

30
Q

What is the standard of deviation?

A

spread of observation distribution using the same units as the original data
provides insight to the dispersion of data points around the mean

31
Q

Larger standard of deviation indicates ____ in data

A

greater variability

32
Q

____ indicates the data is not evenly distributed around the mean

A

skewness

33
Q

What does “skewness” appear as on a graph?

A

data are more concentrated to either the right or left of the mean value (Left skewed/right skewed) and there is a longer tail on one side

34
Q

What does normal distribution appear as when plotted?

A

bell curve - distribution is symmetrical around the mean; mean, median and mode are equal

35
Q

What kinds of data does normal distribution apply to?

A

continuous - interval or ratio data

36
Q

What are the two specific distributions of a bell curve?

A
  1. z-distribution
  2. t-distribution
37
Q

What are the 6 ways that results are displayed graphically?

A
  1. Boxplot (Box and whisker plot)
  2. Stem and leaf plots
  3. bar charts/histograms
  4. pie chart
  5. scatterplot
  6. frequency table
38
Q

A boxplot represents a set of _____ data

A

continuous

39
Q

What is an independent variable?

A

what is being manipulated (I.e., new blood pressure medication)
- not influenced by other variables

40
Q

What is the dependent variable?

A

what is being measured (I.e., blood pressure reduction)
- outcome of interest (typically has multiple)
- value predicted by independent variables

41
Q

What are control variables?

A

what is held constant (I.e., age sex, ethnicity, etc.)

42
Q

What are confounding variables?

A

impacts the dependent variable but is not controlled by the researcher (I.e., genetic predepositions)

43
Q

What is “the range to provide the precision of an estimate with an acceptable level of error”?

A

interval estimate/confidence interval

44
Q

range of values that is believed to encompass the actual population value

A

confidence interval

45
Q

For a ____ between two values, if the % confidence interval does not cross 0, then the result is _______

A

difference; statistically significant

46
Q

For a ___ between two values, if the % confidence interval does not cross 1, then the result is _______

A

ratio; statistically significant

47
Q

What is the difference between a null and alternative hypothesis?

A

null hypothesis is what you are trying to disprove (the assertion that no difference exists between groups)
alternative hypothesis is the desired result/what you want to see (proving that a difference between groups does exist)

48
Q

What is tested in hypothesis testing?

A

only the null hypothesis

49
Q

In hypothesis testing, conclusions are made with respect to the _____.

A

null hypothesis

50
Q

What does it mean to “reject a null hypothesis”?

A

a difference was found

51
Q

What does it mean to “fail to reject” the null hypothesis?

A

found no difference

52
Q

What are the steps to hypothesis testing?

A
  1. formation of alternative hypothesis (Ha) from clinical question
  2. then null hypothesis (H0) is formulated
  3. hypothesis testing of only the null hypothesis
  4. conclusion are made with respect to null hypothesis
53
Q

T/F It is only possible to reject or fail to reject the null hypothesis

A

T

54
Q

What are the errors that hypothesis testing is prone to making?

A
  1. type I error - alpha - 5% error rate - rejecting null when the null is true (saying there IS a difference when there is NOT a difference)
  2. type II error - beta - 20% error rate - fail to reject null when null is false (saying there is NO difference when a difference does exist)
55
Q

Probability of committing one of the errors ____when repeating the analysis

A

increases

56
Q

What is a “power”?

A

the ability to find a difference if one exists

57
Q

power = ___ + ____

A

1 - B

58
Q

What is the power if type II error rate is 20%?

A

power of 80% (% chance that a difference will be found if it exists)

59
Q

The higher the power, the ____ the probability of type II error

A

lower

60
Q

what is p-value?

A

the probability of committing a type 1 error based on the assumption that the null hypothesis is true (no difference found)

61
Q

what does p=0 indicate?

A

no likelihood that a difference seen is due to chance

62
Q

what does p=1 indicate?

A

observed difference is definitely due to chance

63
Q

The smaller the p-value, the ____ the evidence for rejecting the null hypothesis

A

stronger

64
Q

What does p-value not tell us?

A
  1. does not tell us the probability of the null hypothesis being true
  2. it does not indicate the “size of effect”
  3. does not tell us whether something is practically or clinically significant
65
Q

What does statistical significance refer to?

A

the results of an analysis (Is the difference due to random variation?)

66
Q

When can you conclude that the results are statistically significant?

A
  • when there is enough evidence to reject the null hypothesis
  • p-value < a (reject null and accept alternative) - this states that the probability the observed difference is due to chance is < a
67
Q

T/F results either are or are not statistically significant, there is no in between or gray area

A

T

68
Q

What is clinical significance?

A

the practical significance of findings (will the results cause a change in practice?)

69
Q

NNT

A

number needed to treat (involved with risk/benefit measurement for clinical significance)

70
Q

NNH

A

number needed to harm (involved with risk/benefit measurement for clinical significance)