Biostatistics Flashcards

Question

List the four scales of measurement.

Answer 1

Nominal, Ordinal, Interval, and Ratio

Answer 2

Dichotomous data

Answer 3

data can be divided into qualitative categories or groups, but cannot be ordered

Answer 4

data that can be placed in a meaningful order, but lacks information about the size of the interval

Answer 5

Measurements that can be placed in a meaningful order, and have meaningful interval between observations.

Answer 6

Measurements that can be placed in a meaningful order, and have meaningful interval between observations but has no value of absolute zero.

Answer 7

data with meaningful order, meaningful intervals, and absolute zero.

Answer 8

Variables that can take only certain values and none in between

Answer 9

Variables that may take any value.

Answer 10

represents the percentage of observations that fall below a particular score.

Answer 11

divide distributions into a number of equal parts

Answer 12

Bimodal distribution

Answer 13

The median is more useful for highly skewed distributions.

Answer 14

The extent to which scores are clustered together or scattered about.

Answer 15

By subtracting the distribution's mean from the element.

Answer 16

By checking that the sum of the deviation scores for all the elements is 0

Answer 17

The mean of the squares of all the deviation scores in the distribution

Answer 18

Mean square

Answer 19

The square root of the variance (see attached image for formulas) Standard deviation is a measure of how spread out or dispersed the values in a dataset are around the mean (average). It tells you how much the data varies such that a small standard deviation means that the data points are close to the mean (less spread), whereas in a large standard deviation, the data points are more spread out from the mean.

Answer 20

σ in a population s in a sample or SD

Answer 21

The proportion of elements in the normal distribution is constant for a given number of standard deviations above or below the mean of the distribution.

Answer 22

68%, 95%, and 99% respectively.

Answer 23

68%, 95%, and 99.7% respectively.

Answer 24

The location of any element in a normal distribution; It is expressed in terms of how many standard deviations it lies above or below the mean of the distribution.

Answer 25

The population mean and standard deviation

Answer 26

Sample statistics

Answer 27

parameter (e.g. population mean or standard deviation)

Answer 28

Natural, expected random variation that will cause the sample statistic to differ from the population parameter.

Answer 29

1. Random sampling distribution of means always tends to be normal, irrespective of its population distribution. 2. The random sampling distribution of means will become closer to normal as sample size increases. 3. the mean of the random sampling distribution of means is equal to the mean of the original population.

Answer 30

The population SD divided by the square root of the size of the samples.

Answer 31

1. calculate the standard error 2. calculate the z score of the sample mean 3. find the proportion of the normal distribution that lies beyond that z score

Answer 32

the sample mean +/- the z scores obtained from the table multiplied by the standard error

Answer 33

the sample mean plus or minus two standard errors

Answer 34

must be increased 4 fold

Answer 35

The degree to which a figure is immune from random variation in other words, how consistent or repeatable your measurements are, regardless of whether they’re right (Think: Hitting the same spot every time, even if it’s not the bullseye).

Answer 36

the degree to which an estimate is immune from systematic error or bias in other words, how close your measurements are to the true or actual value (think “Hitting the bullseye”).

Answer 37

Z-score is used when the population mean (mu symbol) and population standard deviation (sigma symbol) are known; it is usually used when you have a large sample size (n > 30) or full population parameters in a standard normal distribution. T-score is used when the population standard deviation is not known, and must be estimated from the sample; it is commonly used when working with small sample sizes (n < 30) with T-distribution (looks like a normal distribution but has fatter tails to account for more uncertainty). Summary: • Use a Z-score when the population SD (σ) is known. • Use a T-score when the sample SD (s) is used to estimate the population SD.

Answer 38

The values of z and t are similar when the sample size is large; t and z scores become increasingly different when the sample size is different.

Answer 39

indirectly as degrees of freedom [df]

Answer 40

1. state the null and alternative hypothesis 2. select the decision criterion α (level of significance) 3. establish the critical values 4. draw a random sample from the population, and calculate the mean of that sample 5. calculate the standard deviation and estimated standard error of the sample 6. calculate the value of the test statistic t that corresponds to the mean of the sample 7. compare the calculated value of t with the critical values of t, and then accept or reject the null hypothesis

Answer 41

When the probability that the sample mean could have come from the hypothesized population is less than or equal to .05. (p < .05).

Answer 42

the null hypothesis will be accepted as correct.

Answer 43

inside the range within which 95% of random sample means would be expected to fall

Answer 44

the result was unlikely to have occurred by chance.

Answer 45

H0 is true but rejected; false negative conclusion about null hypothesis

Answer 46

H0 is false but accepted; false positive conclusion about null hypothesis

Answer 47

using a more stringent (lower) level of α

Answer 48

The ability of a test to reject H0 when it is false; | power = 1-β

Answer 49

increasing the sample size

Answer 50

one tailed tests

Answer 51

when more than two means are being compared.

Answer 52

1. the variability resulting from the known differences between the groups. 2. the ordinary random variability within each group, expected in any set of data, caused by sampling error, individual differences between the patients and so on.

Answer 53

nominal data; chi square is a test of proportions.

Answer 54

Correlation and Regression

Answer 55

to quantify the strength and direction of the relationship between two variables.

Answer 56

to express the functional relationship between two variables, so that the value of one variable can be predicted from the knowledge of the other

Answer 57

-1 (-ve correlation) to +1 (+ve correlation); +ve correlation: means high values of one variable are associated with high values of the other variable. -ve correlation: means high values of one variable are associated with low values of the other variable.

Answer 58

1. Pearson product-moment correlation (r) for interval or ratio scale data. 2. Spearman rank-order correlation (ρ) for ordinal scale data

Answer 59

interval and ratio scale data

Answer 60

ordinal scale data

Answer 61

False; both techniques show only a linear association.

Answer 62

Close to 0, even if a strong non-linear relationship exists between the 2 variables because Pearson’s r only measures linear relationships. It does not capture curves or more complex patterns; in other words, an underestimation of the true strength of the relationship will be seen. Note: For non-linear relationships, methods such as the Spearman’s rank correlation or a non-linear regression can be used.

Biostatistics Flashcards

(93 cards)