Biostatistics Flashcards

1
Q

Standard Deviation [SD; s]

A

Shows variability of observations in a data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is standard error [SE] of the mean?

A

The standard deviation of the random sampling distribution of means
SE or SEM; in other words SE shows variability of sample means around the true population mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Describe 95% Confidence Interval [CI] of the mean.

A

Is the range of values for the population mean in which you are 95% sure the true population mean falls within the CI range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Describe 95% Confidence Interval [CI] of the mean.

A

Is the range of values for the population mean in which you are 95% sure that the true population mean falls within the CI range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

List the measures of central location.

A

Mean [average], Median [middle value], and Mode [most common value]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

List the measures of central variation.

A

Range, Percentiles, Standard Deviation, and Standard Error of the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a normal range in frequency distribution?

A

Mean +/- 2 SD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When is a Independent (non paired) t- test used?

A

An independent t-test is used when two groups of subjects are sampled on one occasion; e.g. testing the mean difference in body weights of subjects in group A and group B at time 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

When is a dependent (paired) t- test used?

A

A dependent/paired t-test is used when a group variable of interest is measured on two occasions; e.g. testing the mean difference in body weights of subjects in group A at time 1 and time 2.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is an element in statistics?

A

A single observation; denoted by X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

The total number of elements in a population are denoted by?

A

N

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

The total number of elements in a sample are denoted by?

A

n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are probability samples?

A

Samples in which the researcher can specify the probability of any one element in the population being included.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Probability samples permit the use of what type of statistics?

A

Inferential statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Non-probability samples allow what type of statistics to be used?

A

Descriptive statistics only

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the four basic kinds of probability samples?

A
  1. simple random samples
  2. stratified random samples
  3. cluster samples
  4. systematic samples
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is a simple random sample?

A

A sample in which every individual n the population has an equal probability of being included in the sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

When are clustered samples used?

A

When drawing a simple random sample or stratified random sample is too expensive or laborious.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

How is a cluster sample drawn?

A

The investigator starts by selecting a random set of clusters, and then selects a subset of sample from each cluster.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

The probability of an event is denoted by what?

A

p

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

How are probabilities expressed?

A

Decimal fractions between 0-1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the probability of an event “not occurring”?

A

q = 1-p

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is the addition rule of probability?

A

The probability of any one of several particular events OCCURRING TOGETHER is equal to the sum of their individual probabilities, provided the events are mutually exclusive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is a typical medical use of the binomial distribution?

A

Genetic counseling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

List the four scales of measurement.

A

Nominal, Ordinal, Interval, and Ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What are nominal data that fall into only two groups called?

A

Dichotomous data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Describe nominal scale data.

A

data can be divided into qualitative categories or groups, but cannot be ordered

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Describe ordinal scale data.

A

data that can be placed in a meaningful order, but lacks information about the size of the interval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What are interval scale data?

A

Measurements that can be placed in a meaningful order, and have meaningful interval between observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What are interval scale data?

A

Measurements that can be placed in a meaningful order, and have meaningful interval between observations but has no value of absolute zero.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What are ratio scale data?

A

data with meaningful order, meaningful intervals, and absolute zero.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What are discrete variables?

A

Variables that can take only certain values and none in between

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What are continuous variables?

A

Variables that may take any value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What is a centile rank?

A

represents the percentage of observations that fall below a particular score.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What are quantiles?

A

divide distributions into a number of equal parts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

What is a distribution called when 2 scores both occur with the greatest frequency?

A

Bimodal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

When is a median better measure than mean?

A

The median is more useful for highly skewed distributions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

The median in a population is denoted by?

A

μ

39
Q

The median in a sample is denoted by?

A

X-bar
_
X

40
Q

The measure of central tendency that best resists the influence of fluctuation between different samples?

A

The mean

41
Q

Explain variability?

A

The extent to which scores are clustered together or scattered about.

42
Q

How do you find the deviation score of an element?

A

By subtracting the distribution’s mean from the element.

43
Q

How can you verify the results of calculating the deviation scores for all the elements in a distribution?

A

By checking that the sum of the deviation scores for all the elements is 0

44
Q

What is the variance of a distribution?

A

The mean of the squares of all the deviation scores in the distribution

45
Q

Variance is sometimes known as what?

A

Mean square

46
Q

What is standard deviation?

A

The square root of the variance.

47
Q

What are the symbols for standard deviation?

A

σ in a population
s in a sample
or SD

48
Q

Why is the standard deviation particularly useful in normal distributions?

A

The proportion of elements in the normal distribution is constant for a given number of standard deviations above or below the mean of the distribution.

49
Q

How much of the population falls within +/- 1, +/- 2, and +/-3 SD of the mean?

A

68%, 95%, and 99% respectively.

50
Q

How much of the population falls within +/- 1, +/- 2, and +/-3 SD of the mean?

A

68%, 95%, and 99.7% respectively.

51
Q

What is the Z score of an element?

A

The location of any element in a normal distribution, expressed in terms of how many standard deviations it lies above or below the mean of the distribution.

52
Q

List population parameters.

A

The population mean and standard deviation

53
Q

What are the sample mean and standard deviation called?

A

Sample statistics

54
Q

Inferential statistics involves what?

A

Using a statistic to estimate a parameter

55
Q

What is sampling error?

A

Natural, expected random variation that will cause the sample statistic to differ from the population parameter.

56
Q

Describe central limit theorem?

A
  1. Random sampling distribution of means always tends to be normal, irrespective of its population distribution.
  2. The random sampling distribution of means will become closer to normal as sample size increases.
  3. the mean of the random sampling distribution of means is equal to the mean of the original population.
57
Q

How is the standard error calculated?

A

The population SD divided by the square root of the size of the samples.

58
Q

What are the steps for determining the probability that a random sample of n will have a mean above X?

A
  1. calculate the standard error
  2. calculate the z score of the sample mean
  3. find the proportion of the normal distribution that lies beyond that z score
59
Q

what are confidence limits equal to?

A

the sample mean +/- the z scores obtained from the table multiplied by the standard error

60
Q

95% confidence limits are approximately equal to what?

A

the sample mean plus or minus two standard errors

61
Q

what must be done to the sample size to halve the confidence interval?

A

must be increased 4 fold

62
Q

what is precision?

A

the degree to which a figure is immune from random variation

63
Q

the width of the confidence interval reflects what?

A

precision

64
Q

what is accuracy?

A

the degree to which an estimate is immune from systematic error or bias

65
Q

When is a t score used instead of the z score?

A

t score is used when inferences about means are based on estimates of population parameters rather than the population parameters themselves

66
Q

how similar are the values for z and t?

A

The values of z and t are similar when the sample size is large; t and z scores become increasingly different when the sample size is different.

67
Q

how is the sample size expressed in a table of t values?

A

indirectly as degrees of freedom [df]

68
Q

what are degrees of freedom equal to?

A

n-1

69
Q

what are the steps to test a hypothesis about a mean?

A
  1. state the null and alternative hypothesis
  2. select the decision criterion α (level of significance)
  3. establish the critical values
  4. draw a random sample from the population, and calculate the mean of that sample
  5. calculate the standard deviation and estimated standard error of the sample
  6. calculate the value of the test statistic t that corresponds to the mean of the sample
  7. compare the calculated value of t with the critical values of t, and then accept or reject the null hypothesis
70
Q

When can a null hypothesis be rejected?

A

When the probability that the sample mean could have come from the hypothesized population is less than or equal to .05. (p < .05).

71
Q

If the probability of obtaining the sample mean is greater than .05, what will be made of the null hypothesis?

A

the null hypothesis will be accepted as correct.

72
Q

what is the area of acceptance?

A

inside the range within which 95% of random sample means would be expected to fall

73
Q

what does it mean when a result is significant as p <0.05?

A

the result was unlikely to have occurred by chance.

74
Q

what is a type I /α error?

A

H0 is true but rejected; false negative conclusion about null hypothesis

75
Q

what is a type II/β error?

A

H0 is false but accepted; false positive conclusion about null hypothesis

76
Q

how is it possible to guard against a type I error?

A

using a more stringent (lower) level of α

77
Q

what is the power of a test?

A

The ability of a test to reject H0 when it is false;

power = 1-β

78
Q

conventionally, a study is required to have a power of what to be acceptable?

A

0.8

79
Q

what is the most practical and important way of increasing the power of a statistical test?

A

increasing the sample size

80
Q

which are more powerful, one tailed tests or two tailed tests?

A

one tailed tests

81
Q

When is ANOVA used?

A

when more than two means are being compared.

82
Q

What contributes to the total variability in the results?

A
  1. the variability resulting from the known differences between the groups.
  2. the ordinary random variability within each group, expected in any set of data, caused by sampling error, individual differences between the patients and so on.
83
Q

what type of data does a chi-square test?

A

nominal data;

chi square is a test of proportions.

84
Q

what are the two basic kinds of correlational techniques?

A

Correlation and Regression

85
Q

what is correlation used for?

A

to quantify the strength and direction of the relationship between two variables.

86
Q

what is regression used for?

A

to express the functional relationship between two variables, so that the value of one variable can be predicted from the knowledge of the other

87
Q

what are the possible values of the correlation coefficient r?

A

-1 (-ve correlation) to +1 (+ve correlation);

+ve correlation: means high values of one variable are associated with high values of the other variable.

-ve correlation: means high values of one variable are associated with low values of the other variable.

88
Q

what are the two most commonly used correlation coefficients?

A
  1. Pearson product-moment correlation [r]

2. Spearman rank-order correlation [ρ]

89
Q

Pearson product-moment correlation used for?

A

interval scale data

ratio scale data

90
Q

Spearman rank-order correlation used for?

A

ordinal scale data

91
Q

True/False? Pearson and Spearman correlational techniques are for non-linear data?

A

False; both techniques show only a linear association.

92
Q

if there is a strong non-linear relationship between 2 variables, the Pearson or Spearman coefficients will be what?

A

an underestimate of the true strength of the relationship.

93
Q

does a correlation between two variables demonstrate a causal relationship?

A

No