Statistics Flashcards

Question

3 Measures of Variability

Answer 1

Range, Interquartile Range, Standard Deviation

Answer 2

1) Mean, median, and mode are exactly equal in value. 2) Frequency distribution is symmetrical about its center (the mean). 3) [T]he area under the curve can be precisely defined by the mean and standard deviation (SD), as follows: a) ~68% of the data points fall within ± 1 SD of the mean, b) ~95% of the data point fall within ± 2 SD of the mean, c) ~99% of the data points fall within ± 3 SD of the mean” (19)

Answer 3

Asymmetrical (non-normal/-Gaussian) distributions of data (19).

Answer 4

A data point far away from the center of the data distribution (20).

Answer 5

1) Laboratory values for which the clinically normal (ie, expected) range lies near zero (eg, serum bilirubin or urinary protein concentration)...because abnormal values can be quite high on one side of the distribution but can never be less than zero on the other side. 2) Birth weight 3) Number of days in a hospital stay (22).

Answer 6

Use the median and interquartile range (IQR) (22).

Answer 7

The median and IQR should appear more frequently in medical literature than the mean and SD (22).

Answer 8

1) Difference between mean and SD produces an impossible result (eg, negative tumor size) 2) Mean differs markedly from median (23-24)

Answer 9

Those that should be applied to normally distributed data (24)

Answer 10

Those that should be applied to non-normally distributed data (24)

Answer 11

Using simple counts or percentages of categories; depicted with dot, pie, or bar charts (25).

Answer 12

As for nominal data, using simple counts or percentages of categories; depicted with dot, pie, or bar charts (25).

Answer 13

Because they represent numeric counts, using measures of central tendency and dispersion (26).

Answer 14

Data from every individual in a population

Answer 15

An estimate of the variability among sample means. Calculated in order to get closer to the variability of a population than does standard deviation of a single sample (35-36).

Answer 16

Standard deviation / [divided by] square root of sample size (36).

Answer 17

–“[P]erhaps the greatest value of the SEM is...that it enables us to determine the proportion of sample mean values that will be expected to fall within certain portions of the normal distribution (eg, 68% of all possible sample mean values will occur within ±1 SEM of the ‘true’ [population] mean value)” (36).

Answer 18

Eg, "The estimated mean (SEM) heart rate of the population is 80 (0.8) bpm” (36)

Answer 19

Mean + 95% Confidence Interval (CI) (2 SEMs).

Answer 20

Along with the mean, the preferred way in medical sciences to report the precision of an estimate. 95% CI = 2 SEMs.

Answer 21

Increase the sample size (38).

Answer 22

The hypothesis that the intervention has no effect on the data (45).

Answer 23

A description of how closely observed data matches prediction of null hypothesis. In the example, the number of SEMs between the mean heart rate difference observed and that predicted under the null hypothesis. ([Dif observed - null hypothesis]/SEM)(47).

Answer 24

Find the observed mean, SD, and SEM; see how much the observed mean differs from the mean (often 0) in the null hypothesis; divide by 1 SEM to determine the test statistic, how much this is in SEMs; consult a statistical (often t) table that tells you probability for this SEM difference; and draw a conclusion based upon that probability (47).

Answer 25

"[T]he probability that, if the null hypothesis is true, chance alone could have produced a result as extreme as the one observed" (47). I.e., the probability that this could have happened by chance.

Answer 26

.05. : “As a matter of convention, researchers commonly set the P value (the level at which results will be classified as statistically significant) at .05. This means that they are willing to accept a 5% chance that they could wrongly conclude that an observed result was real when, in fact, it was due to chance” (47).

Answer 27

Demonstrating (in clinical trials, between the control and study group) a large enough difference to have a practical impact on the patient (47).

Answer 28

Performed on a "study [that] uses paired data[: for instance,] two measurements from the same person at different times" (48).

Answer 29

The conventional p [ital] threshold of .05 (49).

Answer 30

All of these tests determine the difference between the observed results and the results that would be expected according to the null hypothesis; they then standardize this difference between observed and expected results by dividing it by the applicable measure of variability (eg, the SEM)” (49).

Answer 31

"used to assess the statistical significance of results obtained with categorical data” (49). [Mnemonic: "Chi" for "Categorical."] Eg, a study of relapse/no relapse and percentages of individuals who fall into each camp (49).

Answer 32

"used to determine the statistical significance of the difference between the means obtained from 2 (and only 2) independent samples” (49). [Mnemonic: "t" for "two"]

Answer 33

"an extension of the t test...used to compare the means obtained from 3 or more groups” (50). N.b.: "The results of an ANOVA indicate only whether a statistically significant difference exists; they don’t indicate which group or groups are different from the others" (50).

Answer 34

Tests involving "parameters (measurable characteristics) such as group means, SD, and variance." The t [ital] [, paired t,] and ANOVA are parametric tests. N.b.: "researchers should use these parametric statistical tests only for data that are roughly normally distributed” (50).

Answer 35

To be performed on study data that are not normally distributed (50).

Answer 36

"[T]he nonparametric alternative to the paired t test" (50).

Answer 37

"[T]he nonparametric alternative to ANOVA” (50).

Answer 38

“[M]istakenly concluding that there is an effect when in fact there isn’t one" (50)

Answer 39

“[I]ncorrectly conclud[in]g that there is no effect, when in fact there really is one” (50).

Answer 40

"many scientific publications expect authors to present both P values [for statistical significance] and [95%] confidence intervals [for clinical importance]” (51).

Statistics Flashcards

(64 cards)