Class 29-37: Intro To Biostats Flashcards

1
Q

How do researchers choose to accept or reject the null hypothesis?

A

Statistical analysis

Based on p and confidence intervals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

3 primary levels for variables based on answers:

A

Nominal
Ordinal
Interval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

2 key attributes of data measurement:

A

Magnitude

Consistency of scale.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Each attribute can be assessed w/ a “yes” or “no” response to the inquiry of:

A

“Does it have it?”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

No magnitude and no consistency

Dichotomous/binary and NON ranked

Simply labeled variables w/o quantitative characteristics (dichotomous)

A

Nominal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Has magnitude but no consistency of scale

Ranked categories

Pain scales, severity of disease

A

Ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Has magnitude and consistency of scale

Anything drawn w/ exact values

A

Interval / ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Nominal and ordinal attributes are considered:

Interval attributes are considered:

A

Discrete; continuous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Which statistical tests are selected based on level of data being compared

A

ALL statistical tests

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

After data, is collected we can appropriate go _____ in specificity/detail of data measurement (levels), but we can never go _____

A

Down; up

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Non-comparative. Simple description of various elements of the study’s data

A

Descriptive statistics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Measures of central tendency and dispersion

A

Mode/ median/ mean
Outliers
Minimum / max /. Range
Interquartile range= Q3-Q1

Know how to calculate these b/c it will be free points on the test.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

The average of the squared differences in each individual measurement value and the groups mean

A

Variance- from the mean

The larger the number= the more variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Square root of variance value (restores units of mean)

A

Standard deviation (SD)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Normally distributed= (one word)

A

Symmetrical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What graph is found when mean / median / mode are equal or near-equal:

A

Normally distributed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Interval data must be:

A

Normally distributed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Stats tests useful or _____________________ data are called parametric tests

A

Normally-distributed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

SD in normal distribution:

A

68%-top of bell curve

95%= +2 SD

99.7%= +3 SD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Parametric tests

A

Fixed mean / median / mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Positively skewed graph

A

Asymmetrical distribution w/ one tail longer than the other

A distribution is skewed anytime the median differs from the mean (when mean is higher than median.

Mean > median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Negatively skewed graph:

A

Asymmetrically distributed w. One tail loner than another to the left.

Distribution is skewed anytime the median differs from the mean (when mean is lower than median)

Mean < median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q
Mean = 15.15
Median= 3
Mode= 0
A

Mean > median

Positively skewed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Outliers do not change:

A

The mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Range is:
subtraction btw min and max. The difference btw the 2 values.
26
Mean 5.05 Median: 7.0 Mode: 7 Skewed?
Mean < median Skewed to the left Negatively skewed.
27
Mean: 44.31 Median: 47.0 Mode: 49 Skewed?
Mean < median Negative skewed. Shift to the left
28
What Age extremes represent 95% of data?
Mean + (2xSD) Mean - (2xSD)
29
A a measure of the asymmetry of a distribution Perfectly normal distribution is symmetric and has value of 0
Skewness Want to be closer to 0
30
A measure of the extent to which obs's cluster around the mean. For a normal distribution, the value of 0 += more cluster -=less cluster
Kurtosis Want to be closer to 0
31
Positive kurtosis =
More clustered
32
What is the age range of middle 68%? 95% 99%
One SD 2 SD 3 SD Add to mean 3 times or subtract to mean 3 time ??
33
Which of mean, mode, median could adequately represent the measure of central tendency for the gender variable?
Mode values would describe the # of each genders ( the most frequent)
34
Which variable is the most descriptive variable of nominal data
Mode is the most descriptive variable of data
35
What data type can median/ mean/ and mode ALL be defendable
Interval
36
Median and mode defendable for data type?
Ordinal
37
Req'd assumptions of interval data for proper parametric test: (2)
1. Normally- distributed -look at mean,median,mode-look at graph, look at skewness values, 2. Equal variances: run statistical test: levenne's test 3. Randomly-derived and independent * if these are not good enough, then interval / parametric CANT be used
38
Common statistical test for parametric test/interval data: Compares equal variances
Levene's test
39
How to handle data that's not normally distributed:
- could use ordinal family stat. Test. (Non-parametric tests) - could throw out outliers. - transform data to a standardized value. (Z score or log)- can create a normal distribution. [issue: how do you log BP?] Segars-picks ordinal typically ****TQ***
40
What should you always do for data interpretation?
Run descriptive statistics and graphs
41
Researchers will either accept or reject this perspective, based on:
Statistical analysis
42
Rejecting the null hypothesis when you shouldn't have!
Type 1 error (alpha)
43
Dont reject the null when you should have
Type II error (beta)
44
The ability of a study design to detect a true difference if one exists btw group-comparisons.... The level of accuracy in corectly accepting/rejecting the null
Power (1-ß) ß=beta=type 2 error
45
The larger the sample size, the greater the likelihood (ability) of detecting a difference if one truly exists. Increase in power
Sample size.
46
Most researchers accept up to _____ type 2 beta error rate...so must researchers want 80% chance of finding differences What about alpha error rate?
20% of the BETA rate!! 5% for type 1 (alpha) rate
47
The number one issue in power that ppl worry about?
Sample size!
48
3 common characteristics for sample size determination:
1. Minimum difference btw groups deemed "significant" 2. Expected variation of measurement. 3. Alpha (type 1) and beta (type 2) error rates and confidence interval. - add in anticipated drop-outs or loss to f.u
49
Statistical tests determine the possible error rate or chance in compared difference or relationship btw variables
P values Or type 1 error rate Or alpha error rate
50
How do p value and type 1 error rate differ?
They dont theyre the same thing
51
% chance of making type 1 error. By convention, accept about 5% risk of being wrong
P value
52
If P value is < 5% (or whichever is preset value), can you reject the null?
Yes! This is statistically significant.
53
Authors that pick 1% preset p value are _____likely to reject the null
LESS
54
If the p value is set higher (~10%), it is ________ to find group differences
Easier But they have a higher risk of being WRONG
55
Define probability value:
Probability of making type 1 error
56
Know how to explain the statistical significance of a p value in own words:
The P value is the probability or chance of making a type I error if rejecting the null hypothesis.
57
What 2 areas do you love to see high p values?
Table 1 characteristics And Levenne's test ----this shows that the studies randomization process worked!
58
Most common selections are 90,95,99% Calc at an a priori percentage of confidence that statistically the real difference or relationship resides. Based on: variation in sample and sample size
Confidence interval (CI)
59
Journals are moving away from solely reporting p values; or showing them at all.
Now people are using CI | Confidence interval.
60
Single best guess at relationship or difference btw groups The value of 0 represents no difference Range represents SD
Point estimate
61
If CI crosses 1.0 or an absolute difference of 0.0:
No difference Sameness NOT SIGNIFICANT = p>0.05
62
Slide 63: bivalirudin study. Interpret MAjor bleeding CI's:
Compared to unfractionated heparin, Bivalrudin 34% less likely to have major bleed than heparin, and can be said w. 95% confidence that the real group difference is btw 51% - 10% reduced risk, which is statistically significant.
63
Looking at forest plots, you dont want CI to cross:
1 on the x axis
64
Important consideration:
Does statistical significance actually confer meaningful, clinical significance?
65
4 key questions to selecting the correct statistical test:
1. What data level is being recorded?**most important 2. What type of comparison/assessment is desired? 3. How many groups are being compared? 4. Is the data independent or related (paired)?
66
1. What data level is being recorded?
What data have magnitude? (Y/n) Does data have a fixed interval? (Y/n) -the remaining 3 questions get you around the other portions of each individual sheet
67
2. What type of comparison/assessment is desired?
Correlation test
68
Provides a quantitative measure of the strength and direction of a relationship btw variables
Correlation Values range from -1 to +1
69
A correlation that controls for confounding variables
Partial correlation
70
Are 2 things related and how are they related?
Correlation
71
Positive correlation: +1.0
One goes up; the other does as well
72
Negative correlation -1.0
One goes up; the other goes down
73
Zero correlation: 0
No correlation whatsoever
74
Perfect correlation:
45 degree angle up or down
75
Artificially fitted and slanted so that the difference btw line and each individual measurement is the smallest distance possible. Like an average of all of the dots
Correlation line
76
Nominal correlation:
Contingency coefficient
77
Ordinal correlation:
Spearman correlation
78
Interval correlation:
Pearson correlation- "is there a linear relationship?" P> 0.05 for a pearson correlation just means there is no linear correlation; there may still be non-linear correlations present!
79
All correlations can be run as ___________________ to control for confounding
Partial correlation
80
NOT only used to see if ppl survive. Frequency of occurrence or change in category over time Help us understand when time is important=changes over time.
Survival test
81
Compares the proportion of, or time-to, event occcurences btw groups Visual image of changes over time. Commonly represented by:
Kaplan-meier curve
82
Nominal survival test:
Log-rank test
83
Ordinal survival test:
Cox-proportional hazards test
84
Interval survival test:
Kaplan-meier test *nominal, ordinal, and interval can all be represented by kaplan-meier curve
85
Outcome prediction/ association of the groups: Allows us to generate an asso. Or To take a bunch of variables and try to predict an outcome.
Regression test
86
Nominal regression test:
Logistic regression
87
Ordinal regression test:
Multinominal logistic regression
88
Interval regression test:
Linear regression
89
You can obtain an ____ from a regression
OR odds ratio
90
Data that is derived independent of their group. Groups are treated and followed independently from other groups
Independent data
91
2 groups of independent data of nominal data: Males vs. females
Pearson's- chi squared test (x squared)
92
3 groups (or more) of independent data of nominal data:
Chi-square test of independence (x squared)
93
Greater than 2 groups w/ expected cell count. Of <5 event occurrences
Fischer's exact test
94
Greater than or equal to 3 groups of independent data-nominal data, for statistically significant findings (P<0.05) in 3 or more comparisons, one must perform subsequent analysis:
Post-hoc testing: to determine which groups are different
95
POst hoc testing for nominal data: Adjusts the p value for # of comparisons being made, very conservative. Take # of comparisons to do and divide the p values 0.05/3 (groups) = 0.01667 is new p value
Bonferroni correction: test of inequality
96
2 groups of paired/related data nominal data:
Mcnemar test
97
>3 groups of paired/related data:
Cochran
98
Paired data = related data From the same person. He will use before and after / pre-post-/ baseline end/ as the terminology..
McNemar test (2 groups related data) Cochran (>3 groups of paired/related data)
99
2 groups of independent data ordinal:
Mann-whitney test
100
>3 groups of independent data ordinal:
Kruskal-wallis test
101
2 groups of paired/related data ordinal:
Wilcoxon signed rank test
102
>3 groups of paired/related data ordinal:
Friedman test
103
Post hoc tests for 3 or more group comparisons: See where the various differences are in between the groups. Not needed if initially NOT stat. Significant!
Student-newman-keul test Dunnett test Dunn test
104
Post hoc test compares all pairwise comparisons possible All groups must be equal in size Ordinal data
Student-newman-keul test
105
Post hoc test for 3 or more tests compares all pairwise comparisons against a single control All groups must be equal in size Ordinal data
Dunnett size
106
Useful post hoc test comparing groups unequally: ordinal
Dunn test
107
2 groups of independent data-interval data
Student t-test
108
>3 groups of independent data-interval
ANOVA: analysis of variance-powerful enough to handle any number of groups MANOVA: multiple analysis of variance
109
Must perform a ______ if statically significant and >3 groups
Post hoc
110
> 3 groups of independent data w/ confounders in interval data
ANCOVA: Analysis of co-variance MANCOVA: multiple analysis of co-variance
111
2 groups of paired/related interval data
Paired t-test Data that is related
112
>3 groups of paired/related interval data
Repeated measures ANOVA Repeated measures MANOVA
113
>3 groups of paired/related data w/ confounders in interval data.
Repeated measures ANCOVA or MANCOVA
114
Can use student-newman-keul, dunnett, or dunn test for post hoc testing. What're 2 others for interval data?
Tukey or scheffe test: compares all pairwise comparisons possible. All groups must be = in size.. --tukey: more conservative than Stu.N.K. --scheffe: MOST conservative. Bonferroni correction: adjusts p values for # of comparisons made
115
Agreement btw evaluators (consistency of decisions, determinations) Looking for consistency of agreement
Kappa statistic Difference in radiologist reading films -1 to +1 +=good agreement -=poor agreement
116
Review: 4 key questions to ask:
1. What data level is being recorded? 2. what type of comparison/assessment is desired? 3. how many groups? 4. Is the data independent or related (paired)?