Class 29-37: Intro To Biostats Flashcards
How do researchers choose to accept or reject the null hypothesis?
Statistical analysis
Based on p and confidence intervals.
3 primary levels for variables based on answers:
Nominal
Ordinal
Interval
2 key attributes of data measurement:
Magnitude
Consistency of scale.
Each attribute can be assessed w/ a “yes” or “no” response to the inquiry of:
“Does it have it?”
No magnitude and no consistency
Dichotomous/binary and NON ranked
Simply labeled variables w/o quantitative characteristics (dichotomous)
Nominal
Has magnitude but no consistency of scale
Ranked categories
Pain scales, severity of disease
Ordinal
Has magnitude and consistency of scale
Anything drawn w/ exact values
Interval / ratio
Nominal and ordinal attributes are considered:
Interval attributes are considered:
Discrete; continuous
Which statistical tests are selected based on level of data being compared
ALL statistical tests
After data, is collected we can appropriate go _____ in specificity/detail of data measurement (levels), but we can never go _____
Down; up
Non-comparative. Simple description of various elements of the study’s data
Descriptive statistics.
Measures of central tendency and dispersion
Mode/ median/ mean
Outliers
Minimum / max /. Range
Interquartile range= Q3-Q1
Know how to calculate these b/c it will be free points on the test.
The average of the squared differences in each individual measurement value and the groups mean
Variance- from the mean
The larger the number= the more variable
Square root of variance value (restores units of mean)
Standard deviation (SD)
Normally distributed= (one word)
Symmetrical
What graph is found when mean / median / mode are equal or near-equal:
Normally distributed
Interval data must be:
Normally distributed.
Stats tests useful or _____________________ data are called parametric tests
Normally-distributed
SD in normal distribution:
68%-top of bell curve
95%= +2 SD
99.7%= +3 SD
Parametric tests
Fixed mean / median / mode
Positively skewed graph
Asymmetrical distribution w/ one tail longer than the other
A distribution is skewed anytime the median differs from the mean (when mean is higher than median.
Mean > median
Negatively skewed graph:
Asymmetrically distributed w. One tail loner than another to the left.
Distribution is skewed anytime the median differs from the mean (when mean is lower than median)
Mean < median
Mean = 15.15 Median= 3 Mode= 0
Mean > median
Positively skewed
Outliers do not change:
The mode
Range is:
subtraction btw min and max.
The difference btw the 2 values.
Mean 5.05
Median: 7.0
Mode: 7
Skewed?
Mean < median
Skewed to the left
Negatively skewed.
Mean: 44.31
Median: 47.0
Mode: 49
Skewed?
Mean < median
Negative skewed.
Shift to the left
What Age extremes represent 95% of data?
Mean + (2xSD)
Mean - (2xSD)
A a measure of the asymmetry of a distribution
Perfectly normal distribution is symmetric and has value of 0
Skewness
Want to be closer to 0
A measure of the extent to which obs’s cluster around the mean. For a normal distribution, the value of 0
+= more cluster
-=less cluster
Kurtosis
Want to be closer to 0
Positive kurtosis =
More clustered
What is the age range of middle 68%?
95%
99%
One SD
2 SD
3 SD
Add to mean 3 times or
subtract to mean 3 time ??
Which of mean, mode, median could adequately represent the measure of central tendency for the gender variable?
Mode values would describe the # of each genders ( the most frequent)
Which variable is the most descriptive variable of nominal data
Mode is the most descriptive variable of data
What data type can median/ mean/ and mode ALL be defendable
Interval
Median and mode defendable for data type?
Ordinal
Req’d assumptions of interval data for proper parametric test:
(2)
- Normally- distributed -look at mean,median,mode-look at graph, look at skewness values,
- Equal variances:
run statistical test: levenne’s test - Randomly-derived and independent
* if these are not good enough, then interval / parametric CANT be used
Common statistical test for parametric test/interval data:
Compares equal variances
Levene’s test
How to handle data that’s not normally distributed:
- could use ordinal family stat. Test. (Non-parametric tests)
- could throw out outliers.
- transform data to a standardized value. (Z score or log)- can create a normal distribution. [issue: how do you log BP?]
Segars-picks ordinal typically
*TQ
What should you always do for data interpretation?
Run descriptive statistics and graphs
Researchers will either accept or reject this perspective, based on:
Statistical analysis
Rejecting the null hypothesis when you shouldn’t have!
Type 1 error (alpha)
Dont reject the null when you should have
Type II error (beta)
The ability of a study design to detect a true difference if one exists btw group-comparisons….
The level of accuracy in corectly accepting/rejecting the null
Power (1-ß)
ß=beta=type 2 error
The larger the sample size, the greater the likelihood (ability) of detecting a difference if one truly exists.
Increase in power
Sample size.
Most researchers accept up to _____ type 2 beta error rate…so must researchers want 80% chance of finding differences
What about alpha error rate?
20% of the BETA rate!!
5% for type 1 (alpha) rate