E-module 2 - Choosing statistics Flashcards
What are the 2 types of analysis of data?
Correlation
Comparison
Definitions of correlation and comparison as types of data analysis.
Correlation
- hypothesis tests to evaluate relationship between variables
Comparison
- hypothesis tests to evaluate differences between groups/populations
What are the 2 types and 4 subtypes of data?
Types
- Quantitative - numeric information
- Qualitative/Categorical - information that can’t be measured
Subtypes
- Quantitative gives rise to CONTINUOUS and DISCRETE (counted) data
- Qualitative/categorical gives rise to NOMINAL (unordered) and ORDINAL (ordered) data
Which subtypes of data are PARAMETRIC and NON-PARAMETRIC?
PARAMETRIC
- continuous quantitative
NON-PARAMETRIC
- discrete quantitative
- nominal categorical
- ordinal categorical
What is the principle segregating continuous and discrete data?
Continuous can be subdivided (potentially) infinitely where discrete cannot
- e.g. age is continuous if measured exactly in months, days, hours etc, but discrete if measured in years (overlap here within a category between continuous and discrete data)
What is important to remember about continuous vs discrete data?
Means or rates are always continuous data. Likelihood is they were generated using discrete data BUT they themselves are continuous
e. g. heart rate is continuous but number of heartbeats in a minute is discrete
- this is because continuous data can take ANY value (e.g. 2, 2.5, 3) but discrete data cannot take certain values (e.g. 2, 3 but NOT 2.5)
When do you check for normality in data?
When you have CONTINUOUS data - discrete and qualitative data is ALWAYS NON-PARAMETRIC
What does normality measure?
Measures central tendency and dispersion of data
What are the 2 tests used for testing normality and their conditions?
Shapiro-Wilk test - n<50
Kolmogorov-Smirnov test - n>50
When can the data be conferred as normal or not normal?
p-value of normality test
if p<0.05 data is NOT normal, otherwise data is normal
What are the 3 outcomes and subsequent distributions of data after normality testing?
YES - Gaussian/Normal distribution
NO - Skewed distribution
NO - Kurtosis
What are the 2 main features of normal distributions?
- 68-95-99.7 rule - 2/3rds of data lies within 1 SD of the mean, 95% within 2 SDs, 99.7% within 3 SDs
AND - Distribution is symmetrical
What are the features of skewed distributions?
ASYMMETRICAL
- mean, median and mode all separated (usually found together in normal dist.) (mode at top of curve, median just downslope from top, mean just further downslope from the median)
- skew is named according to which direction has the long tail e.g. right/positive skew = long positive/right tail and vice versa
- uneven tails with many data points at high/low end of range
What are the features of Kurtosis?
Kurtosis is where data is heavy or light-tailed with respect to a normal distribution
- heavy-tailed = outliers create a wide distribution (graph is flattened)
- light-tailed = lack of outliers creates a narrow distribution (graph is steepened)
Definition of unpaired/independent and paired/dependent data/groups and an example of a study employing this?
Paired/dependent = when two (or more) sets of data have come from the same individual e.g. same subject at different points of the day
- longitudinal study