Statistics Flashcards
(85 cards)
Statistic of central tendency for nominal data
Mode
Yes or no data
Nominal
Standard error of mean from std dev
Std error of mean = standard deviation / sq root of sample size
Categorical variable for nominal
Fishers exact
Odds ratio interpretation for OR of 1.18 (ci 95% 1.04,1.33)
Risk of event elevated by 4% to 33% and statistically sunificat (OR doesn’t include 1)
Type of trial you use odds ratio to measure significance
Case control , sometimes cross sectional or cohort with some modifications
Calculate odds ratio
A/c divided by b/d = ad/bc
Where a = exposed cases B = exposed non cases C = unexposed cases D = unexposed non cases
Continuous data
Data along an infinite or finite continuum that can be broken down into an jndinite degree of detail - weight, temperature, etc)
When would you use kruskal wallis test?
Non parametric and ordinal data
Panns represents which type of data
Continuous, even though made up of multiple ordinal scales
Central tendency stat for ordinal data (ranked in order)
Median (mean not appropriate since data are categorical and not to be treated as continuous)
Types of continuous variables with examples
Interval and ratio
Interval - eg temperature degrees Celsius - equal intervals and zero is arbitrary
Ratio - like interval but there is a true zero - ex: weight, blood pressure
Test to see of data normally distributed
Kolmogorov-smirnov
2 discrete probability distributions
Binomial - only two different outcomes like heads or tails
Poisson - another probability distribution when you count a number of events across times - ex: number of ADRs from drug x over a time
Kurtosis
How flat a distribution is - normal distribution = 3
Skewness - symmetry of distribution - is data clustered at low end positively or negatively skewed?
Low end - positively skewed - outliers on the high end pull mean in higher direction so mean is higher than median
High end - negatively skewed - low numbers pull mean down so mean is lower than median
Standard error of the mean
Different than sd- doesnt tell you how values compare to mean, tells you how this samples mean compared to othersAmples from same population
- for more than 1 sample studies
- is sd/sq root of n
Non parametric test criteria
Non normally distrib data
Eg nominal or ordinal variables with sample size under 30
Also, scales - ordinal - with less than 12 categories eg panss
Defn of beta
Probability of making a type II error
Usually < 0.2, pref < 0.10
Defb of alpha
Prob of type I error
Inversely related to beta
Continuous variable parametric test
- compare two means?
If independent samples - t test (student )
Paired or matched data - paired t test
Comparison of 3 or more groups
One way anova - helps avoid type I error
- performs multiple t tests
Anova detects what?
A difference among the 3 of more groups
- then, a multiple comparison method must be employed to detect which difference
- dunnet, bonfsrroni, tukey, etc
- repeated measures anova - subjects in these are paired and serve as own control (participate in >1 treatment group)
Nominal variables (nonparametric tests)
Chi square test
- ex: test diff of baseline characteristics sex, smoking status, alcohol, yes/no variables like this
- tests observed vs expected frequencies
- must be larger samples