Exam III Flashcards
What are the two key attributes of data measurement (variable)?
- magnitude
2. consistency of scale/ fixed interval
What is the third variable used to assess data when the first two are answered with yes’?
-rational/absolute zero
What are the three levels of measurement?
- nominal
- ordinal
- interval
Examples of nominal measurements
-variables that are simply labeled variables without quantitative characteristics
- gender
- hair color
- occupation
examples of ordinal variables
-variables that have magnitude but no consistency of scale
- interval of ages (1-18, 19-50)
- months homeless (less than three, greater than three)
examples of interval data
-number with units at the end
- age
- number of siblings
What variables are considered discrete vs continuous?
discrete: nominal, ordinal
continuous: interval
What are the measures of central tendency utilized for describing continuous data?
- mode/mean/median
- outliers
- min/max/range
- interquartile range
What is variance?
difference in each individual measurement value and the groups’ mean
What is standard deviation?
- square root of variance value
- know eqn
How do you know when a graph is normally distributed?
-the mean/median/mode are near equal
What are parametric tests?
-stats tests useful for normally distributed data
What are the two types of graphical shapes?
- positively skewed
- negatively skewed
How can you tell the difference between a positively and negatively skewed graph based on stats alone?
positively skewed:
- mean is greater than median
negatively skewed:
-mean is less than median
Definition of skewness
-a measure of asymmetry of a distribution
+a perfectly normal distribution is symmetric and has a skewness value of 0
What is kurtosis?
- a measure of the extent to which observations cluster around the mean. For a normal distribution, the value of the kurtosis statistic is 0.
- how peaked the graph is
positive vs negative kurtosis
positive -> more cluster
negative -> less cluster
What do you get when you add or subtract a std dev from the mean?
-range of middle 68%, 95%, and 99%
What are the required assumptions for interval data to select a parametric test?
- normally distributed
- equal variances
+Levene’s test - randomly derived and independent
What is the Levene’s test?
-a test for variablity
How does one handle interval data that is not normally distributed?
- use a statistical test that does not require the data to be normally distributed (non-parametric)
- transform data to a standardized value (z-score or log)
What is power?
1-beta (type 2 error)
- the ability of a study design, its methodology, and the selected test statistic to detect a true difference if one truly exists between group comparisons (analogous to sensitivity)
- researchers typically choose 80% power to truly distinguish a difference between two groups
What does power have to do with sample size?
-the larger the sample size, the greater the likelihood of detecting a difference if one truly exists (increase in power)
What needs to be determined with each sample? (for size)
- minimum difference between groups deemed significant
+the smaller the difference between groups necessary to be considered significant, the greater the N needed - expected variation of measurement
- alpha and beta error rates
+alpha -> type 1 (5%)
+beta -> type 2 (20%)
+add in anticipated drop outs or loss to follow ups
What is the p value and why is it important?
-statistical tests that determine the possible differences or relationships between variables
-same as type 1 error and alpha
-if less than 0.05, then you can claim there is a difference between groups
+small chance that the difference didn’t occur by chance
When the p value is high which hypothesis will you be more likely to accept?
null hypothesis
What is type 1 error (alpha)?
- false positive
- rejecting the null hypothesis when it is actually true
What is type 2 error (beta)?
- false negative
- accepting the null hypothesis when you should have rejected it
When comparing baseline characteristics in a study what you want the p values to be?
- high p values, want no difference between groups
- same for Levene’s test for baseline
How would one interpret the p value in words?
-probability of making a type 1 error if null is rejected
What is the confidence interval?
-percentage of confidence that statistically the real difference or relationship resides
-based on:
+variation in sample (v/sd)
+sample size
What is the most commonly selected percentage for a CI?
95%