Lecture 1 - Mean, Mode, etc. Flashcards
What is continuous data?
Data that can be divided up an infinite number of times. e.g. age decades -> years -> months -> days, etc.
What is discrete data?
Data that cannot be divided up. aka integer data. e.g. # of fractures someone in a car accident has
What is nominal data?
Data given arbitrary numeric labels. Used merely for identification. e.g. race. white = 0, black = 1, etc.
What is ordinal data?
Ordered rankings with numeric labels. Numbers have meaning in relation to one another, but no intrinsic value. e.g. APGAR score.
What is categorical data?
A type of ordinal data where higher numbers are in fact better, but the actual difference between is not known. e.g. pain scores.
How is mean denoted in equations?
” X bar” (an X with a line on top of it)
What is a major weakness of the mean?
It is strongly influenced by outlying values
What are strengths of the median?
Impervious to outlying values
What is a disadvantage of the median?
It only uses a small portion of the data set and does not give you an idea of the full range of values
What is variance?
A measure of variability around the mean
When should you use the mean vs. mode?
Generally use the mean when variance is low and the mode when variance is high. If the values are well spread out, report both.
What is the standard deviation, numerically?
The square root of variance (converts variance back to a meaningful number for our actual data)
What is large s?
Large spread. No hard and fast rule. Generally when SD is 50% (or more) as big as the mean.
What is a small s?
Anything <25% = small
What is the coefficient of variation (CV)?
Used to describe SD in relation the mean as a percentage. CV = (SD/mean) x100