Summarizing and displaying data Flashcards
what is a Variable?
A value that takes different values or categories ie. blood pressure, nationality
What are the types of measurement scales?
- Numerical = continuous/ discrete
- categorical = nominal/ordinal
What are the types of variables?
- numerical
-continuous = continuous on a number line (0,0.1,0.2)
-discrete = whole numbers (1,2,3) - categorical
-nominal = variables have no order (blood group, nationality)
-ordinal = variables have an order (BMI, disease severity)
-dichotomous/binary = only two values exist (dead or alive)
What type of graph is best used for categorical data?
horizontal bar graph
What type of graph is best used for a small amount of data in continuous scale ?
Small amounts of data = dot plot
What is needed when data sets become too large?
summaries
What are the methods of summarizing data for numerical and categorical data?
- numerical
a. measures of centrality
-means, modes, medians
b. measures of dispersion
-standard deviation, IQR, range, variance - categorical
a. counts/frequencies
b. proportions/percentages
What are other names for the measures of centrality
also called measures of location or measures of central tendency
What do measures of dispersion/variability refer to?
how spread out the data is
What is the most commonly used measure of dispersion?
standard deviation
What does a lower value of standard deviation mean? higher value?
lower = each data point is closer to the mean and to each other / higher = each data point is farther from the mean and from each other
Define the quartiles.
Q1: 25% of the data lie below Q1 (and 75% is above Q1)
Q2 is the same as median = divides the data into two halves (50% above, 50% below)
Q3: 25% of the data lie above Q3 (and 75% is below Q3)
Define quartiles in relation to percentages.
Q1 = 25%
Q2 = 50%
Q3 = 75%
What is the interquartile range?
Q3 - Q1
When are the median and IQRs used? When are the mean and standard deviation used?
- when distribution of values is not symmetric about the center = with outliers
- used without outliers
What is an outlier