Intro to sraristics Flashcards
What are the two categories used to classify data?
Numerical and categorical
What are the two types of numerical variables?
Continuous and discrete
What are the two types of categorical variables?
Ordinal and nominal
Describe continuous variables.
When a continuum of values is possible. For example,height (m). E.g. 1.87m, 1.58m, 1.77m.
Describe discrete variables.
When only discrete values can be used (a whole number). For example, Number of people. E.g. 0, 1, 2.
Describe ordinal variables.
Categories that have an order. For example, size. E.g. small, medium, large.
Describe nominal variables.
Categories that have no order. For example, eye color. E.g. brown, blue, hazel.
What graph is most suitable to represent nominal data?
A Pareto chart.
What graph is most suitable to represent ordinal or discrete data?
A bar chart.
What graph is most suitable to represent continuous data.
A histogram or bar chart.
What are five ways the shape of the distribution of a histogram described?
- Symmetrical or bell shaped (uni-modal (one peak))
- Skewed to the left (left side is the tail)
- Skewed to the right (right side is the tail)
- Symmetrical and bi-modal (two peaks)
- Symmetrical and uniform (flat)
What are the three numerical summaries for center or location?
Mode, median and mean.
What are the three numerical summaries for spread?
Range, inter-quartile range (IQR) and standard deviation.
What is the mode?
The value that occurs the most.
What is the median?
The middle value located after the values are arranged from highest to lowest. Defined for ordinal,discrete and continuous data. If there are an even number of variables there can be two values for the median (M).