Chapter 2 Flashcards
What can we do with qualitative data?
- frequency distributions
- Relative frequency
- Percentages
- Graphs
Frequency
Count of how often each category occurs
Relative frequency
Frequency/sum of all frequencies
Percentage
Relative frequency x 100%
Bar graph
A graph made of bars whose heights represent the frequencies of respective categories.
Pareto chart
A bar graph with bars arranged by their heights in descending order.
Pie chart
A circle divided into portions that represent the relative frequencys or percentages belonging to different categories.
3 ways to graph qualitative data
l. Pie chart
2. Bar graph
3. Pareto chart
3 major decisions when constructing a frequency table
- Number of classes (how many groups)
- class width ((largest value - smallest value) / number of classes)
- Lower limit of the first class (must be less than or equal to our numerical data)
Histogram
A graph in which classes are marked on the horizontal axis and the frequencies, or percentages are marked on the vertical axis. Drawn with no spaces.
Polygon
A graph formed by joining the midpoints of the tops of successive bars in a histogram w/ straight lines.
Frequency distribution curve
For large data sets, a polygon can eventually become a smooth curve.
Less-than class method for histograms
Avoids the gap between classes.
Slightly shifts the upper-limit, so they are just “less than” the next lower limit.
Useful for data sets that contain fractional values (continuous data types).
Single-valued classes
When your data only consists of a few distinct values.
Treats your data as qualitative instead of quantitative.
Types of graphs for quantitative data
- Histogram
- Polygon
- Frequency curve
Cumulative frequency
Gives the total number of values that fall below the upper boundary of each class.
Shapes of histograms
- Symmetric (mirror image)
- Skewed (right or left)
- Uniform
Truncating axes
Graphs can distort the data (intentionally or unintentionally) to make things seem more extreme.
Stem and leaf displays
Displays quantitative data values divided into two portions.
1. The leaf - second digit
2. The stem - first digit
The leaves are typically reordered to sort them.
Grouped stem-and-leaf displays
Useful when you want less stems and more leaves.
Can separate the digits in certain classes with a “*”.
Splitting stem + leaf displays
A stem can be used multiples times to break the leaves into groups.
Useful when you want more stems and less leaves.
See more detail, distributions and peaks.
Dotplots
Horizontal axis consists of values of variables.
Each dot represents a data point.
Similar to a bar graph, but height is shown using dots.
Can be useful for detecting outliers/ extreme values.