2.1-2.3 Flashcards
Define class
A class is one of the categories into which qualitative data can be classified.
Define class frequency
The class frequency is the number of observations in the data set that fall into a particular class.
Define class relative frequency and give the formula.
The class relative frequency is the class frequency divided by the total number of observations in the data set; that is,
class relative frequency = class frequency/n
Define class percentage and give the formula.
The class percentage is the class relative frequency multiplied by 100; that is,
Class percentage = (class relative frequency)*100
Describe a bar graph.
The categories (classes) of the qualitative variable are represented by bars, where the height of each bar is either the class frequency, class relative frequency, or class percentage.
Describe a pie chart.
The categories (classes) of the qualitative variable are represented by slices of a pie (circle). The size of each slice is proportional to the class relative frequency.
Describe a Pareto diagram.
A bar graph with the categories (classes) of the qualitative variable (i.e., the bars) arranged by height in descending order from left to right.
What are three graphical methods for describing quantitative data?
dot plots, stem-and-leaf displays, and histograms
What is a benefit of a stem and leaf and a dot plot display as compared to a histogram?
Histograms do not let us identify individual measurement. While they are visible to some extent in a dot plot and clearly visible in a stem and leaf display. Since STALDs arranges the data in ascending order, it is easy to locate individual measurements.
Describe a dot plot.
The numerical value of each quantitative measurement in the data set is represented by a dot on a horizontal scale. When data values repeat, the dots are placed above one another vertically.
Describe a stem and leaf display.
The numerical value of the quantitative variable is partitioned into a “stem” and a “leaf.” The possible stems are listed in order in a column. The leaf for each quantitative measurement in the data set is placed in the corresponding stem row. Leaves for observations with the same stem value are listed in increasing order horizontally.
Describe a histogram.
The possible numerical values of the quantitative variable are partitioned into class intervals, each of which has the same width. These intervals form the scale of the horizontal axis. The frequency or relative frequency of observations in each class interval is determined. A vertical bar is placed over each class interval, with the height of the bar equal to either the class frequency or class relative frequency.
What is the central tendency of a set of measurements?
the tendency of the data to cluster, or center, about certain numerical values.
What is the variability of a set of measurements?
, the spread of the data
What is the most popular and best understood measure of central tendency for quantitative data sets?
The most popular and best understood measure of central tendency for a quantitative data set is the arithmetic mean.
Define the mean.
The mean of a set of quantitative data is the sum of the measurements, divided by the number of measurements contained in the data set.
What are the symbols for sample mean and population mean>
x = sample mean
mu = population mean
What factors influences how accurately x estimates mu?
- Size of the sample (larger samples = more accurate)
- Variability/spread of data (more variable = less accurate)
What is the median?
The median of a quantitative data set is the middle number when the measurements are arranged in ascending (or descending) order.
How do you calculate the median of a sample?
Arrange the n measurements from the smallest to the largest.
If n is odd, M is the middle number.
If n is even, M is the mean of the middle two numbers.
What are the symbols for sample and population mean?
M = sample n = population
Between median and mean, which is more sensitive to extremely large or small measurements?
Mean.
What does it mean if a data set is skewed?
A data set is said to be skewed if one tail of the distribution has more extreme observations than the other tail.
What is the mode?
The mode is the measurement that occurs most frequently in the data set.
What type of data is the mode particularly useful for?
Qualitative
What is the modal class?
The measurement class containing the largest relative frequency is called the modal class.
What does the choice of measurement of central tendency depend on?
The choice of which measure of central tendency to use will depend on the properties of the data set analyzed and the application of interest. Consequently, it is vital that you understand how the mean, median, and mode are computed.