CH 1.3: Graphical Summaries Flashcards
Define Stem-and-Leaf Plots
Plots that consist of a stem, the leftmost one or two digits, and leaf which consists of the next digit in the data values. Used to show an informal summary about the data.
Define Dotplot
It is a plot that show the concentrated frequency and gaps in data values. It is an informal summary of the data values.
Define Histograms
Histogram can consist of a table of data or plot that shows the frequency, relative frequency and density of the data values over predetermined intervals.
+ Frequency: The number of data values in the set interval.
+ Relative Frequency: The number of data values in the set interval divided by the number of total values.
+ Density: Relative frequency divided by the set interval. (Used to proportion data)
+ Set Interval: When data is large log n or 2(n^1/3) can be used to set up a reasonable interval.
*Important to note that if the set intervals used in the histogram plot are not equal then the density value must be used to graph the plot’s y-axis.
**Also note that relative frequency can be obtained by multiplying the density and interval value in a histogram plot.
When is a histogram plot symmetric?
If it right half is mirror image of its left half.
When is a histogram skewed? Define the type of skewed.
When the plot is not symmetric.
+Right skewed/positively skewed is when the plot has long tail to the right, and the mean is greater than the median.
+Left skewed/negatively skewed is is when the plot has a not tail to the left and the mean is less than median.
Define Unimodal
When a histogram plot has one peak/mode/local maximum.
Define Bimodal
When a histogram plot has two peaks/modes/local maximums.
Describe a boxplot
A graphic that presents the median, the first and third quartiles, and any outliers present.
Define interquartile range
The difference between the third and first quartile. Also known as IQR.
Define Outlier
Any data value in a box plot that is more or less than 1.5 of the IQR.
Define Bivariate
Data where each item is a pair of values.
Define Multivariate Data
Data for which each item consists of more than one value.