Formulas to remember Flashcards
Formula for the width of a class in a histogram
(Largest value - Smallest Value) / (Number of Classes)
What is the reccomended amount of classes for a histogram?
2^k > n
To determine if our choices are valid, we can also follow the rule of 2 to the k larger than n.
The first value of k that brings it above n is the number of classes we should choose
It is reccomended that it be between 5 and 20
What is the difference between Histograms and Bar Charts?
The histogram is specifically for class intervals and numbers that we can add together to create cumulative frequency distributions. These are specific to quantitative data. Earlier, we depicted sleep hours in a histogram.
In contrast, bar charts deal with categories, classes, and labels, such as nominal and ordinal data. Earlier, we depicted student motivation for a major as a bar chart.
Histogram
A histogram displays:
Class intervals
Class frequencies
Histograms are appropriate for quantitative data.
Bar Chart
A bar chart displays:
Frequencies for a set of categories or classes
Bar charts are better for qualitative data.
Ogive
Definition: Graphical representation showing cumulative frequency through class midpoints.
Calculation: Plot cumulative frequency against class midpoints.
Cumulative Nature: Frequency increases cumulatively.
Endpoint: Typically stops at 100% frequency.
Purpose: Alternative data visualization.
Example: First class interval 25-31, midpoint 28, plotted on ogive.
Application: Analyzing cumulative trends in data.
Stem and Leaf Plot
Definition: Technique for visualizing data distribution by showing stems (leading digits) and leaves (trailing digits).
Purpose: Provides a snapshot of data distribution, aiding in quick understanding.
Procedure:
Choosing Stem: Select stem (leading digit) based on data range.
Adding Leaves: For each stem, list leaves (trailing digits) in ascending order.
Rounding Data: Sometimes necessary to round data to fit display, particularly when using larger leaf units.
Application: Used in descriptive statistics to illustrate data distribution effectively.
Calculating the Median
Calculating the Median
Step 1: Sort Values
Arrange values in ascending or descending order.
Step 2: Verify Odd/Even
Determine if the dataset has an odd or even number of values.
Step 3: Split Dataset
Divide the dataset into two equal halves.
Step 4: Round Up
Round the split position to the nearest integer.
Conclusion: The value at the rounded-up position represents the median.