Module 2 Flashcards
What is a variable?
Definition: A characteristic that varies across observations.
Types: Categorical (qualitative) & Numeric (quantitative).
Types of numeric data
Discrete: Countable values (e.g., number of children).
Continuous: Uncountable values within an interval (e.g., weight).
Summarizing categorical data
Group data into categories.
Record frequency.
Calculate relative frequency.
Convert to percentages.
Visualizing categorical data
Bar Chart: Bars proportional to frequency.
Pie Chart: Segments proportional to relative frequency.
Histogram
No gaps between bars, height reflects frequency.
Types of variables
Categorical Variables:
- Represent labels or categories.
- Examples: Marital status, grade in a course, eye color.
Numeric Variables:
- Represent meaningful numbers.
- Subtypes: Discrete & Continuous.
Bar charts
Definition: Graphical display of frequencies for categorical data.
Key Points:
Bars can be vertical or horizontal.
Bar length is proportional to frequency.
Bars must have uniform width.
Pie chart
Definition: Circular chart divided into segments.
Key Points:
Each segment represents relative frequency.
Good for showing proportions visually.
Frequency distribution
Definition: Groups numeric data into intervals.
Steps:
Define intervals.
Count the number of observations in each interval.
Summarize as a table or chart.
Relative Frequency
Definition: Proportion of observations in a category or interval.
Formula:
RelativeFrequency=
FrequencyinInterval/TotalFrequency
Cumulative Frequency
Definition: Total number of observations below a specific interval.
Use Case: Tracks data accumulation across intervals.
Cumulative relative frequency
Definition: Proportion of observations below a specific interval.
Key Points: Helps visualize cumulative trends.
Histogram
Definition: Graphical representation of numeric data using bars.
Key Characteristics:
No gaps between bars.
Bars represent frequencies for numeric intervals.
Shows shape (symmetric or skewed).
Rules for creating graphs
Use the simplest graph possible.
Clearly label axes with scales.
Bars (in bar charts) must be the same width.
Avoid exaggerated vertical axis limits.
Do not stretch or compress the vertical axis.
Skewed distribution
Positive Skew: Right tail is elongated.
Negative Skew: Left tail is elongated.
Symmetric: Mirror image around the center.