Module 2 Flashcards

1
Q

What is a variable?

A

Definition: A characteristic that varies across observations.
Types: Categorical (qualitative) & Numeric (quantitative).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Types of numeric data

A

Discrete: Countable values (e.g., number of children).
Continuous: Uncountable values within an interval (e.g., weight).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Summarizing categorical data

A

Group data into categories.
Record frequency.
Calculate relative frequency.
Convert to percentages.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Visualizing categorical data

A

Bar Chart: Bars proportional to frequency.
Pie Chart: Segments proportional to relative frequency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Histogram

A

No gaps between bars, height reflects frequency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Types of variables

A

Categorical Variables:
- Represent labels or categories.
- Examples: Marital status, grade in a course, eye color.

Numeric Variables:
- Represent meaningful numbers.
- Subtypes: Discrete & Continuous.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Bar charts

A

Definition: Graphical display of frequencies for categorical data.
Key Points:
Bars can be vertical or horizontal.
Bar length is proportional to frequency.
Bars must have uniform width.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Pie chart

A

Definition: Circular chart divided into segments.
Key Points:
Each segment represents relative frequency.
Good for showing proportions visually.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Frequency distribution

A

Definition: Groups numeric data into intervals.
Steps:
Define intervals.
Count the number of observations in each interval.
Summarize as a table or chart.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Relative Frequency

A

Definition: Proportion of observations in a category or interval.
Formula:

RelativeFrequency=
FrequencyinInterval/TotalFrequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Cumulative Frequency

A

Definition: Total number of observations below a specific interval.
Use Case: Tracks data accumulation across intervals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Cumulative relative frequency

A

Definition: Proportion of observations below a specific interval.
Key Points: Helps visualize cumulative trends.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Histogram

A

Definition: Graphical representation of numeric data using bars.
Key Characteristics:
No gaps between bars.
Bars represent frequencies for numeric intervals.
Shows shape (symmetric or skewed).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Rules for creating graphs

A

Use the simplest graph possible.
Clearly label axes with scales.
Bars (in bar charts) must be the same width.
Avoid exaggerated vertical axis limits.
Do not stretch or compress the vertical axis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Skewed distribution

A

Positive Skew: Right tail is elongated.
Negative Skew: Left tail is elongated.
Symmetric: Mirror image around the center.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Contingency table

A

Definition: Table summarizing frequencies of two categorical variables.
Use Case: Displays combinations of categories (e.g., Myers-Briggs type by gender).

17
Q

Stacked column chart

A

Purpose: Visualize relationships between two categorical variables.
Example: Compare proportions within categories (e.g., Myers-Briggs by gender).

18
Q

Interval width

A

Definition: Range of values within a group for numeric data.
Interval width = max - min/number of intervals

19
Q

Shape of distribution

A

Symmetric: Evenly distributed around the center.
Skewed: Data leans left (negative) or right (positive).
Use Case: Histogram visualization.