Chapter 3: Displaying and Describing Categorical Data Flashcards

1
Q

Define ‘Frequency table’.

A

A table that lists the categories in a categorical variable and gives the count (or percentage) of observations for each category.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define ‘Relative frequency table’.

A

A table that displays the proportions or percentages, rather than the counts, of values in each category.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define ‘Distribution’.

A

The distribution of a variable gives:

  1. the possible values of the variable, and
  2. the relative frequency of each value.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define ‘Area principle’.

A

In a statistical display, each data value should be represented by the same amount of area.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define ‘Bar chart’.

A

A chart that shows bars whose heights represent the count (or percentage) of observations for each category of a categorical variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define ‘Relative frequency chart’.

A

A chart that shows bars whose heights represent the percentages, instead of the counts, of observations for each category of a categorical variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define ‘Pie chart’.

A

A chart that shows how a “whole” divides into categories by showing a wedge of a circle whose area corresponds to the proportion in each category.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define ‘Categorical data condition’.

A

The methods in this chapter (chapter 2) are appropriate for displaying and describing categorical data. Be careful not to use them with quantitative data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Define ‘Contingency table’.

A

A table that displays counts and, sometimes, percentages of individuals falling into named categories on two or more variables. The table categorizes the individuals on all variables at once to reveal possible patterns in one variable that may be contingent on the category of the other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Define ‘Marginal distribution’.

A

In a contingency table, the distribution of either variable alone. The counts or percentages are the totals found in the margins (last row or column) of the table.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Define ‘Joint distribution’.

A

The distribution of both variables in a contingency table, expressed as a percentage or a count.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Define ‘Conditional distribution’.

A

The distribution of a variable restricting the WHO to consider only a smaller group of individuals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Define ‘Independent’.

A

Variables are said to be independent of the conditional distribution of one variable is the same for each category of the other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Define ‘Segmented bar chart’.

A

A chart that displays the conditional distribution of a categorical variable within each category of another variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Define ‘Simpson’s paradox’.

A

Relationships among proportions taken within different groups or subsets can appear to contradict relationships among the overall proportions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How can we summarize categorical data?

A
  1. Make and interpret a frequency table for a categorical variable. We can summarize categorical data by counting the number of cases in each category, sometimes expressing the resulting distribution as percentages.
  2. Make and interpret a bar chart or a pie chart (using area principle).
  3. Make and interpret a contingency table. When we want to see how two categorical variables are related, we put the counts (and/or %) in a two-way table.
  4. Make and interpret bar charts and pie charts of marginal distributions. Along with marginal distribution, we also look at conditional distribution of a variable within each category of the other variable. This tells us the association between them and if there is dependence.