Chapter 2: Displaying and Describing Categorical Data Flashcards
Define ‘Frequency table’.
A table that lists the categories in a categorical variable and gives the count (or percentage) of observations for each category.
Define ‘Relative frequency table’.
A table that displays the proportions or percentages, rather than the counts, of values in each category.
Define ‘Distribution’.
The distribution of a variable gives:
- the possible values of the variable, and
- the relative frequency of each value.
Define ‘Area principle’.
In a statistical display, each data value should be represented by the same amount of area.
Define ‘Bar chart’.
A chart that shows bars whose heights represent the count (or percentage) of observations for each category of a categorical variable.
Define ‘Relative frequency chart’.
A chart that shows bars whose heights represent the percentages, instead of the counts, of observations for each category of a categorical variable.
Define ‘Pie chart’.
A chart that shows how a “whole” divides into categories by showing a wedge of a circle whose area corresponds to the proportion in each category.
Define ‘Categorical data condition’.
The methods in this chapter (chapter 2) are appropriate for displaying and describing categorical data. Be careful not to use them with quantitative data.
Define ‘Contingency table’.
A table that displays counts and, sometimes, percentages of individuals falling into named categories on two or more variables. The table categorizes the individuals on all variables at once to reveal possible patterns in one variable that may be contingent on the category of the other.
Define ‘Marginal distribution’.
In a contingency table, the distribution of either variable alone. The counts or percentages are the totals found in the margins (last row or column) of the table.
Define ‘Joint distribution’.
The distribution of both variables in a contingency table, expressed as a percentage or a count.
Define ‘Conditional distribution’.
The distribution of a variable restricting the WHO to consider only a smaller group of individuals.
Define ‘Independent’.
Variables are said to be independent of the conditional distribution of one variable is the same for each category of the other.
Define ‘Segmented bar chart’.
A chart that displays the conditional distribution of a categorical variable within each category of another variable.
Define ‘Simpson’s paradox’.
Relationships among proportions taken within different groups or subsets can appear to contradict relationships among the overall proportions.