Module 4: Visualizing Data Flashcards

1
Q

Contingency table

A

-shows frequency or proportion of sampling units in each level of a categorical variable
-the frequency is simply the number of sampling units that falls in each level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Contingency table proportions

A

-help with visualizing the relative distribution of sampling units among the levels

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

One-way and two-way categorical data

A

-just refer to the number of categorical variables you observe for each sampling unit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Marginal distributions

A

-good way to see overall patterns in data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Row marginal distribution

A

-shows the total counts for each row across columns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Column marginal distributions

A

-shows the total counts for each column across all rows

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Column marginal distributions

A

-shows the total counts for each column across all rows

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

To calculate marginal distributions as proportions

A

-sum all frequencies in the table
-row: sum frequencies across all columns for each row and divide by table total
-column: sum frequencies across all rows for each column and divide by table total

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Conditional distributions

A

-shows the interaction between categorical variables (shown as a separate table)
-to create one you first have to select one of the variables to be the primary variable to be the primary and the other to be the secondary (conditional) variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Seeing the pattern

A

-conditional distributions allow us to see how the secondary variable changes across the primary variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Bar graphs

A

-can be used to visualize categorical data
-most relevant information should be on the horizontal axis
-two variable bar graphs: first designate one variable as the grouping variable and the create it as a grouped bar chart or stacked bar chart

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Histograms

A

-divide numerical data into a number of bins of equal size
-count how many sampling units fit within each bin (frequency)
-create a plot where each bin has a bar with a height equal to the frequency of that bin

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Box plots

A

-box: drawn from 1st quartile to 3rd quartile
-solid line: drawn at the median value
-extreme threshold: 1.5 x IQR above and below box
-whiskers: drawn from edge of box plot to last data point within threshold
-extreme values: symbols drawn overtop any data points outside the extreme threshold

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Scatter plot

A

-used to visualize the relationship between 2 numerical variables, each point on scatter plot is a sampling unit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Line plot

A

-used when you have data on 2 numerical variables, and where the researcher has taken repeated measures from the same sampling unit (data points are not independent of eachother)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly