Module 4 Flashcards

1
Q

w

what is a contingency table?

A

data frequencies or proportions within different levels of categorical variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are one way and two way contingency tables?

A
  • they jusr refer to the number of categorical variables you observe for each sampling unit
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are marginal dsitributions?

A
  • one way to see overall patterns in the data
  • claculate row and column frequencies
  • the row and column sums of a two-way contingency table. They can be shown as frequencies or proportions.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How to find marginal distributions in rows vs columns?

A

rows: sum frequencies accross all columns for each row

column: sum frequencies accross all rows for each column

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what are conditional distributions?

A

two-way tables that show the proportion of sampling units for one variable within each level of the second variable. the interaction between categorical variables (shown as seperate table)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How create conditional distribution?

A

select one of the categorical variables to be the primary variable and the other one to be the secondary (conditional) variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How are conditional distributions calculated?

A

calculated as the frequency from contingency table divided by the marginal distribution of the primary variable

  • identify primary and secondary variable
  • for each cell in the new table, divide the value from the contingency table by the marginal distribution of the primary variable
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What do contiditonal distributions show us in regards to the variables?

A

allow us to see how the secondary variable changes accross the primary variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a bar graph?

A

used to visualize categorical data

vertical or horizontal orientation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are two variable bar graphs?

A
  • can be used to display data with two categotical measurement variables
  • designate one variable as the grouping vairable (forms the base of the figure, and levels of the other variable are shown within each level)
  • next step: do we create it as a grouped bar chart or a stacked bar chart?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What type of variable is good for a grouping variable?

A

ordinal categorical variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are grouped bar charts

A
  • the second variables are shown beside each other within each level of the grouping variable
  • levels of grouping variable are separated using a large gap
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a stacked bar chart?

A
  • levels of the second variable are stacked on top of one another within each level of the grouping variable
  • just one bar for each level of the grouping variable (color used to sepearate)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are histograms?

A
  • visualize numerical data
  • split numerical data into bins of equal size and display the number of sampling units in each bin
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what are the three steps of how histograms are created?

A
  • divide the numerical variable into a number of bins of equal size
  • count how many sampling units fit within each bin (frequency)
  • create a plot where each bin has abar with a height equal to the frequency of that bin, make sure no gaps between the bars
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Advantages of histograms?

A
  • good way to visualize pattern of relative abundance in your sampling units along the numerical variable
17
Q

Disadvantage of histogram?

A

complicated to display hisograms when the dataset has many levels of categorical variable

18
Q

what is a bin?

A

a small range of the numerical variable. The numerical variable is divided into a number of bins of equal size forming the base of the figure.

19
Q

What are box plots?

A
  • visualize numerical data
  • based on quartiles and are popular because they show five descriptive statistics in relative compact design
20
Q

What do boxplots show?

A
  • 1st quartile
  • 3rd quartile
  • minimum
  • median
  • maximum
21
Q

What happens with grouped box plots for categorical groups?

A
  • designate one categorical variable as the grouping variable
  • the second as the secondary variable
  • grouping variable forms the base of the figure
  • the levels of the secondary variable are shown within each level of the grouping variable
22
Q

When should we use histograms vs box plots

A

if you have numerical data for a small number of categorical groups and want to showcase the shape of the data distribution, then histograms are the choice.

if you have many categorical groups, or are not interested in showcasing the shape of the data distribution, then use box plot

23
Q

What is a scatter plot?

A
  • used to visualize the relationship between two numerical variables
  • each point on the scatterplot is a sampling unit
  • both numerical variables are measured from the same sampling unuit
24
Q

What is the independent vs dependent variable?

A

independent: the experimental treatment that is manipulated

dependent: the measured response under those treatments

25
Q

What are line plots?

A

used when you have data on two numerical variables, and where the researcher has taken repeated measures from the same sampling unit

the repeated measurements for each sampling unit are connected together by a line so that the viewer knows the data points are not independent of each other