Brief Review of Intro to Statistics (Module 14) Flashcards
What type of data analysis would be appropriate for 1 continuous explanatory variable (x) and 1 continuous response variable (y)?
simple linear regression
What type of data analysis would be appropriate for multiple continuous explanatory variables (x1, x2, … x[n]) and 1 continuous response variable (y)?
multiple linear regression
What 2 types of data analyses would be appropriate for 1 categorical variable (x) and 1 continuous response variable (y)?
(1) T-test
(2) one-way ANOVA
Under what circumstance would we run a T-test rather than a one-way ANOVA?
If the categorical explanatory variable is binary, such as for sex (‘male’, ‘female’) or the main belligerents in the Wars of the Roses (‘House of York’, ‘House of Tudor’), we perform a T-test.
Under what circumstance would we run a one-way ANOVA rather than a T-test?
If the categorical explanatory variable has more than two possibilities, such as regions of Italy (‘Tuscany’, ‘Campania’, ‘Sicilia’, etc.), or different types of fruit people commonly pack for lunch (‘Oranges’, ‘Bananas’, ‘Apples’, etc.), we perform a one-way ANOVA.
What is the difference between a one-way ANOVA and a two-way ANOVA?
A one-way ANOVA will have only one explanatory variable (x), while a two-way ANOVA will have two explanatory variables (x1, x2).
What is an example of a question we might use a two-way ANOVA to answer which concerns people’s preferred fruit (x1), their annual average mileage (y), and the state in the U.S. they live (x2)?
Is the average annual mileage that people drive influenced by their favorite fruit and does that depend on the U.S. state where they live?
What does the phrase “n-way ANOVA” mean?
ANOVAs can be performed with as many explanatory variables as one wants - “n” represents the number in the analysis, whether that is two or three or ten, etc.
What type of data analysis would be appropriate for 1 categorical explanatory variable (x) and 1 categorical response variable (y)?
Analysis of Contingency Tables
In an analysis of contingency tables, if both the explanatory and the response variables are both binary, what term do we use to describe the test we perform?
Analysis of a Two-Way Contingency Table
What is it called if one or both of the explanatory or response variables in an Analysis of Contingency Tables has more than two possible entries?
Analysis of a R-by-C (Row-by-Column) Contingency Table
What type of analysis would we perform if we have 1 continuous explanatory variable (x) and 1 categorical response variable (y)?
simple logistic regression
What type of analysis would we perform if we have more than 1 continuous explanatory variable (x1, x2, … x[n]) and 1 categorical response variable (y)?
multiple logistic regression
T-tests, ANOVA, linear regressions and logistic regressions are all part of what family of mathematical concepts?
linear models
What is the term and a description for the first assumption of linear models?
LINEARITY - stipulates that there is a linear relationship between the explanatory variable (x) and the response variable (y)
What is the term and a description for the second assumption of linear models?
NORMALITY - for any given value of the explanatory variable (x), the values of the response variable (y) have normally distributed errors
What is the term and a description for the third assumption of linear models?
HOMOGENEITY OF VARIANCE - the variance in the response variable (y) is constant across a range of explanatory variable (x) outputs
What is the term and a description for the fourth assumption of linear models?
INDEPENDENCE - for any given value of the explanatory variable (x), the values from the response variable (y) have independent errors
FILL IN THE BLANKS: For linear models, decent method for (1)______________ and a strong (2)____________________ will make it easier to analyze the data than (3)______________ or (4)_______________ it after the fact to better fit the (5)___________________.
(1) sampling
(2) experimental design
(3) transforming
(4) sub-setting
(5) assumptions of linearity
What kind of data is able to be analyzed in multiple different ways and may reveal answers to multiple different questions?
data collected in accordance with a sound experimental design