Lectures 1-4: Datasets, Variables, Distributions, Estimation, Quantitative Methods Flashcards
What is a variable?
A variable represents a characteristic for each case within a dataset, which can be described using more than one value – e.g. a variable might record incomes or unemployment rates.
What is a nominal (a.k.a. categorical) variable?
A variable with distinct categories that does not tell you anything about the relationship between them, and cannot be ranked in terms of value or order – e.g. birthplace, religion.
Are nominal variables and categorical variables the same thing?
Yes.
What is an ordinal variable?
A variable with categories than can be ordered or ranked according to some sort of criterion (but where you cannot specify the precise size of the interval between any two categories) – e.g. a ranking of low-, semi- or high-skilled workers.
What is an interval (a.k.a. ratio) variable?
A variable on a scale, with an exact distance between any pair of values. It may be either continuous (e.g. height, income) OR discontinuous/discrete (e.g. indivisible units such as numbers of factories or people).
Are interval variables and ratio variables different?
No. They are the same thing.
What is a dummy variable?
A variable that cannot be measured but can still be used by assigning values that represent two (or more) categories – e.g. where 0 = no and 1 = yes.
What is an independent (a.k.a. explanatory) variable?
A variable that explains your dependent variable.
What is a dependent (a.k.a. response) variable?
It represents a phenomenon that you want to understand through comparison to other variables (i.e. your independent variables).
What are univariate (a.k.a. descriptive) statistics?
They capture the distribution of an individual variable; univariate analysis is the simplest form of statistical analysis.
What types of methods would you use to investigate univariate statistics?
For qualitative variables: frequency, mode and median. For quantitative variables: mean, median, mode, standard deviation, etc.
What are bivariate statistics?
They capture the relationship between 2 variables – e.g. racism and income.
What types of methods would you use to investigate bivariate statistics?
For qualitative variables: crosstabulate, Cramer’s V, logistic/multinomial regression. For quantitative variables: correlation (if the independent variable is quantitative), simple regression.
What are multivariate statistics?
They capture or model the relationships among 3 or more variables.
What types of methods would you use to investigate multivariate statistics?
For qualitative variables: logistic/multinomial regression. For quantitative variables: multiple regression.