Chapter 18 Flashcards
Coarse Categorization
simplify variables measured on an interval or ratio scale – reduces the estimate of association
Contingency Table
a table where you lay out all the possible pairs of values that states might have on two variables: (ex. high-high, high-low, low-high, and low-low)
Dichotomous Variable
when there are only two possible values for the variable
Dummy Coding
under this convention, we arbitrarily assign a value of 1 to one level of the variable and a value of 0 to the other level – used for coding a dichotomous variable
Linear Association
for some data, when we look at a scatterplot, it might appear that a curved rather than a straight line best summarizes the association between the 2 variables. By convention, however, we normally use a straight line as the prediction function, and thus, we normally refer to associations between variables as linear associations.
Marginal Frequencies
they tell us how many states have a given value on one of the variables, ignoring their value on the other. (p.456 for more info).
Median Split
splitting variables into two categorical variables in relation to the median: low for below the median, high for above the median
Null Hypothesis
a statement that specifies what we hope is NOT true in the population.
Partial Association
the association that persists between 2 variables when a 3rd variable that might explain their association is held constant.
Pearson Product Moment Correlation Coefficient (r)
the most widely used index of association between continuous variables, ranges between -1.0 and +1.0 with = meaning no associations and +- 1.0 meaning perfect association
P-Value
a probability level that gives you grounds to reject or accept the null hypothesis
R^2
square of the value r and indicates the proportion of variability in one variable accounted for by the other variable; aka tells us by what proportion our predictions of one variable improve when we base those predictions on knowledge of the other variable
Scatterplot
We construct a table in which each case is represented by a single point on a two-dimensional graph, positioned according to its values on the 2 variables.
Statistical Significance
an index of the degree of confidence we can have that an association we observe in a sample would emerge if we were to replicate the study using another sample from the same population.
Spurious Association
an association in which two or more events or variables are not causally related to each other, yet it may be wrongly inferred that they are