Week 3 Flashcards
Crosstabulation
a tabular method for summarizing the distributional characteristics of the data for two variables simultaneously.
Crosstabulation takes the form of:
- Contingency table: qualitative data
- Correlation table: quantitative data
- A mix of qualitative and quantitative data
Covariance definition
Covariance is a measure of how much two random variables vary together. It’s similar to variance, but where variance tells you how a single variable varies, co variance tells you how two variables vary together.

Correlation definition

Absence of correlation
- The absence of correlation between X and Y (ρ = 0) is always due to the covariance (sxy = 0).
- Nul covariance can be attributed to:
- Perfect independence between the variables
- Nonlinear relationship between the variables
Spurious relationship
when the correlation between two phenomena taken alone is basically nonsense.
chi-square statistic
The chi-squared statistic is a single number that tells you how much difference exists between your observed counts and the counts you would expect if there were no relationship at all in the population.

Crosstabulation: an example of a contingency table

Crosstabulation: an example of a correlation table

It can be shown (after some algebra) that if X and Y are independent, the ij frequency must be equal to:

Theoretical independent frequencies definition
the frequencies that would be observed in case
of perfect independence between X and Y
Measure of association:
