L9 - Regression analysis Flashcards
Cross-tabulation: Chi-square test definition (Malhotra, 2013)
A statistical technique that describes two or more variables simultaneously and results in a table that reflects the joint distribution of two or more variables that have a limited number of categories or distinct value.
When to use cross-tabulation
- to test the difference/association between variables
- to compare the behaviour and intentions for different categories of predictor variables such as income, sex and marital status.
Role of Cross-tabulation (Malhotra, 2013)
(1) Simple to conduct analysis and appealing to less sophisticated researchers.
(2) Results can be easily interpreted and understood.
(3) Clear interpretation provides a stronger link between research results and managerial action.
(4) Greater insights into a complex phenomenon than a single multivariate analysis.
(5) Alleviate the problem of sparse cells in discrete multivariate analysis
Four possibilities of cross-tabulation for three or more variables (Malhotra, 2013)
- Refined association between two original variables.
- No association between two original variables despite initial observation.
- Some association between two original variables despite initial observation.
- No change in the initial association.
Process in cross-tabulation (Malhotra, 2013)
1) Test Ho
2) If reject Ho, determine the strength of association by phi coefficient, contingency coefficient, etc.
3) Interpret the pattern of relationship by computing the percentages in the direction of the independent variables
4) Conclude
Cons of Cross-tabulation (Malhotra, 2013)
1) Produce an endless variety of cross tabulation tables.
2) Complex and inefficient as it only examines the association between variables, not causation.
Expected count (expected frequency) calculation
fe = nr*nc / n
Chi-square calculation
X^2 = Σ (observed frequency - expected frequency)^2 / expected frequency
Chi-square analysis definition (slide)
- assess how closely the observed frequencies fit the pattern of the expected frequencies, and is referred to as a “goodness-of-fit” (poor fit - reject Ho).
- analyze the nominal-nominal and nominal-ordinal scaled.
Chi-square distribution definition
A skewed distribution whose shape depends solely on the number of df. As the number of df increases, the chi-square distribution becomes more symmetrical.
Measures for the strength of association
Phi coefficient (Ф), Contingency coefficient, Cramer’s V, Lambda coefficient, Other statistic (tau b, tau c, gamma)
Phi coefficient definition
to measure the strength of association in the special case of a table with two rows and two columns.
Phi coefficient (Ф) calculation
Ф = √ ( X2 / n )
+ Ф = 0: no association
+ Ф = 1: perfectly positive association
+ Ф = -1: perfectly negative association
Relationships between variables can be described in several ways:
Presence, direction, strength of association, and type of relationship (linear or curvilinear).
Covariation definition
The amount of change in one variable that is consistently related to the change in another variable of interest. Or simply, it is the degree of association between 2 variables.