ch7 Flashcards
What is archival data
- Data gathered from existing sources
- Collected for another purpose than that of the current study
- Secondary data (i.s.o. primary)
Beware: existing survey data = secondary data
Internal archival data
Company records and archives
External archival data
- Publicly available data
* Commercially available data
What are the strengths of archival research:
- Tapping into industry wisdom
- Examining effects across time
- Examining effects across countries
- Examining socially sensitive phenomena
Piecing together archival data
•Unit of analysis
= level of data analysis/data aggregation
= level at which DV is measured
•Make sure your unit of analysis correspondents
IV measured at same level or higher level than DV
Unit of Analysis levels
- Country
- Industry
- Firm
- Brand
- Consumer
Unit of Analysis levels (longitudinal data, time-variant)
- Year
- Quarter
- Month
- Week
- Day
Sources of measurement unreliability in archival research
- Missing data
- Inaccurately recorded data
- Fake data
Combining multiple archival indicators into a single measure
1) Standardise each indicator
2) Average the standardised indicators
Solutions to missing data in cross-sectional data sets (meaning without time dimension)
- Listwise deletion; delete entire row if 1 variable is missing *check user manual
- Mean substitution: replace missing value for observation I and variable j with average value on variable j for all other observations
Solution to missing data in longitudinal data sets
Interpolation
Solution to inaccurately recorded data (longitudinal data set)
•Inaccuracies that turn up as extreme data points
SOLUTION: Remove observation: run analyses with and without observation
•Inaccuracies that are not extreme
SOLUTION: Trim/truncate (in large data sets). Remove a fraction of observations, e.g. 1% most extreme observations
*Plot data to notice extreme data points
Fake observations
- Be critical!
- Who collected the data?
- When? Where?
- For what purpose?
Measurement validity
Does a measure represent the construct it is supposed to measure?
Considerable conceptual overlap: valid measure
Little conceptual overlap: questionable proxy
An archival measure may only be a “proxy” of the underlying construct (approximation of construct)
How to validate than an archival measure is a good measure rather than a bad proxy?
- Provide precedence (referring to high quality studies)
- Provide sound logic to support that considerable conceptual overlap exists between construct and measure
- Provide evidence of a substantial correlation between your proxy and a valid survey measure for a small subsample of your data
- provide evidence of substantial correlations (r>.3) with related constructs (“nomological validity”)