Secondary Data Analysis Flashcards

1
Q

What is secondary data analysis?

A

Secondary data analysis ; retrospective analysis ; using existing data sets

Conducting a research project without collecting ‘fresh’ data - i.e. using data that has already been collected in a fresh context.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

When would secondary data analysis occur?

A
  • your supervisor / a colleague might have data collected from a previous study
  • you could use a publicly available research dataset where lots of things are measured (these are often big, longitudinal, population representative data)
  • you could use historical or archival data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are some data repositories?

A
  • APA
  • ICPSR (database for versions of datasets): Inter-university Consortium for Political and Social Research i
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the benefits of secondary data analysis?

A
  • easier (can be faster, doesn’t involve you collecting data)
  • its richer (access to populations and resources that you wouldn’t have access to - time, money, vulnerable populations) and involves types of research studies you couldn’t do yourself (longitudinal data, birth cohorts)
  • ethical use of data (advancing the benefits of the data, no additional distress on participants)
  • can increase research transparency
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the disadvantages of using secondary data?

A
  • you don’t have control over how the data were collected
  • often mismatch between the study design, measures used and your rq/hypotheses (study was designed for a different RQ to yours, often non-experimental, maybe not ideal measures used)
  • sometimes it costs to access the data set
  • working with other people
  • easy to engage in questionable research practices
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the traditional research steps?

A

observation
question
hypothesis
prediction
experiment
results
theory

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the steps involved for secondary data analysis?

A
  1. Determine RQ (possibly tentative hypotheses)
  2. Find appropriate dataset for your RQ
  3. Identify the study’s design, variables
  4. Refine hypotheses in light of actual variables
  5. Analyse data
  6. Make conclusions and write up results
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

The first step of secondary data analysis

A

Determine RQ
- what is the area?
- literature review (what’s known, what is the gap?)
- define research question
- hypotheses -what do you want to investigate, what are your predictions?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Step 2

A

Find appropriate dataset for your RQ
- fits your area / phenomena
- may need to apply to access and use the data for research purposes

Sometimes the first 2 steps are reversed because data presents itself!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Step 3

A

Identify the study’s design, variables, etc

  • what was the original RQ
  • what was the study design
  • who was the population
  • what constructs were targeted, how were they operationalized
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Step 4

A

Refine hypotheses in light of actual variables
- planned RQ might not be testable on this data
- revise and refine hypotheses in light of which data is actually there

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Step 5

A

Analyse data
- descriptive statistics to summarise
- formal analyses to address hypotheses

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Step 6

A

Make conclusions and write up results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Important considerations for secondary data analysis

A
  • may still need ethical approval at the start as the data will be used for purposes other than the original intention
  • at what point does compromise undermine your conclusions Too much
  • analytical methods need to match the sampling design
  • don’t data snoop
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Important considerations for BIG datasets

A
  • longitudinal projects often have a lot of missing data –> is this differential attrition? how will you deal with this?
  • need to consider and make decisions about significance levels and effect size (as p value is inversely proportionate to sample size)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly