Secondary data analysis Flashcards

1
Q

Secondary data

A

Any data that is collected by others (ie: not yourself) at an earlier point in time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Primary data

A

Data that we collect ourselves; original information collected by the researcher themselves

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Expert-coded datasets

A

Experts provide estimates and assessments of various measures; usually opinion based

V-Dem: Varieties of Democracies; experts code country democracy scores based on certain criteria
CHES: Chapel Hill Expert Survey; experts estimate party ideology and party positions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Researcher coded datasets

A

Coded by teams of researchers based on publicly available information (ie: news sources, academic articles, etc.)

COW, UCDP: teams of researchers code data on conflicts and casualties

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Types of secondary data

A
  • You can collect your own surveys
  • Official statisitcs
  • Quantified texts, party manifestos
  • Exisitng qualititative sources
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Categorical variable

A

Binary variable: There are only two categories (i.e. dead or alive).

Nominal variable: There are more than two categories (i.e. whether someone is an omnivore, vegetarian, vegan, or fruitarian).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Ordinal variable

A

The same as a nominal variable, but the categories have a logical order (i.e. whether a student got a fail, pass, merit, or distinction in an exam).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Continous variable

A

Entities get a distinct score

Interval variable: Equal intervals on the variable represent equal differences in the property being measured (i.e. the difference between 6 and 8 is equivalent to the difference between 13 and 15).
Ratio variable: The same as an interval variable, but the ratios of scores on the scale must also make sense (i.e. a score of 16 on an anxiety scale means that the person is, in reality, twice as anxious as someone scoring 8).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Benefits of secondary data - Quality

A

Secondary data are often of higher quality, part of a collective project and usually of larger scale which ensures quality

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Benefits of secondary data - time scale

A

Because these data collection efforts are part of larger projects, they are often collected over periods of time as opposed to single surveys that capture a moment in time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

disadvantages of secondary data

A

Unable to make causal inferences , because secondary data sources are often not experimental (ie: we are not manipulating a treatment as the researcher) it is difficult to make casual inferences

Inaccuracies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly