Quiz 3 Flashcards

1
Q

Experiment

A

A method in which researchers randomly assign individuals to experimental conditions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Benefits of randomness

A
  • Gold standard for causal inference
  • We can make sure that the treatment is the only systematic difference between the two groups under study
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Fundamental problem of causal inference

A

We can’t observe a unit in both its treated and controlled statuses

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Characteristics of lab experiments

A
  • Researcher controls the environment
    -Ideal for when experimental manipulations don’t exist in the real world
  • Make sure there is full randomization
  • Everyone in the treatment group receives the same treatment
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Disadvantages of lab experiments

A
  • Artificial environment
    -Hard to measure long term behavior
    -Non-representative
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Field Experiments

A

Experiments which apply the logic of randomization and variable manipulation to naturally occurring situations

Ex: the ability of sanction message to reduce hate speech on Twitter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Advantages and Disadvantages of field experiments

A

A: more true to life than lab experiments

D:
-Greater chance of failing to fully randomize
-Participant might not fully comply
-Costly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Survey Experiments

A

Experimental manipulation takes the form of a survey, with the same issues that lab experiments have

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Natural ‘experiments’

A

Observational studies that carry some characteristics of experiments, but researchers can’t randomize

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Criteria for evaluating experiment quality

A

Internal and external validity
(Lab experiments are stronger in the former while field experiments are in the latter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Internal Validity

A

The degree to which the research procedure demonstrates a true causal relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Reasons why causality might be compromised

A

-Failure to fully randomize selection bias
-Non-compliance with treatment
-Maturation
-Contamination

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

External Validity

A

The extent to which the results of a study can be generalized across populations, times, and settings. Generalizable to the entire population of interest and to all time periods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Causal inference from strongest to weakest

A

-Experiments
-Natural experiments
-Observational studies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Observational Study

A

Designs in which researcher doesn’t interact with or intervene in the data generation process, but instead merely observes causal sequences and covariations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Cross-sectional design

A

Measurements of the independent and dependent variables are taken at the same time

17
Q

Longitudinal/time series designs

A

Measurements of the independent and dependent variables are taken at different points in time

18
Q

3 Vs of Big Data

A

Volume, Variety and Velocity (lots of data, in a variety of formats, being created constantly)

19
Q

Characteristics of big data

A

-big (lots of info useful for studying rare events and for heterogeneity)
-Constantly changing (good for the study of unexpected events)
-non-reactive (lack of subject behavioral change when they know they’re being observed)
-incomplete (missing data to operationalize concepts, demographic information, and behavior on other platforms)
-inaccessible
-nonrepresentative
-drift in subject over time
-influenced by algorithm
-dirty
-sensitive

20
Q

Farber’s findings

A

Taxi drivers work more on days when wages are higher

21
Q

Now casting

A

Attempts to use ideas from forecasting to measure the current state of the world

22
Q

What went wrong with Ginsberg’s now casting

A
  • outperformed by simpler modeling
    -drift and algorithmic confounding
23
Q

Ginsberg flu data study

A

-Combined Google Trends with CDC fluency data and used it to now cast flu prevalence

24
Q

Seigle Findings (Trump Article)

A

No empirical support for the hypothesis that Trump’s divisive campaign increased hate speech on Twitter

25
Visconti Findings (Chile and natural disasters)
Material damage caused by disaster increased the probability of voters selecting left wing and independent candidates
26
Population
Any well-defined set of units of analysis
27
Sample
Any subset of units collected in some manner from a population
28
Coverage bias
Incomplete or inappropriate sample frame
29
Two types of samples
Random (probability) sample Non probability sample
30
Simple Random Sample
Each individual or group has an equal change of selection. Assumed in most statistics formulas