lectures 7 & 8 Flashcards
Issue with observational research:
Confounding variables (spurious causation)
Causal effect of the treatment
the difference within and between groups
Advantages of experiments (3)
- Standardisation i.e. the same : stimuli, procedure, responses, variables
- Random assignment → controls for confounding variables
- Time order i.e. exposure to treatment (X) precedes the change in the outcome (Y)
disadvantages of experiments (5)
- Feasibility
- Unrepresentative subject pool
- Artificial environment
- Noncompliance and attrition
- ethical considerations
natural experiments
Natural experiments are close to experiments but without the researcher’s ability to assign subjects to one group or another.
→ take advantage of situations in the real world, in which a treatment has been assigned at random, or as-if random, to a group of individuals by nature, politicians, etc.
We can compare the outcomes of the treatment and control groups.
E.g. lotteries
difference in difference studies limitations
- Parallel trends assumption: in the absence of treatment, the two cases would have followed the same path
- There is no other relevant factor that also changed between the two cases in the period.
- There is no interference between the cases
probability distributions
A mathematical function that gives the probabilities of occurrence of different possible outcomes.
can be:
Symmetric
Bimodal
Skewed right
Skewed left
Properties of the normal distribution:
- The probability is highest at the mean
- The distribution is symmetric around the mean
- It is defined by two parameters: µ (the mean) and σ (the standard deviation)
- Two parmeters → infinite number of normal distributions
Proportions under the normal distribution
68% of the observations fall within one standard deviation above and below the mean.
95% of the observations fall within two standard deviations above and below the mean
99.7% of the observations fall within three standard deviations above and below the mean.
standard deviations measures are often called
z scores
z scores
indicates how many standard deviations a score is above or below the mean of its distribution, regardless of the original units of measurement.
Z = (X - µ) / σ
X Is the original score, µ is the mean, and σ is the standard deviation. The z-score tells you how many standard deviations it is above or below the mean.
Z-scores: The standard normal distribution
The standard normal distribution is a special case of the normal distribution, for which µ = 0 and σ = 1
If the original distribution approximates a normal curve, then the shift to z-scores will produce a standard normal distribution, i.e., it has a mean of 0 and a SD of 1.
Parameters
numerical measures that describe certain characteristics of a population
Sample statistics
using estimates to describe the population
→ used to estimate a parameter, we rely on the sample mean to estimate the population mean
The sampling distribution
a probability distribution of a statistic that is obtained through repeated sampling of a specific population.
It describes a range of possible outcomes for a statistic, such as the mean or mode of some variable, of a population.