Theory Cards: Econometrics, Statistics, Causal Inference Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

What is the purpose of A/B testing in experiments?

A/B Testing

A

A/B testing aims to determine if changes to a variable (e.g., a webpage design) lead to a statistically significant difference in a key metric by comparing control and treatment groups.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a null hypothesis in A/B testing?

A/B Testing

A

The null hypothesis states that there is no effect or difference between the control and treatment groups.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a p-value in hypothesis testing?

A/B Testing

A

A p-value represents the probability of observing results as extreme as those in the experiment if the null hypothesis is true. If p < 0.05, results are considered statistically significant.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a Type I error in A/B testing?

A/B Testing

A

A Type I error (false positive) occurs when the null hypothesis is incorrectly rejected, suggesting an effect exists when it doesn’t.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a Type II error in A/B testing?

A/B testing

A

A Type II error (false negative) happens when the null hypothesis is not rejected despite there being an actual effect, failing to detect a real difference.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is power analysis in the context of A/B testing?

A/B testing

A

ower analysis calculates the probability of correctly rejecting a false null hypothesis, reducing the chance of Type II errors and increasing confidence in detecting meaningful effects.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Why is sample size important in A/B testing?

A/B testing

A

A sufficient sample size is needed to detect a statistically significant difference. It depends on the expected effect size, statistical power (usually 0.8), and significance level (often 0.05).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a multi-variant test?

A/B testing

A

A multi-variant test compares more than two variations simultaneously, rather than just a control and treatment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Name a common pitfall in A/B testing and its solution.

A/B testing

A

Sample contamination (when participants in one group are influenced by another) can skew results. One solution is to strictly separate groups or adjust analysis methods.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What statistical test would you use for comparing means in A/B testing?

A/B testing

A

A t-test is commonly used to compare the means of two samples, such as conversion rates or average order values, assuming normally distributed data and equal variances.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

When should T-Test be used?

A/B testing

A

For comparing the means for 2 samples (conversion rates or avg order values)
Assumes normally distributed data and equal variances in both groups
Example: comparing avg revenue per user between a control and a treatment group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When Should Chi-Square Test be used?

A/B testing

A

For categorical data (e.g. converted vs not converted) between 2 groups
Assumes that each observation is independent and expected frequencies in each cell are adequate
Example:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When should Z-Test be used?

A/B testing

A

Similar to t-test but specifically for large samples or when the population variance is known
Assumes normally distributed data with large sample sizes
Example: comparing click-through rates when you have very large sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

When should non-parametric tests be used?

A/B testing

A

When the data doesnt meet the assumptions of normality(e.g. revenue data often has outliers and is not normally distributed)
Example: For skewed data, like transaction amounts between 2 groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

When should Bayesian Test be used?

A/B testing

A

To estimate the probability of one version being better than the other. Provides probability rather than a p-values
Example: estimate how likely the new version is to be better by a specific margin.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the difference between correlation and causation?

Causal Inference

A

Correlation is when two variables move together but don’t imply one causes the other. Causation implies a direct effect of one variable on another.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is a confounding variable in causal inference?

Causal Inference

A

confounding variable is an external factor that influences both variables studied, potentially creating a false impression of causation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What are counterfactuals in causal inference?

Causal Inference

A

Counterfactuals represent “what could have happened” under a different scenario, such as considering if a recovery would still happen without taking medicine.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the Difference-in-Differences method used for?

Causal Inference

A

It’s used in policy impact analysis where randomized trials are not feasible, comparing changes over time between a treatment and control group.

20
Q

What is an instrumental variable (IV) in causal inference?

Causal Inference

A

An IV is a variable affecting the treatment but not directly influencing the outcome except through the treatment, used to address unobserved confounding.

Example: Imagine studying the effect of education on income, but family background (unobserved) affects both. IF we use “distance to the nearest college” as an instrument, it affects the likelyhood of attending a college (treatment) but likely doesn’t directly influence income.

Quasi-Experimental Methods:

21
Q

How does Propensity Score Matching (PSM) work?

Causal Inference

A

PSM matches individuals in treatment and control groups based on observed characteristics to simulate randomization, though it only controls for observed variables.

Example: if we want to study the impact of a job training program, we could match participants(reatment) with non-participants(control) based on characteristics like age, education, and prior job experience.
Limitation: PSM can only control for observed variables.

Quasi-Experimental Methods:

22
Q

What is the use case of Quasi-Experimental Methods? What are some of these methods?

Causal Inference

A

These methods help us make causal inferences when randomized experiments arent feasible
e.g.

  • Diffrence in difference
  • Instrumental Variables
  • Propensity score matching
  • regression Discontinuity Design (RDD)
23
Q

What is Regression Discontinuity Design (RDD)?

Causal Inference

A

Used when treatment is assigned based on a cutoff score (e.g. test scores, age)
Compares individuals just above and just below the cutoff, assuming they are otherwise similar.
Example: suppose scholarships are given only to students with GPA >3.0. RDD would compare students just above (eligible) and just below (ineligible) the cutoff to estimate the effect of scholarships on academic success.
Requires a clear cutoff.

24
Q

What are the key assumptions of regression analysis?

Econometrics

A
  • Linearity: linear relationship between predictors and outcome variable
  • Independence: observations are independent of each other
  • Homoscedasticity: Constant variance of residuals across values of independent variables
  • No multicollinearity: independent variables arent highly correlated
25
Q

What are the 2 types of Panel Data Models?

Econometrics

A
  • Fixed Effects Model
  • Random Effects Model
26
Q

What is the Fixed Effects model in panel data analysis?

Econometrics

A

It controls for unobserved variables that are constant over time but vary between entities, useful when unique characteristics could bias results.

Example: analyzing how wages change over time within a group of individuals where we assume each person has unique, unobserved traits that don’t change over time

27
Q

When is a Random Effects model suitable in panel data analysis?

Econometrics

A

It’s used when variation across entities is assumed to be random and uncorrelated with predictors, ideal when entity-specific traits are randomly distributed.

Example: If studying wage changes across different cities, assuming that individual city characteristics are randomly distributed and uncorrelated with factors like education or experience.

28
Q

What are the 2 types of models in Time Series Analysis?

Time Series Analysis

A
  • Autoregressive (AR) models
  • Moving Average (MA) Models
29
Q

What is an Autoregressive (AR) model?

Time Series Analysis

A

An AR model uses past values of a variable to predict its future values, like predicting tomorrow’s temperature based on today’s.

30
Q

**

What is a Moving Average (MA) model?

Time Series Analysis

A

An MA model uses past forecast errors to predict future values, adjusting for consistent under- or overestimation in past forecasts.

31
Q

What is stationarity in time series?

Time Series Analysis

A

A time series is stationary if its statistical properties, like mean and variance, remain constant over time, which is essential for accurate modeling.

32
Q

What are some of the common techniques in Time Series Forecasting?

Time Series Analysis

A

Common techniques include Exponential Smoothing (which gives more weight to recent observations) and ARIMA (AutoRegressive Integrated Moving Average) for stationary data.

33
Q

What is Heteroskedasticity and how can you correct it?

Time Series Analysis

A

Heteroskedasticity:
Occurs when the variance of residuals (errors) isnt constant across observations. It can lead to inefficient estimates and affect the reliability of hypothesis tests.
Correction: use techniques such as robust standard errors or transform variables (e.g. log transformation) to stabilize variance.
Example: in a model predicting household income, high-income households might have more variance than low-income households, violating the homoscedasticity assumption.

34
Q

What is Autocorrelation and how can you correct it?

Time Series Analysis

A

Occurs when residuals are correlated across time (common in time series data). This violates the independence assumption in OLS and can lead to biased estimates.
Correction: Use models that account for autocorrelation, like ARIMA, or add lagged variables to the model
Example: Monthly sales data might show autocorrelation, as sales in one month can be correlated with sales in the previous month.

35
Q

What is the Akaike Information Criterion (AIC)?

A

AIC is a measure used to compare models, balancing goodness of fit with model complexity; lower AIC indicates a better model.

36
Q

What is cross-validation?

A

Cross-validation evaluates model performance by splitting data into training and testing sets multiple times, often using K-fold cross-validation.

Example: in a dataset with 1000 observations, we could use 10-fold cross-validation to train the model on the 90% of the data and test on 10%, rotating through all portions.

37
Q

What is the Central Limit Theorem?

Statistical Modeling and Inference

A

he Central Limit Theorem states that the sample mean’s distribution approaches normal as sample size increases, regardless of the population’s original distribution.

38
Q

What is Maximum Likelihood Estimation (MLE)?

Statistical Modeling and Inference

A

MLE estimates parameters by maximizing the likelihood that the observed data occurred under the model, used in logistic regression for probability estimation.

39
Q

What is Bayesian inference?

Statistical Modeling and Inference

A

Bayesian inference uses Bayes’ theorem to update the probability of a hypothesis as new evidence emerges, combining prior beliefs with new data.

40
Q

What are the key types of distributions?

Distributions

A

*** Normal (Gaussian) Distribution: **symmetrical, bell-shaped curve.
Example: heights or test scores in a population

* Binomial Distribution: Describes the number of successes in a fixed number of independent trials, each with the same probability of success.
Example: Flipping a coin 10 times to get a certain number of heads

*** Poisson Distribution: **represents the number of events occurring in a fixed interval of time or space, with a known constant man rate and independent of previous events.
Example: Number of customer arrivals at a store in an hour
* Exponential Distributon: Continuous distribution, describes the time between independent events occurring at a constant average rate.

41
Q

What are the properties, Use Case, and Key Concepts of Normal (Gaussian) distribution?

Distributions

A
  • **Properties: **Symmetric, bell-shaped curve; defined by mean (μ) and standard deviation (σ); 68% of data within ±1σ, 95% within ±2σ, and 99.7% within ±3σ.
  • **Examples: **Heights of people, measurement errors, exam scores.
  • Use Case: Commonly used for naturally occurring data that clusters around an average.
  • **Key concepts: **68-95-99.7 rule (about 68% of data falls within 1 standad deviation, 95% within 2, and 99.7% within 3)

The width of the curve is defined by standard deviation

To draw you need to know:
* The avg measurement (mean)
* The standard deviation of the measurements

42
Q

What are the properties, Use Case, and Key Concepts of Binomial Distribution?

Distributions

A
  • Properties: Discrete distribution; defined by number of trials (n) and probability of success (p); mean = np, variance = np(1-p); symmetric for p ≈ 0.5, skewed otherwise.
  • **Examples: ** Number of heads in 10 coin flips, number of conversions in a marketing campaign, pass/fail outcomes in a series of tests.
  • **Use Case: ** Used when there’s a fixed number of independent trials, each with the same probability of success.

Laplace’s Rule of Succession:
48 out of 50 positive reviews: add one more positive and one more negative
–> 49/52 ~ 94.2% chance of having a positive experience

43
Q

What are the properties, Use Case, and Key Concepts of Poisson Distribution?

Distributions

A
  • **Properties: **Models count of events in a fixed interval; mean and variance are equal (λ); skewed for small λ, more symmetric as λ increases.
  • **Examples: **Number of customer calls per hour, website visits per minute, number of defects per unit.
  • Use Case: Suitable for modeling rare events over time or space, where events occur independently.
  • **Key Concept: **Lambda (A)- the average rate of occurence, which is the mean and the variance of the distribution *
44
Q

What are the properties, Use Case, and Key Concepts of Exponential Distribution?

Distributions

A
  • **Properties: **Models time until the next event; defined by rate parameter (λ) with mean = 1/λ and variance = 1/λ²; memoryless property (probability of future events does not depend on past events).
  • **Examples: **Time between arrivals in a queue, time until a machine failure, time until customer churn.
  • Use Case: Ideal for modeling “time until” events in systems with a constant rate.
45
Q

What are the properties, Use Case, and Key Concepts of Uniform Distribution?

A
  • **Properties: **All outcomes within a specified range are equally likely; mean = (a + b) / 2, variance = (b - a)² / 12 for continuous uniform; flat distribution.
  • Examples: Random assignment within a time range, selecting random test data, random numbers within a set range.
  • Use Case: Used when all values within a range are equally likely, either in continuous or discrete form.
46
Q

What are the properties, Use Case, and Key Concepts of Gama and Beta Distributions?

A

Gamma Distribution
* Properties: Continuous, skewed distribution; defined by shape (k) and scale (θ) parameters; mean = kθ, variance = kθ²; as k increases, shape resembles a normal distribution.
* Examples: Insurance claim amounts, waiting times in queuing systems, rainfall amounts.
* Use Case: Suitable for modeling waiting times where multiple events need to happen, such as total claim costs or aggregated waiting times.

Beta Distribution
* Properties: Continuous distribution on [0,1]; defined by shape parameters α and β; mean = α / (α + β), variance = αβ / [(α + β)²(α + β + 1)]; flexible shape, can be symmetric or skewed.
* Examples: Conversion rates, probability of success in Bayesian inference, proportions in surveys.
* Use Case: Used for modeling probabilities, proportions, or percentages, especially in Bayesian statistics.