Final exam Flashcards

1
Q

What is monitoring?

A

Monitoring is a continuous process that tracks what is happening within a program and uses the data collected to inform program implementation and day-to-day management and decisions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What 3 types of questions can answer evaluation?

A
  1. Descriptive questions. The evaluation seeks to determine what is taking place and describes processes, conditions, organizational relationships, and stakeholder views.
  2. Normative questions. The evaluation compares what is taking place to what should be taking place; it assesses activities and whether or not targets are accomplished. Normative questions can apply to inputs, activities, and outputs.
  3. Cause-and-effect questions. The evaluation examines outcomes and tries to assess what difference the intervention makes in outcomes. (Impact evaliation)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the evaluation?

A

Evaluations are periodic, objective assessments of a planned, ongoing, or completed project, program, or policy.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the 5 criteria for the program to be evaluated?

A
  1. Innovative. It is testing a new, promising approach.
  2. Replicable. The program can be scaled up or can be applied in a different setting.
  3. Strategically relevant. The program is a flagship initiative; requires substantial resources; covers, or could be expanded to cover, a large number of people; or could generate substantial savings.
  4. Untested. Little is known about the effectiveness of the program, globally or in a particular context.
  5. Influential. The results will be used to inform key policy decisions.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is theory of change?

A

A theory of change is a description of how an intervention is supposed to deliver the desired results. It describes the causal logic of how and why a particular project, program, or policy will reach its intended outcomes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is a causal effect?

A

a causal effect is a change in some feature of the world that would result from a change to some other feature of the world.
the difference in the potential outcomes for some unit under two different treatment statuses

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is counterfactual comparisons?

A

It’s an experiment where at least one of the worlds we are comparing isn’t the real, factual world - it’s in our imaginations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is treatment variable?

A

It’s X. used to describe any intervention in the world

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is outcome or dependent variable?

A

It’s Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the fundamental problems of the casual inference?

A
  1. Individual cause effect can never be directly observed
  2. At any given time, we only observe any given unit in one state of affair (At a given time, a child either participates in deworming
    program or not)
  3. We can’t observe the difference: Yi(1) − Yi(0)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How do we make progress on answering casual questions if effects are fundamentally unobservable?

A
  1. Conduct a randomize trial, assigning some people for treatment and others no, and then comparing the average outcomes for people in the untreated group to the average outcomes for people in the treated group
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is potential outcome?

A

the potential outcome for some unit under some treatment status is the outcome that unit would experience under that treatment status

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is causal relationships?

A

Refers to the cause-and-effect connection between the treatment variable (X) and outcome variable (Y)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the 2 different conditions based on whether the individual receives the treatment?

A
  1. Treatment is the condition with the treatment (Xi=1)
  2. Control is the condition without the treatment (Xi=0)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the causal effect of X on Y?

A

It’s the change in the outcome variable Y caused by a change in the treatment variable X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

When interpreting the sign of causal effects, we should interpret:

A
  1. a positive effect as the treatment causing an increase in the outcome variable
  2. a negative effect as the treatment causing a decrease in the outcome variable
  3. an effect of zero as the treatment causing no change in the outcome variable
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the average treatment effect (average causal effect)?

A

It’s an average of the individual casual effects of X and Y across a group of individuals. It’s the average change in Y caused by a change in X for a group of individuals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the randomized experiment?

A

it’s a type of study design in which treatment assignment is randomized

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

what is the purpose of randomize experiment?

A

by randomly assigning treatment, we ensure that treatment and control groups are, on average, identical ro each other in all observed and unobserved pre-treatment characteristics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is the difference-in-means estimator?

A

it produces a valid estimate of the average treatment effect when the treatment and control groups are comparable with respect to all the variables that night affect the outcome other than the treatment variable itself

average_effect=average_treatment - average_control

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is experimental data?

A

data collected from a randomized experiment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

what is observational data?

A

data collected about naturally occurring events. Studies that use observational data are called observational studies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

How to set working directory to the folder containing the dataset using in R?

A

setwd(“way_to_the_folder”)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

How to read a dataset in R?

A

read.csv(“name_of_dataset”)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

what is head()?

A

shows first 5 observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

what is == in R?

A

the relational operator that evaluates whether two values are equal to each other. The output is a logical value: TRUE or FALSE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

what is the function ifelse() stands for in R?

A

creates the contents of a new variable based on the values of an existing one: “if logical test is true, return this, else return that”.
Теоритично це дає можливість щось замінити на щось. Накриклад, якщо у нас текстовий показник, ми його можемо замінити на цифровий.

ifelse(data$var==”yes”, 1,0)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

How to store values as a new variable?

A

data$var<-ifelse(data$var==”yes”, 1,0)

ifelse() is just an example

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

what can we do by using square breaks [] in R?

A

[] is the operator used to extract a selection of observations from a variable. To its left, we specify the variable we want to subset. Inside the [] we specify the creation of selection.

Example: data$var1[data$variable2==1]

28
Q

what is the unit of measurement of the difference-in-means estimator if outcome variable is non-binary

A

in the same unit of measurement as the outcome variable

29
Q

what is the unit of measurement of the difference-in-means estimator if outcome variable is binary

A

in percentage points (after multiplying the result by 100)

30
Q

When making predictions, we distinguish between 2 types of variables:

A
  1. the predictor (X): variable that we use as the basis for our predictors
  2. the outcome variable (Y): variable that we are trying to predict based on the values of the predictor
31
Q

what is predicted outcomes (Yˆ)?

A

the values of Y we predict based on (i) the fitted model that summarizes the relationship between X and Y in a dataset where we observe both X and Y for each observation and (ii) the observed values of X

32
Q

How can we make a prediction?

A
  1. fit a model: (i) we observe both X and Y (ii) we summarize the relationship between the average Y and X with a model
  2. make predictions: (i) we observe X but not Y (ii) we compute Yˆ by plugging the observed values of X into the fitted model
33
Q

What is the prediction error (residual)?

A

it measures how far our prediction is from the observed value; it’s the difference between the observed outcome and predicted outcome

34
Q

what is the formula for linear regression model?

A

Yi = α + βXi + єi

Yi - is the outcome for observation i
α (the Greek letter alpha) - is the intercept coefficient
β (the Greek letter beta) - is the slope coefficient
Xi - is the value of the predictor (or independent variable) for
observation i
єi (pronounced epsilon sub i) - is the error for observation i

35
Q

what are the 2 coefficients that define any line need to be estimated for fitting line?

A

the intercept and the slope

36
Q

what is the intercept?

A

Yˆ when X=0. Increasing and decreasing the intercept moves the line up and down

37
Q

What is the slope?

A

a line specifies the angle or stepless of the line. Associated with on-unit increase in X

38
Q

What is the least squares method (OLS)?

A

indetifies the line that minimizes the ‘sum of the squared residuals”

39
Q

how to find the total number of observation in the dataset? (R fuction)

A

dim(dataset_name)

40
Q

how to create the scatter plot if two variables?

A

plot(data$x_var, data$y_var)

41
Q

What is the correlation coefficient?

A

it ranges from -1 to 1 and summarizes the direction and strength of the linear association between two variables

42
Q

What is R function for correlation coefficient?

A

cor(data$var_1, data$var_2)

43
Q

what does the range from -1 to 1 of the correlation coefficient means?

A
  1. If the correlation coefficient is positive (closer to 1), it means the two variables tend to increase or decrease together. When one goes up, the other tends to go up, and when one goes down, the other tends to go down.
  2. If the correlation coefficient is negative (closer to -1), it means there’s an inverse relationship. When one variable goes up, the other tends to go down, and vice versa.
  3. A correlation coefficient close to 0 suggests a weak or no linear relationship.
44
Q

how to estimate the coefficient of the linear model using the least squares model in R?

A

lm(data$y_var ~ data$x_var)

45
Q

How to fit the line to the scatter plot by using the R function?

A

abline()

46
Q

what is log() fuction in R?

A

it computes the natural logarithm of the argument specified inside the parentheses.

this a kind of a transformation that will make the variable of interest more normally distributed and in turn improve the fit of the line to the data

47
Q

How to create histogram in R?

A

hist(data$variable)

48
Q

what is fitted log-log linear model?

A

it’s a fitted linear model in which both Y and X have been log-transformed. In this model, we interpret ß as the predicted percentage change in the outcome associated with an increase in the predictor of 1 percent

49
Q

what is coefficient of determination (Rˆ2)?

A

ranges from 0 to 1 and measures the proportion of the variation of the outcome variable explained by the model. The higher the Rˆ2, the better the model fits the data.

50
Q

In terms of simple linear model how can we define Rˆ2?

A

it equivalent to the correlation between X and Y squared:
Rˆ2=cor(X,Y)ˆ2

based on this it becomes clear that the higher the correlation between X and Y (in absolute terms) the better the model fits the data.

51
Q

what is the estimate decomposition?

A

Estimate=Estimand+Bias+Noise

52
Q

what is the estimate?

A

the number we get as a result of our analysis

53
Q

what is the estimand?

A

is the true quantity of interest in the population that we are trying to learn about. Out hope is that our estimate closely approximates our estimand

54
Q

What is bias?

A

errors that occur for systematic reasons

55
Q

what is noise

A

idiosyncratic error (random mistake) that occur because of chance

56
Q

what is sampling variation?

A

refers to the natural differences or fluctuations that can occur when you take different samples from the same population

57
Q

Randomized control trials reduce bias through

A
  1. Randomization of treatment status
  2. careful selection of units and measurements
58
Q

what is the standard error?

A

tells you how much you can expect that sample mean to vary from the actual population mean.

  1. If the standard error is large, then the estimates would be very spread out and the estimator is relatively imprecise
  2. if the standard error is small then the estimates would be very close together and the estimator is relatively precise
59
Q

what is hypothesis testing?

A

a strategy for assessing the probability of getting a result as extreme as yours under the assumption that the null hypotheses is true.

60
Q

What is null hypothesis?

A

This is a statement of no effect or no difference. It represents the status quo or the idea that there is no change or no relationship in the population.

61
Q

what is p-value

A

the probability that we observe a value of the test statistic at least as extreme as the one we actually observed if the (sharp) null hypothesis is true

A smaller p-value provides stronger evidence against the null

62
Q

what is the confidence interval?

A

the range od values that is likely to include the true value of the parameter (estimand)

63
Q

what is a significance level?

A

determines the rejection threshold of the test and characterizes the probability of false rejection of the null hypothesis

64
Q

what is summary()$coef for?

A

provides a table with the following statistics related to a fitted linear model: estimated regression coefficients, standard errors, test statistics and two-sided p values

65
Q

what is the confidential interval?

A

EST+/-1.96*SE

66
Q

What is a cofounding variable or cofounder?

A

a variable that affects both the likelihood of receiving the treatment and the outcome

67
Q

In the presence of confounding variables:

A

treatment and control groups are not comparable, correlation don’t necessarily imply causation and the DiM estimator don’t provide a valid estimate of the average treatment effect

68
Q

why are there no cofounding variables in randomized experiments?

A

by randomly assigning treatment we break the link between any potential confounders and the treatment variables, thereby eliminating all potential confounding variables