Exam 2: Lectures 5-7 Flashcards
two-sample t-test
used to determine whether or not means on two independent samples from a normal distribution are equal or whether they differ
hypotheses for two-sample t-tests
π»_0: γππππγ_1 = γππππγ_2
π»_π: γππππγ_1 β γππππγ_2
Sensitivity analysis
to see whether the best decision changes as one or more inputs change
two sample t-test info
If we fail to reject null hypothesis (p-value β₯ .05),
If we reject null hypothesis (p-value < .05),
what assumption does a two sample t-test have?
that the variances of two groups are equal in the population
Observational study
analyze data already available to us
Designed experiment
control for various factors such as age, gender, or socioeconomic status so that we can learn more precisely what is responsible for the effects we observe
Null hypothesis for one-way ANOVA
there are no differences in population means across conditions
Alternative hypothesis for one-way ANOVA
at least one of them will be different
what is the analysis of variance (ANOVA)
The procedure for analyzing difference between more than two population means
experimental units
Entities measured at each treatment level (or combination of levels)
In ANOVA, what is the variable of primary interest that we wish to measure?
dependent variable
post hoc test
run to confirm where the differences occur between groups
Given the 0.05 significance level, please understand hypothesis testing with a given p-value
If p-value < .05 , then reject the null hypothesis
If p-value >0.05, then fail to reject the null hypothesis
decision tree
enables a decision maker to view all important aspects of the problem at once: the decision alternatives, the uncertain outcomes and their probabilities, the economic consequences, and the chronological order of events
expected monetary value (EMV)
a weighted average of the possible payoffs for this decision, weighted by the probabilities of the outcomes
what things do decision trees consist of
composed of nodes (circles, squares, and triangles) and branches (lines)
decision node
represents a time when the decision maker makes a decision
probability node
represents a time when the result of an uncertain outcome becomes known
end node
indicates that the problem is completedβall decisions have been made, all uncertainty has been resolved, and all payoffs and costs have been incurred
In every regression study, a single variable that we are trying to explain or predict is..
the dependent variable
simple linear regression
quantifies the relationship where there is a single explanatory variable
purpose of simple linear regression
Evaluate the significance of independent variable in explaining the behavior of dependent variable
Predict the values of dependent variable based on the values of independent variable
r^2
the percentage of variation of the dependent variable explained by the regression
rule of thumb about r^2
< .10: trivial
.10 β .30: small to medium
.30 β .50: medium to large
>.50: large to very large
Please clearly understand the interpretation of the regression coefficients for simple linear regression
If Y is dependent variable, and X1 through Xk are explanatory variables, then a multiple regression equation has the form:
π½_0 is the Y-intercept, and π½1 through π½k are the slopes
General Multiple Regression Equation:
Y = π½_0 + π½1X1i + π½2X2i + β¦ + π½kXki
π½_0 and π½s in the equation are regression coefficients
Each slope coefficient is the expected change in Y when this particular X increases by one unit and the other Xs in the equation remain constant
cross-sectional data
usually data gathered from approximately the same period of time from a population
time series data
involve one or more variables that are observed at several, usually equally spaced, points in time
correlation
numerical summary measures that indicate the strength of linear relationships between pairs of variables (continuous variables)
how do we test the assumption of homogeneity in variance
Leveneβs Test of Equality of Variances
H0: γπβ β γ_π = γπβ β γ_π (Two groupsβ variances are equal)
H1: γπβ β γ_πβ γπβ β γ_π (Two groupsβ variances are unequal)
outlier
observations that fall outside of the general pattern of the rest of the observations