Final session 11b Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

when to use Simple Linear Regression

A

When we want to summarize the linear relationship between two variables, X and Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

how do we do Simple Linear Regression

A

We can do this by drawing a straight line on the scatterplot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

a straight line on the scatterplot is called what

A

regression line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

The regression line is a straight line that describes how what

A

Y changes as X changes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

For a given observation (i), our simple linear regression equation:

A

Yi = b0 + b1X1i + ei

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

explain the parts to Yi = b0 + b1X1i + ei

A

Yi is the value of the DV for observation i X1i is the value of the IV for observation i ei is the residual for person i

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Residuals are called what in the population

A

errors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

whats assumed about Errors

A

assumed normally distributed with a constant variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How to determine the best regression line?

A

The “best” regression line is one that has the smallest residuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is Residual

A
  • “vertical” difference between the regression line and each data
    point
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what method is typically used determine the best regression line

A

Method of least squares

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is Method of least squares

A

the most common method. The least-squares regression line of Y on X1 is the line that makes the sum of squared residuals as small as possible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

The least-squares regression line of Y on X1 is determined in such a way that it makes SSR ….

A

as small as possible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

b0 and b1 are determined todo what

A

minimize SSR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is the goal of least squares

A

In other words, this method aims to minimize the unexplained portion of Y by the regression line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

how to Simple regression used for prediction

A

Using our regression line, if we only had a value on X1, we could predict the value of Y

Plug in value of predictor(s) into the equation for the regression line

17
Q

what is the Statistical test for significance for

A

Often times it is of interest to test the relationship between Y and a predictor variable
If we call β1 the population value for b1, is β1 equal to zero in the population?

18
Q

Statistical test for significance of the slope: give the hypotheses

A

Typically we are testing this hypothesis:
H0 :β1 =0

There is no linear relationship between X1 and Y (no effect of X1 on
Y)
H1 :β1 ̸=0

There is a linear relationship between X1 and Y

19
Q

To test H0 for a slope, we also use what

A

a t-test of the form:

t = b1 − 0 / sb1

20
Q

what is the df for Statistical test for significance of the slope

A

dfR = N − 2

21
Q

the observed value of t is greater than a critical value of t with dfR = N − 2 (and α = .05), we may reject the null hypothesis
This indicates what

A

that the slope is significantly different from zero, suggesting a statistically significant effect of X1 on Y

22
Q

For the t-test to provide accurate results, the following assumptions are required:

A

Relationship between predictor and outcome is linear Independent observations
Homoscedasticity
Normally distributed errors

23
Q

what is Homoscedasticity

A

Variance of errors does not depend on the value of the predictor, X1

24
Q

(simple linear regression) As in ANOVA, we can also divide the variance (or variation) in the DV (Y ) into different parts resulting from different sources
In regression analysis, the total variation in Y is partitioned into:

A

SSM : The variation in Y that is explained by the model (i.e., regression line)
SSR: The variation in Y that is unexplained by the regression line (i.e., the residuals)
SST : Total amount of variation in Y

SST =SSM +SSR

25
Q

for Simple Linear Regression, The F statistic can be used for testing what

A

whether the model overall significantly predicts the dependent variable

26
Q

The F statistic can be used for testing whether the model overall significantly predicts the dependent variable
In the case of a single predictor, what is the hypotheses

A

H0 :β1 =0 H1 :β1 ̸=0

27
Q

If the observed F ratio is greater than a critical value of F with dfM and dfR at α, we may _____ H0

A

reject

28
Q

explain Coefficient of Determination (R2)

A

Proportion of the total variation in Y accounted for by the model R2 = SSM / SST

29
Q

Coefficient of Determination (R2) Ranges from 0 to 1 explain

A

The larger R2, the more variance of the DV is explained 0 = No explanation
1 = Perfect explanation

30
Q

In simple regression, the relationship with Pearson correlation (r) is:

A

r2 = R2

31
Q

R2 is____ high; An adjusted value, R2 adj , is______

A

biased, unbiased