Simple Linear Regression Flashcards

1
Q

Assumptions for Simple Linear Regression

A

Data must be continuous, interval or ratio scale, normally distributed and no significant skewness.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Determining equation of a straight line

A

Y = a + bX

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Least-squares criterion

A

Regression line is fitted with reference to the difference between the line and individual data points.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Residual

A

Differences between the data point and the regression line (the unexplained variance)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Variance

A

Sum of squares/degrees of freedom.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Coefficient of Explanation (r2)

A

Regression sum of squares of Y/total sum of squares of Y. The proportion of variance that is explained by the model. Regression coefficient or scatter plot indicates positive/negative relationship.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Standard Error

A

Simply the Squared root of theunexplained variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Calculating Confidence/prediction intervals

A

We can distinguish between confidence intervals around the line itself and the prediction itself and the prediction intervals that describe the estimates to which estimates of Y vary about the line. The prediction intervals are larger and reflect the collective uncertainties of scatter and errors in sampling a and b.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Standardised Residual

A

Residuals are important because their character has important implications for the regression model. Their magnitude contributes to prediction limits.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Homoscedasticity

A

For a model to be wholly reliable, the residuals should be distributed normally about the line. A requirement of homoscedasticity is that the degree of scatter should not vary greatly along the range of X.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Heteroscedastic

A

For a model to be wholly reliable, the residuals should be distributed normally about the line. If it is not fulfilled the data is said to be heteroscedastic and the regression equation may be unreliable for some purposes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Autocorrelation

A

There should be independence between residuals and, consequently, no autocorrelation between them. Autocorrelation is indicated by either long runs of positie or negative residuals along the regression line, or, on the other hand, by rapid and regular fluctuations of residuals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Measures of autocorrelation

A

Durbin-Watson d-statistic. Based on the sequence of residuals so that: d = sum of successive squared differences/sum of sqaured residuals.

D-W value - indicate whether to accept or reject null hypothesis. 0.-1.475 indicates positive autocorrelation and 2.5-4.0 indicates negative.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Non-Linear Regression

A

There are a number of bivariate relationships in Geography in which incremental changes in the predictor variable (X) are not accompanied by correspondingly uniform changes in the dependent variable (Y). Such relationships are said to be linear and are summarised by curves.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Difficulty that lies in non-linear regression

A

There are many possivle forms of non-linear relationships.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Exponential curves use the universal constant of

A

2.7182

17
Q

Partial regression coefficients

A

Represent changes in slope as X increases and are scale dependent.

18
Q

Non linear regression power curve

A

Equation is Y = aXb