1. Introduction to Linear Model Flashcards

1
Q

What is a model?

A

Formal representation of a system

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are models represented as in statistics?

A

Models are represented as functions

e.g. height = months of age x 50 cm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why are models represented as functions?

A

Function allows a model/belief about how something works in the world

Allows precise specification about what is important (argument of belief) and how it occurs (operations)

Precise specification = Prediction = Prediction is tested against real world data

If a model is a true representation then real world data would closely match

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the difference between a deterministic model and a statistical model?

A

Deterministic model = For exact relationship
Statistical model = Case-by-case variability (shows difference in individual data points)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a linear model?

A

Estimating a model for a relationship

Linear model tries to explain variation in an outcome (Y axis/Dependent variable) using one or more predictor (X axis, Independent variable)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the basic linear model equation?

A

yi = β0 + β1𝑥i + ei

yi = Outcome variable
𝑥i = predictor variable
β0 = intercept
β1 = slope
ei = residual

Subscript i = Each PPT has their own value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the residual?

A

Measure of how well the model fits each data point

Distance between model line (on y axis) and data point

Residual = Positive above line and negative below line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the two types of outliers we can get?

A

Marginal - outliers along one axis (x or y)
Jointly - Outliers that don’t fit with the rest of the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the principle of least squares?

A

Process of obtaining a line of best fit from data based on sum of squares of errors, minimum value of estimation . It predicts the behaviour of the dependent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does the principle of least squares do to our data?

A

Minimises residuals for each data point

Doing it across all data = Predicted values are as close to actual measured values of outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the method of least squares?

A

Fit a line
Calculate residuals
Square residuals
Sum up squares

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do we interpret the intercept of a simple linear model?

A

Expected value of y when x is 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How do we interpret the slope of a simple linear model?

A

Number of units that y increases for a unit increase of x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does e ~ N(0, sigma) mean?

A

Distributed in a normal distribution with a mean of 0

Sigma means standard deviation (estimated using model residuals)

Residuals should be the same at any point along the x axis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does a large sigma suggest?

A

Data is more spread out/further away from the line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly