1. Introduction to Linear Model Flashcards

Question 1

Q

What is a model?

Answer

A

Formal representation of a system

Question 2

Q

What are models represented as in statistics?

Answer

A

Models are represented as functions

e.g. height = months of age x 50 cm

Question 3

Q

Why are models represented as functions?

Answer

A

Function allows a model/belief about how something works in the world

Allows precise specification about what is important (argument of belief) and how it occurs (operations)

Precise specification = Prediction = Prediction is tested against real world data

If a model is a true representation then real world data would closely match

Question 4

Q

What is the difference between a deterministic model and a statistical model?

Answer

A

Deterministic model = For exact relationship
Statistical model = Case-by-case variability (shows difference in individual data points)

Question 5

Q

What is a linear model?

Answer

A

Estimating a model for a relationship

Linear model tries to explain variation in an outcome (Y axis/Dependent variable) using one or more predictor (X axis, Independent variable)

Question 6

Q

What is the basic linear model equation?

Answer

A

yi = β0 + β1𝑥i + ei

yi = Outcome variable
𝑥i = predictor variable
β0 = intercept
β1 = slope
ei = residual

Subscript i = Each PPT has their own value

Question 7

Q

What is the residual?

Answer

A

Measure of how well the model fits each data point

Distance between model line (on y axis) and data point

Residual = Positive above line and negative below line

Question 8

Q

What are the two types of outliers we can get?

Answer

A

Marginal - outliers along one axis (x or y)
Jointly - Outliers that don’t fit with the rest of the data

Question 9

Q

What is the principle of least squares?

Answer

A

Process of obtaining a line of best fit from data based on sum of squares of errors, minimum value of estimation . It predicts the behaviour of the dependent variable

Question 10

Q

What does the principle of least squares do to our data?

Answer

A

Minimises residuals for each data point

Doing it across all data = Predicted values are as close to actual measured values of outcome

Question 11

Q

What is the method of least squares?

Answer

A

Fit a line
Calculate residuals
Square residuals
Sum up squares

Question 12

Q

How do we interpret the intercept of a simple linear model?

Answer

A

Expected value of y when x is 0

Question 13

Q

How do we interpret the slope of a simple linear model?

Answer

A

Number of units that y increases for a unit increase of x

Question 14

Q

What does e ~ N(0, sigma) mean?

Answer

A

Distributed in a normal distribution with a mean of 0

Sigma means standard deviation (estimated using model residuals)

Residuals should be the same at any point along the x axis

Question 15

Q

What does a large sigma suggest?

Answer

A

Data is more spread out/further away from the line

1. Introduction to Linear Model Flashcards

(15 cards)