Statistics L1 Flashcards
What do we use to estimate linear regression?
Equation of a straight line
y = mx + c
What is the least-squares criterion?
Ensure equal total distance of the points either side of a regression line, to the line. The value for this distance should aim to be as small as possible
What two things are needed to determine the visual unexplained and explained variance?
Regression line and mean value of y line
How do you determine the explained and unexplained variance visually?
Explained = the distance between the regression line (predicted values) and the mean value of y line Unexplained = the distance between the observed y value and the mean y-value line
What is the f-ratio?
The ratio of explained to unexplained variance
What is the ideal condition for an f-ratio
f > 1
What is the coefficient of explanation and what is its notation?
r^2 = the proportion/percentage of variance that is explained by the model
What distinguishes linear and multiple regression?
linear only involves only 2 variables whereas multiple regression involves many variables
What is factor analysis?
Technique to determine whether a group of results can be attributed to a singular or multiple underlying factors and to what extent. Transform variables in to factors
What is cluster analysis?
Technique to determine whether a group of results can be clumped/clustered together to reflect meaningful and definable piles of similar information e.g. whether they all coordinate based on colour