Quantitative methods Flashcards
Durbin-Watson tests for?
Value 0 =
Value 2 =
DW lower =
DW middle =
DW upper =
serial correlation
Value 0 = perfect serial correlation
Value 2 = No serial correlation
DW lower = reject the null
DW middle = inconclusive
DW upper = do not reject the null
T stat =
t stat = Coefficient - mean / SE
mean might be zero
R^2 coefficient of determination =
Adjusted R^2 =
Correlation coefficient =
SST =
MSE =
SEE =
R^2 = RSS / SST =
Adjusted R^2 = lower than R^2
Square root of R^2
SST = SSE + RSS
Total Variaiton = SSEunexplained + RSSexplained
MSE = SSE / (N-k-1)
SEE = Square root of MSE
What is multicollinearity?
Signs of multicollinearity
Problems caused by it?
Correcting multicollinearity?
What is multicollinearity?
Two or more independent variable are highly correlated.
Signs of multicollinearity? High R^2, low t-score, use of a matrix.
Problems caused by it? Estimates of regression coefficients can be unreliable.
Correcting multicollinearity? 1. Remove some of the variables 2. Re-run the model
What is serial correlation?
Test
Correcting for serial correlation?
When variables in a time series appear correlated over previous periods of time (used to predict security price changes. Error terms appear correlated. Use DW upper and lower limits to see. Above upper to 2 no serial correlation exists
Test = Durbin Watson
Correcting for serial correlation? Hansen Method adjusts the SE of regression coefficients UPWARDS until there is no serial correlation.
Which model assigns a 1 or 0 to the value of an independent variable?
Discriminant analysis models
Hansen method adjusts what/?
Adjusts standard errors for both conditional heteroskedasticity and serial correlation
What do probit models test for?
How do they estimate the value of the dependant variable?
What sort of variables can probit test?
Test for normal distribution.
Dependant variable is expected to be 1.
Test for qualitative variables
What is homoskedasticity?
What is conditional heteroskedasticity?
What is unconditional heteroskedasticity?
Variance of the error term is constant across all observations
Variance of error terms changes in a systematic manner that is correlated with values of the independant variable
Variance of the error term changes in an unsystematic way that is not correlated with the independent variables.
Dicky Fuller tests for?
Durbin Watson tests for?
Breusch Pagen tests for?
Dicky Fuller tests for Non-stationarity
Durbin Watson tests for serial correlation
Breusch Pagen tests for conditional Heteroskedasticity using chi-squared.
Y = b0 + b1X1 + b2X2 + error term
b0 =
b1X1 =
b2X2 =
Y = What you are forecasting b0 = Intercept b1X1 = Independent variable x regression coefficient b2X2 = Independent variable x regression coefficient
T-stat =
Coefficient - mean / SE
Coefficient could be advertising or hours worked
F Stat =
F Table =
F stat > F table
F Stat = (RSS/K) / (SSE/n-k-1) or RSS/K
F Table = k / n-k-1
F stat > F table then one independent variable explains variance of the dependent variable
k = number of independant variables ie advertising and hours worked k = 2 n = number of data sets usually years
How many dummy variables needed for 4 quarters?
Dummy variables is ONE less. So 3.
Mean reverting level =
MRL = b0 / 1 - b1
AR Model with 101 observations, SE =
What test is used and what level shows AR model is not correctly specified? =
Less than this level shows?
SE = 1 / Square root of n
T-distribution is used at 5%, above 2 is mis-specified.
Less than this level shows that it is specified correctly.
Random walk =
Random walk unit root lag coefficient b1 =
Mean Reverting =
Random walk = Is Significant
lag coefficient = 1
Mean Reverting = Not significant
ARCH Errors =
error terms are heteroskedastic and SE of regression coefficient is incorrect.
Signs include lagged variable being significantly different to zero in the model.
Dicky fuller
a) has a problem with the unit root
b) does not have a problem with the unit root
Dicky fuller
a) has a problem with the unit root = accept the null
b) does not have a problem with the unit root = reject the null as they are cointegrated.
First differenced random walk =
Yt = b0 + error term
Where Yt = Xt - Xt-1
A lag coefficient > 1 concludes
Lag coefficient > 1 = The model has an explosive root.
First differencing =
Allows analysts to conclude original time series are random walk
Dendogram hierarchical clustering with short dendrites indicates?
Denrites are vertical lines and shorter lines indicate similar clusters of data.
Supervised learning
vs
Unsupervised learning
Supervised learning uses prelabelled data such as fraudulent activities
vs
Unsupervised learning does not use pre-labelled data and algorithms try to describe the data.