Forecasting cards Flashcards
ways to decompose seasonality from data
classical seasonal decomposition, X11 decomposition, SEATS decomposition
ARIMA
Autoregressive, integrated moving average
Establish stationarity
Rolling statistics (rolling mean should be similar to zero)
ADF test
Autoregressive model
a model that assumes output is based on its own previous values and a stochastic term
Augmented Dickey–Fuller test
Test to determine if unit root. The more negative, the less likely a unit root.
If there is a unit root present in the time series (null hypothesis), it implies that the time series is non-stationary.
If the null hypothesis is not rejected, the time series is non-stationary. If the null hypothesis is rejected, it’s stationary.
KPSS test
Test for stationarity of a time series. Null hypothesis is that time series is stationary.
K-means clustering
a process that assigns each data point to its nearest center point and then recomputes the mean of each group.
Elbow method determines the best number of clusters, by mapping out average distance to each cluster center
DBSCAN
Creates clusters by iteratively going over each point and if there is enough other points within a certain distance, a cluster is started and it continues to take other points that satisfy the criteria within the cluster. Points that do not reach the threshold are classified as noise