Data Analysis Flashcards

1
Q

ggplot()

A

Scatterplot that shows correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

GGally: ggpairs ()

A

Scatterplot matrix shows the correlation between various time series (more than 2)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Positive correlation in lag plots

A

Means strong seasonality

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does correlation measure?

A

It measures the extent of linear relationship between two variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does autocorrelation measure?

A

It measures the linear relationship between lagged values of a time series

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a “lag”?

A

A fixed amount of passing time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How data changes depending on the autocorrelations for the lags?

A
  1. When data have a trend, the autocorrelations for small legs tend to be larger and positive because obdervations nearby in time are also nearby in value. Thus, the ACF of a trended time series tends to have positive values that slowly decrease as the lags increase
  2. When data are seasonal, the autocorrelations will be latger for the seasonal lags than for other lags
  3. When data are both trended and seasonal you see combination of those effects.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

White noise

A

For white noise series we expect autocorrelation to be close to zero, 95% of spikes in the correlogram (ACF) to lie withing blue lines (+-2^T)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

3 types of time series pattern?

A

Seasonality, trend and cycles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

3 components that are comprising time series? Formula

A

A trend-cycle component (or trend), a seasonal component, a remainder component (containing anythjng else in the series)
yt = St +Tt + Rt

y-data
S-seasonal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Box Cox transformation purpose

A

It transforms the data so that it closely resembles a normal distribution. (To make errors normally distributed to construct confidence intervals and conduct hypothesis tests).
Additionally transforming our variables csn improve the predictive power of our models because transformations can cut away white noise

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is guerrero feature?

A

It can be used to choose the best lambda value for u in box cox transformstion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly