Statistical modelling in Space and Time Flashcards

Question

Equation for the variogram

Answer 1

γ(h) = σ² − C(h) Given a covariance function we can calculate the corresponding variogram

Answer 2

C(h) → 0 as h → ∞ lim h→∞ [γ(h)] = σ² And we have C(h) = σ² − γ(h) = lim k→∞ [γ(k)] − γ(h)

Answer 3

Kriging is using the variogram to interpolate between data | i.e. using a variogram instead of a covariance function

Answer 4

In simple kriging the mean is assumed to be constant and known (i.e. zero)

Answer 5

The mean is estimated as well as the parameters in the variogram

Answer 6

The mean is a function of some covariates, usually but not exclusively, spatial co-ordinates.

Answer 7

1. number of pairs in each bin - more data so trust more thus larger weighting than elsewhere 2. the theoretical variogram - where variogram is higher, trust it more(?) 3. equal weights

Answer 8

Calculate the sample variogram Choose a shape for the variogram Fit that variogram to the sample variogram by weighted least squares

Answer 9

Hawkins and Cressie is an alternative to estimating the sample variogram.

Answer 10

squared loss = mean of the posterior; absolute loss = median; (0, 1) loss = mode (also known as maximum a posteriori (MAP) estimates)

Answer 11

``` Subjective Bayes Objective Bayes Conjugate Priors Non-Informative Priors Informative Priors (MCMC methods) ```

Answer 12

Gibbs Sampler | Metropolis-Hastings

Answer 13

Conjugate prior is one such that the formula for the posterior and the prior are the same

Answer 14

Improper priors / Non-informative priors

Answer 15

Maximise the posterior (MAP) MCMC Approximate the posterior and sample from that Discretise the prior on δ

Answer 16

Monte Carlo

Answer 17

Leave one out Leave N out If a completely independent data set, hold some back and use to check (Indivudual Prediction Errors)

Answer 18

Parameters ICs BCs

Answer 19

STRUCTURAL UNCERTAINTY; - Uncertainty in the underlying science (Don't know the world perfectly, hence equations not perfect) - Uncertainty in the solution of the equations (the discretisation adds additional uncertainty etc) UNCERTAINTY IN THE INPUTS

Answer 20

A Gaussian process to model the simulator output as a function of its input

Answer 21

How sensitive is the simulator output to a change in an input (or combination of inputs)

Answer 22

If we are uncertain about the simulator inputs what does that say about our uncertainty on the outputs

Answer 23

Optimised Latin Hypercubes and quasi-Monte Carlo sequences(Sobol)

Answer 24

``` Maximin - maximise the minimum distance between points Orthogonal designs (e.g. good coverage on x1 + x2 ) ```

Answer 25

Distributing points in some space such that they are evenly distributed with respect to some (mostly geometrically defined) subsets. The discrepancy (irregularity) measures how far a given distribution deviates from an ideal one.

Answer 26

Specify the Gaussian process model (mean and covariance function) Select the prior distributions for the GP hyperparameters Choose a design for training and validation Run the ensemble of model runs Fit the emulator to the simulator runs Validate and re-fit if needed

Answer 27

Time has one direction Time series is considered as discrete as it is often collected at regular intervals (Trends more common in time series - need to consider more) Time series are used to extrapolate, whereas spatial data is normally used for interpolation

Answer 28

Seasonal anomalies | Seasonal differences

Answer 29

Strictly Stationary | Strict stationarity implies weak/second-order stationarity and the converse is true for Gaussian processes only.

Answer 30

Intrinsic stationarity

Answer 31

Variance = 1

Answer 32

The Wiener-Khintchine Theorem

Answer 33

The Fourier transform of a valid covariance (correlation) function is a density function (and vice versa) Allows you to talk about time series in a Fourier Space

Answer 34

The fourier transform of the ACF is called the Spectral Density Function (spectrum) NB the fourier transform of the spectrum is also the ACF

Answer 35

Take the fourier transform of the cross-covariance between 2 time series x_t and y_t

Answer 36

``` Complex number S_xy = c_xy − iq_xy c_xy is the co-spectrum q_xy is the quad-spectrum S_yx is the complex conjugate of S_xy ```

Answer 37

Coherency = correlation between the x and y at a particular frequency

Answer 38

Can because of complex number Amplitude: sqrt[(c_xy)^2 + (q_xy)^2] Phase: arctan(c_xy/q_xy)

Answer 39

Yes | It is model-free

Answer 40

Difference equations

Answer 41

εt is white noise. εt is i.i.d (independently and identically distributed) from a normal distribution with mean zero and with variance σ^2_w

Answer 42

Auto-covariance function / variance of the process h is the lag ρ(h=0) = 1

Answer 43

``` x_t = ε_t + β1*ε_{t-1} x_t = (1 + β1*B)ε_t ```

Answer 44

x_t = α*x_{t-1} + ε_t

Answer 45

|α| < 1 ~ stationary process α = 1 ~ random walk |α| > 1 ~ explosive process

Answer 46

An AR(q) model is actually an MA model with infinite order. An MA model is an infinite order AR model. (Can be shown for q=1 and for any q i.e. all AR processes)

Answer 47

Innovation variance

Answer 48

PACF: what correlation do I have in the next lag, that isn't explained by all the previous lags

Answer 49

Causal form, define xt in terms of εt (put everything on the MA side of the ARMA model) Invertible form, define εt in terms of xt (put everything of the AR side of the ARMA model)

Answer 50

Let θ(z) and Φ(z) be the AR and MA polynomials with B (the backward shift operator) replaced with a complex number z. An ARMA process is causal iff Φ(z) =/= 0 for |z| <= 1

Answer 51

Let θ(z) and Φ(z) be the AR and MA polynomials with B (the backward shift operator) replaced with a complex number z. An ARMA process is invertible iff θ(z) is =/= 0 for |z| <= 1

Answer 52

Best AIC or BIC values Akaike Information Critera (AIC) Bayesian Information Critera (BIC)

Answer 53

ARIMA | DLM

Answer 54

Residuals should be normally distributed | The residuals should be uncorrelated (can tell by ACF, or spectrum)

Answer 55

AR can be done differently because linear; One step ahead prediction error, by minimising the mean squared prediction error - substituting in the best linear predictor. ARMA and MA; The Durbin-Levinson Algorithm

Answer 56

The state space | What we're most interested in...

Answer 57

DLM | Dynamic Linear Model

Answer 58

State equation | Observation equation

Answer 59

G_t (Φ_t) in state equation | F_t (A_t) in observation equation (F^T)

Answer 60

REGRESSION MATRICES F_t (A_t) ~ for the observation equation G_t (Φ_t) ~ for the state equation VARIANCE MATRICES OF THE NORMAL DISTRIBUTION OF THE ERROR TERM V_t ~ for the observable equation W_t ~ for the state equation

Answer 61

Time Series DLM (TSDLM) | Constant DLM

Answer 62

``` A Univariate DLM has y_t (Y_t) and v_t univariate. Note x_t (θ_t) can still be a vector in a univariate DLM. ```

Answer 63

We are trying to estimate the properties of the state (x_t) from the data (y_s) If t > s this is forecasting (using only the past) If t = s this is filtering (using the past and present) If t < s this is smoothing (using past, present and future)

Answer 64

Numerically MLE Bayes Apart from some special cases MCMC Gibbs sampler (Usually use this)

Answer 65

A way to forecast data

Answer 66

Combine data with output from a numerical model to make a better forecast 1) Kalman filter 2) Variational Methods

Answer 67

Run an ensemble of models that are spread enough to | allow us to calculate P_ft

Answer 68

1. CHANGE SPACE: Warp space so that our conventional methods work 2. CHANGE METHODS: Explicitly use a GP model that includes the non-stationarity

Answer 69

y(x) = µ(x) + σ(x) + ε(x) ``` • y(x) - our output • µ(x) - deterministic mean function • σ(x) - zero mean GP • ε(x) - nugget to cope with measurement error and small scale variability ```

Answer 70

* Geographic space is G-space | * Deformed space is D-space (D=f(G))

Answer 71

A model where you can estimate all the parameters

Answer 72

1. Use a non-stationary covariance function (scale/marginalise) 2. Use the process convolution definition of GPs 3. Reformulate the GP as the solution to a stochastic partial differentiable equation

Answer 73

The covariance function

Answer 74

Stochastic Partial Differential Equation (NB, not deterministic) These are Partial Differential Equations driven by white noise.

Answer 75

LINEAR SPDEs | The solution of all linear SPDEs are Gaussian processes.

Answer 76

The inverse of the covariance matrix

Answer 77

It is sparse So can use sparse-matrix methods Which are fast calculations

Answer 78

Integrated Nested Laplace Approximation A procedure that uses the Laplace approximation to approximate the posterior for a set of Gaussian models (including GMRF)

Answer 79

MCMC for Bayesian inference

Answer 80

S_s ~ spatial variance matrix (qxq) | S_t ~ temporal variance matrix (pxp)

Answer 81

Empirical Orthogonal Functions (EOFs) | PCA

Answer 82

Separability implies that - the spatial correlation structure does not change with time and that - the time structure does not change with space. The covariance function in space and time can be separated into a spatial part and a temporal part.

Answer 83

Generalised DLM | Coregionalisation?