Statistical Analysis of fMRI Data: The General Linear Model Flashcards

1
Q

What is the design matrix?

A

It is a matrix that contains variation over time of all independent variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are regressors of interest?

A

They are the values in the design matrix associated with independent variables that the researcher is interested in. Each regressor of interest is usually associated with one experimental condition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are covariates (nuisance regressors)?

A

They are other variables that the researcher is not interested in, but that still are expected to predict variations of the signal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Are regressors paired exactly with the occurrence of the independent variable?

A

No, they are convoluted to account for the distortion in the signal caused by the Haemodynamic Response Function

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

When are nuisance regressors actually useful? Can you make some examples?

A

Nuisance regressors are useful when they help predict some variance in the Y (BOLD signal) and reduce epsilon (error). Movement of a participant when blinking eyes or moving a hand to perform a task, heart rate and respiration are examples of useful nuisance regressors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the Beta values telling us? Make an example, what happens if column 3 of the design Matrix has a big beta?

A

Beta values are the coefficient of a regressor. They are telling us how much the variation in the independent variable predicts change in the system. For instance a big beta in column three means that a small change in the independent variable 3 is associated with a big change in the signal Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the difference in terms of relationship between variables and signal in a simple linear regression and a multiple linear regression?

A

In a simple linear regression values of the independent variables are each linearly associated directly with changes of the signal, in a multiple linear regression the interactions are also among the variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a constant regressor?

A

It is a regressor variable with an associated beta that is used to account for a background noise in the signal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How is epsilon assumed to be?

A

Independent and identically distributed according to zero mean normal distribution with unknown variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

In a geometric interpretation of the GLM, how would you interpret the vector of residuals?

A

The shortest line in the hyperplane connecting the modeled signal with the actual signal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the mathematical and non mathematical objective of the process called Ordinary Least Squares (OLS)?
Explain its mathematical expression ๐‘‹T๐œ€ฦธ= 0
๐‘‹T( ๐‘ฆ โˆ’ ๐‘‹๐›ฝ)= 0

A

The mathematical objective is to reduce the squared (so that it accommodates for positive and negative) error in the GLM.
The non mathematical is to reduce to the minimum the discrepancy between modeled and actual signal, which is what the second equation is saying

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does efficiency measure? What does it mean to have high efficiency?

A

Efficiency measures the ability of a regressor to predict the signal independently of other regressors. A low efficiency is bad because it gets impossible to tell which condition elicited a certain BOLD response (imagine for instance nuisance regressors covarying strongly with a regressor of interest.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a t-contrast measuring in GLM?

A

If there is a difference between two regressors, it calculates the probability of differences between two conditons.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is an F-contrast measuring in GLM?

A

If there is an effect of first or second regressor, as the t-contrast test it it tested against a null hypothesis.
Itโ€™s a statistical test used to measure the overall significance of a set of predictors or conditions in explaining the observed variance in the data.
It tests the significance of a linear combination of multiple predictors or contrasts. It evaluates whether a combination of predictors, taken together, has a significant impact on the response variable. F-contrasts can be used to test hypotheses about the overall effect of a group of conditions or to compare models with different sets of predictors. (Last part is ChatGPT)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Slow drifts, what are they and how to take care of them

A

Low frequency signals, pure noise for our purposes, we can set a low pass filter and also apply a Discrete Cosine Transform (a nuisance regressor) and regress the slow drifts out of the signal โ€œtemporal filteringโ€

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How to deal with the fact that stimulus response functions are boxcar shaped and not reflecting the haemodynamic response function?

A

They are convoluted with te HRF, so also temporally shifted, to give a more realistic prediction of the BOLD signal.

17
Q

How to deal with the fact that despite spatial realignemnt there are still some movement artifacts in the signal?

A

Include the movement artifacts as some nuisance regressor

18
Q

Serial auto correlation, what is it due to and how is it approached?

A

Consecutive fMRI scans are not statistically
independent, i.e. fMRI time series have serial correlation.
An auto-regressive model of order one [AR(1)] is fit
to account for serial correlations.

19
Q

What is the sphericity assumption?

A

Key assumption in the analysis of repeated measures data, (in this case time series data). It assumes that the variances of the differences between all pairs of conditions are equal. In other words, it assumes that the covariance structure of the data is spherical or circular.

20
Q

How do we get from a i.i.d noise model to an enhanced noise model?

A
21
Q

What is the problem of multiple comparisons due to?

A

When running t tests on multiple voxels, despite the p number being really low, the probability of type 1 errors is multiplied by the number of voxels serially analyzed. atype of correction, e.g Bonferroni one, is needed

22
Q

What is Gaussian Random Field Theory?

A

It is a theory that tries to estimate the maximum amount of correlation between voxels (due to both proximity and smoothing in preprocessing) and then taking it into account to temper statistical corrections that would be too conservative. e.g. untempered Bonferroni

23
Q

Parametric Modulators

A

parametric modulator is a regressor that captures the relationship between a specific experimental condition or stimulus and a CONTINUOUS variable of interest. It allows for modeling the effect of the variable on brain activity in a more nuanced and flexible manner.

24
Q

Second Level GLM, what are they?

A

Once the first-level analysis is completed for each participant, the second-level GLM is used to combine and analyze the results across participants. It allows for making inferences about the group or population as a whole. The primary goal of the second-level analysis is to identify consistent patterns of activation or effects across participants and assess their statistical significance.

The second-level GLM involves modeling the individual-level results or parameters obtained from the first-level analysis as the dependent variable. This typically includes the estimated coefficients, contrast estimates, or statistical maps representing the activation or effect size at each voxel or region of interest.

The second-level GLM can include various statistical methods, such as t-tests, ANOVA (analysis of variance), mixed-effects models, or nonparametric tests, depending on the research question and study design. It allows for testing group-level hypotheses, such as identifying brain regions showing significant activation across the entire group or comparing the effects between different conditions, groups, or populations.

In summary, the second-level GLM in fMRI data analysis is performed after the first-level analysis and aims to combine and analyze the results across participants to make group-level inferences. It enables researchers to identify consistent patterns of activation or effects and assess their statistical significance in the context of the entire study population