Week 8 Flashcards

1
Q

For continous variables, which information do we want to look at: M (SD) or N(%)?

A

Mean and Standard Deviation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

With fixed effects in a random intercept model, what are you estimating between X and Y?

A

The overall association between X and Y for the WHOLE sample without allowing the strength of direction of the effect/slope to vary randomly by person.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

If you have a fixed effect in a random intercept model, are you assuming the effect of X on Y will be different or the same for everybody?

A

The same

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

If you have a random intercept with a fixed effect, and say Y is stress and X is energy, what is the random intercept actually showing?

A

That the mean value of stress does vary between persons because it is random, however, the overall effect of energy on stress for the whole sample is fixed - this does not vary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

The Int_Fri variable contains information on whether a person interacted with a friend on a day. One = they did, Zero = they didn’t.

Does the 0 and 1 column here tell us that 64% of people interacted with a friend on any day?

A

No. It tells us that of the observations (days) in the survey, 64% featured an interaction with a friend

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

In the Bstress variable, or between stress, the average scores of stress were firstly averaged across the days per person

(e.g ID 10 got 3, 2, and 5 across three days. Then divided that by 3)

THEN that score is ….?

A

Averaged across the sample for all people.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What type of variable is being shown here and how has the data been dealt with?

A

The betweenstress variable (Bstress), which is avg. score per person across days, and for those we only want ONE row of data input per person (not duplicated operator being used).

Why? Because we want to avoid inflating the mean for the individuals who filled out more observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

To ensure that repeated baseline variables (seen in between person variables such as sex on loneliness measures, because often the things that are repeated are biological sex) do not contribute a higher weighting to the mean, what might we do?

A

We drop the duplicated ID by using thegltable code but als data= dm[!duplicated(ID)]

If you have long format data set and want to report descriptives statistics on variables that do NOT change across time points, this is one way to include non duplicated ID’s.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

When looking at descriptives of any between subjects variables in multilevel data need to ask yourself…?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

If the variable does NOT change within a person in a dataset, do you want the mean of all observations?

A

No, as people with more observations will count more towards the descriptives

Better to have the mean of the observations within the one ID.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Will this variance in this WITHIN continous variable show variance in both between and within aka total variance?

A

No, just mean of individual means so variance will be between units only.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

In this within continous variable, are we seeing the mean of individual means, so between units only, or mean across all observations, so variance in both between AND within units also known as TOTAL Variance?

A

Yes! Variance in both between and within units, aka TOTAL VARIANCE.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What could be a problem with taking the mean across ALL observations, not just people?

A

You include a lot of variance from people who provided more data. Potentially bias the results.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

In the mean of individual means, what will the variance be taken from?

A

Between units only

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

In the mean across ALL observations, what will the variance be taken from?

A

Variance will be TOTAL variance taken from both between and within units

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How can you report variance on between units only?

A

Average by person and report descriptives on individual averages

or

Report descriptives on individual timepoints/assessments.

17
Q

How can you report total variance?

A

Report descriptives on the overall variable whih captures the total variance BUT may unequally weight participants (as takes into account variance from people who provided more observations)

18
Q

What are the two variance estimators for our LMM?

A
19
Q

Of the two variance estimators, ML and REML, which one would we use and why ?

A

Prefer to use REML as it doesn’t need to know the mean, therefore it is less biased, whereas ML estimates are biased.

REML is good if you need variance estimates, BUT because it transforms the data we can’t use for model comparison

20
Q

If you see a SD for a model is fixed at 0. And the model is normally distributed. What type of model can you assume you are looking at and why?

A

Because it is fixed at 0 we assume it is a fixed effect as 0 does not allow variation in the intercepts or for random effects.

So it’s just assumed you are looking at the estiamted mean across everyone.

If the intercepts are mixed effects then individual variation in the means(aka intercepts) is allowed for.