Linear Mixed Models Flashcards

1
Q

What is a outlier?

A

An observation that lies outside the overall pattern of the distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what are the 5 most common causes of outliers?

A

 Data entry or processing error
 Measurement error
 Experimental/Intentional Error

 Sampling Error

 No error at all, just a novelty in the data (think Malcolm Gladwell)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the 3 methods too define a outliner? pros and cons of each

A

Z-score (i.e. two standard deviations)

Pro: easy implementation, effective if normal

Con: can eliminate natural tail scores

Inter-quartile range (IQR) method

Pro: easy implementation, more robust to slight deviations in normalcy

Con: only data between 25% and 75%

Cluster Analysis

In k-means outlier detection, the data are partitioned into groups by assigning them to the closest cluster

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

______ can occur as a result of an outlier or experimental mortality

A

Missing data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How you deal with an outlier is largely influenced by the nature of the data, type of experimental design and statistical analysis?

A

Is missingness random or systematic?

Cost-benefit

 Cost of running another participant?
 Missing one data point in a two-year longitudinal study

 Availability of participants

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the 4 solutions to missing data?

A

Discard data

Remember, in a repeated measures design discarding one observation leads to exclusion of all observations for that subject.

Imputation (i.e. replace missing data with substitute value)

  • replace missing data point with mean of observed variable
  • Last value carryforward
  • Use information from related observations
  • Estimation based upon individual and effect means
  • Use information from related observations (ie, mother income for dad)

Replace data

  • Most common for a between groups design without repeated measures

For data with more than two repeated measures use an analysis that models subject data as a random factor

  • mixed models
  • Based upon regression, each subject is treated as a random factor. Establishes a linear trend over data that estimates any missing data points
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How are ANOVA and regression the same?

A

ANOVA reports each mean and a p-value that says at least two means are different

Regression reports only one mean (the reference category) and the difference between the reference category and all other means

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a mixed model?

A

Mixed Model
- A statistical model that contains both fixed effects and random effects

Fixed Effect

  • All levels of the effect are sampled

 e.g. the effect of an independent variable — all levels of the independent variable are measured

  • *Random Effect**
  • Only a random samples of levels is

acquired

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the equation for a mixed model?

A

Outcome = Intercept + (fixed effects)+ (random effects) + (error)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the 5 advantages of linear mixed models over ANOVAs w/ repeated measures?

A
  1. Missing Data
    - Anova all data dropped
    - LMM only that time point dropped
  2. Post hoc-test
    - Anova, cant run
    - LMM can run
  3. Flexibility w/ time
    - ANOVA time is category
    - LMM can be continious
  4. Easier to build more complex models that account for quantifiable sources of error
    - ANova , Can include covariates to attempt to account for the effect of an extraneous variable
    - LMM can directly model the effect of additional variables to explain their effect on the dependent variable
  5. Differing number of repeats/Different time points
    - ANova, time categorical, all categories presents an all data points must be mesured at same time
    - LMM, data can be measured at different points since time continious
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

When is ANOVA better then LMM?

A

Simple models
 A pre-post test design with only two levels of the factor time
 A mixed measures ANOVA with two groups and two levels of time

ANOVA is typically a simple model–>always run the simplest model that gives the most accurate result

How well did you know this?
1
Not at all
2
3
4
5
Perfectly