3 - Fundamental Skills Flashcards

1
Q

What is a covariate and a factor?

A

Covariate - independent, not primary variable. This is what is measured - quantitative. It can be discrete (count), or continuous (measurement).

Factor - categorical, qualitative - sort by data. It can be nominal (class) or ordinal (size).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the p-value?

A

The probability of the test statistic being that extreme or more, if the null is true.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How to tell when to use the mean versus the median?

A

Median - may be more representative as outlier may skew mean.

Mean - normal distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the difference in variation shown by the range and IQR?

A

Range - whole data variation.

IQR - less susceptible to outliers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How do we test correlation?

A

Two covariates in a scatterplot.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do we test causality?

A

Two covariates affecting one another in a GLM scatterplot.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How do we test association?

A

Two factors affected each other in a chi square bar plot.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How do we test the means?

A

Test if means are statistically different in t-test box plot.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What kinds of response and explanatory variables are tested in models?

A

GLM - response (covariate), explanatory (covariate or factor).

T-test - explanatory (factor).

(Multiple) Regression - explanatory (covariate).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is partitioning?

A

We can see the total variation and what is and isn’t explained by a particular variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is R-squared?

A

How much of the response variation is explained by the explanatory variable - standardised volume explained by model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When would we use the adjusted R-squared?

A

If there are multiple explanatory variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the f-ratio?

A

The mean SS for each explanatory variable divided by the mean RSS. Each has its own f-ratio, and in GLMS, the probability of the F being high or higher is if the null is true.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Sum of Squares.

A
  1. Calculate deviation - (mean-value)
  2. Square the deviation (so it no longer totals to 0)
  3. Total (sum)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

R-squared calculation.

A

ESS (explained)/TSS (total)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Mean RSS/ESS

A

ESS/EDfs or RSS/RDfs