First year Statistics Flashcards

To recap what was learnt about statistics in my first year.

1
Q

Give the types of data.

A

Discrete:
Nominal - lowest form, categorical, basic
Ordinal - has a rank, but categorical e.g. Likert Scales

Continuous:
Interval - regular intervals, ranked, no absolute 0 e.g. height, time
Ratio - highest power, ranked, absolute 0 e.g. %s

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the assumptions for the different types of tests?

A

Sample size, data type, skewness, peakedness

Non-parametric: <30, all discrete tests, continuous data that is not normally distributed

Parametric: >30, continuous data that is normally distributed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Name the inferential statistical tests for both types.

A

Non-parametric: Chi-Squared, Mann-Whitney, Kruskal-Wallis

Parametric: T-tests 1 and 2-tailed, ANOVA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Name the types of relational statistics.

A

Measure correlation (not causation) based on distance from 0.

Non-parametric: Spearman’s Rank
Parametric: Pearson’s r

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the different confidence intervals?

A
  1. 05 = 95%
  2. 01 = 99%
  3. 001 = 99.9%
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Simple Linear Regression? What does it permit us to do?

A

Permits us to make numerical predictions of one variable by reference to another
Involves the comparison of a dependent variable (changed) and independent variable (measured)

Attempts to predict the changes in Y based on changes observed in X; proportion of changes in Y explained by X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is variance?

A

Sum of squares/degrees of freedom

Explained = diffrence between mean and predicted values

Unexplained = difference between observed and predicted (a.k.a. RESIDUAL)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the outputs of a simple linear regression model? Explain their significance.

A

R-square = type and strength of relationship; +1 is strong positive, -1 strong negative

F-ratio = explained / unexplained variance;
The further from 1, the better the model at explaining changes in Y

Test statistic = if P > 0.05, then can accept alternative hypothesis and reject the null, and conclude that the model explains a significant proportion of changes in Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Summarise the overall aim of simple linear regression.

A

To maximise explained variance which is implied by a large F-ratio. The higher the R-square, the more variance in Y explained by C i.e. there is a higher proportion of variance explained by the model. This also suggests values were closer to the line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly