Year 1 Flashcards

Question 1

Q

What are the four scales of measurement?

Answer

A

Nominal, ordinal, interval, ratio

Question 2

Q

What distinguishes ordinal and interval?

Answer

A

Ordinal has a natural ordering to the data

Question 3

Q

What distinguishes interval and ratio?

Answer

A

Interval does not have an absolute zero point but ratio data does

Question 4

Q

What are the 4 movements of a distribution?

Answer

A

Central tendency
Dispersion
Skewness
Kurtosis

Question 5

Q

Negative skew?

Answer

A

Mode > mean

Question 6

Q

Positive skew?

Answer

A

Mode < mean

Question 7

Q

platykurtic, mesokurtic, leptokurtic?

Answer

A

Platykurtic (k<3)
Mesokurtic (k~3)
Leptokurtic (k>3)

Question 8

Q

Parametric data analysis requirements?

Answer

A

continuous data
n>30
normally distributed

Question 9

Q

Non-parametric data analysis requirements?

Answer

A

not continuous
n<30
non-normal

Question 10

Q

What process allows us to begin conducting arithmetic on parametric data?

Answer

A

Normalisation or standardisation

Question 11

Q

What happens to the mean and SD after you carry out normalisation on a distribution?

Answer

A

Mean = 0
SD = 1

Question 12

Q

What way is a null hypothesis always phrased?

Answer

A

Negatively

Question 13

Q

who decides the level of significance associated with hypothesis testing?

Answer

A

user based on opinion and consideration of distribution characteristics

Question 14

Q

When is something classed as not statistically significant?

Answer

A

If the significance value falls outside the significance confidence threshold.

Question 15

Q

What are the 2 ways of determining whether a distribution is normally distributed?

Answer

A

Q-Q plot

K-S test

Question 16

Q

How does a q-q plot work?

Answer

A

Points should lie as closely along the line (representing a normal distribution) and be evenly distributed either side.

Question 17

Q

How does a K-S test work?

Answer

A

Hypothesis testing - if the value returned lies outside the significance threshold then there is NOT a statistically significant difference between a normal distribution and the investigated normal distribution i.e. it is normally distributed

Question 18

Q

What are inferential statistics?

Answer

A

Tests of difference between either samples and populations

Question 19

Q

What are the 3 parametric inferential statistics?

Answer

A

On-sample t-test = sample and population
Two-sample t-test = sample and sample
ANOVA = 2+ samples

Question 20

Q

What are relational statistics?

Answer

A

Testing for a relationship between variables i.e. correlation.

Question 21

Q

What is the parametric relational statistic test?

Answer

A

Pearson’s correlation coefficient

Question 22

Q

What are the 4 non-parametric statistical tests?

Answer

A

one-way chi = sample and population
two-way chi = 2+ sample
MWU = comparison of sample means
Kruskal Wallis = ANOVA

Question 23

Q

What is the non-parametric relational statistic test?

Answer

A

Spearman’s Rank

Question 24

Q

What is the ‘least squares criterion’?

Answer

A

the principle that the total difference between the points and the regression line is as small and identical either side.

Question 25

Q

What is the f-ratio?

Answer

A

the ratio of explained variance to unexplained variance

Question 26

Q

What is the coefficient of explanation for linear regression?

Answer

A

R squared

Question 27

Q

What are two sources of regression error?

Answer

A

Standard error = error associated with how difficult it is to represent data that is naturally very tricky
sampling error = concerned with the regression line characteristics being incorrect or poor at representing data

Question 28

Q

What is homoscedascity?

Answer

A

When residuals from the regression line are consistently spaced either side of the regression line.

Question 29

Q

Why is homoscedascity important?

Answer

A

because that forms one of the assumptions held regarding analysis - that there are even residuals either side of the regression line

Question 30

Q

How do we test for homoscedascity/heteroscedascity in spss?

Answer

A

scatter plot needs to be well scattered with no clear patterns
P-plot needs to have similar amount of points either side of line and be well tied to the line
histogram needs to be normal in style (Gaussian)

Question 31

Q

What is autocorrelation?

Answer

A

When each correlation between x and y values are not independent i.e. the correlation is affected by something else

Question 32

Q

What is wrong with autocorrelation?

Answer

A

It is assumed to not occur in our parametric tests

Question 33

Q

What test do we use for testing for autocorrelation?

Answer

A

durbin watson

Question 34

Q

What is the range of values for significant positive autocorrelation, no autocorrelation and significant negative autocorrelation?

Answer

A

positive autocorrelation = 0-1.475
no autocorrelation = 1.566-2.434
negative autocorrelation = 2.525 - 4

Question 35

Q

what is the difference between autocorrelation and multicollinearity?

Answer

A

autocorrelation involves the correlation between one predictor and y being affected by something else whereas multicollinearity is when different predictors are linked so that distinguishing their individual impact on y is difficult to determine