RDA Test 2 Flashcards

Question

What do measures of variability tell us?

Answer 1

About the spread of data and in some instances, the amount of ‘noise’ in the data set

Answer 2

- Range and interquartile range - Mean absolute deviation - Variance - Standard deviation - Standard error of the mean (SE)

Answer 3

The difference between the smallest and largest value is a distribution

Answer 4

It is susceptible to an extreme score in a distribution The interquartile range can be used to remedy this

Answer 5

When we take the centre 50% of values, between the 25th and 75th percentile

Answer 6

A measure of how much difference or deviation there is from the mean

Answer 7

By working out the difference between each value and the mean (ignoring all the negative signs – otherwise they would sum to zero), adding them up, and then dividing by N

Answer 8

An indication of the overall amount of variability in a set of data, but in a squared format, is used in many statistical formulae, and is denoted as s2

Answer 9

By adding up the squared differences (deviations) between each value in a set of data and the mean, and then dividing by N – 1

Answer 10

The square root of the variance Measures in the original units of measurement and relates to the standard normal distribution, so we can get a much better sense of the distribution of scores in our data

Answer 11

The standard deviation of the population mean

Answer 12

A measure of the tailedness of a distribution (how “pointy” a distribution is)

Answer 13

Degree of asymmetry, can vary in severity, and can be either positive (a positive skewness value), negative (a negative skewness value) or neither (a skewness value of 0)

Answer 14

The relationship between variables

Answer 15

- The plot normally has the predictor variable on the x-axis, and the outcome variable on the y-axis - We can also include a line of best fit (i.e. the best linear statistical model for the data) - This can also include some measure of ‘noise’ around the best fitting model (e.g. standard error of the mean, confidence interval, etc.)

Answer 16

The range of data

Answer 17

- The middle 50% of data that is not disturbed by the outliers (extreme scores) (indicated by dots outside the whiskers) - Any potential outliers that might be causing skew - The median value in the distribution (the black bar in the box) - The whiskers represent the top 25% and bottom 25% of values

Answer 18

If the box is proportional to the whiskers

Answer 19

The distribution of data

Answer 20

To assess whether the data is normally distributed They do this by comparing a theoretical distribution of values (for the mean and standard deviation of our data) on the x-axis, with our observed values plotted on the y-axis If the data is normally distributed, then we expect the data points to form a straight line across the graph

Answer 21

The probability of an observation occurring is remote enough

Answer 22

The last 5% of the tail (the rejected region)

Answer 23

Tests used if the distribution is not normal (e.g. chi-square)

Answer 24

Tests used if the distribution is normal (e.g. t-test)

Answer 25

Whether our distribution of values is significantly different to normal. That is, whether it’s too asymmetrical. We use this to see if our distribution is normal or not.

Answer 26

The relationships between samples

Answer 27

- Explore whether there is a real relationship between variables which is unlikely to have occurred due to external factors (i.e. unlikely to have occurred due to chance) - Also allows us to determine the: - Direction of the relationship - Strength of this relationship

Answer 28

Scatterplot

Answer 29

Pearson's r

Answer 30

Spearman's rho

Answer 31

Paired samples t-test

Answer 32

Independent samples t-test

Answer 33

Mann-Whitney

Answer 34

Causality There may be a 3rd variable that we are unaware of

Answer 35

Predictor - X-axis Outcome - Y-axis

Answer 36

Covariance

Answer 37

The degree to which scores on two variables deviate from their sample means

Answer 38

Because two variables will most likely be two different types of measurement (e.g. number of ice-creams sold vs. temperature) and so we need to convert into a common metric When we standardize our measures we end up with a value that ranges from -1 > + 1

Answer 39

We use the coefficient (r) and divide by the SE (standard error) of r

Answer 40

Whether our r value deviates from zero enough to be in a critical area of the normal distribution

Answer 41

Perfect positive correlation

Answer 42

No correlation

Answer 43

Perfect negative correlation

Answer 44

As one variable increases, the other variable increases in response

Answer 45

As one variable increases, the other variable decreases in response

Answer 46

The more the points adhere to a straight line, the stronger the relationship is, the more scattered the points are, the weaker the relationship is

Answer 47

Lots of shared variance

Answer 48

Little shared variance

Answer 49

Similar changes for the other variable

Answer 50

r x r (x 100 [for %])

Answer 51

Ranks the data from 2 variables Each value is assigned a rank (e.g. 1st, 2nd 3rd, etc., plus ranks that are tied in rank) Pearson’s formula for r is then applied to the ranks to calculate the correlation statistic (rs)

Answer 52

Spearman’s rho is less likely to find significance due to it not being as robust

Answer 53

Allows us to look at measures of central tendency, and get a feel for any relevant patterns

Answer 54

Whether our data is normally distributed and determines whether difference or relationships are statistically meaningful

Answer 55

Scores fall to the top end of the distribution

Answer 56

Scores fall the bottom end of the distribution

RDA Test 2 Flashcards

(85 cards)