STATISTICS REVISION Flashcards

1
Q

rows columns cells

A

rows= observations
columns =variables
cells = value of a variable for specific observation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

CATEGORIES of responses -nominal ordinal quantitative

A

nominal - response categories cannot be placed in specific order -ethnicity
ordinal - can BE placed in RANK - preference , levels
quantitative - interval and ratio = responses measured on a continuous scale with rank order -assuming uniform distance between responses - income age temperature

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

measures of central tendency

A

MEAN MEDIA MODE
biomodial = more than one mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

measures of dispersion

A

range , = sensitive to outliers, not accurate representation of data
interquartile range= quartiles= data into quarters - box plot
variance
standard deviation=square root of variance = distance of an observation from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

VARIANCE

A

deviations from mean - square difference from mean then sum differences and divide by n-1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

proportions

A

n of observations in categories divided by the total number of observations
-code variables as numbers 0=no 1=yes
add up values / by n (n=number of respondents)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

histograms

A

frequency distributions for quantitative variables
value of variable = X AXIS
how often = Y AXIS
continuous variables - sample size growing = smooth curve

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

probability distributions

A

lists possible outcomes of an event and their probabilities - assigns a probability to each possible value of a random variable
SUM = 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

EMPIRICAL RULE -frequency distributions

A

higher standard deviation= greater variability
(distance from mean )
68% observations fall between y-s and y+s
95% fall between y-2s and y+2s
all or nearly all fall between y-3s and y+3s=
BELL SHAPED Distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Empirical rule - PROBABILITY DISTRIBUTIONS

A

normal distribution but with - SYMMTERY ABOUT MEAN
BELL SHAPED CURVE
68% IN 1 STANDADRD DEVIATION
95% values in 2
99% in 3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

sampling distribution

A

distribution of all these possible sample means
-by using this info - can predict how close it falls to population mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

central limit theorem

A

as number of samples increases the sampling distribution approximates the normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

confidence intervals

A

a range of values in which a ** parameter** will fall in the population with a given probability
Point estimate - margin of error ; point estimate + margin of error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

how to interpret confidence interval for a mean

A

“95% confident that the interval … contains the “mean population age NOT population age is between … “

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

STATISTICAL SIGNIFICANCE test

A

uses data to summarise the evidence about a hypothesis by COMPARING point estimates of the parameters with the values predicted by the hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

5 parts of the significance test

A

ASSUMPTIONS, HYPOTHESES, TEST STATISTIC, P VALUE , ALEVEL SIGNIFICANCE TEST

17
Q

ASSUMPTIONS -on what?

A

-type of data
-randomization
-population distribution
-sample size

18
Q

null hypotheses ?

A

a statement that the parameter takes a particular value

19
Q

Alternative hypotheses

A

parameter falls in some alternative range of values -this is the research hypotheses

20
Q

proof by contradiction ?

A

significance test analyses sample evidence about NULL hypothesis by investigating if data contradicts Null hypothesis- if data is unusual REJECT null

21
Q

TEST statistic

A

summarises how far the estimate falls from the parameter value in NULL hypothesis number of standard errors between the estimate and null hypothesis value

22
Q

P VALUE

A

probability that the TEST STATISTIC equals the observed value
- SMALL p value = stronger evidence against NULL = supporting alternative

23
Q

A level significance level

A

reject null if p value falls below a pre specified cut off point = boundary value
SMALLER a level = stronger evidence must be to reject null