STATISTICS REVISION Flashcards

Question 1

Q

rows columns cells

Answer

A

rows= observations
columns =variables
cells = value of a variable for specific observation

Question 2

Q

CATEGORIES of responses -nominal ordinal quantitative

Answer

A

nominal - response categories cannot be placed in specific order -ethnicity
ordinal - can BE placed in RANK - preference , levels
quantitative - interval and ratio = responses measured on a continuous scale with rank order -assuming uniform distance between responses - income age temperature

Question 3

Q

measures of central tendency

Answer

A

MEAN MEDIA MODE
biomodial = more than one mode

Question 4

Q

measures of dispersion

Answer

A

range , = sensitive to outliers, not accurate representation of data
interquartile range= quartiles= data into quarters - box plot
variance
standard deviation=square root of variance = distance of an observation from the mean

Question 5

Q

VARIANCE

Answer

A

deviations from mean - square difference from mean then sum differences and divide by n-1

Question 6

Q

proportions

Answer

A

n of observations in categories divided by the total number of observations
-code variables as numbers 0=no 1=yes
add up values / by n (n=number of respondents)

Question 7

Q

histograms

Answer

A

frequency distributions for quantitative variables
value of variable = X AXIS
how often = Y AXIS
continuous variables - sample size growing = smooth curve

Question 8

Q

probability distributions

Answer

A

lists possible outcomes of an event and their probabilities - assigns a probability to each possible value of a random variable
SUM = 1

Question 9

Q

EMPIRICAL RULE -frequency distributions

Answer

A

higher standard deviation= greater variability
(distance from mean )
68% observations fall between y-s and y+s
95% fall between y-2s and y+2s
all or nearly all fall between y-3s and y+3s=
BELL SHAPED Distribution

Question 10

Q

Empirical rule - PROBABILITY DISTRIBUTIONS

Answer

A

normal distribution but with - SYMMTERY ABOUT MEAN
BELL SHAPED CURVE
68% IN 1 STANDADRD DEVIATION
95% values in 2
99% in 3

Question 11

Q

sampling distribution

Answer

A

distribution of all these possible sample means
-by using this info - can predict how close it falls to population mean

Question 12

Q

central limit theorem

Answer

A

as number of samples increases the sampling distribution approximates the normal distribution

Question 13

Q

confidence intervals

Answer

A

a range of values in which a ** parameter** will fall in the population with a given probability
Point estimate - margin of error ; point estimate + margin of error

Question 14

Q

how to interpret confidence interval for a mean

Answer

A

“95% confident that the interval … contains the “mean population age NOT population age is between … “

Question 15

Q

STATISTICAL SIGNIFICANCE test

Answer

A

uses data to summarise the evidence about a hypothesis by COMPARING point estimates of the parameters with the values predicted by the hypothesis

Question 16

Q

5 parts of the significance test

Answer

A

ASSUMPTIONS, HYPOTHESES, TEST STATISTIC, P VALUE , ALEVEL SIGNIFICANCE TEST

Question 17

Q

ASSUMPTIONS -on what?

Answer

A

-type of data
-randomization
-population distribution
-sample size

Question 18

Q

null hypotheses ?

Answer

A

a statement that the parameter takes a particular value

Question 19

Q

Alternative hypotheses

Answer

A

parameter falls in some alternative range of values -this is the research hypotheses

Question 20

Q

proof by contradiction ?

Answer

A

significance test analyses sample evidence about NULL hypothesis by investigating if data contradicts Null hypothesis- if data is unusual REJECT null

Question 21

Q

TEST statistic

Answer

A

summarises how far the estimate falls from the parameter value in NULL hypothesis number of standard errors between the estimate and null hypothesis value

Question 22

Q

P VALUE

Answer

A

probability that the TEST STATISTIC equals the observed value
- SMALL p value = stronger evidence against NULL = supporting alternative

Question 23

Q

A level significance level

Answer

A

reject null if p value falls below a pre specified cut off point = boundary value
SMALLER a level = stronger evidence must be to reject null