A2 Stats Flashcards

1
Q

describe exponential modelling

A

use logs & coding to convert exponential relationship to linear relationship & use regression line
y = ax^n –> logy = nlogx + loga & plot logx against logy

y=kb^x
logy = xlogb + logk & plot x against logy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

define PMCC

A

product moment correlation coefficient
numerical measure of the type & strength of linear correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is the PMCC for a sample & population represented by?

A

for sample: r
for population: ρ

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what values is r b/w?

A

-1 ≤ r ≤ 1
r = 1 linear & positive correlation
r = -1 linear & negative correlation
r = 0 no correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what does n/a mean in table?

A

data for that day is not available
remove the points from calculations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

describe how to find PMCC using calculator

A

menu –> 6: statistics
2: y = a + bx
enter values for x & y
option –> 4: regression calc
PMCC is r value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

comment on suitability of a linear regression or exponential model for given data

A

e.g. as r is very close to 1, there is strong positive correlation b/w ___________. therefore the data points lie close to a straight line so a linear regression model is suitable for __________ data
so exponential model is suitable for raw data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what constitutes a ‘strong correlation’?

A

generally over 0.6 or less than -0.6

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

describe how to find equation of the regression line from coded data

A

‘unlog’
compare with the equation y = mx + c
logb =
loga =
work out a & b
state the equation at the end

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what does significance level mean?

A

the chance of incorrectly rejecting H0 when it is true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what must the conclusion of a hypothesis test include?

A

accept/reject H0
RELATE TO CONTEXT OF THE Q

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is PMCC hypothesis testing used for?

A

used to determine whether the PMCC for a sample, r, indicates that there is likely to be a linear relationship within the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what are the null & alternative hypotheses for PMCC hypothesis testing?

A

the H0 is always that there is no correlation in the population ρ = 0

for positive correlation ρ > 0
for negative correlation ρ < 0
for any correlation ρ ≠ 0 (NB halve the significance level)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

describe the method of a PMCC hypothesis test

A
  1. state H0: ρ = 0
    & H1
  2. significance level =
  3. n =
    (n is number of pairs of data)
  4. find critical value using the PMCC table in data booklet (one +ve & one -ve for 2-tailed test)
  5. do number line & if r value is outside critical region (see OneNote) then accept H0, but if r value is inside the critical region, reject H0
  6. conclusion
    e.g. as 0.1149 (r-value) < 0.5067 (critical value), it is not in the critical region so we accept H0. there is not sufficient evidence of a positive correlation b/w daily maximum gust & relative humidity (linking to Q)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

for any set notation Q, what is the first step?

A

draw Venn Diagram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

dot method for P(A’ u B’) or P(A’ u B’)

A

for u: add things with at least 1 dot
for n: add things with all the dots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

describe conditional probability

A

the probability of an event can change depending on the outcome of a previous event
the probability that event B occurs, GIVEN that event A has already occured

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

what is the notation for conditional probability?

A

P(B|A)
the probability that event B occurs, GIVEN that event A has already occured

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

usually P(B|A)…

A

≠ P(A|B)

20
Q

in conditional probability, if A & B are independent, what is the formula?

A

P(B|A) = P(B|A’) = P(B)

21
Q

two-way table

A

see OneNote & notes in folder
write down marginal totals

22
Q

what is the addition formula?

A

P(A u B) = P(A) + P(B) - P(A n B)

23
Q

what is the multiplication formula?

A

P(B|A) = P(A n B) / P(A)
must divide by the probability of the 2nd letter

24
Q

describe how conditional probabilities can be represented on a tree diagram

A

see Gordon OneNote

25
Q

binomial distribution

A

a discrete probability distribution

26
Q

normal distribution

A

a continuous probability distribution

27
Q

describe the normal distribution graph

A

symmetrical about the mean (mean = median = mode)
infinite in both directions, the x-axis is an asymptote
area under the graph = 1

28
Q

what percentage of values are within 1, 2 & 3 standard deviations of the mean?

A

68% of values are within 1 standard deviation of the mean
95% of values are within 2 standard deviations of the mean
99.7% of values are within 3 standard deviations of the mean

29
Q

X~N(μ,σ^2)

A

X is normally distributed with population mean of μ
& population variance of σ^2 (σ = standard deviation)

30
Q

describe using calculator for normal distribution

A

only ever use normal CD
the upper/lower limit you chose must be at least 5 standard deviation above or below the mean

31
Q

questions combining normal distribution & binomial distribution

A

see Gordon OneNote

32
Q

describe the inverse normal distribution

A

area means area to the left

33
Q

what is the formula for coding the standardised normal distribution?

A

Z = x - μ / σ
x = raw score
μ = mean
σ = standard deviation
Z = standardised score

34
Q

what are the parameters for standard normal distribution?

A

X~Z(0,1^2)
μ = 0
σ = 1

35
Q

percentage points of the normal distribution table

A

p = probability to the right of the x value on the normal curve = greater than is +ve z value
when P(Z<z), make the z value from the table -ve

36
Q

describe how to find z values on calculator

A

use inverse normal function
area = p (to the LEFT)
μ = 0
σ = 1

37
Q

what are the conditions to be able to model the binomial distribution with the normal?

A
  1. if n(the sample size) is large (> 50)
  2. p(the chance of success) is close to 0.5
    such that np > 10

must apply a continuity correction (using upper/lower bounds)

38
Q

how do you find the parameters of the normal distribution is X~B(n,p) can be approximated as Y~N(μ,σ^2)

A

μ = np
σ = √np(1-p)
in formula book

39
Q

when calculating probabilities using a normal approximation to a binomial distribution what must be applied?

A

continuity correction

for ≤ or ≥, use the bound that includes the integer - think number line, see OneNote

for < or >, use the bound that does not include the integer

40
Q

in Q, what does ‘use a suitable approximation’ mean?

A

use normal to approximate binomial

41
Q

in a Q, sometimes have to do binomial distribution twice

42
Q

what is true if a (parent) population is normally distributed?

A

mean = μ
standard deviation = σ
any sample taken from the population is also normally distributed

43
Q

what is the mean & standard deviation of a sample of a (parent) population?

A

mean = μ
standard deviation = σ / √n
where n is the sample size

44
Q

describe how to do hypothesis testing for normal distribution

A

see Gordon OneNote
it is testing the mean
1. identify the test statistic (x̄)
2. state μ & σ of population & sample
3. state hypotheses H0 & H1
4. either:
- find probability of test statistic &
compare to significance level
- find critical value & see if test statistic
lies in critical region