paper 2 Flashcards

Question 1

Q

NORMAL DISTRIBUTION

What are the features of a normal distribution curve?

Answer

A

a bell shape curve
a single peak
symmetrical about the mean
• 50% above and 50% below the data
most of the data is within 1 s.d. of the mean

Question 2

Q

What proportions of the sample is at which point?

Answer

A

68% = within 1 s.d. of the mean (µ + 1∂ and µ - 1∂)

95% = within 2 s.d. of the mean (µ + 2∂ and µ - 2∂)

99.7% = within 3 s.d. of the mean (µ + 3∂ and µ - 3∂)

Question 3

Q

What are the notations for normal distribution?

Answer

A

µ = mean

∂ = standard deviation

Question 4

Q

What is the notation for a random X that is normally distributed?

Answer

A

X ~ N (µ,∂)

Question 5

Q

What does ∂² mean?

Answer

A

∂² is the standard deviation squared and is called VARIANCE

Question 6

Q

What are the rules for probability of normal distribution?

Answer

A

If X<15 LEAVE IT
(if it is pointing at number (less than))

If X>15 subtract it from 1
(pointing at X (greater than))

if the answer is negative then DO THE OPPOSITE

Question 7

Q

How do you calculate the probability of normal distribution on a calculator?

Answer

A

Press the MENU button and press 7
Press ‘2: Normal CD’
If X>15 (less than) put:
LOWER = -10000000
UPPER = 15 (or whatever number it is)

If X<15 (greater than) put:
LOWER = 15
UPPER = 10000000

Question 8

Q

What are some other points about finding ND on the calculator?

Answer

A

calculator value is ALWAYS LESS THAN

- the same rules still apply about whether to subtract from zero or not

Question 9

Q

INVERSE DISTRIBUTION

What is the inverse normal?

Answer

A

INVERSE NORMAL = Area = probability/percentile

Question 10

Q

INVERSE DISTRIBUTION

What is the inverse normal?

Answer

A

INVERSE NORMAL = Area = probability/percentile
(e.g. 95% = 0.95
Area = 0.95 )

Question 11

Q

How do you find inverse normal on a calculator?

Answer

A

Press the MENU button and press 7
Press ‘3: Inverse Normal’
Then input the area (to the left of the boundary), the standard deviation and the mean

Question 12

Q

EXAMPLE QUESTION OF INVERSE NORMAL

X~N (25,4)

Find ‘a’ given that P(X = 0.27

Answer

A

Input the information into the calculator:
Area :0.27
∂ :2
µ. :25
XInv = 23.77
write as a=23.77

Question 13

Q

EXAMPLE QUESTION 2 OF INVERSE DISTRIBUTION

X~N (25,4)
P(24

Answer

A

1. find P(X<24) with NORMAL distribution
   LOWER: -100000
   UPPER: 24
   ∂: 2
   µ: 25 
                              P=0.30854 (the area on the left of the 24 boundary)

Find P(X

Question 14

Q

CONFIDENCE INTERVALS

What will CONFIDENCE be based on?

Answer

A

THE SIZE OF THE SAMPLE = the larger the size of the sample, the closer the estimate is likely to be to the true population mean

THE VARIANCE = If readings are generally more varied then the estimate will be less reliable

Question 15

Q

How do you you calculate the standard error? what is it?

Answer

A

Standard error = ∂/√n

Standard error is how different the population mean is likely to be from a sample mean

(How different the population mean is from the point estimate)

Question 16

Q

What is the formula for confidence intervals?

Answer

A

x̅ ± 1.96 ∂/√n

(with µ in middle)

x̅ = sample mean
n = sample size 
∂ = population standard deviation

Question 17

Q

How does this formula look written out in full?

Answer

A

x̅ - 1.96 ∂/√n < µ < x̅ + 1.96 ∂/√n = 95%

(the numbers will change based on your level of confidence)

- = lower confidence limit 
\+ = upper confidence limit

Question 18

Q

What are the decimal numbers that are substituted into the formula for different confidence intervals?

Answer

A

90% = 1.64
95% = 1.96
98% = 2.33
99% = 2.57

Question 19

Q

EXAMPLE CONFIDENCE INTERVALS QUESTION

A sample of 16 fish with a mean length in the sample of 28cm. The standard deviation of this length is 4cm. Show a 95% confidence interval for the mean length of the fish in the length.

Answer

A

x̅ = 28
n = 16
∂ = 4
CI = 95%
UPPER = 28 + 1.96 (4/√16)
= 29.96
LOWER = 28 - 1.96 (4/√16)
= 26.04
Confidence interval = 26.04 < µ < 29.96

Question 20

Q

What does PMCC stand for?

Answer

A

Product moment correlation coefficient

Question 21

Q

How is the PMCC notated?

Answer

A

It is usually notated with the letter ‘r’

Question 22

Q

What is the letter ‘r’ (PMCC)

Answer

A

r is a number between -1 and 1
(- 1< r < 1)

\+1 = perfect positive correlation 
-1 = perfect negative correlation 
0 = no correlation

Question 23

Q

How do you calculate the PMCC on the calculator?

Answer

A

Press the MENU button and press 6 (statistics)
Press ‘2: a+bx’
input all the x and y data points into the table
press option (OPTN)
press ‘4: regression calculation)
use ‘r’ for the PMCC

Question 24

Q

can the ‘r’ value be affected by outliers?

Answer

A

yes it can

Question 25

Q

What is the equation for the regression line?

Answer

A

y = a + bx

a = y - intercept 
b = gradient

(substitute the letters from the question into the formula swell as the numbers e.g. if the letters were w and l the equation would be W = a + bl)

Question 26

Q

How do you calculate regression line of the calculator?

Answer

A

Press the MENU button and press 6 (statistics)
Press ‘2: a+bx’
input all the x and y data points into the table
press option (OPTN)
press ‘4: regression calculation)
use ‘a’ and ‘b’ for regression line

Question 27

Q

What do you need to do when answering the question?

Answer

A

write the a and the b value
substitute these numbers into the formula
then answer the question by drawing the line or explaining what it shows

Question 28

Q

MEAN AND STANDARD DEVIATION

How is mean represented and worked out with listed data and frequency?

Answer

A

x̅ = ∑fx / ∑f

x = individual data entries 
f = frequency

Question 29

Q

How is mean represented and worked out with grouped data and frequency?

Answer

A

x̅ = ∑fx / ∑f

x = grouped data MIDPOINTS
f = frequency

Question 30

Q

What is the advantage of the mean?

Answer

A

It is the most used average and uses every item of data.

Question 31

Q

What is the disadvantage of the mean?

Answer

A

It might not be representative if there is an extreme value (affected by outliers)

Question 32

Q

what is standard deviation?

Answer

A

A measure of SPREAD that uses all of the data

- a HIGHER s.d. means that the data is MORE SPREAD OUT (and the opposite if it is low)

Question 33

Q

What is the advantage of using standard deviation?

Answer

A

It uses all of the data

Question 34

Q

What is the disadvantage of using standard deviation?

Answer

A

It takes longer to calculate and is therefore time consuming

Question 35

Q

How do you calculate the standard deviation of a set of LISTED data?

Answer

A

find the mean of the data
Square all of the values SEPARATELY then add them together
use the formula:

√∑x̅i²/n - x̅²

n = the number of values 
x̅² = mean squared

get s.d.

Question 36

Q

How to find the standard deviation of grouped data?

Answer

A

find the mean of the data
find the MIDPOINTs of the group
multiply midpoints by the FREQUENCY
add all of the values up
use formula:

√∑fx²/∑f - x̅²

∑fx² = value from above
∑f = sum of frequency 
x̅² = mean squared

Question 37

Q

What is the variance?

Answer

A

Standard deviation squared (∂²)

Question 38

Q

How do you calculate standard deviation on a calculator? (listed data)

Answer

A

Press the MENU button and press 6 (statistics)
Press ‘1: 1-variable’
Then press ‘SHIFT’ ‘MENU’, go down a page and press ‘3: statistics’
press (2 : OFF)
input your data
then press option (OPTN)
then press 3: 1-variable calc’
find ∂x for standard deviation

Question 39

Q

How do you calculate standard deviation on a calculator? (grouped data)

Answer

A

Press the MENU button and press 6 (statistics)
Press ‘1: 1-variable’
Then press ‘SHIFT’ ‘MENU’, go down a page and press ‘3: statistics’
press (1 : ON)
input your data (for x input the MIDPOINTS and enter the frequencies)
then press option (OPTN)
then press 3: 1-variable calc’
find ∂x for standard deviation

Question 40

Q

What factors do you need to look out for when doing critical analysis?

Answer

A

Is there any data to back up statements made?
Use of vague or emotive language.
Has the writer assumed too much either about the subject matter or the readers knowledge?
how is the sample size and is it proportional to the research that they are doing?
(if a graph) does it have axis/ are the axis misleading?
Is it showing what it is meant to?
Is there errors in the data?
Is it even possible?
Are the scales distorting the data?
Is it the best type of graph?

Question 41

Q

What is the rule for outliers?

Answer

A

AN OUTLIER = an extreme value

it is generally when we’re 1.5 IQRs beyond the lower and upper quantities

Question 42

Q

What is an example of an outlier question?

Answer

A

IQR = 7
UQ = 22
LQ = 15

7 x 1.5 = 10.5
= 22 + 10.5 = 32.5
= 15 - 10.5 = 4.5

Question 43

Q

What is a point estimate?

Answer

A

the process of finding an approximate value of some parameter—such as the mean (average)—of a population from random samples of the population.
point estimation involves the use of sample data to calculate a single value which is to serve as a “best guess” or “best estimate” of an unknown population parameter. (e.g. finding the mean)
knowing that the mean of a sample is called a ‘point estimate’ for the mean of the population

Question 44

Q

How do you calculate point estimate?

Answer

A

A point estimate of the mean of a population is determined by calculating the mean of a sample drawn from the population.
The calculation of the mean is the sum of all sample values divided by the number of values.

Question 45

Q

How do you increase the accuracy of a point estimate?

Answer

A

The accuracy of the point estimate is likely to be improved by increasing the sample size

Question 46

Q

what is the equation for standardising?

Answer

A

= first find the area from the numbers (e.g. 0.45)

∂

N = number in probability

= you then look at the statistical tables