Numerical Measures and Data Representation Flashcards

1
Q

How to work out the Mean [x̄]

A

Sum of data values / number of data values

x̄ = Σx / n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How do you work out the mean[x̄] from a frequency table?

A

Add new column that multiplies number (if needed do midpoint) by frequencies = (n*f)

x̄ = Σ(n*f) / Σf

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Q1 =

A

lower quartile

for grouped data
= 1/4 nth data value

for non-grouped data
= (n+1)/4

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Q2 =

A

Median

for grouped data
= 1/2 nth data value

for non-grouped data
= (n+1)/2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Q3

A

Upper Quartile

for grouped data
= 3/4 nth data value

for non-grouped data
= ((n+1)/4) *3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

In Listed Data - for quartiles - if a decimal…

A

round up !

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

In Listed data - for quartiles - if whole…

A

find midpoint with the next one

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

For finding Quartiles in Grouped data:

A

INTERPOLATION

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is Variance?

A

Shows how spread out data is.
(σ2)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Standard Deviation (σ) fb

A

root of variance.

σ = √(Σx2/n) - (Σx/n)2
{mean of squares minus square of means}

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Standard Deviation/ Variance form a frequency table

A

USE CALCULATOR
- menu
- 6:Statistics
-1: 1-variable
- Fill Table (using Midpoints)
- Option
- 3: 1-variable Calc

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

For coded data what terms affect the Standard Deviation?

A

only terms multiplied or divided

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Histogram how do you calculate Frequency Density ?

A

frequency / class width

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How do you draw a boxplot from a cumulative frequency diagram?

A

Find median in CumFreq and trace down to x-axis to get the data correlation
Half median to get LQ and add LQ to Median to get UQ then again map to x-axis to get actual data for each

Top value is most extreme value MAXIMUM
and Bottom value is least extreme value on the boxplot MINIMUM

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Why use a histogram?

A

data is continuous
no gaps

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Comparing two diagrams what should you talk about?

A
  1. Location
  2. Spread
17
Q

What is the range for ‘r’ in Product Moment Correlation Coefficient?

A

-1<= r <= 1
PMCC measures strength and positivity of correlation - this is why can only go between 1 and -1

18
Q

Regression line

A

Line of Best Fit
y = a + bx

19
Q

What is meant by the term Interpolation?

A

Estimating inside the data range

20
Q

What is meant by Extrapolation?

A

estimating outside the data range

21
Q

When do you change boundaries in grouped data?

A

when there is a gap and it is continuous data

use these new boundaries to find midpoints

22
Q

Frequency Diagram

A

(x,y) <- plot points
x=upper class boundary
y=cumulative frequency

join with a curve

23
Q

frequency polygon

A

(x,y)
x= midpoint
y=frequency density

join up with straight lines

23
Q

Area of bar [histogram]

A

PROPORTIONAL to frequency
(if not to scale find scale factor)
always check scale on histogram

23
Q

What are histograms used for?

A

grouped, continuous data

24
Q

Pros and Cons of Box Plot

A

+ highlights outliers
+easy to compare data sets

-data categorised in 4 sections means detailed analysis is not possible

25
Q

Pros and Cons Histogram

A

+clearly shows shape of distribution

  • doesnt always highlight outliers
  • not easy to estimate quartiles and median
26
Q

Cumulative Frequency Curve Pros and Cons

A

+ makes it easy to find the min, LQ, median, UQ and max

  • doesnt always highlight outliers