SUMMARISING DATA and CORRELATION Flashcards

1
Q

What is an average?

A

An average is a single value used to describe a set of data as a measure of central tendency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the 3 type of averages?

A

Mode, Median and Mean.
* The mode is the value appearing most often.
* The median is the middle value when all the values are in order.
* To work out the mean, add up all the values and divide by the number values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is meant by transforming data?

A

When all data values are transformed (increased or decreased) by the same amount or percentage, the averages are transformed by the same amount or percentage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the two ways to measure dispersion or spread?

A

Range and Interquartile range.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Whats the formula to calculate the range of a data set?

A

Range = largest value - smallest value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Whats the formula to calculate the interquartile range of a data set?

A

Interquartile range = (IQR) = (Q3) - lower quartile (Q1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Box plot used to display?

A

Box plots are used to display the minimum value,
the maximum value, the lower quartile, the upper
quartile and the median from a distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the advantages of Mode?

A
  • Easy to find and always a data value
  • Can be used with any type of data
  • Unaffected by open-ended or extreme values
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the disadvantages of Mode?

A
  • May not exist
  • Cannot be used to calculate
    a measure of spread
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the advantages of Median?

A
  • Unaffected by extreme values
  • Can be used to help calculate quartiles,
    interquartile range and skew
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the disadvantages of Median?

A
  • May not be a data value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the advantages of Mean?

A
  • Uses all the data
  • Can be used to calculate standard deviation
    and skew
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the disadvantages of Mean?

A
  • Always affected by open-ended or extreme values
  • Rarely a data value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the golden rules when comparing data?

A
  • always work out an average and make a comment
  • always work out a measure of spread and make a comment.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the two types of variables?

A

The explanatory (independent) variable
(the one you change) is plotted on the horizontal (x ) axis.

The response (dependent) variable (the one that
responds to, or depends on, the (explanatory variable) is plotted on the vertical (y ) axis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is scatter diagram used for?

A

Scatter diagrams are used to show whether two sets of data are associated. This means there is a relationship between them.

17
Q

Define correlation

A

Correlation describes the association, which can show an increasing or decreasing trend. Correlation
can be strong or weak.

18
Q

Define causal relationship

A

A causal relationship between two variables means that a change in one of the variables directly causes a change in the other variable, such as the number of hours of sunshine and
the temperature, or the age of a car and its value.

19
Q

What does Spearman’s rank correlation coefficient measures?

A

Spearman’s rank correlation coefficient, measures the strength of the correlation
between two sets of data.

The value for Spearman’s rank correlation coeffi cient lies between -1 and +1. The further the value is from zero, the stronger the correlation.