Averages And Spread Flashcards

1
Q

What is an average also called?

A

Measure of central tendency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is x̄?

A

The mean (x bar)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is it called when you have two modes?

A

Bimodal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Q1?

A

The lower quartile

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Q2?

A

The median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Q3?

A

Upper quartile

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the median?

A

The middle value when the data is in order.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the lower quartile?

A

One quarter of the way through a data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the upper quartile?

A

Three quarters of the way through the data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a measure of spread?

A

A measure of how spread out the data is.
eg range, IQRange

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the advantage and disadvantage of using the range as a measure of spread?

A

Takes all the data into account but can be affected by extreme values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the advantage of using the interquartile range as a measure of spread?

A

It’s not affected by extreme values but only considers the spread of the middle 50% of the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is standard deviation (σ) equal to?

A

The square root of variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What do standard deviation and variance show?

A

The dispersion of values around the mean. Standard deviation gives a kind of ‘average’ amount by which all the values deviate from the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the best measure of spread?

A

Standard deviation / variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does Σx mean?

A

Sum of the x values.

17
Q

What does Σx^2 mean?

A

Sum of the x^2 values.

18
Q

What does σ^2 x mean?

A

Variance

19
Q

What does σx mean?

A

Standard deviation

20
Q

What does n mean?

A

Number in data set.

21
Q

What does min(x) mean?

A

Smallest data value

22
Q

What does max(x) mean?

A

Largest data value

23
Q

What is an outlier?

A

An extreme value that lies outside the overall pattern of data.

24
Q

What is a common definition of an outlier?

A

• Greater than Q3 + k(Q3 -Q1)
Or
• Less than Q1 - k(Q3 - Q1)

25
Q

What is (Q3 - Q1)?

A

The interquartile range.

26
Q

Are there sometimes outliers that are legitimate values?

A

Yes

27
Q

When outliers are a mistake, what are they called?

A

Anomalies

28
Q

What is the process of removing anomalies from a data set known as?

A

Cleaning the data.

29
Q

What must you do when removing anomalies?

A

Justify why it is an anomalie and not just an outlier.

30
Q

What does a box plot show?

A

•median
•quartile
• max and min values
•outliers

31
Q

When comparing 2 box plots what should you consider?

A

• median as a measure of average
• IQR as a measure of spread
• outliers
• symmetry

32
Q

What is a symmetrical box plot?

A

When the median is in the middle of the box, there is no skew.
Q3 - Q2 = Q2 - Q1

33
Q

What is a positive skew box plot?

A

When the median is to the left of the box.
Q3 - Q2 > Q2 - Q1

34
Q

What is a negative skew box blot?

A

When the median is to the right of the box.
O3 - Q2 < Q2 - Q1

35
Q

In a histogram, what is the area of the bar proportional to?

A

The frequency in each class.