Averages And Spread Flashcards

1
Q

What is an average also called?

A

Measure of central tendency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is x̄?

A

The mean (x bar)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is it called when you have two modes?

A

Bimodal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Q1?

A

The lower quartile

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Q2?

A

The median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Q3?

A

Upper quartile

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the median?

A

The middle value when the data is in order.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the lower quartile?

A

One quarter of the way through a data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the upper quartile?

A

Three quarters of the way through the data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a measure of spread?

A

A measure of how spread out the data is.
eg range, IQRange

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the advantage and disadvantage of using the range as a measure of spread?

A

Takes all the data into account but can be affected by extreme values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the advantage of using the interquartile range as a measure of spread?

A

It’s not affected by extreme values but only considers the spread of the middle 50% of the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is standard deviation (σ) equal to?

A

The square root of variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What do standard deviation and variance show?

A

The dispersion of values around the mean. Standard deviation gives a kind of ‘average’ amount by which all the values deviate from the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the best measure of spread?

A

Standard deviation / variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does Σx mean?

A

Sum of the x values.

17
Q

What does Σx^2 mean?

A

Sum of the x^2 values.

18
Q

What does σ^2 x mean?

19
Q

What does σx mean?

A

Standard deviation

20
Q

What does n mean?

A

Number in data set.

21
Q

What does min(x) mean?

A

Smallest data value

22
Q

What does max(x) mean?

A

Largest data value

23
Q

What is an outlier?

A

An extreme value that lies outside the overall pattern of data.

24
Q

What is a common definition of an outlier?

A

• Greater than Q3 + k(Q3 -Q1)
Or
• Less than Q1 - k(Q3 - Q1)

25
What is (Q3 - Q1)?
The interquartile range.
26
Are there sometimes outliers that are legitimate values?
Yes
27
When outliers are a mistake, what are they called?
Anomalies
28
What is the process of removing anomalies from a data set known as?
Cleaning the data.
29
What must you do when removing anomalies?
Justify why it is an anomalie and not just an outlier.
30
What does a box plot show?
•median •quartile • max and min values •outliers
31
When comparing 2 box plots what should you consider?
• median as a measure of average • IQR as a measure of spread • outliers • symmetry
32
What is a symmetrical box plot?
When the median is in the middle of the box, there is no skew. *Q3 - Q2 = Q2 - Q1*
33
What is a positive skew box plot?
When the median is to the left of the box. *Q3 - Q2 > Q2 - Q1*
34
What is a negative skew box blot?
When the median is to the right of the box. *O3 - Q2 < Q2 - Q1*
35
In a histogram, what is the area of the bar proportional to?
The frequency in each class.