Chapter 3 Flashcards

1
Q

What is an outlier?

A

An extreme value that lies outside the overall pattern of the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How do you calculate an outlier?

A
  • Either greater than Q₃ + k (Q₃-Q₁)
  • Or less than Q₁ - k(Q₃-Q₁) (k is often 1.5)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Sometimes outliers can be correct, so why remove them from the data?

A

Since it could be misleading to keep it in. We must remove anomalies from data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does cleaning the data mean?

A

Removing anomalies from the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What do box plots show?

A

The important features of the data -
- Quartiles
- Max and min values (range)
- Outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do you draw a cumulative frequency diagram?

A

x-axis = Class widths
y-axis = cumulative frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What type of data is represented in histograms?

A

Grouped continuous data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the area of a bar in a histogram proportional to?

A

It is proportional to the frequency of each class.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How do you plot a histogram?

A

x-axis = class widths
y-axis = frequency density
Bars are together

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How do you calculate frequency density?

A

Frequency density = frequency / class width

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What do you compare data with?

A
  • Measure of location (mean, medium, mode)
  • Measure of spread (quartiles, standard deviation, variance, range)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly