Chapter 1 ~ Exploring Data Flashcards

1
Q

Distribution

A

Indicates what values a variable takes on and the frequency (i.e. how often) at which it takes on these variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Outlier

A

An individual observation that falls outside the overall pattern of the graph

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Relative frequency histogram

A

Has the same shape as a histogram with the exception that the vertical axis measures relative frequencies instead of frequencies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the key features of a histogram?

A

Centre
Spread
Shape

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the three basic shapes of histograms?

A

Symmetric
Skewed right
Skewed left

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the three measures of centre?

A

Mean
Median
Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Sample mean

A

Arithmetic average or arithmetic mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Mode

A

Element or elements that occur most often.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Median

A

“Middle number” of the data when it has been arranged in increasing order.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Bimodal

A

A data set with two modes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Median position formula

A

(n + 1)/2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the measures of spread?

A

Range
Interquartile range (IQR)
Five number summary
Variance and standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Range

A

Largest # – smallest #

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Interquartile range (IQR)

A

Q3 – Q1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Five number summary

A

Minimum, Q1, median, Q3, maximum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Standard deviation

A

Measures how numbers are spread out from the mean.

Non resistant to outliers.

16
Q

What should be used to describe a symmetric distribution?

A

Mean and standard deviation

17
Q

What should be used to describe a skewed distribution?

A

Median, IQR, five number summary

18
Q

1.5 IQR Rule

A

A data point is considered to be an outlier if it lies more than 1.5 IQRs below Q1 or 1.5 IQRs above Q3

19
Q

Boxplot

A

(Aka box and whisker plot)

A graph which displays the five number summary of a set of data

20
Q

Modified boxplot

A

A graph which also displays the five number summary of a data set, however it also indicates whether the data set contains outliers (according to the 1.5 IQR Rule)

21
Q

Side by side boxplots

A

Can be used to compare the distributions of two data sets