Chapter2 Flashcards

1
Q

What are the types of data?

A

Numerical and Categorical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What data is categorised in Categorical data?

A

Nominal and Ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What data is categorised in Numerical data?

A

Discrete and Continuous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a bar chart used for?

A

To display the frequency distribution of a Categorical variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What type of data does a bar chart use?

A

Categorical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Mode?

A

The value of a variable that occurs most frequently.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a Histogram used for?

A

To display the frequency distribution of a Numerical variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What type of data does a Histogram use?

A

Numerical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a Stem Plot?

A

The visual display of a numerical data set, an alternate display of a histogram.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a Dot Plot?

A

A number line with each data point marked by a dot.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do you describe the distribution of a numerical variable?

A

Shape (symmetric or skewed)
Center (midpoint of the distribution)
spread

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is Mean?

A

A summary statistic used to locate the centre of a symmetric distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Range?

A

The difference between the smallest and the largest data values

Range = Largest value - Smallest value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the Standard Deviation?

A

The summary statistic that measures the spread of the data values around the Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is Median?

A

The summary statistic that can be used to locate the centre of a distribution. It is the midpoint of the distribution.
If the distribution is clearly skewed or the there are no outliers, the medium is preferred to the mean as a measure of centre.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a Quartile?

A

A summary statistic that divides an ordered data set into four equal groups.

17
Q

What does the IQR mean/is used for?

A

The Interquartile Range gives the spread of the middle 50% of data values in an ordered data set.
The IQR is preferred to the standard deviation as a measure of spread.

18
Q

What is in the Five-Number Summary?

A
Minimum value
The first Quartile
Median
The third Quartile
Maximum value
19
Q

How do you calculate for an Outlier?

A

Lower Fence = Q1 - (1.5xIQR)

Upper Fence = Q3 + (1.5xIQR)

20
Q

What is a Box Plot?

A

A visual display of a five-number summary with adjustments made to display outliers separately when they are present.