Chapter 3 Flashcards

1
Q

Characteristic or measure obtained by using the data values from a sample

A

Statistic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Characteristic or measure obtained by using all the data values from a specific population

A

Parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Rounding is the last step
Round to one more decimal point than in original data

A

General Rounding Rule

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

The sum of the total X values, divided by the total number of values

A

Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Calculated by using sample data. This is a statistic

A

Sample Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Calculated by using all the values in the population. Population is a parameter

A

Population Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When data is in numeric order

A

Data Array

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Midpoint of the data array. Symbol is MD

A

Median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

The value that occurs most often in a data set

A

Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Data set that only has one value that occurs with the greatest frequency

A

Unimodal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Data set that has two values that occur with the same greatest frequency

A

Bimodal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

A data set that has more than two values that occur with the same greatest frequency

A

Multimodal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

No data value occurs more than once

A

No mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Class with the largest frequency

A

Modal Class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Extremely low data values or extremely high data values in a data set

A

Outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Rough estimate of the middle. Affected by outliers.
(Defined as the sum of the lowest and highest values in the data set divided by 2)

A

Midrange (MR)

17
Q

Mean that considers an additional factor. Used when the values are not all equally represented

A

Weighted Mean

18
Q

Majority of the data values fall to the left of the mean and cluster at the lower end of the distribution; tail is to the right.
Mean is to the right of the median, and mode is to the left of the median.

A

Positively skewed or right-skewed distribution

19
Q

Data values are evenly distributed on both sides of the mean (when the distribution is unimodal, the mean, median, and mode are the same and at the center of the distribution)

A

Symmetric Distribution

20
Q

Majority of the data values fall to the right of the mean and cluster at the upper end of the distribution. Mean is to the left of the median, and mode is to the right of the median.

A

Negatively skewed or left-skewed

21
Q

Highest value minus the lowest value

A

Range (R)

22
Q

Based on the difference or distance each data value is from the mean

A

Data Variation

23
Q

Data variation difference or distance is called

A

Deviation

24
Q

Average of the squares of the distance each value is from the mean

A

Population Variance

25
Q

Square root of the variance

A

Population Standard Deviation

26
Q

Standard deviation divided by the mean. Results are represented as a percentage

A

Coefficient of Variation (CVar)

27
Q

Range used to approximate the standard deviation. Only is an approximation and should be used when distribution is unimodal and roughly symmetric

A

Range Rule of Thumb

28
Q

Specifies the portions of the spread in terms of the standard deviation (The proportion of values from a data set that will fall within k standard deviations of the mean will be at least 1-1/k^2) Can be used to find the minimum percentage of data values that will fall between any two given values

A

Chebyshev’s Theorem

29
Q

When distribution is bell-shaped

A

Empirical Rule

30
Q

Changing the data values to a different scale i.e) Changing data values to Fahrenheit scale

A

Linear Transformation of The Data

31
Q

Z score or Standard Score

A

A value obtained by subtracting the mean from the value and dividing the result by the standard deviation. The symbol is Z.

32
Q

Percentiles

A

Divide the data set into 100 equal groups.

33
Q

Quartiles

A

Divide the distribution into four equal groups

34
Q

Interquartile Range (IQR)

A

The difference between the third and first quartiles

35
Q

Outlier

A

Extremely low data value when compared with the rest of the data values

36
Q

Exploratory data analysis (EDA)

A

data can be organized using a stem and leaf plot. This measure of central currency is the median.

37
Q

Boxplot

A

Graph of a data set obtained by drawing a horizontal line from the minimum data value to Q1, drawing a horizontal line from Q3 to the maximum data value, and drawing a box whose vertical sides pass through Q1 and Q3 with a vertical line inside the box passing through the median or Q2.

38
Q

Modified boxplot

A

Can be drawn and used to check for outliers