Chapter 3: Descriptive Statistics: Numerical Measures Flashcards

1
Q

sample statistics

A

a numerical value used as a summary measure for a sample (ex. the sample mean, x bar, the sample variance, s^2, and the sample standard deviation, s)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

population parameter

A

a numerical value used as a summary measure for a population (ex. the population mean, u, the population variance, o^2, and the population standard deviation, o)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

point estimator

A

a sample statistic, such as x bar, s^2, and s, used to estimate the corresponding population parameter.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

mean

A

a measure of central location computed by summing the data values and dividing by the number of observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

median

A

a measure of central location provided by the value in the middle when the data are arranged in ascending order.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

mode

A

a measure of central location, defined as the value that occurs with greatest frequency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

weighted mean

A

the mean obtained by assigning each observation a weight that reflects its importance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

geometric mean

A

a measure of location that is calculated by finding the nth root of the product of n values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

percentile

A

a value that provides information about how the data are spread over the interval from the smallest to the largest value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

pth percentile

A

for a data set containing n observations, the pth percentile divides the data into two parts: approximately p% of the observation are less that the pth percentile and approximately (100-p)% of the observations are greater than the pth percentile.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

quartiles

A

the 25th, 50th, and 75th percentiles, referred to as the first, second (median), and third quartile, respectively. the quartiles can be used to divide a data set into four parts, with each part containing approximately 25% of the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

range

A

a measure of variability, defined to be the largest value minus the smallest value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

interquartile range (IQR)

A

a measure of variability, defined to be the difference between the third and first quartiles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

variance

A

a measure of variability based on the squared deviations of the data values about the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

standard deviation

A

a measure of variability computed by taking the positive square root of the variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

coefficient of variation

A

a measure of relative variability computed by dividing the standard deviation by the mean and mulptiplying by 100

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

skewness

A

a measure of the shape of a data distribution. data skewed to the left=negative skewness. skewed to the right=positive skewness. symmetrical data=zero skewness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

z-score

A

a value computed by dividing the deviation about the mean (xi-x bar) by the standard deviation s. a z-score is referred to as a standardized value and denotes the number of standard deviations xi is from the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

chebyshev’s theorem

A

a theorem that can be used to make statements about the proportion of data values that must be within a specified number of standard deviations of the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

emperical rule

A

a rule that can be used to compute the percentage of data values that must be within one, two and three standard deviations of the mean for the data that exhibit a bell-shaped distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

outlier

A

an unusually small or large data value

22
Q

five number summary

A

a technique that uses 5 numbers to summarize the data: smallest value, 1st quartile, median, 3rd quartile, largest value

23
Q

box plot

A

a graphical summary of data based on a 5 number summary

24
Q

covariance

A

a measure of linear association between two variables. positive values=positive relationship. negative values=negative relationship.

25
Q

correlation coefficient

A

a measure of linear association between two variables that takes on values between -1 and +1. values near +1 indicate a strong positive linear relationship. values near -1 indicate a strong negative linear relationship. and values near zero lack any liner relationship.

26
Q

a numerical value used as a summary measure for a sample (ex. the sample mean, x bar, the sample variance, s^2, and the sample standard deviation, s)

A

sample statistics

27
Q

a numerical value used as a summary measure for a population (ex. the population mean, u, the population variance, o^2, and the population standard deviation, o)

A

population parameter

28
Q

a sample statistic, such as x bar, s^2, and s, used to estimate the corresponding population parameter.

A

point estimator

29
Q

a measure of central location computed by summing the data values and dividing by the number of observations.

A

mean

30
Q

a measure of central location provided by the value in the middle when the data are arranged in ascending order.

A

median

31
Q

a measure of central location, defined as the value that occurs with greatest frequency.

A

mode

32
Q

the mean obtained by assigning each observation a weight that reflects its importance.

A

weighted mean

33
Q

a measure of location that is calculated by finding the nth root of the product of n values.

A

geometric mean

34
Q

a value that provides information about how the data are spread over the interval from the smallest to the largest value

A

percentile

35
Q

for a data set containing n observations, the pth percentile divides the data into two parts: approximately p% of the observation are less that the pth percentile and approximately (100-p)% of the observations are greater than the pth percentile.

A

pth percentile

36
Q

the 25th, 50th, and 75th percentiles, referred to as the first, second (median), and third quartile, respectively. the quartiles can be used to divide a data set into four parts, with each part containing approximately 25% of the data.

A

quartiles

37
Q

a measure of variability, defined to be the largest value minus the smallest value.

A

range

38
Q

a measure of variability, defined to be the difference between the third and first quartiles

A

interquartile range (IQR)

39
Q

a measure of variability based on the squared deviations of the data values about the mean.

A

variance

40
Q

a measure of variability computed by taking the positive square root of the variance.

A

standard deviation

41
Q

a measure of relative variability computed by dividing the standard deviation by the mean and mulptiplying by 100

A

coefficient of variation

42
Q

a measure of the shape of a data distribution. data skewed to the left=negative skewness. skewed to the right=positive skewness. symmetrical data=zero skewness

A

skewness

43
Q

a value computed by dividing the deviation about the mean (xi-x bar) by the standard deviation s. a z-score is referred to as a standardized value and denotes the number of standard deviations xi is from the mean.

A

z-score

44
Q

a theorem that can be used to make statements about the proportion of data values that must be within a specified number of standard deviations of the mean

A

chebyshev’s theorem

45
Q

a rule that can be used to compute the percentage of data values that must be within one, two and three standard deviations of the mean for the data that exhibit a bell-shaped distribution.

A

emperical rule

46
Q

an unusually small or large data value

A

outlier

47
Q

a technique that uses 5 numbers to summarize the data: smallest value, 1st quartile, median, 3rd quartile, largest value

A

five number summary

48
Q

a graphical summary of data based on a 5 number summary

A

box plot

49
Q

a measure of linear association between two variables. positive values=positive relationship. negative values=negative relationship.

A

covariance

50
Q

a measure of linear association between two variables that takes on values between -1 and +1. values near +1 indicate a strong positive linear relationship. values near -1 indicate a strong negative linear relationship. and values near zero lack any liner relationship.

A

correlation coefficient