Descriptive Statistics Flashcards

1
Q

Bins (numerical data); classes (categorical data)

A

The nonoverlapping groupings of data used to create a frequency distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Box plot

A

A graphical summary of data based on the quartiles of a distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Categorical data

A

Data for which categories of like items are identified by labels or names. Arithmetic operations cannot be performed on this kind of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Coefficient of variation

A

A measure of relative variability computed by dividing the standard deviation by the mean and multiplying by 100.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Correlation coefficient

A

A standardized measure of linear association between two variables that takes on values between —1 and + 1. Values near —1 indicate a strong negative linear relationship, values near +1 indicate a strong positive linear relationship, and values near zero indicate the lack of a linear relationship.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Covariance

A

A measure of linear association between two variables. Positive values indicate a positive relationship; negative values indicate a negative relationship.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Cross-sectional data

A

Data collected at the same or approximately the same point in time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

cumulative frequency distribution

A

A tabular summary of quantitative data showing the number of data values that are less than or equal to the upper class limit of each bin.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Data

A

The facts and figures collected, analyzed, and summarized for presentation and interpretation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Empirical Rule (68-95-99.7)

A

A rule that can be used to compute the percentage of data values that must be within 1, 2, or 3 standard deviations of the mean for data that exhibit a bell-shaped distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Frequency distribution

A

A tabular summary of data showing the number of data values in each of several nonoverlapping bins.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Geometric mean

A

A measure of central tendency that is calculated by finding the nth root of the product of n values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Growth factor

A

The percentage increase of a value over a period of time is calculated using the formula (1 - bake). A value less than 1 indicates negative growth, whereas a value greater than 1 indicates positive growth. The value cannot be less than zero.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Histogram

A

A graphical presentation of a frequency distribution, relative frequency distribution, or percent frequency distribution of quantitative data constructed by placing the bin intervals on the horizontal axis and the frequencies, relative frequencies, or percent frequencies on the vertical axis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Interquartile Range (IQR)

A

The difference between the third and first quartiles.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

mean (arithmetic mean)

A

A measure of central tendency computed by summing the data values and dividing by the number of observations.

17
Q

Median

A

A measure of central tendency provided by the value in the middle when the data are arranged in ascending order.

18
Q

Mode

A

A measure of central tendency, defined as the value that occurs with greatest frequency.

19
Q

Observation

A

A set of values corresponding to a set of variables.

20
Q

Outlier

A

An unusually large or unusually small data value.

21
Q

percent frequency distribution

A

A tabular summary of data showing the percentage of data values in each of several nonoverlapping bins.

22
Q

Percentile

A

A value such that approximately p percent of the observations have values less than the pth percentile; hence, approximately (100 - p) percent of the observations have values greater than the pth percentile. The 50th percentile is the median.

23
Q

Population

A

The set of all elements of interest in a particular study.

24
Q

Quantitative data

A

Data for which numerical values are used to indicate magnitude, such as how many or how much. Arithmetic operations such as addition, subtraction, and multiplication can be performed on this data.

25
Q

Quartile

A

The 25th, 50th, and 75th percentiles, referred to as the first quartile, second quartile (median), and third quartile, respectively. These can be used to divide a data set into four parts, with each part containing approximately 25% of the data.

26
Q

Random sampling

A

Collecting a sample that ensures that: (1) each element selected comes from the same population and (2) each element is selected independently.

27
Q

Random variable

A

A quantity whose values are not known with certainty.

28
Q

Range

A

A measure of variability, defined to be the largest value minus the smallest value.

29
Q

Relative frequency distribution

A

A tabular summary of data showing the fraction or proportion of data values in each of several nonoverlapping bins.

30
Q

Sample

A

A subset of the population.

31
Q

Scatter chart

A

A graphical presentation of the relationship between two quantitative variables. One variable is shown on the horizontal axis and the other on the vertical axis.

32
Q

Skewness

A

A measure of the lack of symmetry in a distribution.

33
Q

Standard deviation

A

A measure of variability computed by taking the positive square root of the variance.

34
Q

Time series data

A

Data that are collected over a period of time

35
Q

Variable

A

A characteristic or quantity of interest that can take on different values.

36
Q

Variation

A

Differences in values of a variable over observations.

37
Q

z-score

A

A value computed by dividing the deviation about the mean (x1 - x) by the standard deviation σ. It is referred to as a standardized value and denotes the number of standard deviations that x is from the mean.