Chapter 1 Vocab Flashcards

Question

Outlier

Answer 1

A data point that differs significantly from other observations.

Answer 2

The most common value of a data set.

Answer 3

When the right and left sides of the graph are approx. mirror images of each other.

Answer 4

Where the tail of the graph is on the right side, if the right side of the graph (containing the half of the observations with larger values) is much longer than the left side.

Answer 5

Where the tail of the graph is on the left side, if the left side of the graph (containing the half of the observations with larger values) is much longer than the right side.

Answer 6

A dot plot in which there is only one peak

Answer 7

A dot plot in which there are two peaks

Answer 8

A plot where each data value is split into a "leaf" (usually the last digit) and a "stem" (the other digits). Stems are written in a vertical column with the smallest at the top to the largest at the bottom, where no stem is skipped, even if there is no data value. A vertical line is written at the right of this column, for the “leaf” is to be written to the right of the line. The “leaf” on the right side are arranged in numerical order, increasing in number from the stem. (Ex: [5|2 4] represents the values of 52 and 54).

Answer 9

All digits but the final (ones) digit

Answer 10

The final (ones) digit

Answer 11

A method used to more accurately represent data using a stemplot, therefore making it easier to identify the shape of the plot. Separates the “leaf” values from 0-4 and 5-9 on separate stems of the same value.

Answer 12

A stemplot used for representing two sets of categorical data. This is done by representing one set of data’s “leaf” values on the left from the stem, and one set of data on the right.

Answer 13

A diagram consisting of rectangles whose area is proportional to the frequency of a variable and whose width is equal to the class interval. Displays the distribution of a quantitative variable.

Answer 14

The average of the data set. Also known as “x bar”(x̅), and represented by the letter x with a horizontal line above it. Found by adding all of the data points and dividing by the amount of points that were added. Formula in compact notation is x̅ = ∑xi / n

Answer 15

Identifies if the measure of center is affected by outliers. If it is affected, it isn’t a resistant measure of center. If it isn’t affected, it is a resistant measure of center.

Answer 16

The midpoint of a distribution, the number such that about half the observations are smaller and about half are larger. Arrange all observations in order of size, from smallest to largest. If the number of observations n is odd, the median is the center observation in the ordered list. If the number of observations n is even, the median is the average of the two center observations in the ordered list.

Answer 17

Shows the full spread of the data. Calculated by subtracting the smallest number in the data set from the largest number in the data set. Could be less accurate due to outliers.

Answer 18

The median of the observations that are to the left of the median in the ordered list.

Answer 19

The median of the observations that are to the right of the median in the ordered list.

Answer 20

(Q3-Q1) Is a measure of statistical dispersion, being equal to the difference between 75th and 25th percentiles.

Answer 21

This rule uses the first and third quartile values as well as the IQR to calculate outliers in the data set. The rule is: if a data point is less than (Q1- 1.5 x IQR) or more than (Q3+ 1.5 x IQR), it is an outlier.

Answer 22

A set of 5 numbers that show the overall spread and diversity of a data set. The 5 numbers are: the minimum data point, the first quartile, the median, the third quartile, and the maximum data point (in that order).

Answer 23

A graph that is formed by the 5 number summary, creating a visual on a number line that shows the quarters of the data set. A boxplot is arranged above a number line, with a central box drawn from the first quartile (Q1) to the third quartile (Q3), and a line inside the box to mark the median. Lines (called whiskers) extend from the box out to the smallest and largest observations that are not outliers. Outliers are marked with a special symbol like an asterisk.

Answer 24

The distance a data point is from the mean of the set. (xi-x̄)

Answer 25

The expectation of the squared deviation of a random variable from its mean. The average of the squared deviations. Formula: (S2 = ∑(xi-x̄)/n-1) Variance is also represented by (s2x).

Answer 26

The “typical” distance of the values in the data set from the mean. Formula: (Standard Deviation = √variance) Standard deviation is also represented by (sx).

Chapter 1 Vocab Flashcards

(50 cards)