AP Stat Vocabulary Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

Individuals

A

are objects described by a set of data. Individuals may be people, animals, or things.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Variable

A

any characteristic of an individual. A variable can take different values for different individuals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Categorical Variable

A

places an individual into several groups of categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Quantitative Variable

A

takes numerical values for which it makes sense to find an average.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Distribution

A

The ___ of a variable tells us what values the variable takes and how often to take these values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Pie Chart

A

Chart that shows the distribution of a categorical variable as a “pie” whose slices are sized by the counts or percents for the categories.
A pie chart must include all the categories that
make up a whole.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Bar Graph

A

Graph used to display the distribution of a categorical
variable or to compare the sizes of different quantities.
The horizontal axis of a bar graph identifies the categories or quantities being compared.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Two-Way Table

A

Table of counts that organizes data about two categorical variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Marginal Distribution

A

the ___ of one of the categorical variables in a two-way table of counts is the distribution of values of that variable among all individuals described by the table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Conditional Distribution

A

A ____ of a variable describes the values of that variable among individuals who have a specific value of another variable. There is a separate conditional distribution for each value of the other variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Segmented Bar Graph

A

Graph used to compare the distribution of
a categorical variable in each of several groups. For each group,there is a single bar with “segments” that correspond to the different values of the categorical variable.
The height of each segment is determined by the percent of individuals in the group with that value. Each bar has a total height of 100%.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Association

A

There is an _____ between two variables if knowing the value of one variable helps predict the value of the other. If knowing the value of one variable does not help you predict the value of the other, then there is no association between the variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Side-by-Side Bar Graph

A

Graph used to compare the distribution of a categorical variable in each of several groups. For each value
of the categorical variable, there is a bar corresponding to each group.
The height of each bar is determined by the count or percent of individuals in the group with that value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Frequency Table

A

Table that displays the count (frequency) of observations in each category or class.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Relative Frequency Table

A

Table that shows the percents (relative frequencies) of observations in each category or class.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Roundoff Error

A

Difference between the calculated approximation of a number and its exact mathematical value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Dotplot

A

Simple graph that shows each data value as a dot above

its location on a number line.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Shape

A

When describing a distribution ___ look at the main features. Look for major peaks, clusters of values and obvious gaps, potential outliers, rough symmetry, or clear skewness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Symmetric (distribution)

A

A graph in which the right and left sides are approximately mirror images of each other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Skewed Right

A

A distribution is skewed to the right if the right side of the graph (containing the half of the observations with larger values) is much longer than the left side.

21
Q

Skewed Left

A

It is skewed to the left if the left side of the graph is much longer than the right side.

22
Q

Unimodal

A

A graph of quantitative data with a single peak.

23
Q

Bimodal

A

A graph of quantitative data with two clear peaks.

24
Q

Multimodal

A

A graph of quantitative data with more than two

clear peaks.

25
Q

Stemplot

A

Simple graphical display for fairly small data sets that gives a quick picture of the shape of a distribution while including the actual numerical values in the graph.
Each observation is separated into a stem, consisting
of all but the final digit, and a leaf, the final digit.

26
Q

Splitting Stems

A

Method for spreading out a stemplot that has too

few stems.

27
Q

Back-to-Back Stemplots

A

Plot used to compare the distribution of a quantitative variable for two groups. Each observation in both groups is separated into a stem, consisting of all but the final digit, and a leaf, the final digit.
The stems are arranged in a vertical column with the smallest at the top. The values from one group are plotted on the left side of the stem and the values from the other group are plotted on the right side of the stem. Each leaf is written in the row next to its stem, with
the leaves arranged in increasing order out from the stem.

28
Q

Histogram

A
Graph that displays the distribution of a quantitative variable. The horizontal axis is marked in the units of measurement for the variable. The vertical axis contains the scale of counts or percents. Each bar in the graph represents an equal width class. 
The base of the bar covers the class, and the bar height is the class frequency or relative frequency.
29
Q

Mode

A

Value or class in a statistical distribution having the greatest frequency.

30
Q

Center

A

The center is the median and/or mean of the data.

31
Q

Spread

A

The range of data

32
Q

Range

A

The maximum value minus the minimum value for a set

of quantitative data.

33
Q

Outliers

A

Individual value that falls outside the overall pattern of a

distribution.

34
Q

Mean

A

Arithmetic average. To find the mean of a set of observations, add their values and divide by the number of observations.

35
Q

Median

A

The midpoint of a distribution; the number such that about half the observations are smaller and about half are larger.
To find the median of a distribution: (1) Arrange all observations in order of size, from smallest to largest. (2) If the number of observations n is odd, the median is the center observation in the ordered list. (3) If the number of observations n is even, the median is the average of the two center observations in the ordered list.

36
Q

Inference

A

Drawing conclusions that go beyond the data at hand.

37
Q

Interquartile Range

A

IQR = Q3 – Q1.

38
Q

Five Number Summary

A

Smallest observation, first quartile, median, third quartile, and largest observation, written in order from smallest to largest.
In symbols: Minimum Q1 Median Q3 Maximum

39
Q

Boxplot

A

Graph of the five-number summary. The box spans the quartiles and shows the spread of the central half of the distribution. The median is marked within the box. Lines extend from the box to the smallest and largest observations that are not outliers. Outliers are marked with a special symbol such as
an asterisk

40
Q

Standard Deviation

A

Statistic that measures the typical distance of the values in a distribution from the mean. It is calculated by finding an “average” of the squared distances and then taking the square root.

41
Q

Variance

A

“Average” squared deviation of the observations in a data set from their mean. In symbols,

42
Q

Census

A

Study that attempts to collect data from every individual

in the population.

43
Q

Continuous Variable

A

Variable that takes all values in an interval of numbers. The probability distribution of a continuous random variable is described by a density curve.
The probability of any event is the area under the density curve and above the values of the variable that make up the event.

44
Q

Discrete Variable

A

Takes a fixed set of possible values with
gaps between. The probability distribution of a discrete random variable gives its possible values and their probabilities.
The probability of any event is the sum of the probabilities for the values of the variable that make up the event.

45
Q

Univariate Data

A

When we conduct a study that looks at only one variable, we say that we are working with univariate data. Suppose, for example, that we conducted a survey to estimate the average weight of high school students. Since we are only working with one variable (weight), we would be working with univariate data

46
Q

Bivariate Data

A

When we conduct a study that examines the relationship between two variables, we are working with bivariate data. Suppose we conducted a study to see if there were a relationship between the height and weight of high school students. Since we are working with two variables (height and weight), we would be working with bivariate data

47
Q

Population

A

In a statistical study, the entire group of individuals

we want information about.

48
Q

Sample

A

Subset of individuals in the population from which we

actually collect data.

49
Q

Simpson’s Paradox

A

when averages are taken across different groups, they can appear to contradict the overall averages