Chapters Two-Eight: Vocabulary Flashcards

(38 cards)

0
Q

The What

A

The variables (or labels)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
1
Q

The Who

A

The subject(s)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Quantitative Variables

A

Data that is measured in units

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Area Principle

A

The area populated by a part of the graph that corresponds to the magnitude of the value it represents.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Marginal Distribution

A

The frequency distribution of one of the variables in a margin.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Conditional Distribution

A

The distribution of one variable for just those cases that satisfy a condition on another variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Distribution

A

1) gives the possible values of a variable

2) relative frequency of each value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Area Principle

A

Each data value should be represented by the same amount of area.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Categorical data condition

A

Displaying and describing categorical data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Contingency Table

A

Displays counts and, sometimes, percentages of individuals falling into named categories of two or more variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Simpson’s Paradox

A

When averages are taken from different groups and they appear to contradict the overall averages.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do you describe the shape of a histogram?

A

1) distinguish how many modes
2) determine whether it’s symmetrical or skewed
3) find any outliers or gaps

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Mode

A

The hump or high point in a histogram. It can be bimodal or unimodal.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Uniform

A

Distribution that is basically even.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Median

A

The middle value of data. Normally used for a skewed graph with the interquartile range.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Range

A

The difference between between the maximum and minimum.

16
Q

Interquartile Range

A

Difference between the first and third quartiles.

17
Q

What does the 5-Number Summary describe?

A

The minimum, Q1, the median, Q3, and the maximum

18
Q

Mean

A

Found by summing all of the data by dividing by the count

19
Q

Variance

A

The sum of squared deviations from the mean, divided by n-1

20
Q

Standard Deviation

A

Usually reported with the mean. It is the square root of the variance.

21
Q

Outlier

A

Any point more than 1.5 IQR from either end of the box in a box plot.

22
Q

Far Outlier

A

If a point is 3.0 IQR from either end of the boxplot

23
Q

Standardizing

A

Used to eliminate units. Standardized values can be compared even if the original variables had different units.

24
How do you find a standardized value?
By subtracting the mean an dividing by the standard deviation.
25
Shifting
Adding a constant to each data value adds the same constant to the mean, median, and quartiles. It does not change the standard deviation or IQR.
26
Rescaling
Multiplying each data value by a constant mulitplies both the measures of position and the measures of spread.
27
Parameter
A numerically valued attribute of a model.
28
What does a z-score tell?
How many standard deviations a value is from the mean.
29
Nearly Normal Condition
The model is unimodal and symmetric.
30
What does the association of a plot show?
1) direction 2) form 3) strength
31
Correlation Coefficient
"r" is a numerical measure of the direction and strength of a linear association.
32
Lurking Variable
A variable other than x and y that simultaneously affects both variables, accounting for the correlation between the two.
33
Predicted Value
"y" found for a given x-value in the data. The predicted values (x, ^y) all fit exactly on the line.
34
Residuals
The predicts value subtracted by the original value.
35
Regression to the Mean
Because the correlation is always less that 1.0, the predicted ^y tends to be fewer standard deviations from its mean than "x" was from its mean.
36
Line of Best Fit
Housetop y = B(of 0) + (B(of 1) • x) B(of 0) = y-intercept B(of 1) = slope
37
Slope
Given in "y-units per x-unit." Changes of one unit in x are associated with changes of of B(of 1).