Chapters Two-Eight: Vocabulary Flashcards

0
Q

The What

A

The variables (or labels)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
1
Q

The Who

A

The subject(s)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Quantitative Variables

A

Data that is measured in units

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Area Principle

A

The area populated by a part of the graph that corresponds to the magnitude of the value it represents.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Marginal Distribution

A

The frequency distribution of one of the variables in a margin.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Conditional Distribution

A

The distribution of one variable for just those cases that satisfy a condition on another variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Distribution

A

1) gives the possible values of a variable

2) relative frequency of each value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Area Principle

A

Each data value should be represented by the same amount of area.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Categorical data condition

A

Displaying and describing categorical data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Contingency Table

A

Displays counts and, sometimes, percentages of individuals falling into named categories of two or more variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Simpson’s Paradox

A

When averages are taken from different groups and they appear to contradict the overall averages.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do you describe the shape of a histogram?

A

1) distinguish how many modes
2) determine whether it’s symmetrical or skewed
3) find any outliers or gaps

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Mode

A

The hump or high point in a histogram. It can be bimodal or unimodal.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Uniform

A

Distribution that is basically even.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Median

A

The middle value of data. Normally used for a skewed graph with the interquartile range.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Range

A

The difference between between the maximum and minimum.

16
Q

Interquartile Range

A

Difference between the first and third quartiles.

17
Q

What does the 5-Number Summary describe?

A

The minimum, Q1, the median, Q3, and the maximum

18
Q

Mean

A

Found by summing all of the data by dividing by the count

19
Q

Variance

A

The sum of squared deviations from the mean, divided by n-1

20
Q

Standard Deviation

A

Usually reported with the mean. It is the square root of the variance.

21
Q

Outlier

A

Any point more than 1.5 IQR from either end of the box in a box plot.

22
Q

Far Outlier

A

If a point is 3.0 IQR from either end of the boxplot

23
Q

Standardizing

A

Used to eliminate units. Standardized values can be compared even if the original variables had different units.

24
Q

How do you find a standardized value?

A

By subtracting the mean an dividing by the standard deviation.

25
Q

Shifting

A

Adding a constant to each data value adds the same constant to the mean, median, and quartiles. It does not change the standard deviation or IQR.

26
Q

Rescaling

A

Multiplying each data value by a constant mulitplies both the measures of position and the measures of spread.

27
Q

Parameter

A

A numerically valued attribute of a model.

28
Q

What does a z-score tell?

A

How many standard deviations a value is from the mean.

29
Q

Nearly Normal Condition

A

The model is unimodal and symmetric.

30
Q

What does the association of a plot show?

A

1) direction
2) form
3) strength

31
Q

Correlation Coefficient

A

“r” is a numerical measure of the direction and strength of a linear association.

32
Q

Lurking Variable

A

A variable other than x and y that simultaneously affects both variables, accounting for the correlation between the two.

33
Q

Predicted Value

A

“y” found for a given x-value in the data. The predicted values (x, ^y) all fit exactly on the line.

34
Q

Residuals

A

The predicts value subtracted by the original value.

35
Q

Regression to the Mean

A

Because the correlation is always less that 1.0, the predicted ^y tends to be fewer standard deviations from its mean than “x” was from its mean.

36
Q

Line of Best Fit

A

Housetop y = B(of 0) + (B(of 1) • x)
B(of 0) = y-intercept
B(of 1) = slope

37
Q

Slope

A

Given in “y-units per x-unit.” Changes of one unit in x are associated with changes of of B(of 1).