3: Exploratory Data Analysis: Relationships between variables Flashcards

1
Q

scatterplot

A

shows relationship b/w two quant. variables measured on the same individuals.

values of one variable appear on horiz axis and values of other appear on vert axis. each individual in the data appears as the point in the plot fixed by the values of both variables for that individual

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

for scatterplot, the explanatory variable always goes on which axis?

A
explanatory variable (if there is one) is on X axis
response variable is on Y
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

two variables are __________ when above average values of one tend to accompany above average values of the other and below average values also tend to occur together

A

two variables are POSITIVELY ASSOCIATED when above average values of one tend to accompany above average values of the other and below average values also tend to occur together

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

two variables are ______________ when above-average values of one tend to accompany below average values of the other and vice versa

A

two variables are NEGATIVELY ASSOCIATED when above-average values of one tend to accompany below average values of the other and vice versa

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

strength of a scatterplot relationship is shown by…

A

how closely the points follow a clear form (e.g. line)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

transformation of data

A

we replace the original values by the transformed values and then use the transformed values for our analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

correlation

A

measures the direction and strength of the linear relationship b/w 2 quant variables. r.

if we have data for x and y for n individuals… The means and std deviation of the two variables are xbar and Sx for the x-values and ybar and Ys for the y values, the correlation is:

PASTE FORMULA HERE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

correlation indicates the direction of a linear relationship by its sign. r > 0 for ____ association and r < 0 for ________ association.

A

correlation indicates the direction of a linear relationship by its sign. r > 0 for POSITIVE association and r < 0 for NEGATIVE association.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

correlation is/is not a resistant measure.

A

correlation is not a resistant measure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

normal distributions

A
  1. density curve is always on or above the X axis
  2. density curve has exactly 1.0 total area beneath it
  3. normal distrib describes overall pattern of a distribution
  4. area under the curve and above (or below) any value is the relative frequency of all observations that fall in that range
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

center and spread of density curves

  1. mode
  2. median
  3. mean
A
  1. mode = peak point
  2. median = point at which half the total area is on each side
  3. mean - point at which curve would balance if solid
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

In Normal Curve….
approx ____% of observations fall within 1 std. dev of mean
approx ____% of observations fall within 2 std. dev of mean
approx ____% of observations fall within 3 std. dev of mean

A
  1. 68
  2. 95
  3. 99.7
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Crosstab measures ____ and ____ variables. Rows are responsible for ___ variable and columns are ____ variable.

A

Crosstab measures CAT and CAT variables. Rows are responsible for RESPONSE variable and columns are EXPLANATORY variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

In a scatterplot, the response variable = ___ axis and the explanatory variable = ___ axis

A

In a scatterplot, the response variable = Y axis

and the explanatory variable = X axis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

correlation

A

standardized measure of the direction and strength of the linear relationship b/w 2 quant variables

ranges from -1.0 to 1.0

b/c r uses the standardized values of the observations, the correlation doesn’t change when we change units of measurement

correlation ignores distinction b/w response and explanatory variable

correlation r is strongly affected by a few outlying observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly