Statistics 3 Flashcards

1
Q

Association

A

We speak of an association (or: correlation) between two variables if certain values of one variable tend to go with certain values of the other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Dependent variable

A

In association analysis, we are interested in whether one or a set of variables help to explain or predict another variable. We call the variable we seek to explain or predict the dependent variable (or also: outcome variable, response variable, Y). We use the term dependent variable because the values of the dependent variable are hypothesized to depend on values of other variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Independent Variable

A

We call the variable(s) that are hypothesized to explain or predict the dependent variable independent variable(s) (or: treatment, stimulus, explanatory variable, X).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Three components of association

A
  • Nature of direction
  • Strength
  • Statistical significance

To establish association means that we establish all three components

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Nature of direction

A

Nature of the direction of the relationship between two variables in your sample.

  • Respondents with high education are more likely to turn out in elections. Therefore, education increases electoral participation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Strength

A

Strength of the relationship between two variables in your sample

  • Respondents with high education are twice as likely to participate in elections. Therefore, education strongly increases electoral participation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Statistical methods for establishing associations

A
  • linear regression
  • Pearsons r
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Statistical significance

A

Statistical significance of the relationship in your sample

That is, how likely is it that the association you observe in a sample generalizes to the population (e.g., all UK voters)?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Linear Relationship

A
  • On a scatter plot it is the simplest way of describing a relationship between two quant variables.
  • A straight line, linear relationship.
  • Linear regression gives us the best linear association.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How to do a linear function

A

𝒚=𝜶+ 𝜷𝒙

whereby:
𝑦 is the dependent variable;
𝑥 is the independent variable;
𝛼 is the intercept or constant: the value of 𝑦 when 𝑥 = 0;
𝛽 is the slope or gradient: how much 𝑦 changes when 𝑥 increases by 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What tweak must we make to the function?

A

Linear functions are deterministic, but real-world data is messy. To account for this, we say that we explain the expected value of the dependent variable: E(y).

𝑬(𝒚)=𝜶+ 𝜷𝒙

We account for variation around the regression line

Take-home point: linear regression does not make exact predictions; it predicts an average value of Y for a given X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Least squares estimation

A
  • Establishes the line of best fit
  • Least squares estimation establishes the combination of intercept and regression slope which minimize the sum of the squared residuals, often also called the sum of squared errors (SSE)

𝑆𝑆𝐸= ∑(𝑦 − 𝑦̂)𝟐

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Establishing the Nature (or: Direction) of Linear Associations

A

The sign of the slope coefficient (𝜷) tells us about the nature (direction) of linear associations

𝛽>0 = positive relationship (as X increases, so does Y)
𝛽<0 = negative relationship (as X increases, Y decreases)
𝛽=0 = independence (as X increases, Y stays the same)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How can learn about the strength of linear associations by interpreting the size of the slope coefficient (𝜷) ?

A
  • 𝜷 gives us the change in Y if X increases by 1
  • Can make sense of that by putting in relation with distributions of X and Y
  • Straightforward to calculate increases (or decreases) of more/less than 1
How well did you know this?
1
Not at all
2
3
4
5
Perfectly