W3: RQ for Associations (Page 22-34) Flashcards

1
Q

What is the margin of error in confidence intervals.

Are they symmetrical?

A

Margins of Error

  • Lengths from sample estimates to lower bound and to upper bound.
  • They can be asymmetrical
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a two-way contingency table

A

Two-way contingency table:

Frequency counts of belong jointly to each category of one variable and each category of a second variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does a joint cell in a contingency table contain

A

Frequency count of all participants who belong to both variables a and b

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the row and column variable on a contingency table called

A

Marginal Cells.

Note: Marginal Cells will add up to sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What can help us understand a two-way table visually

A

Mosaic Plots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a Cramer’s V? Include details such as range and what does it not do?

A
  • Measure of strength of association in contingency table, where at least 1 variable has 3 + categories
  • No direction (No intrsinic ordering to 3 or more categories)
  • Range: 0 to 1
  • Both a sample statistic and population parameter (Though population paramter is unknown, and hence a CI is calculated for the populaton parameter based on the sample statistic)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Does Cramer’s V use words like significant? What are some issues raised in the lecture?

A
  • No use of words like significant
    • Because significant has a strict statistical meaning
  • No statment that says no association
    • Sample data will never tell us the true state of affairs at the population level
  • No P-value
    • Cramar’s V does not provide, CI is more informative as it contains any information in a p-value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is an estimator and an estimate

A

Estimator:

  • Function applied to sample statistic to obtain an estimate for a population parameter (Estimate, not a calculation*)
    • _​_Estimate
      • Point: Sample statistic value
      • CI: Interval estimator
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How many estimators are there for the same population parameter. Why are there different estimators?

A
  • We can have different estimators for the same population parameter and obtain different values
    • Point estimate: Different
    • Interval estimate: Different
  • They have different properties and based on different assumptions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How can estimators of a population parameter vary in 3 properties: What does a 95% unbiased interval estimator do?

A

Unbias

Captures the true population parameter value 95% of the time on average over the long run

  • Biasness does not depend on sample size
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How can estimators of a population parameter vary in 3 properties. What does a 95% consistent interval estimator do?

A

Consistency

  • Increasing closer to capturing the true population parameter value 95% of the time on average over the long run as SAMPLE SIZE increase
  • Consistency relates to sample size
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How can estimators of a population parameter vary in 3 properties. What does a 95% efficient interval estimator do?

A

Efficiency

Produces a more narrow confidence interval on average over the long run compared to some competing estimator

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is the ideal interval estimator. What sometimes happens

A

Ideal Interval Estimator:

  • Unbiased and very efficient
  • Sometimes, a consistency + efficiency is preferred over unbiased + inefficient
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are examples of effect sizes? What is an effect size and how is it estimated? When is it more useful

A
  • Quantitative measure of strength of relationsip between constructed measures.
  • Estimated by sample statistics and can be applied to population parameters.
    • It is more useful when accompanied by a CI.
      • Correlations
      • Cramer’s V
      • Regression coefficients
      • Means
      • Mean differences
      • Standardised mean differences
      • R-Squared
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

When both variables in a contingency table contain 2 categories, what is often used to report effect size

A

Odds Ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the odds ratio and what does it do. Range? Is it SS or PP

A
  • Ratio of 2 odds formed by considering (a) odds of one category in a variable and (b) odds of one category in another variable
    • Comparision of category in one variable relative to the 2 categories in the other variable
  • Measure of strength of association between 2 variables both containing 2 categories (Only 2x2)
  • Range: 0 to infinity
    • 1 = No association
    • 0 = Reduction in odds (neg corr)
    • +infiity = Increase in odds (pos corr)
  • Sample Statistic and Population Parameter
17
Q

What are odds

A

Probability of one event occuring relative to probability of it not occuring

(p)/(1-p)

18
Q

What happens to odds ratio if category order on one variable was reversed

A

The value will just be a reciprocal

19
Q

When we have an odds ratio <1, what do we usually do

A

Reverse ordering of two categories in one of the variable in a contingency table and recalculate odds ratio (which will >1)

20
Q

How will increasing sample size affect confidence intervals

A

CI will be narrower and more precise.

21
Q

What is the difference between Cramer’s V compared to Chi-Square

A
  • Chi-square only tell us direction/significance of association (not strength)
  • Cramer’s V measures STRENGTH of association (not direction)
22
Q

How do we know which interval estimator to use?

A

Context. Need a stimulation study,

23
Q

Bias:

A

The mean of the sampling distribution is not equal to the population parameter value (irrespective of sample size)

24
Q

What do we look at Mosaic Plots

A

Relative difference in heights of boxes.

Greater difference of same colour = Indicative of an association.

25
Q

Consistency:

A

The mean of the sampling distribution gets closer to the true population parameter value as sample size gets larger.

26
Q

Efficiency:

A

The average width of a confidence interval is smaller for more efficient interval estimator (for a given sample size and irrespective of whether the interval estimator is biased and/or consistent).

27
Q

What is the limitation of a correlation matrix. What should we use instead? to get past this limitaiton

A

Might have too much detail to readily discern patterns. Use correlation plot for pattern identification instead