WEEK 2: CONFIDENCE INTERVAL, CORRELATION AND REGRESSION Flashcards

1
Q

What is a point estimator?

A

It is a single number that estimate plausible value of a population parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are characteristics of a good point estimator?

A

*Unbiased
*Consistent
*Precise ( small standard error)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Outline examples of point estimators?
What parameters they measure?

A

*Sample variance- Population variance
*Sample standard deviation- Population standard deviation
*Sample mean- Population mean
*Sample mode- Population mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is an interval estimator?

A

A pair of point estimators between which the population parameter of interest resides in

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a characteristic of a good point interval?

A

*Short
*Contain population parameter (ΞΈ) with high probability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a significance level of an interval estimator (a) ?

A

It is a value that is used to determine CI with probability 1-a

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

State the 4 factors that affect the range of Confidence interval range

A

*Confidence level ( 80,90,95,99 probability)
*Number of samples
*Standard deviation of sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the relationship between confidence interval range and width and confidence level?

A

As the confidence level increase, the confidence interval range and width increase.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

State the assumptions for the Confidence interval of the mean

A
  • samples were randomly selected
    *The underlying distribution is Gaussian distribution
    *Samples were independently selected from the population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the formula for calculating CI for 95%?

A

To calculate 95% confidence interval:

πœƒΒ± 1.96 βˆ— (π‘ π‘‘π‘Žπ‘›π‘‘π‘Žπ‘Ÿπ‘‘ π‘’π‘Ÿπ‘Ÿπ‘œπ‘Ÿ)

Standard error= standard deviation/ square root of the sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

In a random sample of 36 MBBS1 year 1 students, the mean height was 168cm with a standard deviation of 14cm.

Determine a 95% confidence interval for the mean height of all University of Botswana students.

A

πœƒ=168cm, SD=14, sample size (n) =36

95% CI = (168 Β±1.96*standard error)

π‘ π‘‘π‘Žπ‘›π‘‘π‘Žπ‘Ÿπ‘‘ π‘’π‘Ÿπ‘Ÿπ‘œπ‘Ÿ =14/ square root of 36 = 2.33
∴ 95% 𝐢𝐼 = 168 Β± 1.96 βˆ— 2.33
∴ 95% 𝐢𝐼 = 168 Β± 4.57 π‘œπ‘Ÿ [163.43, 172.57]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

In a random sample of 100 patients at Princess Marina Hospital (PMH), 11% of the patients are allergic to penicillin.

Determine a 95% confidence interval for the proportion of patients at PMH that are allergic to penicillin.

A

ΖΈ=11% or 0.11, sample size (n) =100
95% CI for proportion = (ΖΈ Β±1.96*standard error for Proportions (SEP))

π‘€β„Žπ‘’π‘Ÿπ‘’ 𝑆𝐸𝑃 =π‘βˆ—(1βˆ’π‘) 𝑛
=0.11βˆ— (1βˆ’0.11)100
= 0.031289

∴ 95% 𝐢𝐼 = 0.11 Β± 1.96 βˆ— 0.031289
∴ 95% 𝐢𝐼 = 0.11 Β± 0.061 π‘œπ‘Ÿ [0.0487, 0.1713]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is standard error?

A

A value indicating error when a sample mean is used as a point estimator of the population mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

State the formula for calculating standard error

A

π‘ π‘‘π‘Žπ‘›π‘‘π‘Žπ‘Ÿπ‘‘ π‘’π‘Ÿπ‘Ÿπ‘œπ‘Ÿ =π‘ π‘‘π‘Žπ‘›π‘‘π‘Žπ‘Ÿπ‘‘ π‘‘π‘’π‘£π‘–π‘Žπ‘‘π‘–π‘œπ‘›/ π‘ π‘Žπ‘šπ‘π‘™π‘’ 𝑠𝑖𝑧𝑒 square root

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is association between variables?

A

The extent to which any two variables are related or correlated.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is Correlation?

A

Describes the strength of linear association of variables

17
Q

What do the following signs for correlation mean?
+, -, 0

A

+: perfect positive correlation
-: reverse relationship/ negative association
0: no linear association

18
Q

How to determine whether the correlation is statistically significant if you have r- value without p-value or CI?

A

Use rule of thumb: the modulus of r must be greater than 2 divided by square root of sample size

r> 2/√sample size OR r<- 2/√ sample size

19
Q

Pearson’s correlation coefficient

A

Uses linear relationship

20
Q

Spearmen’s correlation coefficient

A

*Tests monotonic relationship
*Uses non- parametric computation of Pearson’s correlation coefficient that uses ranges to determine the degree of association between variables

21
Q

When to use spearmen’s correlation coefficient?

A
  • If data seems to exhibit non-linear relationship
    *If at least one variable is ordinal
    *If data has outliers
22
Q

How to determine causal relationship between variables?

OR

How would you show that changes in one variable would cause changes in another variable?

A

Use linear regression

Y=mx+c

23
Q

What is the difference between correlation and regression?

A

Correlation: describes the strength of linear association between variables
Regression: expresses the relationship between variables in the form of an equation