Kvant flashcards lecture 2-3

1
Q

Confidence interval

A

a range in which the true parameter value is likely to be found with a certain probability (95%)
Sample mean +- margin of error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Margin of error

A

the maximum expected difference between the true parameter and the estimate.
Calculated by multiplying the Z-score by the standard error.

Provides the range within the which the true parameter is likely to lie, given a certain confidence level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

z-score

A

Gives an idea of how far from the mean a data point is (standard deviation)

Positive z-score: Value is above the mean.
Negative z-score: Value is below the mean.
Z-score of 0: Value equals the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Chi-squared test

A

Used for categorical data to test relationships between variables

A Chi-squared test is a statistical test used to compare observed and expected frequencies in categorical data to determine whether the difference is due to random variation or a real relationship.
Two types:
- Goodness of fit test: Checks if sample data matches a population.
- Test of Independence: Checks for association between two categorical variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Law of large numbers:

A

The Law of Large Numbers (LLN) states that as the sample size increases, the sample mean approaches the population mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Null hypothesis (H0):

A

Assumes no effect or no difference.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Alternative hypothesis (H1):

A

Represents a claim contradicting H0. There is an affect or difference.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Type I error

A

Rejecting the H₀ when it is actually true (false positive).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Type II error

A

Failing to reject H₀ when H₁ is true (false negative).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Significance level (α)

A

Is a measure of the strength of the evidence that must be present in the sample before rejecting the null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

p-value

A

The probability of obtaining a result at least as extreme as the one observed, assuming h0 is true.

The smallest significance level at which H0 would be rejected.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Critical value:

A

Defines the upper and lower bounds of a confidence interval

Example:
confidence interval: 95%
Critical Value (Two-Tailed): From the z-table, The critical values: −1.96 and +1.96.
If your z-score is outside: −1.96 and +1.96., reject H0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Z-test

A

Used when the population variance is known, and sample size is large (n > 30).
compares the sample mean to the population mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

T-test (student’s t-test)

A

Used when the population variance is unknown, and sample size is small (n ≤ 30).
One-sample t-test: compares sample mean to population mean
Two-sample t-test: Compares means between two groups (independent or paired).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Ekstra: Variance unknown, but sample size is large, z or t-test?

A

T-test. u may use the sample variance as an estimate of the population variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Chi-squared test

A

Used for categorical data to test relationships between variables
A Chi-squared test is a statistical test used to compare observed and expected frequencies in categorical data to determine whether the difference is due to random variation or a real relationship.

Two types:
- Goodness of fit test: Checks if sample data matches a population.
- Test of Independence: Checks for association between two categorical variables.