stats Flashcards

1
Q

probability: permutation with repetition. def and formula

A

order does not matter. n**p

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

probability: permutation without repetition. def and formula

A

If we have n items total and want to pick p in a certain order. n!/(n - p)!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

probability: combination with repetition. def and formula

A

order does not matter. (n + p - 1)! / p!(n-1)!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

probability: combination without repetition. def and formula

A

n!/((p)!(n - p)!)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Type I error in hypothesis testing

A

when we reject a true null hypothesis. False positive. Caused by researchers not choosing correct significance level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Type 2 error hypothesis testing

A

when we accept a false null hypothesis. False negative.. Caused by sample size and sigma

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

probability of making type 2 error in hypothesis testing

A

beta.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

probability of rejecting a false null hypothesis (power of the test)

A

1 - beta, where beta is prob of making type 2 error. power of test can be increased by bigger sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Z-score in hypothesis testing

A

standardized variable obtained := (sample mean - pop mean) / standard_error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

critical value in hypothesis testing

A

value from z-table or t-table for given significance and sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

reject null hypothesis based on z/t score

A

if abs(score) > critical value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

reject null hypothesis based on pvalue

A

pvalue < level of significance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

pvalue formula and interpretation

A

p = sides * (1 - stats.t.cdf((1+confidence)/2, len(arr)-1)

pvalue is the probability that we observe our sample dataset if the null hypothesis is true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

industry standard for t/zscore for rejecting

A

> 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

industry standard for sample size where t-test and z-test can be used interchangeably

A

n > 50

How well did you know this?
1
Not at all
2
3
4
5
Perfectly