Binomial distribution and Hypothesis testing (wk3 3) Flashcards

1
Q

What is Pascals’s triangle?

A

-A tree of numbers which help to identify the probability of situations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a decision tree?

A

Multiple coin toss can be visualised as a connection of branches, which forms a decision tree

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a binomial distribution (with a coin toss)?

A

-You can calculate the probability of getting heads from tosses. If you write down the probabilities in each node, you get a probability distribution. For this specific case where there are always two choices, this distribution is called the binomial distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is cumulative probability?

A

-When the number of coin tosses become high, it does not make much sense of using the probability of getting the exact number of heads
-Therefore, it can make much more sense to use the probability that the value falls in a certain range, and this is called the cumulative probability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is two-tailed? (cumulative probability)

A

Taking the cumulative probability at both ends

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a discrete distribution (through a coin toss)?

A

The coin tossing is a discrete event i.e. you can count how many times something happened. Binomial distribution is a discrete distribution (i.e. distribution is a bunch of numbers located at each count

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a continuous distribution (through a coin toss)?

A

You need a continuous distribution to describe a distribution of a continuous variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a normal distribution?

A
  • Binomial distributions have specific shapes, which becomes clearer as the number of tosses increases
  • The shape is a continuous distribution, called normal distribution
  • Mean and SD describe normal distributions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a binomial test?

A

Testing the statistical significance of deviations from a theoretically expected probability of a binary event. Null hypothesis is that the observed data occurred by the expected probability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Name and describe the 3 types of binomial test:

A
  • Observed proportion < Expected proportion. Cumulative probability from 0 to observed
  • Observed proportion > Expected proportion. Cumulative probability from observed to max.
  • Observed proportion (not equal to) Expected proportion. Two-tailed cumulative probability and Same distance from the mean.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a confidence interval?

A

Confidence interval -> A range of plausible values associated with a confidence level (usually 95%). In binomial test, this means that you are 95% certain that the true proportion falls within the CI.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a hypothesis test?

A

-In statistics, we test the probability that null hypothesis is true (i.e. there is no effect/ relationship)
-You can never prove something is true but can prove something is false
-It is difficult to make a precise research hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a null hypothesis?

A

A hypothesis against the research question, claiming that there is no difference in the result and the only difference observed are just noise/error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a research/ alternative hypothesis?

A

Research/alternative hypothesis -> Opposite to the null hypothesis claiming that there is a difference in the result.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the process with accepting/ rejecting null hypothesis?

A

Most of analyses in experimental science are focused on testing whether the obtained data can reject the null hypothesis. The significancy is estimated by the probability that the difference occurred by chance, which is called the p-value. The threshold/ alpha level for the significancy is p<0.05.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a p-value and confidence interval?

A

-P-Values -> If the probability is very low we can reject our hypothesis. If the probability is not so low, we cannot reject our hypothesis. These probabilities used to reject hypotheses are called p-values.
P-values -> A probability that your hypothesis is right. A threshold level for the p-value, called alpha-level needs to be defined prior to the analysis. A usual choice is 0.05, i.e.
* If p-value is <0.05 you reject the hypothesis
* If p-value is >0.05 you accept the hypothesis

17
Q

What is an error in stats tests?

A

Since we are dealing with probability, potential error in the test always exist.

18
Q

What is a type-1 and type-2 error?

A
  1. Type 1 (false-positive) -> The vaccine is not effective but you can conclude it is effective (reject the null hypothesis when it is true)
  2. Type 2 (false-negative) -> The vaccine is effective but you can conclude it is not effective (not to reject the null hypothesis when it is false).