Week 3 Flashcards
Probability sum?
Number of sequences with specific outcome / number of all possible sequences
What is probability used for ?
To quantify how likely a set of data was obtained by pure chance
What is binomial distribution?
When there is only two choices in a probability
How is binomial distribution measured ?
Using Pascal’s triangle
Counting the layers = number of tests
Adding the numbers on the layer = possible sequences
And then counting the number of sequences with wanted data
What is cumulative probability?
Probability of observing a number of successes that is less than or equal to a specific number ( value falls within a range )
This is due to the number of tests the probability of it being specific are very low/zero
Binomial distribution sum? What does each mean part mean? Example?
Bi(k | n, q) =
number of sequences with k /
number of all possible sequences
K= specific combination/result
N=amount of tests
Example = Bi( 3 | 4, 0.5 ) = 4 / 16 = 0.25
3 = heads (in H or T) 4 = number of tosses
.5 chance of getting H 16 = possible sequences
Cumulative probability sum? Which does each bit mean?
Bi ( /? ≤ k ≤ ? | n, q)
k= within certain value
n= tests done
q= probability of getting specific eg 50/50 = 0.5q
Difference between discrete and continuous (event) distribution?
Discrete events are when you can count the number of times something happened, continuous is a variable that can change such as weight
Binomial is discrete
What is probability density in continuous distribution?
As in continuous distribution a variable can change, the chance of it being specific such as exactly 60kg is 0. Therefore the probability is the area under the range given such as between 50-60kg but because it is continuous it must be referred to as probability density and not just probability
What is probability density in continuous distribution?
As in continuous distribution a variable can change, the chance of it being specific such as exactly 60kg is 0. Therefore the probability is the area under the range given such as between 50-60kg but because it is continuous it must be referred to as probability density and not just probability
What is the continuous distribution also known as and what two types of numbers can describe it?
Normal distribution - Most fundamental
It can be described using only the Mean and SD
What is two tailed probability ?
Checking how far data deviated from the mean, therefore taking cumulative probability from both ends
What is a Statistical test?
We need to assess whether a given scientific claim/hypotheses is valid or not and the probability of it being valid or due to chance.
What is a p-value?
Probabilities used to reject hypotheses. If it is very low the hypothesis is rejected, if it is higher it cannot be rejected.
What is an alpha level?
A threshold p-value made prior on whether to reject or not, it is commonly 0.05.
If it less than 0.05 it is significant and he null is rejected and an alternative is accepted, If it is above then it is accepted
What is a null hypothesis ? Example?
A hypothesis against the research question usually claiming there is no significant difference between a result and its relationship with something.
Eg the vaccine has no effect on infection rate
What are the 2 errors in stats tests? With examples?
Type 1 : False positive - Rejecting the null hypothesis when its true eg vaccine is not effective but i conclude it is
Type 2 : False negative - Don’t reject/accepting the null when its false eg vaccine is effective but i conclude it isn’t
What is the binomial test/ When is it used?
The binomial test checks whether the observed proportion of successes in your data, of two single categorical variables such as heads/tails, is significantly different from the expected (or hypothesized) proportion.
If the test result is statistically significant, it suggests that the observed proportion is likely different from the expected proportion.
What are confidence intervals?
A range of plausible values associated with confidence level, meaning you are 95% sure that the true proportion fall within this range.
What are confidence intervals?
A range of plausible values associated with confidence level, meaning you are 95% sure that the true proportion fall within this range.
How to determine whether to do a one or two tailed test?
One tailed is for a specific direction, if the result is either greater or less than hypothesised value.
If you want to test any difference with no specific direction or check for deviation used two tailed.
What is binomial probability?
Evaluating the probability of observing a specific number of successes in a fixed number of trials with a hypothesised probability of success
What is hypothesis testing?
Procedure used to evaluate claims on data
1) Make hypotheses
- Null = No difference
- Alternative = difference
2) Make significance level - probability of rejecting null if results are positive/allow for it - usually 0.05
3) Test statistics - use appropriate test, t-test, chi-squared etc. to determine how far from the null.
4) P-value - determines whether results are due too chance / inconsistent or not, then make a decision based off value.
5) Conclusion and Summary
How to calculate expected number of successes in binomial distribution ?
Number of trials x probability of success rate