Probablility Flashcards

1
Q

probability

A

provides mathematical framework for thinking about the uncertainty of future events

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

compound events

A

Independence do you like conditional probability knew line random variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Probability density function

A

Cumulative density function knew line quintiles and percentiles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Classical Probability distributions

A

binomial (discrete), negative binomial (discrete),Poisson (discrete), Uniform (continuous), normal/gaussian (continuous), exponential (continuous),power law (continuous)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

binomial (discrete)

A

tbc

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

negative binomial (discrete)

A

tbc

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Poisson (discrete)

A

tbc

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Uniform (continuous)

A

tbc

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

normal/gaussian (continuous)

A

tbc

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

exponential (continuous)

A

tbc

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

power law (continuous)

A

tbc

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

statistics

A

provide a mathematical framework to collect analyse interpret and present data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

descriptive statistics

A

used to describe and summarise the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

inference statistics

A

used to make predictions by taking samples of data from a population of making generalisations about that population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Central limit theorem CLT

A

if we repeatedly take independent random samples of the size and from population under when Anna is large the distribution of the samples means it will approach a normal distribution. This allows us to make inferences from a sample about a population without needing the characteristics of the whole population. Confidence intervals hypothesis testing and P value analysis are all based on the CLT.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

law of large numbers LLN

A

if an experiment is repeated independently a large number of times the average should be close to the expected value.

17
Q

sampling

A

the process of collection and selection of data the goal is to make a statistical inference about a population from a small set of observations

18
Q

random sampling

A

Every member in your population has equal chance of being sampled

19
Q

stratified sampling

A

the population is split into groups and then members are randomly sampled from each group

20
Q

cluster sampling

A

the population is first split into groups or clusters and then some clusters of randomly selected to be in your sample

21
Q

Systematic sampling

A

every member in your population is ordered into a list you then choose a random point and then select every Kth member

22
Q

sampling errors

A

errors introduced to your data via sampling excuse in the data in some way therefore not reflective of the real world distribution. this can be combated by increasing the sample size and ensuring the sample accurately represents the entire population.

23
Q

hypothesis testing

A

a tool used to determine the probability that a given hypothesis is true

24
Q

hypothesis testing process

A

formulates the null hypothesis
choose a test statistic
calculate the P value
compare the P value to a significant value alpha

25
Q

Z-test

A

test to determine if the sample mean is the same as the population mean

26
Q

T-test (one sample )

A

test to determine if the mean of a normally distributed population is different from a hypothesise value

27
Q

T-test (two-sample )

A

tested determine if the means of two populations are significantly different from one another

28
Q

chi-square test (goodness of fit)

A

test to determine how the observed data fits some given probability distribution

29
Q

chi squared test (for independence )

A

test to determine if two categorical variables are related

30
Q

Correlation

A

describe the relationship between two variables in a context such that one variable affects another

31
Q

Pearson coefficient

A

the degree of the relationship between linear linearly related variables

32
Q

Spearman rank coefficient

A

computed on ranks and depicts monotonic relationships also known as the person correlation coefficient between rank variables

33
Q

linear algebra

A

provides a mathematical framework to operate on matrices