lecture 3: intro to probability and statistics Flashcards

Question 1

Q

what is statistical causality/causation

Answer

A

one thing will directly cause the other, i.e. cause and effect
note: gold standard to determine causality is specific experimental design + randomised studies

Question 2

Q

what is correlation

Answer

A

any statistical relationship whether causal or not, indicates a predictive relationship that can be exploited in practice

Question 3

Q

what is Simpson’s paradox

Answer

A

a phenomenon in which a trend appears in several groups of data but disappears/reverses when these groups are combined

Question 4

Q

what is a random variable

Answer

A

its possible values are numerical outcomes of a random phenomenon, eg. roll dice

Question 5

Q

what are the two types of random variables (RV)

Answer

A

discrete and continuous

Question 6

Q

what is a discrete RV (DRV)

Answer

A

it takes on only a countable number of values

Question 7

Q

what is the probability distribution of a DRV

Answer

A

probability mass function

Question 8

Q

what is a continuous RV (CRV)

Answer

A

it takes on an infinite number of possible values in some interval

Question 9

Q

what is the probability distribution of a CRV

Answer

A

probability density function

Question 10

Q

what are the 3 most important statistics of a RV

Answer

A

expectation, variance and standard deviation

Question 11

Q

what is the main difference in formulas between DRV and CRV

Answer

A

for CRV, the formulas are integrals

Question 12

Q

What is Bayes’ rule

Answer

A

P(y⎮X) 
P(y) P(X⎮y)
= -------------------
P(X)
P(y) P(X⎮y)
= ------------------
∑y P(y) P(X⎮y)

Question 13

Q

parameter estimation look at lecture 3 page 27 to 32

Answer

A

maximum likelihood estimation

Question 14

Q

what is parametric machine learning

Answer

A

a learning model that summarises data with a set of parameters of fixed size
involves 2 steps:
1. select a from for the function eg. normal distribution
2. learn the coefficients for the function from the training data

Question 15

Q

what is non-parametric learning

Answer

A

algorithms that do not make strong assumptions about the form of the mapping function, good when you don’t have a lot of data and no prior knowledge

lecture 3: intro to probability and statistics Flashcards

(15 cards)