Week 9 DSE Flashcards

1
Q

What does likelihood mean?

A

Prob of observing data x if we think that y equals to some number

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What can bayes rule be used for ?

A

binary, discrete vairable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does fk (X) mean?

A

conditional probability mass function of X (if X discrete) for an observation from class k.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is 𝜋𝑘 ?

A

prior probability that random observation comes from class k

easily estimated under random sampling: just a fraction of the training observations that belong to class k.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the assumption under naive bayes classifier?

A

Assumes that within class k, all p predictors are independent.

ELIMINATES THE NEED FOR JOINT DISTRIBUTION

fk(x)= fk1(x1) x fk2(x2) x fk3(x3)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What to do when joint distribution is multivariate normal?

A

Estimate covariance matrix (tough in high dimension)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is another option for quantitative/continuus X besides drawn from normal distribution?

A

Use non parametric density estimation

make a histogram for observation of jth predictor with each class

or use kernel density estimator

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How to estimate a qualitative X?
What odes it mean to be qualitative?

A

qualitative means x discrete

count the proportion of training observations for the j-th predictor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is it called when pi 1 and pi2 both equal 0.5, IF THERE ARE 2 CLASSES

A

called flat prior

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What assumption does the naive bayes function in r rely on ?

A

assumption of Normal distribution for quantitative predictors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What do we ask ourselves when applying statistical methods?

A

are assumptions valid?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the purpose of assuming predictors independent conditional on class?

A

mainly ensures quantitative tractability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a potential source of high variance? How does naive bayes solve it?

A

Estimating joint distribution of predictors when P gets larger becomes a very hard problem requiring a lot of data

simplifies this problem, introducing some bias, but reducing variance

Naive bayes is a relatively simple method with little tendency to overfit.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

When does naive bayes reduce variance drastically?

A

when N is not large relative to P

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

When does naive bayes work best?(slide 29)

A

Works best when data has relatively many predictors so that reliable estimation of joint densities of predictors for each class is hard to achieve.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly