Hard ML Problems Flashcards

1
Q

Difficulties of clustering.

A

it is challenging to determine what action to take based on the cluster. You can try to assign a meaning to a cluster, but this can be tricky because the model might not group by criteria that you find intuitive.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is an alternate approach when clustering?

A

One alternative approach is to label some items before you cluster, and then try to propagate those labels across the entire cluster.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is one option to determine what constitutes as an anomaly to get labeled data?

A

One option is to define a heuristic and use it to label anomalies. However, once you’ve defined this heuristic, you might as well use the heuristic in your production system, since an ML model can’t beat the heuristic used to train it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are correlations?

A

mutual relationships or connections between two or more things

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is causation?

A

one event or factor causing another

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why is causation hard to determine?

A

it is easy to see that something happened, but much harder to understand why it happened.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

You can’t determine causation from only __ __.

A

observational data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How to determine causation?

A

You would need to run an experiment, comparing users who didn’t see the review with similar users who did. In general, you need to intervene in the world—run an experiment—to determine causation; you can’t see it in purely observational data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

If you have no __ to train a model, then machine learning cannot help you

A

data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly