Last Quiz Flashcards

1
Q

Why are continuous value functions harder?

A

You lose the ability to track the outcome of each action

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Name the three big nuisance factors

A

Random seed
Hyperparameters
Network architecture

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Explain why random seed is a nuisance factor

A

Initial policy weights and following exploration actions are randomized: some agents might “get lucky

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Explain why hyperparameters are a nuisance factor

A

There’s no systematic way to find them yet (e.g. learning rate, reward scaling) but they have a big effect on algol success

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Explain why network architecture is a nuisance factor

A

People running the same code on different setups get different results.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are three ML “cheats”?

A

1) Report the max of many trials w/o mean and std dev
2) Selecting a random seed
3) Small sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What did Henderson do?

A

Create a “reproducibility checklist” that ML algols have to pass to be statistically significant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What’s the problem with his checklist?

A

People pretend to follow, but they don’t and it’s hard to check

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why does Q-learning over-estimate?

A

Scott’s story: continuously training on the same set creates overconfidence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How does ORB-SLAM work?

A

Recognize features and parallax to locate self in world

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is feature mapping?

A

creating feature descriptors, comparing feature vectors to assess motion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the pinhole camera model?

A

That all rays of light that fall onto a plane converge onto a single point.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Goodheart’s law?

A

When a measure becomes a target, it ceases to be a good measure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is double Q-learning?

A

Actor-Critic where the critic criticizes the other actor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly