AI Flashcards

1
Q

What is an “environment”?

A

An enviroment is where a AI will be performing it’s tasks. Like a maze

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the Bellman Equation?

A

It writes the value of a decision problem at a certain point in time in terms of the payoff from some initial choices and the value of the remaining decision problem that results from those initial choices

The bellman equation assigns points with the best outcome has the highest points and worst outcome has the lovest.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the Markov Decision Process (MDP) ?

A

It is the process where outcomes are partly random and partly under the control of the decision maker. It provieds a mathematical framwork for decision making.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is an “Agent?”

A

an agent is our Artificial intelligence that will perform the actions inside the environment. It will learn from the feedback of the environment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is an plan

A

a plan is like a treasure map for AI. This indicate what direction the agent should proceed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is deterministic search?

A

A deterministic search gives a pre set of probability that the agent will perform a certain action

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is deterministic search?

A

A deterministic search gives a pre set of probability that the agent will perform a certain action.
This means that if the agent choses an action then it will perform that action 100% of the time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is non-deterministic search?

A

Thisis when we have an environment that mimics a real world application. This means that if the agent chooses an action that there are more random variables in play.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is non-deterministic search?

A

Thisis when we have an environment that mimics a real world application. This means that if the agent chooses an action that there are more random variables in play.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the Markov property?

A

It is when the future state only depends on the state you are in now. Not the states that where before the present state.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does stochastic mean?

A

It means that there is some randomness.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is “Living penalty”?

A

Living penalty is when the agent is getting rewards while performing actions that will take it closer to the goal. It is called “living penalty” beacuse the reward is given as a negative number, rather than a positive. The incentive is then to complete the goal as fast as possible.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Q-learning intuition?

A

Q-learning is a reinforcement learning technique used in machine learning. The goal of Q-Learning is to learn a policy, which tells an agent what action to take under what circumstances. It does not require a model of the environment and can handle problems with stochastic transitions and rewards, without requiring adaptations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How is Q different from V ?

A

We are looking for the value of each action rather than the value of each state.
Meaning it will look at what action is more lucrative.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Why do we use the letter Q?

A

Probably beacuse the word Quality.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is temporal difference?

A

Temporal difference (TD) learning is an approach to learning how to predict a quantity that depends on future values of a given signal.