AI Flashcards

Question 1

Q

What is an “environment”?

Answer

A

An enviroment is where a AI will be performing it’s tasks. Like a maze

Question 2

Q

What is the Bellman Equation?

Answer

A

It writes the value of a decision problem at a certain point in time in terms of the payoff from some initial choices and the value of the remaining decision problem that results from those initial choices

The bellman equation assigns points with the best outcome has the highest points and worst outcome has the lovest.

Question 3

Q

What is the Markov Decision Process (MDP) ?

Answer

A

It is the process where outcomes are partly random and partly under the control of the decision maker. It provieds a mathematical framwork for decision making.

Question 4

Q

What is an “Agent?”

Answer

A

an agent is our Artificial intelligence that will perform the actions inside the environment. It will learn from the feedback of the environment

Question 5

Q

What is an plan

Answer

A

a plan is like a treasure map for AI. This indicate what direction the agent should proceed

Question 6

Q

What is deterministic search?

Answer

A

A deterministic search gives a pre set of probability that the agent will perform a certain action

Question 7

Q

What is deterministic search?

Answer

A

A deterministic search gives a pre set of probability that the agent will perform a certain action.
This means that if the agent choses an action then it will perform that action 100% of the time

Question 8

Q

What is non-deterministic search?

Answer

A

Thisis when we have an environment that mimics a real world application. This means that if the agent chooses an action that there are more random variables in play.

Question 9

Q

What is non-deterministic search?

Answer

A

Thisis when we have an environment that mimics a real world application. This means that if the agent chooses an action that there are more random variables in play.

Question 10

Q

What is the Markov property?

Answer

A

It is when the future state only depends on the state you are in now. Not the states that where before the present state.

Question 11

Q

What does stochastic mean?

Answer

A

It means that there is some randomness.

Question 12

Q

What is “Living penalty”?

Answer

A

Living penalty is when the agent is getting rewards while performing actions that will take it closer to the goal. It is called “living penalty” beacuse the reward is given as a negative number, rather than a positive. The incentive is then to complete the goal as fast as possible.

Question 13

Q

What is Q-learning intuition?

Answer

A

Q-learning is a reinforcement learning technique used in machine learning. The goal of Q-Learning is to learn a policy, which tells an agent what action to take under what circumstances. It does not require a model of the environment and can handle problems with stochastic transitions and rewards, without requiring adaptations.

Question 14

Q

How is Q different from V ?

Answer

A

We are looking for the value of each action rather than the value of each state.
Meaning it will look at what action is more lucrative.

Question 15

Q

Why do we use the letter Q?

Answer

A

Probably beacuse the word Quality.

Question 16

Q

What is temporal difference?

Answer

A

Temporal difference (TD) learning is an approach to learning how to predict a quantity that depends on future values of a given signal.