Week 5 Flashcards
What is supervised learning in machine learning?
Supervised learning is a machine learning technique where models learn from labeled data (training set) and apply this knowledge to new, unseen data (test set) with the goal of function approximation and classification.
What types of methods are commonly used in supervised learning?
Common methods in supervised learning include linear regression, logistic regression, and Support Vector Machine (SVM).
What is unsupervised learning in machine learning?
Unsupervised learning involves learning from unlabeled data to find hidden structures within the dataset.
What is the goal of unsupervised learning?
The goal of unsupervised learning is data description and pattern recognition.
What methods are utilized in unsupervised learning?
Methods used in unsupervised learning include K-Means clustering, neural networks, and principal component analysis (PCA).
How does feedback differ between supervised and unsupervised learning?
In supervised learning, instructive feedback is used to guide the learning process, whereas unsupervised learning typically does not use feedback.
What does reinforcement learning focus on?
Reinforcement learning focuses on the interaction between an agent and its environment.
How does an agent interact with the environment in RL?
The agent selects actions, and the environment provides evaluative feedback based on those actions.
What is the sequence of events in a reinforcement learning process?
The sequence in RL involves state, action, reward, new state, and so on, as exemplified by the SARSA algorithm.
What does SARSA stand for in reinforcement learning?
SARSA stands for State-Action-Reward-State-Action, which is a sequence that describes how an agent learns from the consequences of its actions in an environment.
What is the Markov property in the context of Markov Decision Processes?
The Markov property states that the conditional probability distribution for the system at the next time step depends only on the current state, not on the sequence of events that preceded it.
How do the effects of actions in a Markov Decision Process relate to the state and history?
In an MDP, the effects of actions depend only on the current state of the system and not on the prior history of how the agent arrived at that state.
What do deterministic transitions mean in a Markov Decision Process?
Deterministic transitions in an MDP mean that the next state of the system is determined with certainty given the current state and action taken.
What are stochastic transitions in a Markov Decision Process?
Stochastic transitions in an MDP involve probability, indicating that the next state is not determined with certainty and can be one of several possible states.
what are the elements of the environment en agent of reinforced learning?
What is the Law of Effect according to Thorndike (1944)?
The Law of Effect states that actions followed by satisfaction will be strengthened, while those that produce discomfort will be weakened.
What is learning defined as in the provided material?
Learning is defined as updating your expectations based on new information or experiences.
What happens if you can perfectly predict the future according to the learning model?
If the actual outcome matches the expected outcome, no adjustment of expectations is necessary.
How should expectations be adjusted when predictions are not perfect?
When the actual outcome does not match the expected outcome (prediction error exists), expectations should be adjusted accordingly to improve future predictions.
explain why the right calculations are more efficient.
the left calculation requires more memory capacity over time, while the right calculation is minimal and constant.
why would you use **weighted-average ** (learning rate) instead of a normal average?