Week 1: Data and KNN Flashcards
What is machine learning approach?
Programming an algorithm to automatically learn from data, or from experience, uncover patterns in data, building autonomous agents
What should be emphasized in machine learning?
- Predictive performance
- Scalability
- Autonomy
Why might you want to use a learning algorithm?
- Hard to code solution by hand (vision, speech)
- System needs to adapt to a changing environment (spam detection)
- Want the system to perform better than human programmers
- Privacy/ fairness (ranking search results)
How does machine learning perform compared to humans?
It may perform better or worse than humans
Define artificial intelligence
- A subfield of CS that refers to computer programs that can solve problems humans are good at
- E.g vision, natural language
Define machine learning
A subfield of AI focused on learning (tuning parameters) from data
Define neural networks
Parametric model used in ML loosely based on biological neurons
What is deep learning?
Neural networks with multiple layers
What is data science?
An emerging field which applies ml techniques to domain-specific problems
What are some machine learning domains?
- Computer vision
- Speech recognition
- Natural Language Processing
- Recommender system
- Games
Types of machine learning
- Supervised learning
- Semi-supervised learning
- Reinforcement learning
- Unsupervised learning
What is supervised learning
- They have labeled examples of the correct behavior
- Predict unknown values of the data using other known data
- Classification (is this A or B?)
- Anomaly detection (is this weird?)
- Regression (how much/ how many)
What is semi-supervised learning
Utilizes both labeled and unlabeled data
What is reinforcement learning
Learning system which interacts with the world and learns to maximize a scalar reward signal
What is unsupervised learning
- No labeled examples, instead looking for interesting patterns in the data
- Find human interpretable and previously unknown patterns that describe the unlabeled data
- Clustering (how is data organized)
- Association rule mining (are these related?)