Python with IBM Flashcards
Coursera
What is the Machine Learning technique of Regression/Estimation?
Predicting continuous values. ie predicting the price of a house based on its characteristics or estimating the Co2 emission from a car’s engine
What is the Machine Learning technique of Classification?
Predicting the item class/category of a case: Yes or No answer. ie if a cell is benign or malignant
What is the Machine Learning technique of Clustering?
Finding the structure of data; summarization; grouping of similar things. ie can find similar patients, or can be used for customer segmentation in the banking field
What is the Machine Learning technique of Associations?
Associating frequent co-occuring items/events ie grocery items that are usually bought together by a particular customer
What is the Machine Learning technique of Anomaly detection?
Discovering abnormal and unusual cases ie credit card fraud detection
What is the Machine Learning technique of Sequence mining?
Predicting the next events. ie the click stream or what will a user will click next
What is the Machine Learning technique of Dimension Reduction?
Reducing the size of data
What is the Machine Learning technique of Recommendation systems?
Associates people’s preferences with others who have similar tastes and recommends new items to them
Parts of a Machine Learning Pipeline
Data Preprocessing->Train/Test split->Algorithm setup->Model fitting->Prediction->Evaluation->Model export
What are the 2 types of Supervised Learning?
Classification and Regression
What is Classification?
The process of predicting discrete class labels or categories. yes or no
What is Regression?
The process of predicting a continous value
What are the Unsupervised learning techniques?
- Dimension reduction
- Density estimation
- Market basket analysis
- Clustering
What is Clustering
Clustering is grouping of data points or objects that are somehow similar by:
*Discovering structure
* Summarization
* Anomaly detection
What is the difference between Supervised and Unsupervised Learning
Supervised Learning
Classification: Classifies labeled data
Regression: Predicts trends using previous labeled data
Has more evaluation methods than unsupervised learning
Controlled environment
Unsupervised Learning
Clustering: Finds patterns and groupings from unlabeled data
Has fewer evaluation methods than supervised learning
Less controlled environment