ML Flashcards
Artificial Intelligence
Program that can sense, reason, act & adapt
Machine Learning
Algorithms where performance improve as they re exposed to more data
Deep Learning
Multilayered neural networks learn from lots of data
Types of analytics in order of complexity
Descriptive
Predictive
Prescriptive
Ability to process large data because of ________. Using:
1.
2.
3.
Infrastructure
- CLoud Services
- GPU: Handles graphic rendering tasks
- TPU: Gives high performance & pure efficiency when running tensor flow
Types of ML (3 types)
Supervised - training data
Unsupervised - no training data
Reinforcement - interaction, +/-ve feedback
Summary of what to use:
Continuous = …
Discrete = …
…Accuracy
…Confusion Matrix
Support Vector Machines use
Discriminative classifier
Clear margin of separation between categories
Used in small clean data sets
Don’t suffer from over fitting as much as other methods
When should you not use Support Vector Machines
large data sets because required training time is higher in noisy data sets
Decision trees
Used for regression lasts - continous variable decision trees
Used for classification - Categorical variable decision trees
Why use them?
→ involves stratifying or segmenting the predictor space spare into # of regions
→ good for non-linear data.
Random Forests
Belong in the general category of Ensemble Methods
Increases predictive accuracy but sometimes at the expense of explainability
Ways to increase accuracy
Bagging:
Learns from each other independently in parallel & combines them for determining model average
Boosting:
Learns sequentially & adaptively to increase model predictions of learning algorithm
Pruning
Decreases size of decision trees by removing sections of the tree that are non-critical & redundant to classify instances.
Decreases of the final classifier .’. increase accuracy by decrease of overfitting
Why use Unsupervised Learning
Easier to obtain unlabelled data
Takes place in real time
Decrease complexity in comparison to supervised
finds all kinds of unknown patterns in data
K-means cluster analysis
Help with data-driven insights
Deep domain knowledge in required
No right or wrong matter for interpretation