10 ML Algorithms Flashcards
Example of Ensemble ML Methods?
- Bayesian averaging
- bagging
- boosting
- error-correcting output coding
What are the main clustering methodologies?
- Centroid-based algorithms
- Connectivity-based algorithms
- Density-based algorithms
- Probabilistic
- Dimensionality Reduction
- Neural networks / Deep Learning
What is Singular Value Decomposition?
In linear algebra, SVD is a factorization of a real complex matrix. For a given m * n matrix M, there exists a decomposition such that M = UΣV, where U and V are unitary matrices and Σ is a diagonal matrix.
PCA is actually a simple application of SVD.
Top supervised ML algorithms:
1) Support Vector Machines
2) Ensemble Methods
3) Logistic Regression
4) Ordinary Least Squares Regression
5) Naïve Bayes Classification
6) Decision Trees.
Machine learning algorithms can be divided into 3 broad categories — 1, 2 ,3
supervised learning, unsupervised learning, and reinforcement learning.
What is Reinforcement learning?
Between supervised and unsupervised. There is some form of feedback available for each predictive step or action, but no precise label or error message.
What are Ensemble ML Methods?
Learning algorithms that construct a set of classifiers and then classify new data points by taking a weighted vote of their predictions
Applications of Independent Component Analysis (ICA):
- digital images,
- document databases,
- economic indicators
- psychometric measurements.
What is Principal Component Analysis?
PCA is a statistical procedure that uses an orthogonal transformation to convert a set of observations of possibly correlated variables into a set of values of linearly uncorrelated variables called principal components.
Naïve Bayes Classification examples:
- email as spam or not spam
- news article about technology, politics, or sports
- Used for face recognition software
What is clustering?
Clustering is the task of grouping a set of objects such that objects in the same group (cluster) are more similar to each other than to those in other groups.
What is an Independent Component Analysis (ICA)?
- ICA is a statistical technique for revealing hidden factors that underlie sets of random variables, measurements, or signals.
- ICA is related to PCA, but it is a much more powerful technique that is capable of finding the underlying factors of sources when these classic methods fail completely.
What is the advantage of SVM?
- biggest problems that have been solved using SVMs
- large-scale image classification
- Used when the number of features is big
how do ensemble methods work and why are they superior to individual models?
- They average out biases
- They reduce the variance
- They are unlikely to over-fit
What is the advantage of Decision Trees?
As a method, it allows you to approach the problem in a structured and systematic way to arrive at a logical conclusion.