Intro To Machine Learning Flashcards
What is machine learning?
A field of study that gives the computer the ability to learn from data without being explicitly programmed
What is Scikit-learn?
General purpose machine learning library. This is probably the most widely used machine learning library
What is XGboost
Library primarily used as a gradient boosting machine learning algorithm framework ( advanced machine learning algorithm). This is a commonly used Library in machine learning and industry
What is LightGBM?
Another library primarily used as a gradient machine learning algorithm, exactly like XGBoost
What is NLTK( natural language toolkit)?
Among other things, this is a toolkit that helps in understanding text and enhancing some machine learning models
What is TensorFlow?
Deep learning framework that easily allows for complete customization of deep learning algorithm, butbhas a massive learning curve. This framework can also do traditional machine learning, but it requires a lot of knowledge of how individual machine learning algorithms work
What is PyTorch?
Deep learning framework that feels familiar to most Python developers. It can act as a replacement for NumPy. Since PyTorch has a very similar interface to NumPy, Python developers can migrate to it relatively easily. This framework can also do traditional machine learning, but it requires a lot of knowledge of how individual machine learning algorithms work, though less than tensorflow
What is Keras?
Deep learning framework. You can think of it as a high level wrapper of many different deep learning frameworks similar to how seaborn is a wrapper of matplotlib
What are the two broad types of machine learning?
Supervised Learning and unsupervised learning
What is supervised learning?
The most common form of machine learning is supervised learning. In sickit-learn, a supervised learning algorithm learns the relationship between your features matrix and your target vector to make predictions
What’s a features matrix and target vector?
A features is just one property of the data that is represented as a column. A target is the column of the dataset you want to make predictions for
What is Regression?
Predict a continuous value. This is considered a regression problem. This means that your target vector contains continuous quantities like home prices
What is Classification?
Predict a categorical value. This is considered a classification problem. This means your target vector contains categorical quantities like different flower species
What is unsupervised learning?
In machine learning, you aren’t always trying ti predict a value. Sometimes your goal is to find some structure in your dataset. Unsupervised learning is when you train an algorithm without giving it answers for example in your dataset. No target vector
What is Target (y)?
The target is the column we are trying to predict. In this case the “charges” column is the target.