Google ML Engineer Flashcards
What kinds of problem benefit from ML
- Identification (Classification)
- Prediction
- Grouping (Clustering)
What problem does ML Solve
- Identification (Classification)
- Prediction
- Grouping (Clustering)
What are technical success metrics
- Accuracy (Positive match/total)
- Precision (True Positive match / (False positive + True positive)
- Recall (True Positive match / (True positive + False negative)
What are different types of model
- Decision Tree
- Deep Learning
- Regression
What are the steps in ML ops pipeline?
- Data extraction
- Data validation
- Data preparation
- Model training
- Model evaluation
- Model validation
Which of the following comes first in a machine-learning pipeline?
- Model evalution
- Data extraction
- Data preparation
- Model training
Data extraction
Which of the following is not a kind of data preparation task?
- Addressing missing data
- Removing unwanted data
- Integrating data
- Evaluating model performance
Evaluating model performance
Which one isn’t the target feature of the label dataset
- Attributes of instances
- Structured tables
- Images
- Random data
Random data
What are target values of label dataset
- Class or category
- Value to predict
What are the classification algorithms
- Logistic Regression
- Decision Trees (End of decision trees are classifications), e.g. Random trees
- Naive Bayes (It uses statistic and probability, it use to collate and coappear for classification)
- Neural Networks (Deep learning network)
- Nearest Neighbor (How near or far points are in space for classification)
- Ensemble methods (These are different methods)
Is logistic regression a classification algorithm or regression algorithm
Classification algorithm
Give a example of decision tree algorithm
Random forest
Which algorithm is based on How near or far points are in space and what type of algorithm it is?
- Nearest Neighbor
- Classification algorithm
Which are the three regression algorithm
- Linear Regression (Learn formula from series of values which represent the strait line and predict based on formula)
- Decision Tree Regression (Using the structure of decision trees)
- Polynomial Regression (Like to learn a function, but instead of the straight line, it could be a curve)
You have data on the price of cars sold over the past two years. You have data on the sale price, age of the car, mileage, interior features, gas mileage, and several other features. You want to use this data to predict the sale prices of other cars. What kind of ML problem is this?
- Classification
- Regression
- Reinforcement learning
- Unsupervised learning
This is a regression problem because we are trying to predict a continuous value.