Common AI Workloads Flashcards by Maria A

What features and capabilities does Azure Machine Learning provide?

Automated Machine Learning

Azure Machine Learning designer

Data and compute management

Pipelines

How well did you know this?

Not at all

Perfectly

What is labelling?

The process of identifying raw data (images, text files, audio, etc.) and adding one or more meaningful and informative labels to provide context for machine learning.

How well did you know this?

Not at all

Perfectly

What is unsupervised learning?

Unsupervised learning is a subcategory of ML defined by its use of unlabelled datasets to train models that discover hidden patterns or data groupings without human intervention.

How well did you know this?

Not at all

Perfectly

What is supervised learning?

Supervised learning is a subcategory of ML defined by its use of labelled datasets to train models that classify data or predict outcomes precisely.

How well did you know this?

Not at all

Perfectly

What are two examples of supervised learning models?

Classification and Regression.

How well did you know this?

Not at all

Perfectly

What is an example of unsupervised learning models?

Clustering.

How well did you know this?

Not at all

Perfectly

What is a dataset?

A collection of data.

How well did you know this?

Not at all

Perfectly

What’s the difference between unsupervised ML labelling and supervised ML labelling?

With supervised ML, labelling is prerequisite to produce training data and each piece of data will generally be labelled by a human.

With unsupervised ML, labelling is produced by the computer and may not be human readable.

How well did you know this?

Not at all

Perfectly

What is regression?

A form of machine learning that is used to predict a numeric label based on an item’s features.

How well did you know this?

Not at all

Perfectly

What is time series forecasting?

Regression with a time-series element, that predicts numeric values at a future point in time.

How well did you know this?

Not at all

Perfectly

What is classification?

A form of machine learning that is used to predict which category, or class, an item belongs to.

How well did you know this?

Not at all

Perfectly

What is clustering?

A form of machine learning that is used to group similar items into clusters based on their features.

How well did you know this?

Not at all

Perfectly

What is a ground truth?

A properly labelled dataset used as the objective standard to train and assess a given model.

The accuracy of the trained model is dependant on the accuracy of the ground truth.

How well did you know this?

Not at all

Perfectly

What are the stages of the ML pipeline and what are they for?

Pre-processing - preparing data and feature engineering before passing the data to an ML model for training or inference.

Post processing - translating the output of a ML model back into a human readable format

Training - the process of training the model

Serving - the process of deploying the model to an endpoint to be used for inference

Inference - Invoking a ML model by sending a request and expecting back a prediction.

How well did you know this?

Not at all

Perfectly

What is data cleaning?

The process of correcting errors within a dataset.

How well did you know this?

Not at all

Perfectly

What is data reduction?

Reducing the volume of data, or applying dimensionality reductions to reduce the dimensions of inputted vectors

How well did you know this?

Not at all

Perfectly

What is feature engineering?

Study These Flashcards

Transforming data into numerical values (vectors) to be ingested by a ML model.

What is sampling?

Study These Flashcards

Balancing a dataset to be uniform across labels by adding or removing records.

How are features and labels used in ML?

Study These Flashcards

ML uses features to predict labels.

What is a training dataset?

Study These Flashcards

A training dataset is used to train an ML model.

What is a validation dataset?

Study These Flashcards

A validation dataset is used to estimate the accuracy of an ML model.

What is an algorithm in ML?

Study These Flashcards

A procedure run on data to create an ML model.

How do ML algorithms work?

Study These Flashcards

By performing pattern recognition. They learn from data or are fit on a dataset.

What are the 5 model evaluation metrics for classification?

Study These Flashcards

Accuracy

Precision

Recall

F1 Score

AUC

What does MAE measure?

The average difference between predicted values and true values. The lower this value is, the better the model is predicting.

What does RMSE measure?

The average square root of the mean squared difference between predicted values and true values. When compared to the MAE, a larger difference indicates greater variance in the individual errors.

What unit are MAE and RMSE based on?

The same unit as the label.

What is RSE?

A relative metric based on the differences between predicted values and true values.

What is RAE?

A relative metric based on the absolute differences between predicted values and true values.

What is the range of RSE and RAE?

0 to 1. The closer to 0 the metric is, the better the model is performing.

What can you use RSE and RAE for and why?

As the metrics are relative, they can be used to compare models where the labels are in different units.

What does accuracy measure?

The ratio of correct predictions (true positives + true negatives) to the total number of predictions.

What does precision measure?

The fraction of positives correctly identified. | True positives / true positives + false positives

What does recall measure?

The fraction of classified positives that were actually positives. (true positives / true positive + false positives)

What is F1 Score?

An overall metric that essentially combines precision and recall.

What is AUC?

Area under curve. It is the metric that measures the area under the ROC curve. It can be any value from 0 to 1. The larger the AUC, the better the model is performing.

What is knowledge mining?

A discipline in AI that uses a combination of intelligent services to quickly learn from vast amounts of information.

What would you use knowledge mining for?

Content research Auditing, risk, and compliance management Business process management Customer support and feedback analysis Digital asset management Contract management

What is feature selection?

The process of deciding which relevant original features to include and which irrelevant features to exclude for predictive modelling.

What is the difference between feature selection and dimensionality reduction?

In feature selection, the original features don't change. In dimensionality reduction new features are created from original features.

Common AI Workloads Flashcards

(40 cards)