AI / ML Flashcards
____ is a broad field for the development of intelligent systems capable of performing tasks that typically require human intelligence.
AI
Identify the AI component described below:
Collect vast amount of data.
a) Data Layer
b) ML Framework or Algorithm Layer
c) Model Layer
d) Application Layer
data layer
Identify the AI component described below:
data scientists and engineer work together to understand use cases, requirements, and frameworks that can solve them
a) Data Layer
b) ML Framework or Algorithm Layer
c) Model Layer
d) Application Layer
ML framework
Identify the AI component described below:
implement a model and train it, we have the structure, the parameters and functions, optimizer function
a) Data Layer
b) ML Framework or Algorithm Layer
c) Model Layer
d) Application Layer
model layer
Identify the AI component described below:
how to serve the model and its capabilities for your users
a) Data Layer
b) ML Framework or Algorithm Layer
c) Model Layer
d) Application Layer
application layer
____ is a type of AI for building methods that allow machines to learn.
Machine Learning
With Machine Learning, ____ is leveraged to improve computer performance on a set of task.
data
With Machine Learning, you make ____ based on data used to train the model.
predictions
True/False: In machine learning, no explicit programming rules are created, data is given to the algorithm.
t
What function does the Transformer Model provide?
Ability to process a sentence as a while instead of word by word. Faster and more efficient text processing (less training time).
What is a transformer-based LLM?
powerful models that can understand and generate human-like text. Trained on vast amounts of text data from the internet, books and other sources where they learn patterns and relationships between words and phrases.
A ____ model can take in a mix of audio, image and text and output a mix of video, image and text.
multi-modal
Recap differences between AI vs ML vs DL vs Gen AI:
Sometimes we know “if this happens, then do that” (AI)
Sometimes we’ve seen a lot of similar things before, and we classify them (ML)
Sometimes we haven’t seen something before, but we have “learned” a lot of similiar concepts, so we can make a decision (DL)
Sometimes, we get creative and based on what we’ve learned, we can generate content (GenAI)
GPT (Generative Pre-trained Transformer) - generate human text or computer code based on input prompts. (for language)
BERT (Bidirectional Encoder Representations from Transformers) - similar intent to GPT, but reads the text in two directions. Makes it good for language translations. (for language)
RNN (Recurrent Neural Network) - meant for sequential data such as time-series or text, useful in speech recognition, time-series prediction.
ResNet (Residual Network) - Deep Convolutional Neural Network (CNN) used for image recognition tasks, object detection and facial recognition. (for images)
SVM (Support Vector Machine) - ML algorithm for classification and regression.
WaveNet - model to generate raw audio waveform, used in Speech Synthesis. (for audio)
GAN (Generative Adversarial Network) - models used to generate synthetic data such as images, videos or sounds that resemble the training data. (for data augmentation)
XGBoost (Extreme Gradient Boosting) - an implementation of gradient boosting
____ data includes both input features and corresponding output labels.
Labeled: for exmple an image of animals where each is labeled according to the animal type
With labeled data, you are able to use ____ learning where the model is trained to map inputs to known outputs.
supervised
____ data includes only input features without any output labels.
Unlabeled: for example an image of cats and dogs with no label identifying what is cat and what is dog.
With unlabeled data, you are able to use ____ learning where the model tries to find patterns or structures in the data.
unsupervised
____ data is organized in a structured format, often in rows and columns.
structured; can be tabular data in rows and columns, or time series data (data points collected or recorded at successive points in time)
There are other types of structured data.
____ doesn’t follow a specific structure and is often text-heavy or multimedia content.
Unstructured
Examples of unstructured ____ data can include articles, social media posts, or customer reviews.
text
Examples of unstructured ____ data is data in the form of images which can vary widely in format and content.
image
Supervised Learning needs ____ data; its very powerful, but difficult to perform on millions of datapoints.
labeled
With a Regression ML algorithm using supervised learning, we can make numerical ____ based on input data.
predictions
With a Regression ML algorithm, the output variable is ____, meaning it can take any value within a range.
continuous
A use case for ____ ML algorithm is when the goal is to predict a quanity or real value.
Regression
With a Classification ML algorithm using supervised learning, we try to predict the ____ label of input data.
categorical
With a Classification ML algorithm, the output variable is ____, meaning it falls into a specific category or class.
discrete
A use case for ____ ML algorithm are scenarios where decisions or predictions need to be make between distinct categories (fraud, image classification, etc)
Classification
Examples of the supervised learning classification algorithm can include binary, multiclass and multi-label classification. Identify each below:
a) classify emails as “spam” or “not spam”
b) classify animals in a zoo as “mammal”, “bird”, “reptile”
c) assign multiple lables to a movie, like “action” or “comedy”
a) binary classification
b) multiclass classification
c) multi-label classification
With supervised learning for a ML algorithm, typically you use 60%-80% of your dataset for ____.
training
With supervised learning for a ML algorithm, typically you use 10%-20% of your dataset for ____.
validation; used to rune model parameters and validate performance
With supervised learning for a ML algorithm, typically you use 10%-20% of your dataset for ____.
testing; used to evaluate the final model performance
____ engineering is the process of using domain knowledge to select and transform raw data into meaningful features.
Feature; an example would be converting a birth date to an age
Identify the techniques of feature engineering used with supervised learning:
Extracting useful information from raw data, such as deriving age from date of birth.
a) feature extraction
b) feature selection
c) feature transformation
feature extraction
Identify the techniques of feature engineering used with supervised learning:
Selecting a subset of relevant features, like choosing important predictors in a regression model.
a) feature extraction
b) feature selection
c) feature transformation
feature selection
Identify the techniques of feature engineering used with supervised learning:
Transforming data for better model performance, such as normalizing numerical data.
a) feature extraction
b) feature selection
c) feature transformation
feature transformation