Section 8 AI and ML Flashcards

Question 1

Q

What is Artificial Intelligence (AI)?

Answer

A

AI is a broad field focused on developing intelligent systems capable of tasks requiring human intelligence, such as perception, reasoning, learning, problem-solving, and decision-making.

Question 2

Q

What are some key use cases of AI?

Answer

A

Use cases include computer vision (self-driving cars, facial recognition), fraud detection, and intelligent document processing (IDP).

Question 3

Q

What are the main layers of an AI system?

Answer

A

Data Layer (collecting vast amounts of data)\n2. Machine Learning Framework Layer (defining ML frameworks and algorithms)\n3. Model Layer (training the AI model)\n4. Application Layer (serving the model to users)

Question 4

Q

What is Machine Learning (ML)?

Answer

A

ML is a subset of AI where machines learn from data to improve performance on tasks without explicit programming.

Question 5

Q

What are common ML tasks?

Answer

A

Regression (predicting continuous values) and classification (categorizing data points into groups).

Question 6

Q

What is the difference between AI and ML?

Answer

A

AI is a broad field that includes ML, while ML is a method within AI that enables computers to learn from data without explicit rules.

Question 7

Q

What is Deep Learning?

Answer

A

A subset of ML that uses artificial neural networks with multiple hidden layers to process complex data patterns.

Question 8

Q

How does Deep Learning work?

Answer

A

Deep learning models use layers of neurons (input, hidden, and output) to process data, learn patterns, and adjust connections to improve predictions.

Question 9

Q

What are Neural Networks?

Answer

A

Neural networks are AI models inspired by the human brain, consisting of interconnected neurons that process and learn from data.

Question 10

Q

What is Generative AI (GenAI)?

Answer

A

A subset of deep learning where models generate new content (e.g., text, images) by learning from large datasets.

Question 11

Q

What is a Foundation Model?

Answer

A

A large, pre-trained AI model that can be adapted for various tasks, such as GPT models for text generation.

Question 12

Q

What are Transformer Models?

Answer

A

A deep learning architecture that processes entire sequences efficiently, enabling advanced NLP tasks like ChatGPT.

Question 13

Q

What are Multi-Modal Models?

Answer

A

AI models that process multiple types of inputs (e.g., text, images, audio) and generate diverse outputs.

Question 14

Q

How does Generative AI differ from Traditional AI?

Answer

A

Traditional AI classifies or predicts based on existing data, while GenAI creates new content, such as text or images.

Question 15

Q

What is GPT

Answer

A

Generative Pre-trained Transformer, a model that generates human text or computer code based on input prompts.

Question 16

Q

What is BERT

Answer

A

Bidirectional Encoder Representations from Transformers, a language model that reads text in two directions, useful for translation.

Question 17

Q

What is RNN

Answer

A

Recurrent Neural Network, a neural network for processing sequential data like time series and speech recognition.

Question 18

Q

What is ResNet

Answer

A

Residual Network, a deep convolutional neural network (CNN) used for image recognition tasks like object detection and facial recognition.

Question 19

Q

What is SVM

Answer

A

Support Vector Machine, an ML algorithm used for classification and regression tasks.

Question 20

Q

What is WaveNet

Answer

A

A model used to generate raw audio waveforms, commonly used in speech synthesis.

Question 21

Q

What is GAN

Answer

A

Generative Adversarial Network, a model for generating synthetic data like images, videos, or sounds that resemble training data.

Question 22

Q

What is XGBoost

Answer

A

Extreme Gradient Boosting, an implementation of gradient boosting used for regression and classification tasks.

Question 23

Q

What is labeled data?

Answer

A

Labeled data includes both input features and output labels, allowing for supervised learning.

Question 24

Q

What is an example of labeled data?

Answer

A

Images of animals labeled as ‘dog’ or ‘cat’.

Question 25

Q

What is unlabeled data?

Answer

A

Unlabeled data has input features but no output labels, requiring unsupervised learning.

Question 26

Q

What is an example of unlabeled data?

Answer

A

A collection of images without any labels, where the algorithm must find patterns.

Question 27

Q

What is structured data?

Answer

A

Data that is organized in a structured format, often in rows and columns, such as tabular data.

Question 28

Q

What is an example of structured data?

Answer

A

A customer database with columns like Customer_ID, Name, Age, and Purchase_Amount.

Question 29

Q

What is an example of time series data?

Answer

A

Stock prices recorded over time.

Question 30

Q

What is unstructured data?

Answer

A

Data that does not follow a specific structure, often text-heavy or multimedia content.

Question 31

Q

What is an example of unstructured data?

Answer

A

Text reviews, social media posts, or images.

Question 32

Q

Why is having good training data important?

Answer

A

Poor-quality data (garbage in) leads to poor model performance (garbage out).

Question 33

Q

What is supervised learning?

Answer

A

A type of learning where an algorithm maps inputs to known outputs using labeled data.

Question 34

Q

What is unsupervised learning?

Answer

A

A type of learning where an algorithm finds patterns in unlabeled data without explicit labels.

Question 35

Q

What is supervised learning?

Answer

A

Supervised learning is a type of machine learning where a model learns to map inputs to outputs using labeled data.

Question 36

Q

Why is labeled data important in supervised learning?

Answer

A

Labeled data allows the model to learn the correct output for given inputs, making supervised learning powerful but sometimes difficult due to the challenge of obtaining large labeled datasets.

Question 37

Q

What is regression in supervised learning?

Answer

A

Regression is used to predict a continuous numeric value based on input data, such as predicting weight based on height.

Question 38

Q

What is an example of linear regression?

Answer

A

Predicting a person’s weight based on their height using a straight line that best fits the trend in the data.

Question 39

Q

What is classification in supervised learning?

Answer

A

Classification is used to predict a categorical label (e.g., classifying animals as dogs, cats, or giraffes based on height and weight).

Question 40

Q

What is the key difference between regression and classification?

Answer

A

Regression predicts a continuous value (e.g., house price), while classification predicts a category (e.g., spam or not spam).

Question 41

Q

What are examples of regression use cases?

Answer

A

Predicting house prices, stock market trends, and weather forecasting.

Question 42

Q

What are examples of classification use cases?

Answer

A

Email spam detection, image recognition, fraud detection, and medical diagnostics.

Question 43

Q

What are the three main data splits in supervised learning?

Answer

A

Training set (60-80%), Validation set (10-20%), Test set (10-20%).

Question 44

Q

Why is a validation set used?

Answer

A

To fine-tune the model and optimize hyperparameters before final testing.

Question 45

Q

What is feature engineering?

Answer

A

Feature engineering is the process of transforming raw data into meaningful features to improve model performance.

Question 46

Q

What is an example of feature engineering?

Answer

A

Converting a birthdate column into an age column to make it more useful for machine learning models.

Question 47

Q

What are the three main types of feature engineering?

Answer

A

Feature Extraction (e.g., deriving age from birthdate) 2. Feature Selection (e.g., choosing important variables) 3. Feature Transformation (e.g., normalizing data for better performance).

Question 48

Q

What is the difference between binary and multi-class classification?

Answer

A

Binary classification predicts two categories (e.g., spam or not spam), while multi-class classification predicts more than two categories (e.g., mammal, bird, reptile).

Question 49

Q

What is multi-label classification?

Answer

A

Multi-label classification allows multiple categories per input (e.g., a movie can be both ‘action’ and ‘comedy’).

Question 50

Q

What is unsupervised learning?

Answer

A

Unsupervised learning is a type of machine learning where the algorithm finds patterns and structures in unlabeled data without explicit supervision.

Question 51

Q

What are common techniques in unsupervised learning?

Answer

A

Clustering, association rule learning, and anomaly detection.

Question 52

Q

What is clustering in unsupervised learning and name any algorithm used for this purpose??

Answer

A

Clustering is the process of grouping data points based on similarities, such as customer segmentation.

One of the Algorithm Used for this is - K Means Clustering

Question 53

Q

What is an example of clustering?

Answer

A

Grouping customers based on purchasing behaviors to create targeted marketing campaigns.

Question 54

Q

What is association rule learning and name any algorithm used for this purpose?

Answer

A

Association rule learning identifies relationships between items, such as frequently bought-together products in a supermarket.

One of the Algorithm Used for this is - Apriori

Question 55

Q

What is an example of association rule learning?

Answer

A

The Apriori algorithm finds that people who buy bread often also buy butter, helping supermarkets optimize product placement.

Question 56

Q

What is anomaly detection technique and name any algorithm used for this purpose?

Answer

A

Anomaly detection is the process of identifying data points that differ significantly from normal patterns, often used in fraud detection.

One of the Algorithm Used for this is - Isolation forests, one-class SVM, and autoencoders are popular anomaly detection methods.

Question 57

Q

What is an example of anomaly detection learning?

Answer

A

It detects unusual transactions (outliers) that differ from normal patterns, helping identify potential fraud.

Question 58

Q

What is semi-supervised learning?

Answer

A

Semi-supervised learning combines a small amount of labeled data with a large amount of unlabeled data to improve learning efficiency.

Question 59

Q

What is pseudo-labeling in semi-supervised learning?

Answer

A

Pseudo-labeling is the process where a model trained on labeled data assigns labels to unlabeled data, which is then used for further training.

Question 60

Q

What is the benefit of semi-supervised learning?

Answer

A

It reduces the cost of labeling large datasets while still achieving high model accuracy by leveraging both labeled and unlabeled data.

Question 61

Q

What is self-supervised learning?

Answer

A

Self-supervised learning is a type of machine learning where a model generates its own pseudo-labels from unlabeled data to solve supervised learning tasks.

Question 62

Q

How does self-supervised learning differ from unsupervised learning?

Answer

A

Unlike unsupervised learning, self-supervised learning generates labels from the data itself, enabling it to solve tasks typically handled by supervised learning.

Question 63

Q

What is an example of self-supervised learning?

Answer

A

Language models like GPT use self-supervised learning by predicting missing words in text, learning grammar, structure, and meaning without human-labeled data.

Question 64

Q

What are pre-text tasks in self-supervised learning?

Answer

A

Pre-text tasks are simple, self-generated tasks that a model solves to learn patterns in data, such as predicting missing words or the next word in a sentence.

Answer 65

A

Examples include predicting the next word in a sentence, filling in missing words, reconstructing occluded images, or predicting future frames in a video.

Answer 66

A

A type of machine learning where an agent learns to make decisions by performing actions in an environment and maximizing cumulative reward.

Answer 67

A

The learner or decision-maker in the environment.

Answer 68

A

The external system that the agent interacts with.

Answer 69

A

Choices made by the agent, such as moving up, down, left, or right in a maze.

Answer 70

A

Feedback provided by the environment based on the agent’s actions.

Answer 71

A

-1 for a step, -10 for hitting a wall, and +100 for reaching the exit in a maze.

Answer 72

A

The current situation of the environment, which the agent observes before taking an action.

Answer 73

A

The strategy used by the agent to determine actions based on the current state.

Answer 74

A

Through many simulations, the agent learns from past mistakes and updates its policy to maximize cumulative rewards.

Answer 75

A

Observe the state, choose an action, transition to a new state, receive a reward, and update the policy.

Answer 76

A

Gaming (chess, Go), robotics (navigation, object manipulation), finance (portfolio management), healthcare (treatment optimization), and autonomous vehicles (path planning).

Answer 77

A

The agent improves by trial and error, refining its policy over many attempts.

Answer 78

A

A technique that incorporates human feedback into reinforcement learning to align AI models with human goals, wants, and needs.

Answer 79

A

Data collection, 2. Supervised fine-tuning, 3. Building a separate reward model, 4. Optimizing the language model with a reward-based model.

Refer Image from AWS

steps in image
1.

https://aws.amazon.com/what-is/reinforcement-learning-from-human-feedback/

Answer 80

A

A set of human-generated prompts and ideal responses are gathered to train the model.

Answer 81

A

Humans rank different AI-generated responses, helping the model learn human preferences.

Answer 82

A

It serves as an automated evaluator of AI-generated responses, replacing the need for continuous human judgment.

Answer 83

A

Learning from Human Feedback (RLHF):

Step 1: Supervised Fine-Tuning (SFT)
Goal: Train a base LLM (Large Language Model) using human demonstration data.

Process:

Collect human-labeled prompts and responses.

Fine-tune the base model using supervised learning to align with human-like responses.

Step 2: Training a Reward Model (RM)
Goal: Develop a model that evaluates AI-generated responses based on human preference.

Process:

A fine-tuned model (SFT) generates multiple responses to the same prompt.

Humans rank these responses to indicate preference.

A separate reward model (RM) is trained based on human ranking.

Step 3: Optimizing Policy using Proximal Policy Optimization (PPO)
Goal: Improve the model’s response generation by optimizing the policy using the reward model.

Process:

The fine-tuned model (SFT) generates responses to new prompts.

The reward model (RM) evaluates and assigns rewards.

The model updates its policy through reinforcement learning using PPO (Proximal Policy Optimization).

This iterative process ensures that the AI generates more human-aligned, contextually appropriate responses over time.

Answer 84

A

Overfitting occurs when a model performs well on training data but poorly on evaluation data because it memorizes noise instead of learning the underlying pattern.

Answer 85

A

Underfitting happens when a model performs poorly on both training and evaluation data because it is too simple to capture the underlying patterns in the data.

Answer 86

A

Bias is the error between the predicted value and the actual value, often caused by incorrect assumptions in the model.

Answer 87

A

Variance represents how much a model’s predictions change when trained on different datasets. High variance indicates overfitting.

Answer 88

A

High bias is caused by overly simplistic models that fail to capture the complexity of the data, leading to underfitting.

Answer 89

A

High variance is caused by overly complex models that fit training data too closely and fail to generalize well to new data.

Answer 90

A

Bias can be reduced by using a more complex model, adding more relevant features, or improving the data quality.

Answer 91

A

Variance can be reduced by simplifying the model, using fewer features, or increasing training data size.

Answer 92

A

A balanced model has low bias and low variance, meaning it generalizes well to unseen data.

Answer 93

A

It indicates underfitting, where the model is too simple and fails to capture data patterns.

Answer 94

A

It indicates overfitting, where the model memorizes training data but performs poorly on unseen data.

Answer 95

A

It indicates a poor model that neither captures patterns well nor generalizes properly.

Answer 96

A

It indicates a well-balanced model that effectively captures data patterns and generalizes well.

Answer 97

A

Overfitting can be detected if the model has high accuracy on training data but significantly lower accuracy on test data.

Answer 98

A

Underfitting can be detected if the model has low accuracy on both training and test data.

Answer 99

A

The bias-variance tradeoff refers to the challenge of balancing bias and variance to achieve optimal model performance.

Answer 100

A

“Confusion matrix

Answer 101

A

“A table used to evaluate classification models by comparing actual vs. predicted values.”

Answer 102

A

“Precision = True Positives / (True Positives + False Positives). Measures how many predicted positives were actually correct.”

Answer 103

A

“When false positives are costly.”

Answer 104

A

“Recall = True Positives / (True Positives + False Negatives). Measures how many actual positives were correctly identified.”

Answer 105

A

“When false negatives are costly.”

Answer 106

A

“A metric that balances precision and recall: F1 = 2 * (Precision * Recall) / (Precision + Recall).”

Answer 107

A

“Accuracy = (True Positives + True Negatives) / Total Predictions. Often not used for imbalanced datasets.”

Answer 108

A

“A metric that evaluates classification models by plotting true positive rate vs. false positive rate.”

Answer 109

A

“The model is perfect in distinguishing classes.”

Answer 110

A

“The model performs no better than random chance.”

Answer 111

A

MAE(Mean Absolute Error), MAPE(Mean Absolute Percentage Error), RMSE(Root Mean Squared Error), R-squared

Answer 112

A

“The average absolute difference between actual and predicted values.”

Answer 113

A

“Measures prediction error as a percentage of actual values.”

Answer 114

A

“A metric that squares errors before averaging to penalize larger errors more heavily.”

Answer 115

A

“A measure of how well input features explain the variance in the target variable. Closer to 1 means a better model.”

Answer 116

A

“A dataset where each category has an equal or similar number of instances.”

Answer 117

A

Inferencing is when a model makes predictions based on new data.

Answer 118

A

Real-time inferencing, batch inferencing, and edge inferencing.

Answer 119

A

Inferencing where predictions are made instantly, prioritizing speed over perfect accuracy.

Answer 120

A

Chatbots, recommendation systems, fraud detection, self-driving cars.

Answer 121

A

Inferencing where large amounts of data are processed at once, prioritizing accuracy over speed.

Answer 122

A

Data analysis, report generation, medical imaging, financial forecasting.

Answer 123

A

Inferencing done on edge devices with limited computing power, often in areas with poor internet connectivity.

Answer 124

A

Low latency, offline capability, reduced cloud dependency.

Answer 125

A

Limited computing power, making it difficult to run large models like LLMs.

Answer 126

A

Hosting the model on a remote server and accessing it via API calls.

Answer 127

A

Higher latency and requires an internet connection, but allows for more powerful models.

Answer 128

A

Compact AI models optimized for edge devices with low computational power.

Answer 129

A

Speed vs. accuracy, compute power vs. latency, online vs. offline capability.

Answer 130

A

1️⃣ Identify Business Problem →
2️⃣ Frame as ML Problem →
3️⃣ Collect & Prepare Data →
4️⃣ Feature Engineering →
5️⃣ Model Training →
6️⃣ Hyperparameter Tuning →
7️⃣ Model Evaluation →

If Business Goals Not Met 🔄 Go Back to Data Collection / Feature Engineering

If Business Goals Met → Proceed

8️⃣ Model Testing →
9️⃣ Deployment →

🔄 Model Monitoring & Debugging →

🔄 Retrain with New Data → Repeat Process

This cycle ensures continuous improvement and adaptation of the ML model

Answer 131

A

It transforms data into useful features for better model performance.

Answer 132

A

Enhance data, perform data augmentation, or improve features.

Answer 133

A

To ensure the model remains accurate and adapts to changes.

Answer 134

A

The process of enhancing a dataset by adding more data points or variations.

Answer 135

A

To understand data structure, visualize relationships, and compute statistics.

Answer 136

A

Settings that define the model structure and learning process, set before training begins.

Answer 137

A

Learning rate, batch size, number of epochs, regularization.

Answer 138

A

How fast the model incorporates new data.

Answer 139

A

Faster convergence but risk of overshooting optimal solution.

Answer 140

A

More precise convergence but slower training.

Answer 141

A

Number of training examples used per iteration to update model weights.

Answer 142

A

More stable learning but requires more time to compute.

Answer 143

A

Faster training but may lead to less stable updates.

Answer 144

A

How many times the model iterates over the entire training dataset.

Answer 145

A

Underfitting – model does not learn enough from the data.

Answer 146

A

Overfitting – model learns training data too well but fails on new data.

Answer 147

A

Adjusts balance between simple and complex models.

Answer 148

A

Increasing regularization reduces overfitting.

Answer 149

A

Model performs well on training data but poorly on new data.

Answer 150

A

Small training data, too many epochs, overly complex model.

Answer 151

A

Increase training data size, early stopping, data augmentation, regularization.

Answer 152

A

Increase the training data size.

Answer 153

A

Finding the best hyperparameter values to optimize model performance.

Answer 154

A

Grid search, random search, SageMaker AMT.

Answer 155

A

Improves accuracy, reduces overfitting, enhances generalization.

Answer 156

A

When a problem has a deterministic solution that can be computed exactly using traditional programming.

Answer 157

A

Calculating the probability of drawing a specific card from a known deck.

Answer 158

A

ML provides approximations, while deterministic code gives exact answers.

Answer 159

A

Whether the problem requires exact solutions or can tolerate approximation.