CAIC 9.4 Flashcards

Question 1

Q

What are some common marketing campaigns and advertising tactics used by retailers?

Answer

A

Direct marketing emails and digital advertisements

These tactics aim to attract customers with incentives or discounts based on demographics.

Question 2

Q

What is the primary goal of marketing campaigns in retail?

Answer

A

To achieve a high conversion rate while minimizing advertising costs and reducing customer disturbances.

Question 3

Q

How do ML models optimize marketing campaigns?

Answer

A

By using customer data and demographic factors to identify potential customers and determine appropriate messaging and incentives.

Question 4

Q

What is the purpose of customer segmentation in marketing?

Answer

A

To understand different customer segments and improve the effectiveness of marketing campaigns.

Question 5

Q

What is highly personalized marketing?

Answer

A

Marketing that creates individual profiles using behavior data to generate customized campaigns.

Question 6

Q

Fill in the blank: ML approaches to user-centric targeted marketing predict the conversion rate, known as _______.

Answer

A

[conversion probability]

Question 7

Q

What is contextual advertising?

Answer

A

A targeted marketing technique that displays ads relevant to the content on a web page.

Question 8

Q

How does ML assist in contextual advertising?

Answer

A

By identifying the context of an ad to ensure appropriate placement.

Question 9

Q

What is generative AI’s role in targeted marketing?

Answer

A

To create dynamically personalized content tailored to individual customer preferences.

Question 10

Q

Why is understanding consumer perception crucial for retail businesses?

Answer

A

It significantly impacts their success and helps in monitoring brand reputation.

Question 11

Q

What techniques do retailers use to assess customer sentiment?

Answer

A

Soliciting feedback and monitoring social media channels.

Question 12

Q

What is sentiment analysis?

Answer

A

A text classification problem that determines whether sentiment is positive, negative, or neutral.

Question 13

Q

What are common algorithms used for sentiment analysis?

Answer

A

ML algorithms, including deep learning-based algorithms.

Question 14

Q

How do retailers use sentiment analysis?

Answer

A

To gain insights into customer preferences and identify areas for improvement.

Question 15

Q

What do retailers rely on for inventory planning and demand forecasting?

Answer

A

To manage inventory costs while maximizing revenue and avoiding out-of-stock situations.

Question 16

Q

What limitations do traditional demand forecasting methods have?

Answer

A

Limitations in accuracy and reliability.

Question 17

Q

Which techniques are retailers turning to for improved demand forecasting?

Answer

A

Statistical and ML techniques such as regression analysis and deep learning.

Question 18

Q

Fill in the blank: Deep learning-based algorithms can produce accurate demand forecasts by incorporating multiple _______.

Answer

A

[data sources]

Question 19

Q

What is the system architecture of an autonomous vehicle composed of?

Answer

A

Perception and localization, decision and planning, control.

Question 20

Q

What role does perception play in autonomous driving?

Answer

A

It gathers information about surroundings and determines the vehicle’s position.

Question 21

Q

What sensors are used in the perception stage of autonomous vehicles?

Answer

A

RADAR, LIDAR, cameras, and ultrasonic systems.

Question 22

Q

What is the function of the decision and planning stage in autonomous vehicles?

Answer

A

To control the vehicle’s motion and behavior based on perception data.

Question 23

Q

What is the role of AI/ML in the decision and planning stage?

Answer

A

To analyze data and determine the optimal path for the vehicle.

Question 24

Q

What is the purpose of the control module in autonomous driving?

Answer

A

To translate decisions into physical actions that control the vehicle.

Question 25

Q

What techniques can be applied in the control module of autonomous vehicles?

Answer

A

Adaptive control systems and reinforcement learning techniques.

Question 26

Q

What does ADAS stand for?

Answer

A

Advanced Driver Assistance Systems.

Question 27

Q

What features do ADAS technologies provide?

Answer

A

Detecting potential hazards, issuing warnings, and taking corrective actions.

Question 28

Q

What is the objective function in ML optimization?

Answer

A

A business metric aimed at minimizing or maximizing discrepancies between projected and actual values.

Question 29

Q

What is the purpose of optimizers in ML?

Answer

A

To find optimal model parameters for minimizing the objective function.

Question 30

Q

What is gradient descent?

Answer

A

An iterative approach for optimizing neural networks and ML algorithms.

Question 31

Q

Fill in the blank: The learning rate is a hyperparameter that controls the magnitude of _______ in ML optimization.

Answer

A

[parameter updates]

Question 32

Q

What is gradient descent?

Answer

A

An iterative approach for optimizing neural networks and ML algorithms by calculating the rate of error change associated with input variables.

Gradient descent updates model parameters step by step to reduce error.

Question 33

Q

What is the learning rate in gradient descent?

Answer

A

A hyperparameter that controls the magnitude of parameter updates at each iteration.

It allows fine-tuning of the optimization process.

Question 34

Q

What are the key steps in the gradient descent optimization process?

Answer

A

Initialize W randomly
Calculate error using W
Compute the gradient of the error
Update W based on the gradient
Repeat until gradient is zero

This indicates that the optimal value of W has been reached.

Question 35

Q

What is the normal equation in ML?

Answer

A

An alternative optimization technique that provides a one-step analytical solution for calculating coefficients in linear regression models.

Unlike gradient descent, it does not require iterative updates.

Question 36

Q

What are the two primary types of ML tasks?

Answer

A

Classification
Regression

Classification involves categorizing data, while regression involves predicting continuous values.

Question 37

Q

What is overfitting in machine learning?

Answer

A

When a trained model learns the training data too well but fails to generalize to new, unseen data.

Simpler algorithms with fewer parameters may help prevent overfitting.

Question 38

Q

What factors should be considered when selecting a ML algorithm?

Answer

A

Problem type
Dataset size
Number and nature of features
Computational requirements
Interpretability of results
Assumptions about data distribution

These factors aid in making informed decisions for algorithm selection.

Question 39

Q

What does linear regression aim to estimate?

Answer

A

The output value by calculating the weighted sum of input variables assuming a linear relationship.

This is expressed through a linear function of coefficients and input variables.

Question 40

Q

What is the goal of logistic regression?

Answer

A

To estimate the probability of an event occurring, effectively separating classes of data points.

It uses a logistic function to map input variables to a probability score.

Question 41

Q

What is a decision tree?

Answer

A

A hierarchical model that splits data based on features to classify or predict outcomes.

It uses algorithms like the Gini index and information gain for splitting.

Question 42

Q

What is a key advantage of decision trees?

Answer

A

Their ability to capture non-linear relationships and interactions between features.

Decision trees can handle both numerical and categorical features.

Question 43

Q

What is a limitation of decision trees?

Answer

A

They can be prone to overfitting, especially with a large number of features and noisy data.

Overfitting occurs when the model memorizes training data but performs poorly on unseen data.

Question 44

Q

How does a random forest improve upon decision trees?

Answer

A

By combining the decisions of multiple trees to enhance overall performance.

It utilizes majority voting for classification or averaging for regression.

Question 45

Q

What is gradient boosting?

Answer

A

A sequential algorithm that aggregates results from different trees, where each tree corrects the errors of the previous one.

It differs from random forests, which use parallel independent trees.

Question 46

Q

What is a key advantage of gradient boosting?

Answer

A

It excels in handling imbalanced datasets and can achieve higher performance with proper tuning.

It allows for custom loss functions, enhancing flexibility in modeling.

Question 47

Q

What is a limitation of gradient boosting?

Answer

A

It lacks parallelization capabilities, making it slower in training compared to algorithms that can be parallelized.

This sequential nature can hinder efficiency.

Question 48

Q

What is the main advantage of gradient boosting?

Answer

A

It has the potential to achieve higher performance than other algorithms when properly tuned.

Question 49

Q

What custom feature does gradient boosting support?

Answer

A

Custom loss functions.

Question 50

Q

What is a limitation of gradient boosting related to data?

Answer

A

It is sensitive to noisy data, including outliers.

Question 51

Q

What is XGBoost?

Answer

A

A widely-used implementation of gradient boosting.

Question 52

Q

How does XGBoost improve training times?

Answer

A

It enables training a single tree across multiple cores and CPUs.

Question 53

Q

What techniques does XGBoost use to mitigate overfitting?

Answer

A

Powerful regularization techniques.

Question 54

Q

What are some other popular variations of gradient boosting trees?

Answer

A

LightGBM
CatBoost

Question 55

Q

What does K-NN stand for?

Answer

A

K-Nearest Neighbors.

Question 56

Q

What is the underlying assumption of K-NN?

Answer

A

Similar items tend to have close proximity to each other in the feature space.

Question 57

Q

How does K-NN classify a new data point?

Answer

A

By majority voting among the K nearest neighbors.

Question 58

Q

What is a key advantage of K-NN?

Answer

A

Its simplicity and lack of need for training or tuning with hyperparameters.

Question 59

Q

What challenge does K-NN face as the number of data points increases?

Answer

A

Predictions can become slower.

Question 60

Q

What is a limitation of K-NN regarding dimensionality?

Answer

A

It is not suitable for high-dimensional datasets.

Question 61

Q

What does an artificial neuron do?

Answer

A

Processes inputs from another neuron, transforms them, and sends output.

Question 62

Q

What does the activation function in an artificial neuron do?

Answer

A

Modifies the output of the linear function.

Question 63

Q

What is a Multi-Layer Perceptron (MLP)?

Answer

A

A neural network that stacks multiple layers of neurons.

Question 64

Q

What is the purpose of backpropagation in neural networks?

Answer

A

To adjust the weights of each neuron based on the contribution to the error.

Answer 65

A

Tabular data
Images
Text

Answer 66

A

Grouping items together based on shared attributes.

Answer 67

A

To group similar data points together in clusters.

Answer 68

A

It is sensitive to the initial placement of centroids.

Answer 69

A

A sequence of data points recorded at successive time intervals.

Answer 70

A

The long-term direction of the data.

Answer 71

A

Repeating patterns within a fixed interval.

Answer 72

A

Statistical properties like mean and variance remain constant over time.

Answer 73

A

Analyzing and predicting time series data.

Answer 74

A

Autoregressive
Moving average
Differencing

Answer 75

A

A state-of-the-art forecasting algorithm based on neural networks.

Answer 76

A

Its black-box nature lacks interpretability.

Answer 77

A

An essential machine learning tool used for personalized recommendations.

Answer 78

A

The black-box nature of the deep learning model, which lacks interpretability and transparency.

Answer 79

A

DeepAR performs poorly when the dataset is small.

Answer 80

A

To predict a user’s preference for items based on user or item attribute similarities or user-item interactions.

Answer 81

A

Retail
Media and entertainment
Finance
Healthcare

Answer 82

A

The preferences and behaviors of similar users.

Answer 83

A

It can provide highly personalized recommendations matched to each user’s unique interests.

Answer 84

A

Collaborative models struggle when new users or items with no ratings are introduced.

Answer 85

A

A technique that involves learning vector representations for both users and items in the user-item interaction matrix.

Answer 86

A

To approximate the original user-item interaction matrix by predicting missing entries.

Answer 87

A

Multi-Armed Bandit.

Answer 88

A

To dynamically explore and exploit different recommendations to optimize user experience.

Answer 89

A

Striking the right balance between exploration and exploitation.

Answer 90

A

The ability of computers to interpret and understand visual representations, such as images and videos.

Answer 91

A

Object identification
Image classification
Text detection
Face recognition
Activity detection

Answer 92

A

Convolutional Neural Network (CNN).

Answer 93

A

Feature extraction from input images.

Answer 94

A

To reduce the dimensionality of the extracted features.

Answer 95

A

Max pooling
Average pooling

Answer 96

A

Signals from initial inputs diminish as they traverse through multiple layers.

Answer 97

A

Skip connections that allow signals to bypass certain layers.

Answer 98

A

The relationship between computers and human language.

Answer 99

A

Document classification
Topic modeling
Speech-to-text conversion
Language translation
Reading comprehension

Answer 100

A

Counts the number of times a word appears in a text.

Answer 101

A

TF (Term Frequency)
IDF (Inverse Document Frequency)

Answer 102

A

A technique used to generate low-dimensional representations for words or sentences that capture semantic meaning.

Answer 103

A

Embedding captures the semantic meaning of words, while BOW and TF-IDF create large and sparse input vectors.

Answer 104

A

A technique used to generate low-dimensional representations (mathematical vectors) for words or sentences that capture the semantic meaning of the text.

Answer 105

A

Words or sentences with similar semantic meanings are closer to each other than those with different meanings.

Answer 106

A

A metric that measures how similar two vectors are by calculating the cosine of the angle between them.

Answer 107

A

They offer more meaningful representations of the underlying text compared to other techniques like simple word counts.

Answer 108

A

Thomas Mikolov created Word2Vec in 2013.

Answer 109

A

CBOW (Continuous Bag of Words)
Continuous-Skip-Gram

Answer 110

A

It tries to predict a word for a given window of surrounding words.

Answer 111

A

It tries to predict surrounding words for a given word.

Answer 112

A

To run across running text and choose one of the words as the target while the rest serve as inputs.

Answer 113

A

A straightforward one-hidden-layer MLP network.

Answer 114

A

The actual embeddings for the words.

Answer 115

A

They can be readily used for tasks like text classification or entity extraction.

Answer 116

A

It produces a fixed embedding representation for each word, disregarding contextual variations in meaning.

Answer 117

A

Bidirectional Encoder Representations from Transformers.

Answer 118

A

Predicting randomly masked words in sentences
Predicting the next sentence from a given sentence

Answer 119

A

It generates context-aware embeddings that consider surrounding words.

Answer 120

A

It generates embeddings at subword levels.

Answer 121

A

Token embedding.

Answer 122

A

A transformer.

Answer 123

A

A self-attention layer
A feed-forward network layer

Answer 124

A

It calculates the strength of the connection between one token and all other tokens in the input sentence.

Answer 125

A

Question answering
Named entity extraction
Text summarization

Answer 126

A

It indicates its effectiveness in various NLP tasks when it was released.

Answer 127

A

A technique used to adapt the pre-trained model for specific tasks.