CAIC 9.3 Flashcards

1
Q

What are some marketing tactics employed by retailers?

A

Direct marketing emails, digital advertisements, incentives, discounts based on demographics

These tactics aim to attract potential customers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the primary goal of marketing campaigns in retail?

A

Achieve a high conversion rate while minimizing advertising costs and reducing customer disturbances.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What role do ML models play in marketing campaigns?

A

Optimize the effectiveness of marketing campaigns by identifying potential customers and appropriate messaging.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is customer segmentation in marketing?

A

Understanding different customer segments to improve marketing campaign effectiveness.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is unsupervised clustering in customer segmentation?

A

Grouping customers based on demographic data without predefined labels.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Fill in the blank: Highly personalized _______ can improve conversion rates.

A

marketing campaigns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What type of data is used to create accurate individual profiles for targeted marketing?

A

Historical transaction data, response data to historical campaigns, social media data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is contextual advertising?

A

A targeted marketing technique that displays ads relevant to the content on a web page.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How does generative AI enhance targeted marketing?

A

Creates dynamically personalized content tailored to individual customer preferences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why is understanding consumer perception crucial for retail businesses?

A

It significantly impacts their success and brand reputation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is sentiment analysis?

A

A text classification problem that determines whether sentiment is positive, negative, or neutral.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do ML algorithms assist in sentiment analysis?

A

By training models to detect sentiment in text data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What techniques do retailers use to assess customer sentiment?

A

Soliciting feedback, monitoring social media channels.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the purpose of inventory planning and demand forecasting in retail?

A

Manage inventory costs while maximizing revenue and avoiding out-of-stock situations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What limitations do traditional methods for demand forecasting have?

A

Accuracy and reliability issues.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What statistical techniques are retailers using for demand forecasting?

A

Regression analysis, deep learning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Fill in the blank: ML-based forecasting models can generate both _______ and probabilistic forecasts.

A

point forecasts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is the significance of AI and ML in the automotive industry?

A

Improving efficiency, safety, and customer experience.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is one of the most significant applications of AI and ML in the automotive industry?

A

Autonomous driving.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What are the three main stages of the system architecture in autonomous vehicles?

A

Perception and localization, decision and planning, control.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the role of the perception stage in autonomous driving?

A

Gathering information about surroundings and determining the vehicle’s position.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What types of sensors are used in the perception stage of autonomous vehicles?

A

RADAR, LIDAR, cameras.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What does the decision and planning stage in autonomous driving do?

A

Controls the motion and behavior of the vehicle based on perception data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

How do AI and ML enhance the path planning process in autonomous vehicles?

A

By analyzing real-time data, traffic patterns, and user inputs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is the function of the control module in autonomous driving?

A

Translates decisions into physical actions controlling the vehicle.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What is Adaptive Control in the context of autonomous vehicles?

A

Dynamically adjusts control inputs based on sensor data and real-time feedback.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What are Advanced Driver-Assistance Systems (ADAS)?

A

Technologies that enhance driving safety and experience.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What features do ADAS include?

A

Lane departure warning, automatic emergency braking.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What is the primary role of ML solutions architects?

A

Identify suitable data science solutions and design technology infrastructure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What do ML algorithms learn by optimizing?

A

An objective function, also known as a loss function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What is the purpose of optimization in ML?

A

Minimize or maximize an objective function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

Fill in the blank: The learning rate is a hyperparameter that controls the _______ of parameter updates.

A

magnitude

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What is gradient descent?

A

An iterative approach for optimizing ML algorithms by calculating error changes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What is gradient descent?

A

An iterative approach for optimizing neural networks and ML algorithms by calculating the rate of error change associated with each input variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What is the role of the learning rate in gradient descent?

A

Controls the magnitude of parameter updates at each iteration.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

List the key steps in the gradient descent optimization process.

A
  • Initialize the value of W randomly
  • Calculate the error (loss)
  • Compute the gradient of the error
  • Update W to reduce the error
  • Repeat until the gradient is zero
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

What is the normal equation in the context of ML algorithms?

A

A one-step analytical solution for calculating the coefficients of linear regression models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What are the primary types of ML tasks discussed?

A
  • Classification
  • Regression
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

Fill in the blank: Classification algorithms are suitable for tasks where the goal is to ______ data into distinct classes.

A

[categorize]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

What is overfitting in machine learning?

A

When a trained model learns the training data too well but fails to generalize to new, unseen data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

What factors should be considered when selecting a ML algorithm?

A
  • Problem type
  • Dataset size
  • Number and nature of features
  • Computational requirements
  • Interpretability of results
  • Assumptions about data distribution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

What is linear regression?

A

A method that utilizes a linear function of a set of coefficients and input variables to predict a scalar output.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

What does logistic regression estimate?

A

The probability of an event or outcome, such as transaction fraud or passing an exam.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

True or False: Logistic regression is suitable for problems with complex non-linear relationships.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

What is a decision tree in machine learning?

A

A model that divides data hierarchically based on rules, leading to similar data points following the same decision path.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

List the advantages of decision trees.

A
  • Capture non-linear relationships
  • Handle both numerical and categorical features
  • Minimal preprocessing required
  • Highly interpretable
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

What is a limitation of decision trees?

A

Prone to overfitting, especially with a large number of features and noisy data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

What is the role of the Gini index in decision trees?

A

Measures the probability of misclassification.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

What is random forest?

A

An ensemble method that combines the decisions of multiple decision trees to improve overall performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

What is bagging in the context of random forests?

A

A technique that involves using the same sample multiple times in a single tree to make the model more generalized.

51
Q

List the advantages of random forests over decision trees.

A
  • Improved accuracy
  • Reduced overfitting
  • Robust to outliers
  • Feature importance estimation
52
Q

What is gradient boosting?

A

A sequential approach that aggregates results from different trees, where each tree corrects the errors of the previous one.

53
Q

List the advantages of gradient boosting.

A
  • Handles imbalanced datasets
  • Potential for higher performance with tuning
  • Supports custom loss functions
  • Captures complex relationships in data
54
Q

What is a limitation of gradient boosting?

A

Lacks parallelization capabilities, making it slower in training compared to parallelizable algorithms.

55
Q

What is the primary advantage of gradient boosting?

A

It has the potential to achieve higher performance than other algorithms when properly tuned.

56
Q

What does gradient boosting support that adds flexibility in modeling?

A

Custom loss functions.

57
Q

What is a limitation of gradient boosting related to data?

A

It is sensitive to noisy data, including outliers.

58
Q

What is XGBoost?

A

A widely-used implementation of gradient boosting that focuses on speed and performance.

59
Q

What key improvement does XGBoost offer over traditional gradient boosting?

A

It enables training a single tree across multiple cores and CPUs for faster training times.

60
Q

Name two other popular variations of gradient boosting trees.

A
  • LightGBM
  • CatBoost
61
Q

What is K-NN primarily used for?

A

Both classification and regression tasks.

62
Q

What is the underlying assumption of the K-NN algorithm?

A

Similar items tend to have close proximity to each other in the feature space.

63
Q

How does K-NN determine the class label for a new data point?

A

Through majority voting among the K nearest neighbors.

64
Q

What is one advantage of K-NN?

A

Its simplicity and lack of the need for training or tuning with hyperparameters.

65
Q

What is a significant limitation of K-NN related to dataset size?

A

As the number of data points increases, the complexity grows, making predictions slower.

66
Q

What is an artificial neuron modeled after?

A

The learning process of the human brain.

67
Q

What is the role of the activation function in an artificial neuron?

A

It modifies the output of the linear function to capture non-linear relationships.

68
Q

What does MLP stand for in the context of neural networks?

A

Multi-Layer Perceptron.

69
Q

What is the purpose of backpropagation in neural networks?

A

To adjust the weights of each neuron to optimize the training objective.

70
Q

What types of tasks can MLP handle?

A
  • Classification
  • Regression
71
Q

What is clustering in data mining?

A

Grouping items together based on their shared attributes.

72
Q

What is the K-means clustering algorithm used for?

A

To group similar data points together in clusters.

73
Q

What is a significant drawback of K-means clustering?

A

Selecting the optimal number of clusters (K) can be subjective and challenging.

74
Q

What are the three important characteristics of time series data?

A
  • Trend
  • Seasonality
  • Stationarity
75
Q

What does the trend of a time series indicate?

A

The long-term direction of the data, whether it shows an overall increase or decrease.

76
Q

What is ARIMA used for?

A

Analyzing and predicting time series data.

77
Q

What does the ‘I’ in ARIMA stand for?

A

Integrated, referring to the differencing of the time series to achieve stationarity.

78
Q

What is a major advantage of using DeepAR for time series forecasting?

A

It captures complex non-linear relationships and can utilize multivariate datasets.

79
Q

What is a significant drawback of DeepAR?

A

Its black-box nature makes it less interpretable compared to simpler statistical methods.

80
Q

What does DeepAR model in marketing?

A

Multiple variables simultaneously

DeepAR provides accurate predictions and insights for marketing campaigns.

81
Q

What is a significant drawback of DeepAR?

A

Black-box nature, lacks interpretability and transparency

This makes forecasts difficult to explain compared to simpler statistical methods.

82
Q

What is a major disadvantage of DeepAR regarding data?

A

Data-hungry nature, performs poorly with small datasets

83
Q

What is a recommender system?

A

An ML technology that predicts user preferences for items

It relies on user or item attribute similarities or user-item interactions.

84
Q

In which industries has the recommender system gained widespread adoption?

A
  • Retail
  • Media and entertainment
  • Finance
  • Healthcare
85
Q

What is collaborative filtering?

A

A recommendation algorithm based on similarities in user preferences

It analyzes collective experiences to recommend items.

86
Q

What is one major benefit of collaborative filtering?

A

Provides highly personalized recommendations

87
Q

What is the cold-start problem in collaborative filtering?

A

Struggles with new users or items with no ratings

88
Q

What technique is commonly used in collaborative filtering for recommendations?

A

Matrix factorization

89
Q

What is matrix factorization’s role in collaborative filtering?

A

Scalability benefits for large catalogs

It sacrifices some interpretability due to latent factor modeling.

90
Q

What is a Multi-Armed Bandit (MAB) approach?

A

A recommendation system inspired by trial and error

91
Q

How do MAB algorithms learn?

A

Through online learning without pre-existing training data

92
Q

What is the balance that MAB algorithms strive to achieve?

A

Exploration and exploitation

93
Q

What does computer vision refer to?

A

Ability of computers to interpret visual representations

94
Q

What are some tasks performed by computer vision?

A
  • Object identification
  • Image classification
  • Text detection
  • Face recognition
  • Activity detection
95
Q

What is a Convolutional Neural Network (CNN)?

A

A deep learning architecture for processing image data

96
Q

What does a convolutional layer do in a CNN?

A

Extracts features from input images

97
Q

What is max pooling in CNNs?

A

A technique that selects the maximum value from outputs

98
Q

What is the vanishing gradient problem?

A

Signals diminish as they traverse through multiple layers

99
Q

How does ResNet address the vanishing gradient problem?

A

By implementing a layer-skipping technique

100
Q

What is Natural Language Processing (NLP)?

A

Focuses on the relationship between computers and human language

101
Q

What are some tasks performed in NLP?

A
  • Document classification
  • Topic modeling
  • Speech recognition
  • Language translation
102
Q

What are the two widely used methods for representing word relevance before embeddings?

A
  • Bag of Words (BOW)
  • Term Frequency-Inverse Document Frequency (TF-IDF)
103
Q

What is the main idea behind Bag of Words (BOW)?

A

Counts the number of times a word appears in a text

104
Q

What does TF-IDF measure?

A

Importance of a word in a document and across all documents

105
Q

What is embedding in NLP?

A

Generates low-dimensional representations capturing semantic meaning

106
Q

What is embedding?

A

A technique used to generate low-dimensional representations (mathematical vectors) for words or sentences, capturing the semantic meaning of the text.

107
Q

What is the underlying idea of embedding?

A

Words or sentences with similar semantic meanings tend to occur in similar contexts.

108
Q

In a multi-dimensional space, how are semantically similar entities represented?

A

The mathematical representations of semantically similar entities are closer to each other than those with different meanings.

109
Q

What metric is often used to measure the similarity of embedding vectors?

A

Cosine similarity.

110
Q

What does the embedding vector represent?

A

The intrinsic meaning of the word, with each dimension representing a specific attribute associated with the word.

111
Q

Why have embeddings become crucial in NLP tasks?

A

They offer more meaningful representations of the underlying text compared to techniques like simple word counts.

112
Q

Who created Word2Vec and in what year?

A

Thomas Mikolov in 2013.

113
Q

What are the two techniques supported by Word2Vec for learning embeddings?

A
  • CBOW (Continuous Bag of Words)
  • Continuous-skip-gram.
114
Q

What does CBOW try to predict?

A

A word for a given window of surrounding words.

115
Q

What does continuous-skip-gram try to predict?

A

Surrounding words for a given word.

116
Q

What is the training dataset for CBOW generated from?

A

A sliding window across running text.

117
Q

How is the training problem formulated in Word2Vec?

A

As a multi-class classification problem where the model learns to predict classes (words in the vocabulary) for the target word.

118
Q

What type of network is used to train Word2Vec embeddings?

A

A one-hidden-layer MLP (Multi-Layer Perceptron) network.

119
Q

What does the output of the MLP network represent in Word2Vec?

A

A probability distribution for the target words.

120
Q

What is the purpose of using embeddings in downstream tasks?

A

To serve as features for tasks such as text classification or entity extraction.

121
Q

What is a common approach to using embeddings in NLP tasks?

A

Using pre-trained embeddings.

122
Q

What limitation does Word2Vec have regarding word meanings?

A

It produces a fixed embedding representation for each word, disregarding contextual variations in meaning.

123
Q

What are contextualized word embeddings?

A

Embeddings that take into account the surrounding words or overall context in which a word appears.

124
Q

What advantage do contextualized embeddings provide?

A

They capture the diverse meanings a word can have, enabling more accurate context-specific analyses.