CAIC 9.4 Flashcards

1
Q

What are some common marketing campaigns and advertising tactics used by retailers?

A

Direct marketing emails and digital advertisements

These tactics aim to attract customers with incentives or discounts based on demographics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the primary goal of marketing campaigns in retail?

A

To achieve a high conversion rate while minimizing advertising costs and reducing customer disturbances.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How do ML models optimize marketing campaigns?

A

By using customer data and demographic factors to identify potential customers and determine appropriate messaging and incentives.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the purpose of customer segmentation in marketing?

A

To understand different customer segments and improve the effectiveness of marketing campaigns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is highly personalized marketing?

A

Marketing that creates individual profiles using behavior data to generate customized campaigns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Fill in the blank: ML approaches to user-centric targeted marketing predict the conversion rate, known as _______.

A

[conversion probability]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is contextual advertising?

A

A targeted marketing technique that displays ads relevant to the content on a web page.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How does ML assist in contextual advertising?

A

By identifying the context of an ad to ensure appropriate placement.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is generative AI’s role in targeted marketing?

A

To create dynamically personalized content tailored to individual customer preferences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why is understanding consumer perception crucial for retail businesses?

A

It significantly impacts their success and helps in monitoring brand reputation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What techniques do retailers use to assess customer sentiment?

A

Soliciting feedback and monitoring social media channels.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is sentiment analysis?

A

A text classification problem that determines whether sentiment is positive, negative, or neutral.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are common algorithms used for sentiment analysis?

A

ML algorithms, including deep learning-based algorithms.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How do retailers use sentiment analysis?

A

To gain insights into customer preferences and identify areas for improvement.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What do retailers rely on for inventory planning and demand forecasting?

A

To manage inventory costs while maximizing revenue and avoiding out-of-stock situations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What limitations do traditional demand forecasting methods have?

A

Limitations in accuracy and reliability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Which techniques are retailers turning to for improved demand forecasting?

A

Statistical and ML techniques such as regression analysis and deep learning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Fill in the blank: Deep learning-based algorithms can produce accurate demand forecasts by incorporating multiple _______.

A

[data sources]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the system architecture of an autonomous vehicle composed of?

A

Perception and localization, decision and planning, control.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What role does perception play in autonomous driving?

A

It gathers information about surroundings and determines the vehicle’s position.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What sensors are used in the perception stage of autonomous vehicles?

A

RADAR, LIDAR, cameras, and ultrasonic systems.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the function of the decision and planning stage in autonomous vehicles?

A

To control the vehicle’s motion and behavior based on perception data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is the role of AI/ML in the decision and planning stage?

A

To analyze data and determine the optimal path for the vehicle.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is the purpose of the control module in autonomous driving?

A

To translate decisions into physical actions that control the vehicle.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What techniques can be applied in the control module of autonomous vehicles?

A

Adaptive control systems and reinforcement learning techniques.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What does ADAS stand for?

A

Advanced Driver Assistance Systems.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What features do ADAS technologies provide?

A

Detecting potential hazards, issuing warnings, and taking corrective actions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What is the objective function in ML optimization?

A

A business metric aimed at minimizing or maximizing discrepancies between projected and actual values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What is the purpose of optimizers in ML?

A

To find optimal model parameters for minimizing the objective function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What is gradient descent?

A

An iterative approach for optimizing neural networks and ML algorithms.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Fill in the blank: The learning rate is a hyperparameter that controls the magnitude of _______ in ML optimization.

A

[parameter updates]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What is gradient descent?

A

An iterative approach for optimizing neural networks and ML algorithms by calculating the rate of error change associated with input variables.

Gradient descent updates model parameters step by step to reduce error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What is the learning rate in gradient descent?

A

A hyperparameter that controls the magnitude of parameter updates at each iteration.

It allows fine-tuning of the optimization process.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What are the key steps in the gradient descent optimization process?

A
  • Initialize W randomly
  • Calculate error using W
  • Compute the gradient of the error
  • Update W based on the gradient
  • Repeat until gradient is zero

This indicates that the optimal value of W has been reached.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What is the normal equation in ML?

A

An alternative optimization technique that provides a one-step analytical solution for calculating coefficients in linear regression models.

Unlike gradient descent, it does not require iterative updates.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

What are the two primary types of ML tasks?

A
  • Classification
  • Regression

Classification involves categorizing data, while regression involves predicting continuous values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

What is overfitting in machine learning?

A

When a trained model learns the training data too well but fails to generalize to new, unseen data.

Simpler algorithms with fewer parameters may help prevent overfitting.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What factors should be considered when selecting a ML algorithm?

A
  • Problem type
  • Dataset size
  • Number and nature of features
  • Computational requirements
  • Interpretability of results
  • Assumptions about data distribution

These factors aid in making informed decisions for algorithm selection.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

What does linear regression aim to estimate?

A

The output value by calculating the weighted sum of input variables assuming a linear relationship.

This is expressed through a linear function of coefficients and input variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

What is the goal of logistic regression?

A

To estimate the probability of an event occurring, effectively separating classes of data points.

It uses a logistic function to map input variables to a probability score.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

What is a decision tree?

A

A hierarchical model that splits data based on features to classify or predict outcomes.

It uses algorithms like the Gini index and information gain for splitting.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

What is a key advantage of decision trees?

A

Their ability to capture non-linear relationships and interactions between features.

Decision trees can handle both numerical and categorical features.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

What is a limitation of decision trees?

A

They can be prone to overfitting, especially with a large number of features and noisy data.

Overfitting occurs when the model memorizes training data but performs poorly on unseen data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

How does a random forest improve upon decision trees?

A

By combining the decisions of multiple trees to enhance overall performance.

It utilizes majority voting for classification or averaging for regression.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

What is gradient boosting?

A

A sequential algorithm that aggregates results from different trees, where each tree corrects the errors of the previous one.

It differs from random forests, which use parallel independent trees.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

What is a key advantage of gradient boosting?

A

It excels in handling imbalanced datasets and can achieve higher performance with proper tuning.

It allows for custom loss functions, enhancing flexibility in modeling.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

What is a limitation of gradient boosting?

A

It lacks parallelization capabilities, making it slower in training compared to algorithms that can be parallelized.

This sequential nature can hinder efficiency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

What is the main advantage of gradient boosting?

A

It has the potential to achieve higher performance than other algorithms when properly tuned.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

What custom feature does gradient boosting support?

A

Custom loss functions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

What is a limitation of gradient boosting related to data?

A

It is sensitive to noisy data, including outliers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

What is XGBoost?

A

A widely-used implementation of gradient boosting.

52
Q

How does XGBoost improve training times?

A

It enables training a single tree across multiple cores and CPUs.

53
Q

What techniques does XGBoost use to mitigate overfitting?

A

Powerful regularization techniques.

54
Q

What are some other popular variations of gradient boosting trees?

A
  • LightGBM
  • CatBoost
55
Q

What does K-NN stand for?

A

K-Nearest Neighbors.

56
Q

What is the underlying assumption of K-NN?

A

Similar items tend to have close proximity to each other in the feature space.

57
Q

How does K-NN classify a new data point?

A

By majority voting among the K nearest neighbors.

58
Q

What is a key advantage of K-NN?

A

Its simplicity and lack of need for training or tuning with hyperparameters.

59
Q

What challenge does K-NN face as the number of data points increases?

A

Predictions can become slower.

60
Q

What is a limitation of K-NN regarding dimensionality?

A

It is not suitable for high-dimensional datasets.

61
Q

What does an artificial neuron do?

A

Processes inputs from another neuron, transforms them, and sends output.

62
Q

What does the activation function in an artificial neuron do?

A

Modifies the output of the linear function.

63
Q

What is a Multi-Layer Perceptron (MLP)?

A

A neural network that stacks multiple layers of neurons.

64
Q

What is the purpose of backpropagation in neural networks?

A

To adjust the weights of each neuron based on the contribution to the error.

65
Q

What are the types of data MLP can handle?

A
  • Tabular data
  • Images
  • Text
66
Q

What is clustering in data mining?

A

Grouping items together based on shared attributes.

67
Q

What is the K-means algorithm used for?

A

To group similar data points together in clusters.

68
Q

What is a drawback of K-means related to initial conditions?

A

It is sensitive to the initial placement of centroids.

69
Q

What is a time series?

A

A sequence of data points recorded at successive time intervals.

70
Q

What does trend refer to in time series analysis?

A

The long-term direction of the data.

71
Q

What does seasonality capture in time series data?

A

Repeating patterns within a fixed interval.

72
Q

What is stationarity in time series?

A

Statistical properties like mean and variance remain constant over time.

73
Q

What is ARIMA used for?

A

Analyzing and predicting time series data.

74
Q

What are the three components of ARIMA?

A
  • Autoregressive
  • Moving average
  • Differencing
75
Q

What is DeepAR?

A

A state-of-the-art forecasting algorithm based on neural networks.

76
Q

What is a significant drawback of DeepAR?

A

Its black-box nature lacks interpretability.

77
Q

What is a recommender system?

A

An essential machine learning tool used for personalized recommendations.

78
Q

What is a significant drawback of DeepAR?

A

The black-box nature of the deep learning model, which lacks interpretability and transparency.

79
Q

What is a major disadvantage of DeepAR regarding data?

A

DeepAR performs poorly when the dataset is small.

80
Q

What is the primary function of a recommender system?

A

To predict a user’s preference for items based on user or item attribute similarities or user-item interactions.

81
Q

In what industries has the recommender system gained widespread adoption?

A
  • Retail
  • Media and entertainment
  • Finance
  • Healthcare
82
Q

What does collaborative filtering rely on to make recommendations?

A

The preferences and behaviors of similar users.

83
Q

What is one of the major benefits of collaborative filtering?

A

It can provide highly personalized recommendations matched to each user’s unique interests.

84
Q

What is the cold-start problem in collaborative filtering?

A

Collaborative models struggle when new users or items with no ratings are introduced.

85
Q

What is matrix factorization in the context of collaborative filtering?

A

A technique that involves learning vector representations for both users and items in the user-item interaction matrix.

86
Q

What is the main goal of matrix factorization?

A

To approximate the original user-item interaction matrix by predicting missing entries.

87
Q

What does MAB stand for in recommendation systems?

A

Multi-Armed Bandit.

88
Q

What is the basic principle behind MAB-based recommendation systems?

A

To dynamically explore and exploit different recommendations to optimize user experience.

89
Q

What do MAB algorithms struggle with?

A

Striking the right balance between exploration and exploitation.

90
Q

What does computer vision refer to?

A

The ability of computers to interpret and understand visual representations, such as images and videos.

91
Q

What are some tasks performed by computer vision?

A
  • Object identification
  • Image classification
  • Text detection
  • Face recognition
  • Activity detection
92
Q

What architecture is specifically designed for processing image data?

A

Convolutional Neural Network (CNN).

93
Q

What is a key role of the convolutional layer in a CNN?

A

Feature extraction from input images.

94
Q

What is the function of the pooling layer in a CNN?

A

To reduce the dimensionality of the extracted features.

95
Q

What are the two commonly used pooling techniques in CNNs?

A
  • Max pooling
  • Average pooling
96
Q

What is the vanishing gradient problem in CNNs?

A

Signals from initial inputs diminish as they traverse through multiple layers.

97
Q

What does ResNet use to address the vanishing gradient problem?

A

Skip connections that allow signals to bypass certain layers.

98
Q

What is the focus of Natural Language Processing (NLP)?

A

The relationship between computers and human language.

99
Q

What are some common tasks within NLP?

A
  • Document classification
  • Topic modeling
  • Speech-to-text conversion
  • Language translation
  • Reading comprehension
100
Q

What does the Bag-of-Words (BOW) model do?

A

Counts the number of times a word appears in a text.

101
Q

What are the two components of TF-IDF?

A
  • TF (Term Frequency)
  • IDF (Inverse Document Frequency)
102
Q

What is embedding in the context of NLP?

A

A technique used to generate low-dimensional representations for words or sentences that capture semantic meaning.

103
Q

How does embedding differ from BOW and TF-IDF?

A

Embedding captures the semantic meaning of words, while BOW and TF-IDF create large and sparse input vectors.

104
Q

What is embedding?

A

A technique used to generate low-dimensional representations (mathematical vectors) for words or sentences that capture the semantic meaning of the text.

105
Q

What does the proximity of embeddings in multi-dimensional space indicate?

A

Words or sentences with similar semantic meanings are closer to each other than those with different meanings.

106
Q

What is cosine similarity?

A

A metric that measures how similar two vectors are by calculating the cosine of the angle between them.

107
Q

Why are embeddings widely adopted in NLP tasks?

A

They offer more meaningful representations of the underlying text compared to other techniques like simple word counts.

108
Q

Who created Word2Vec and in what year?

A

Thomas Mikolov created Word2Vec in 2013.

109
Q

What are the two techniques used for learning embeddings in Word2Vec?

A
  • CBOW (Continuous Bag of Words)
  • Continuous-Skip-Gram
110
Q

Describe the CBOW technique.

A

It tries to predict a word for a given window of surrounding words.

111
Q

Describe the Continuous-Skip-Gram technique.

A

It tries to predict surrounding words for a given word.

112
Q

What is the purpose of a sliding window in the CBOW technique?

A

To run across running text and choose one of the words as the target while the rest serve as inputs.

113
Q

What is the structure of the MLP network used for training Word2Vec embeddings?

A

A straightforward one-hidden-layer MLP network.

114
Q

What do the weights of the hidden layer represent after training Word2Vec?

A

The actual embeddings for the words.

115
Q

What is a key advantage of using embeddings as features for downstream tasks?

A

They can be readily used for tasks like text classification or entity extraction.

116
Q

What is the limitation of Word2Vec in terms of word meanings?

A

It produces a fixed embedding representation for each word, disregarding contextual variations in meaning.

117
Q

What does BERT stand for?

A

Bidirectional Encoder Representations from Transformers.

118
Q

What are the two main tasks BERT performs?

A
  • Predicting randomly masked words in sentences
  • Predicting the next sentence from a given sentence
119
Q

How does BERT improve upon Word2Vec?

A

It generates context-aware embeddings that consider surrounding words.

120
Q

What additional capability does BERT have regarding embeddings?

A

It generates embeddings at subword levels.

121
Q

What is the term for learning embeddings using input tokens in BERT?

A

Token embedding.

122
Q

What architectural component does BERT primarily use?

A

A transformer.

123
Q

What are the two components of each encoder in a transformer?

A
  • A self-attention layer
  • A feed-forward network layer
124
Q

What does the self-attention layer in a transformer do?

A

It calculates the strength of the connection between one token and all other tokens in the input sentence.

125
Q

What NLP tasks can BERT be used for?

A
  • Question answering
  • Named entity extraction
  • Text summarization
126
Q

What is the significance of BERT achieving state-of-the-art performance?

A

It indicates its effectiveness in various NLP tasks when it was released.

127
Q

What is fine-tuning in the context of using a pre-trained BERT model?

A

A technique used to adapt the pre-trained model for specific tasks.