3_2: Market Analytics: Analysing and Predicting aggregated Demand and Competiton Flashcards

1
Q

What is Customer Segmentation?

A

Segmentation: slicing a pie
is the process of dividing a heterogeneous customer base into distinct groups based on shared characteristics, behaviors, or needs

–>Groups are homogenous within and **heterogenous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Name the 3 difficulties of customer segmentation?

A

Finding actionable outcome: is the result meaningful for the business need=

Choice of method: there is no method that is priori preferable to others

Iterative approach: much time and rounds of data collection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

RFM analysis is used to predict …?

A

used to predict the response rate and profitability generated by marketing campaings

–>tool to identify a organization´s best customers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the meaning behind “RFM”?

A

Recency (R): numbe rof time units that have passed since last purchased

Frequency (F): average number of purchases per time unit

Monetary value (M): total doolar amount spent per time unit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

For what is the RFM method used

A
  • it is used to identify the most engaged customers based on the observation that recency, frequency, and monetary metrics are often correlated with the probability of response and lifetime value
  • common to use same discrete scoring for all three metrices –>three dimensional cube
  • targeting decisions by selecting a subset of segment from the RFM cube
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

4 Steps of the RFM Analysis

A
  1. recency: sort databse in terms of most recent transactions and score your customers
  2. frequency: re-sort the dartabase on frequency
  3. Monetary value: Re-sort the database on sales dollar volume (monetary value)
  4. Selection use the three columns and each customers total score
    –>highest scores are the best customers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Advantages and Disadvantages of RFM Analysis?

A

Pros:
- Valuable for short term financial orientation
- requires no marketing strategy
- Fast, simple, and easy to use, exlain and implement

Cons:
- Liimited marketing usage as iit is only about engagement
- does not measure the factors that impact customer behavior

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is Behavioral segmentation?

A

Goal to understand customer behavior (marketing orientation)

  • uses behavioral data instead of financial data
  • –>Customer core behavior doen´t change
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is Supervised learning (Classification)?

A

Group affiliation: is known
Goal: to predict outcome data from independent variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

From RFM Analysis to Behaviroal Segmentation

Picture

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Unsupervides learning (clustering)?

A

Group affiliation: unkown
Goal: discover grouppings from data structue

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the two methods of distance-based clustering?

A

Hierachircal clusteirng
K-Mean-based clustering

–>minimize the discance between the group member while max. distance to members of other groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the two methods of Model-based clustering?

General description of Model-based clustering?

A

Model-based clustering
Latent class analyis

–>Model data so that the observed variance can be represented by a small group with specific distrib. characteristics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How does Heararchical clustering work?

A

observations are group acc. to their similarity (distance matrix) clust method used complete linkage method

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the steps in Hierachical Clustering?
(Distance-based clustering)

A
  1. Calculation of distance between the observation by Euclidean distance dissimilarity matrix
  2. the model uses the complete linkage method, comparing distance between all group members
  3. Output dendogram which is interpreted by height and where observations are joined
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Dendogram: Hierarchical clusteirng
(distance-based clustering)

A

At the lowest level, the groups are combined into smaller groups that are relatively similar.
–>These groups are sucessively combiine with less similar groups

Height = dissiminlarity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is k-Mean-based clustering?

A

(also k-mean clustering) = find groups based of sum-squares deviation from the multivariate center of the assigned group

–>centers need to be specified

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What are the steps in (k-)mean-based clustering?

A
  1. Choose number of clusters and maximum distance
    –>requires numeric data
  2. Find observation for cluster 1
  3. Take second obersavtion if far enough from 1 –>Cluster
  4. > Take next observation and compare with 1 and 2 (ggf. cluster 3)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What do k-means cluster plots show?
What are the limitations

A

whether it is possible to differentiate groups based on key variables

Limitation:
K-means requires arbitrary specification of clusters (use different values for k)
–>difficult to determine whether one solution is better than the other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is the problem with K-means cluster plots?

A

Difficult whether one sultion is better than another
–>Repeat analysis for several number of clusters to compare the results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

How can the outcome of the k- mean based model be tested?

Distance based

A
  1. Check mean values by ussing aggregate()
  2. Plot k-mean cluster to chech if it is possible to differentiate groups based on key variables
  3. Alternatively plot two continous variable by segment
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

How can the solution of the model based clusteirng be tested? ( mclust())
How to compare between models?

A
  1. Check mean values by using aggregate()
  2. PLot model based clusters
  3. use other values for G, and compare the model outputs:
    - Log liklihood –>less negative
    - BIC –>lowest value
    –>also used for model comparisoon
23
Q

How can the solution of the Laten class analysis be tested?

A
  1. Check mean values using aggregate()
  2. Plot the LCA clusters
  3. compare predicted class memberships
24
Q

How can the solution of hierarchical clustering be tested?

Distance based clustering

A
  • Zooming in and focusing on certain branches of the dendogram
  • use the ceophentic correlation coefficient (CPC) = measure the correlation between the original dissimilarty and the cophentic distances
  • CPCC close to 1 –>strong positive correlation
25
Q

What is the output of model-based clustering ?

A

Either shows the optimal number of clusters if g was not predetermined
- BIC
- Log-Liklihood

26
Q

Key facts about model-based clustering?
(mclust)

A
  • observation come from groups with different statistical distributions –>algorithm try to find best set of such underlying distribution
  • it clusters as being drawn from a mixture of normal distribution
  • Can only be used with numerical data
27
Q

What is the Laten Class Analysis (LCA)?
(Model-based Clustering)

A

differences are attributable to unobserved groups that one wishes to uncover (nclass is predetermined) –>Bayesian technique

–>Goal: estimate probabilities of membership in each class and assing individual to their most likely class

28
Q

Steps in Latent-class analysis?
(Model-based clustering)

A
  1. Variable scores are caused by the hidden groups
  2. LCA posits a latent variable that maximizes liklihood of obserrving the scorces and the variables
  3. It creates a probability of each observation belonging to each segment
  4. Segment with highest probability is the segment where most observations are placed
29
Q

Advantages of LCA?

A
  • Possible for complex data
  • Provides optimal number of clusters
  • Provided indicator for variables
  • Segment probability score
30
Q

What are diagnostics to test for statistical fit? ( Latent class analysis - model based clustering)

A
  • BIC:Bayes information criterion (lower values better)
  • Error rate (better if lower)
  • Negative log likelihood (better if less negative)
31
Q

Clustering process in R?

A
  1. Transofrm to right format
  2. Compute distance matrix
  3. Apply clustering method
  4. Analyze groups
  5. Examine solution in the model and apply
32
Q

How does Classification work?
(Supervised learning)

A

= use observations with known status to derive predictors which then can be applied to new observations

33
Q

Classification process in R

A
  1. Colllcecct data (group membership is known)
  2. Splitting data: Trainsing set with 50-80% of obserations and test set with 20-50%
  3. Build prediction model: identify predictors from training data
  4. Assess performance by applying predictors to test dataset
34
Q

How can the performance of the Naiive Bayes Classification be assed?

A
  1. Considering raw agreement rate
    mean(raw$Segment == predition) = 0.92 –>92% correct prediction
  2. compare performance against random chance using ARI
    - 1 = perfect agreement, 0 = random -1 = complete disagreement
  3. asses performance for each different class using Confusion Matrix
    - actual segment is left (rows)
    - predicted (columns
35
Q

How can the performance of the Random forrest be assed?

A
  1. compare performance against random chance using ARI (adjustedRandIndex)
    1= perfect agreement, 0 = random -1 = complete disagreement
  2. asses performance for each different class using Confusion Matrix using test data
    - actual segment is left (rows)
    - predicted (columns
36
Q

How can the outcome of the Naive Bayes Classification be tested?

A
  1. use test data to predict() values based on trained model using test data
  2. Revview the segment frequencies and compare to the inital a-priori frequencies based on the training data
37
Q

How can the outcome of the Random Forest be tested?

A
  1. use test data to predict() values based on trained model using test data
  2. plot the clusters based on test data
38
Q

What does Naive Bayes Classification do? (Supervised learning)

A

= Training data is used to learn probability of class membership as a function of each predictor variable considered independently –>using bayes rule

–>starts with observed probabilities of vairbales conditiona on segments found in the training data
—>only uses one model

39
Q

What is the Random forrest classification (compared to Naive bayes classification? (Supervised learning)

A

Instead of unsing a sinlge model, it builds and ensemlbe of models that jointly classify the data by fitting many classification trees (forrest)

–>not providing class membership

40
Q

What is the Class imbalance? and how can it be resolved in RandomForrest models?

A

using randomForest for prediction the model might generate values with 90% being in one group –>imbalance

–>Resolving:
- looking at frequency table of the training data, to see the group allocation –>pick smallest
- SSet sampsize= “value” in randonForest model

41
Q

What is meant by importance analysis in the Random Forest model?

A

the model uses many predictor variables, thus it is useful to know the importance of different classification variables

–>randomForest( importance = TRUE)

42
Q

Advantage of Random forrest model?

A
  • many classification trees(format) instead of one model
  • More accurate because more models are applied
  • Useful to estimate the importance of predictor variables
43
Q

Classification trees are used for?

A

used to predict a categorical (and usually) binary dependent varible

44
Q

What are the steps of classification trees?

A
  1. Identify most effect prediction variable for predicting the binary dependent variable
  2. Stat wit root node with all combinations and then use independent variables to split the root node to create most improvement in class separation
45
Q

Classification trees:
What is a Pure decision node?

A

Pure decision node: all data points associated with that node have same value of dependent variable

46
Q

Classification trees:
What is Impurity?

A

Impurity (unreinheit): assess the degree of impurity or heterogeneity within a subset of data in the classification tree.

impurity of a split: is the weighted average of impurities for the nodes involved in that split

47
Q

Classification trees:
What is entropy?

A

–>Entropy is a measure of impurity or disorder

  • Entropy is calculated using conditional probability
  • Always between 0 and 1
    –>**Lower entropy –>decreasing impurity!
48
Q

What is Collaborative filtering?

A

filter and predict choices based on other people behavior
–>Typically using rating matrixes

Idea: people with similar preferences tend to like similar items

49
Q

What are the two method of collaborative filtering?

A
  1. Neighborhood based method: prediict unknown rating by using the nearest neighbor approach
  2. Model-based method: use more complex, predictive models
50
Q

Collaborative filtering: baseline estimates:

A

A rating matrix exhibts string user and item biases

  • one can account for these systematic user and item effects:

The model caputures only the average user and item effect but it can help to absorb the biases and isolate the signal that represent user item interactions

51
Q

Advantages Collaboorative filtering?

A

Advantages:
- Make recommendations without any additional information about catalog items
- helps to produce non-trivial recommendations
- rating captures human tastes and judgments

52
Q

What are the disadvantes of Collaborative filtering?

A

Disadvantages:
- difficult to build reliable prediction models with trustworthy rating
- content filtering is biased towards popular items and standard choices
- cold start problem does not work for new users
- Product standardization is difficult
- Arbitrary assumptions

53
Q

Advantages of User-based filtering?

A
  • easy to program
  • more attractive when users are personally familiar
54
Q

Advantage of Item-based filtering?

A
  • preferred when customers are not familiar
  • item-based matrix of correlation is more stable over time
  • matrix needs to be updated less often