Marketing Analytics Flashcards

1
Q

Why segment the market?

A

Value may be more significant to the consumer; better consumer experience; more cost effective

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is mass marketing?

A

Offer 1 thing to the whole mass

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is segment-based marketing?

A

Offer different offerings for each segment - ex. Group A vs. Group B

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is one-to-one marketing?

A

Offer different offerings to each customer - ex. A -> Customer 1, B -> Customer 2, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the 4 bases to segment consumer markets?

A
  1. Demographic: age, gender, income, ethnicity, etc.
  2. Geographical: world region, country, city vs. rural, etc.
  3. Behavioral: usage, loyalty, etc.
  4. Psychographic: lifestyles, beliefs, attitudes, interests, personality, values, etc.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the OCEAN model for segmenting markets?

A

O = openness - do they enjoy new experiences?
C = conscientiousness - do they prefer plans and order?
E = extraversion - do they like spending time with others?
A = agreeableness - do they put other people’s needs before theirs?
N = neuroticism - do they tend to worry a lot?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How would you calculate the distance between respondents? (3 steps)

A
  1. Decide on the clustering variables & importance of each of the characteristics
  2. Select a measure of (dis)similarity using the Euclidean distance
  3. Select a clustering method like hierarchical clustering
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the Euclidean Distance formula?

A

Euclidean distance (De) of (x,y) = √ (x2 – x1)^2 + √ (y2 – y1)^2
You need a data matrix to be able to compute the distance matrix (distance matrix is the values from Euclidean distance formula). This would be used if you wanted to calculate the distance between consumer perspectives (for example, importance of innovation vs. constant communication)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the 3 key criteria for actionable segmentation?

A
  1. Distinctive: customers within a segment are similar but differ from customers in other segments
  2. Substantial: sufficient large to create value
  3. Accessible: ability to reach customers within segments
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does the intercept in a linear regression stand for?

A

It is the value of the dependent variable when both independent variables are 0. Ex. it’s the value of sales when radio and tv advertising is 0.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the p-value stand for?

A

It is the measure of statistical significance, so if the p-value is below 0.05 then the pattern or relation we find is statistically significant.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the adjusted r-squared show?

A

It shows the variation in intercept vs. independent variables. Ex. sales vs. radio and tv advertising.
This is a measure of the goodness of fit of the model therefore the higher it is, the better you can explain your marketing mix.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is an elasticity?

A

It’s the % change in response variable for a 1% change in the predictor variable
Ex. the % change in sales for a 1% change in advertising spending.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How do you use the ratio of elasticities method? (3 steps)

A
  1. Sum elasticities
  2. Compute ratio of elasticities
  3. Multiply ratio of elasticities by total budget to spend & make recommendation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do you compute elasticities from a linear regression model?

A

Advertising elasticity = advertising estimate (from linear regression) * (baseline advertising / baseline sales)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is underfitting?

A

low R-squared and low predictive accuracy

17
Q

What is overfitting?

A

high R-squared and low predictive accuracy

If a model has a high R-squared and a low predictive accuracy, the model may be tailored to the specific data but fails to generalise outside the sample

18
Q

What happens when a firm always spends the same amount on facebook campaigns and instagram campaigns in the same weeks?

A

Both variables will be perfectly correlated (ie. 1) and the model will not be able to distinguish between the impact of the two perfectly correlated variables

So, even if predictors are highly correlated (ie. higher than .80), the model can suffer from ‘multicollinearity’ which reduces the accuracy of the estimates because the observed effect might be overstated

19
Q

How do you deal with multicollinearity?

A
  1. Calculate correlations between predictor variables
  2. If the correlations are high, you can reformulate the variables where possible
    Ex. you can sum advertising spendings across channels (ie. online vs. offline)
20
Q

Omitted variable bias exists if two conditions are met:

A
  1. an omitted variable affects your response variable
    AND
  2. the omitted variable correlates with one or more predictor variables
21
Q

What do we know about carryover effects? When can they happen?

A

In reality, consumer response to advertising can be delayed, therefore not accounting for carryover effects can cause advertising elasticities to be under-valued.

22
Q

How do you measure the effect of x (ex. advertising) beyond the current time period?

A

Adstock equation

Ex. Adstock(t) = coefficient(t) + lambda *Adstock(t-1)

For example, if lambda = 0.3, then adstock from one time period ago still had a 30% effect in the current time period

23
Q

What is synergy effects?

A

The combined used of marketing mix instruments

24
Q

Why use predictive analytics in marketing?

A

To leverage historical data to identify the likelihood of future outcomes.

It also helps us predict how consumers will behave and then we can predict what/how to change the marketing strategy for them.

25
Q

What is the difference between Statistical Modeling vs. Machine Learning?

A

Statistical Modeling makes statistical inference about relationships between variables using historical data
Ex. Linear regression model to understand return on marketing investment

Machine Learning models outperform statistical models in terms of predictive accuracy. They vary in interpretability

26
Q

What are the two types of Machine Learning algorithms?

A

Supervised vs. Unsupervised

Unsupervised learning: given set of inputs (no output) and algorithm looks for patterns and structures in data
Ex. Hierarchical clustering

Supervised learning: given set of inputs and outputs and algorithm learns to generate outputs from inputs
Ex. Classical And Regression Tree (CART)

27
Q

Explain how overfitting can occur in a model?

A

If a model has a high r-squared but a low predictive accuracy, the model may be tailored to the specific data, but fails to generalize outside the sample.

Including too many predictors in the training sample can result in poor predictive performance.

28
Q

What happens when we don’t include a predictor variable that drives response?

A
  1. Lower r-squared
  2. Higher MAPE
  3. Biased estimates
29
Q

What does CART stand for and what are they used for?

A

Classification And Regression Tree (CART)

  1. Classification Tree
    - Suitable for discrete outcomes
    - Ex: Predicting customer churn (0/1)
  2. Regression Tree
    - Suitable for continuous outcomes
    - Ex: Predicting brand sales
30
Q

Can a classification tree model account for:
- Non-linear effects?
- Synergy effects?
- Carryover effects?
- Multicollinearity?
- Predictive accuracy?

A
  • Yes
  • Yes
  • Yes
  • Multicollinearity no bc mutually exclusive
  • Yes, more accurate than the linear regression model bc it takes into account non-linear effects
31
Q

What are the advantages of using a classification tree? (3)

A
  1. Easy to implement
  2. No need to specify relation in advance
    • Implicitly performs variable selection
    • Can automatically account for nonlinear relationships and interaction effects
  3. Predictive accuracy
32
Q

What is the major disadvantage of using a classification tree and what is the solution?

A

Disadvantage = Overfitting, harder to generalize data

Solution = Random forest

33
Q

What is a random forest?

A
  • Random forests combine results from multiple trees
  • They select a random set of predictors to build multiple trees, and then average the results
  • This leads to more generalizable data and higher predictive accuracy
34
Q

What are the benefits of A/B testing in marketing? (4)

A
  1. Increase website traffic
  2. Reduce bounce rates
  3. Increase conversion rates
  4. Reduce cart abandonment
35
Q

What are some limitations of A/B testing? (3)

A
  1. A/B testing can help making data-driven decisions on marketing mix decisions (ie. discount) as well as on seemingly subjective questions of design, color, layout of marketing content
  2. Power lays in ability to easily and continuously test what is working for your brand or company in the market
  3. However, A/B testing is not free of costs (ie. set up, opportunity costs, analysis)
36
Q

What is the solution for multivariate testing?

A

Where A/B testing tests different versions of a web page, multivariate testing tests multiple elements on a web page

37
Q

What is algorithmic bias in marketing?

A

Algorithmic bias in marketing happens when the algorithms used to make marketing decisions favor one group of people over another, often unfairly.

Imagine you’re at a big party, and the bouncer only lets in people wearing blue shirts. That’s a form of bias, right?

  • Can occur by incorporating potentially discriminating characteristics (ie. gender)
  • Can also occur when excluding discriminating characteristics (ie. If you can’t entirely avoid gender in your model bc it correlates with behaviour then if you used behaviour and not gender, then an omitted bias can occur
38
Q

How can you mitigate algorithmic bias?

A
  1. Bias detection: does the algo produce the same outcome for identical customers differing on characteristics like gender or race?
  2. Diversity: does programming team have different background, experiences/perspectives?
  3. Incorporate fairness into algos: can we optimize our desired outcome and fairness?