Clustering Flashcards

1
Q

What is cluster analysis?

A

A statistical technique that can be applied to data that exhibit natural groupings

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Could you give some examples of some measures of similarity

A

Correlation coefficient, Distance measure, association coefficients

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is one way of measuring distance?

A

Euclidean distance. Square difference between two variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the basic steps involved in cluster analysis?

A
  1. Formulate the problem, select the variable you want to use as basis
  2. Compute the distance between customers
  3. apply the clustering procedure to distance measure
  4. Decide on the number of clusters
  5. Map and interpret clusters-draw conclusion-
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is K-means clustering?

A

It is a popular algorithm used for clustering as it is simple and speedy. The user must specify the number of clusters required before starting the algorithm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Can you go through the steps of the k-means clustering

A

1.Choose the number of clusters k
2.Generate k random points as cluster centroids
3.Assign each point to the nearest cluster centroid
4.Recompute the new cluster centroid (average of all the points in a cluster)
5.Repeat the two previous steps until some convergence criterion is met. Usually the convergence criterion is that the assignment of customers to clusters has not changed over multiple iterations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the main issue with K-mean clustering?

A

It does not provide an estimate of number of clusters to use

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the elbow criterion?

A

It is a way of determining number of clusters to use. It states that you should choose a number of clusters so that adding another cluster does not add sufficient information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the criteria frequently used to evaluate the effectiveness of the segmentation scheme?

A

Identifiability- extent to managers can recognize segments in the marketplace
Sustainability – satisfied is segments represent a large enough portion of the market to ensure profitable customization of the marketing program
Accessibility- extent to which managers can reach the identified segments through marketing campaigns
Actionability- needs of target segment are consistent with the goals and core competencies of the firm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is it called when you describe the clusters?

A

Profiling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How would you categorize cluster analysis under source, methodology and objective?

A

Source: primary
Methodology: quantitative
Objectives: descriptive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly