2. Multivariate Data Analysis Flashcards

1
Q

types of multivariate statistical techniques

A

regression analysis
factor analysis
cluster analysis
discriminant analysis
canonical correlation analysis
structural equation modeling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

cluster analysis

A

used to group data points with similar characteristics together
= segment a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

common uses of clustering in marketing

A

customer segmentation
product recommendation
market analysis

= targeted campaigns
= more effective marketing effo rts + increased customer satisfaction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

hierarchical clustering

A

popular clustering algorithm that groups similar data points together in a hierarchical structure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Ward’s method

A

type of linkage criteria used to determine the similarities between clusters when merging them

  • agglomerative (bottom-up approach)
  • attempts to create more evenly sized clusters
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Squared Euclidean Distance

A

distance metric to determine the similarity between data points or clusters

  • squared in order to place greater weight on objects that are farther apart
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

factor analysis

A

used to explore the underlying structure of a set of variables
- by identifying groups of variables that are highly correlated and have a shared variance

= simplify data + identify key drivers of consumer behavior

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

uses of factor analysis in marketing

A

used to identify consumer preferences and brand perceptions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

when is factor analysis valid?

A

when it passes both:
1. Kaiser-Meyer-Olkin (KMO) Test
2. Bartlett’s Test of Sphericity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Bartlett’s Test of Sphericity

A

used to determine if data is appropriate for factor analysis

  • it tests the null hypothesis (assumes 0 correlation) - tests if the correlation matrix is the same as the identity matrix
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Bartlett’s Test of Sphericity - interpretation

A

p-value at 0.00 (insignificant)
- smaller than significance level (0.05 or 0.001)
- null hypothesis can be rejected
- items correlate w/ each other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Kaiser-Meyer-Olkin (KMO) Test

A

measure of sampling adequacy

  • shows to which extent SPSS can find underlying dimensions
  • indicates how much of the volatility in the variables may be attributable to underlying causes (if correlation is sufficiently high)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Kaiser-Meyer-Olkin (KMO) Test - interpretation

A

values range from 0 - 1
- pass if >0,5
- should be above 0,6 for a good fit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

why did we use the Squared Euclidean distance?

A

data barrier: only 2 segments instead of 4
- two metrics were symmetrical so they cancelled themselves out = merging segments

square euclidean: turn values into +ve outcomes for them not to nulify by being symmetrical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Chat GPT results survey

A

study showed:
- very high level for WillUseUni
- generally positive attitude towards using it (except 2)

(correlation b/w finding unethical w/ higher willuseuni)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

recommendations to unis

A

skills accelerators (how to cite, how to determine value, when to use it, how to assess reliability…etc.)
= data literacy AI

transparent communication of rules

17
Q

dendrogram

A

hows the hierarchical relationship between the data points
- y-axis: distance b/w clusters