Question 1

What is different about MDS compared to clustering? How are they similar?

Accepted Answer

- MDS is older - MDS has an explicit model (clustering has weak/no model) SIMILAR: both analyse distance (dissimilarities) and both can do variables OR cases

Question 2

How do you represent MDS?

Accepted Answer

- geometrical picture - each case/variable is a point in space - 1D: line - 2D: chart ^ can do more than this, but 1/2 usually preferred because easier to visualise on page

Question 3

What are the 3 types of MDS?

Accepted Answer

- CLASSICAL: simplest, 1 proxmity, matrix, interval data (often ratio) - NON-METRIC: most common, 1 distance matrix, assumes ordinal only - >1 MATRIX: can create matrix for each subject, unweighted = replicated MDS, weighted = individual diffs MDS

Question 4

What is the MDS problem?

Accepted Answer

- locate points in space to represent variables, so that distances b/w points is similar to original distances - need to find: coordinates aj and ak of points j and k on the mth of r dimensions - so that distance djk is distance b/w j and k, in r-dimensional Euc space

Question 5

What is the goal of MDS?

Accepted Answer

- dimension reduction - n variables, you have an n-dimensional space - want to "shrink" down from many data points to fewer dimensions - AIM: make data more understandable

Question 6

What are the advantages and issues with dimension reduction?

Accepted Answer

- issues: always lose something - good: very useful simplification of the data - can oversimplify! (can lose too much info)

Question 7

How do we define distance in MDS?

Accepted Answer

- we let the distance d be some function of r (where r = original distance) - djk = f (rjk )

Question 8

What is the difference between classic and modern MDS?

Accepted Answer

- classic: function is linear regression, djk = a + b.rjk - modern: rank-ordered function (i.e. want rank order of distances to be the same as original)

Question 9

How does MDS work?

Accepted Answer

- start with points located randomly in space (of a dimensionality chosen by the user) - math procedures 'move' points around to minimise stress - many iterations > until rank order of their distances 'best' matched the rank order of the data (i.e. until stress is the lowest point)

Question 10

What is stress? What situations produce higher stress?

Accepted Answer

- badness of fit. How well does MDS representation fit data - Kruskal - less than 0.15 is good fit - more variables = higher stress; higher dimensions = low stress - want to use #dimensions that are required for acceptable stress value

Question 11

What did Kruskal do? What is good about this?

Accepted Answer

- devised a method of rank-order transformations - monotone regression - distance of the proximity matrix converted into rank order - no need for assumptions of ratio scales

Question 12

What are the 2 MDS methods in SPSS?

Accepted Answer

- ALSCAL: method of steepest descent - PROXSCAL: iterative majorisation > use this

Question 13

Why use PROXSCAL?

Accepted Answer

- doesn't need a good starting point - always converges to minimum stress

Question 14

How do you know when to stop?

Accepted Answer

- stress doesn't change by larger than a preset criterion - stress value reaches a preset min value - program has reached set no. of iterations

Question 15

What are the subjective vs. statistical interpretation of MDS?

Accepted Answer

- subjective: look at diagram to see patterns and how they group together > MORE COMMON - statistical: regression of dimensional coordinates of an MDS solution against other variables

Lecture 11 - MDS Flashcards

(26 cards)