Chapter 10 Flashcards

Question

iterative relocation

Answer 1

the process iteratively reassigning objects to clusters to improve the partitioning

Answer 2

1/partition objects into k nonemepty subsets 2. compute seed points as the entroid of the clusters of the current partitioning 3. assign each object to the cluster with the nearest seed point 4. iteratively improves the within cluster variant

Answer 3

applicable only to objects in a continuouse n dimesional space need to specify k in advance sensitive to noisy data and outlires

Answer 4

selection of the intial k means dissimilarity calculations strategies to calculate cluster means

Answer 5

repalcing means of clusters with modes | using a frequency based method to update modes of clusters

Answer 6

insread of rtaing the mean vlkaue of the object in a cluster as a reference point, edoids can be used which h is the mont cneteally located object in a cluster

Answer 7

grouping data objects into hierarchy or tree of clusters | hierarchy is useful for data summarization and visualization

Answer 8

organized objects into a hierarchy using a bott up strategy start with individual objects as clusters, iterativelrt merged to form larger and larger cluster, the single cluster becomes the hierarchy root the merging step: find two lcuster that are cosese and comjine the two to form one cluster

Answer 9

employs top down stratagy ler all the given objects form one cluster, iteratively spliot into smaller sub clusters and recursively partition those clusters into smaller ones intil each cluster at the lowest level contains only one object

Answer 10

integrate hierarchical with other clusterting methods

Answer 11

introduced in kaufmann and tousseeuw implemented in statistical packages merge nodes that have the least dissimilarity eventually all nodes belong to the same cluster

Answer 12

impleemented in statistical analysis packages inver order of AGNES eventually each node forms a cluster on its own.

Answer 13

partitionign and hierarchical methods are sedigned to find spjerocal; shaped cluster

Answer 14

discover lcuster of srbritary hspe handle noise one scan need density parameters as termination condition

Answer 15

the dnesity of and object o can be measured by the number of objects close to o it finds core objects, that have dense neighborhoods , connects core objects and their neighborhoods to form dense regions as clusters

Answer 16

a point is density reachable from a point if there is a chain of points

Answer 17

a point p is sensity connected to a point if there is a point o such that both p an q are density reachable form

Answer 18

arbritart select a point p retieve all point density reachable from p if p is a core point, a cluster is formed if p is a border point, no points are senity reachable from p and DBSCAN visits the next point of the database continue until; all points have been processed

Answer 19

cohesive within clusters

Answer 20

distinctive between clusters

Answer 21

the similarity measure used by the method its implementation its ability to discover some or all of the hidden patterns

Answer 22

simalarity is expressd in terms of a distance function, typically metric

Answer 23

ther is usually a separate quality function that measures the goodness of a cluster it is hard to define similar enough or good enough, it is highly subjective

Answer 24

supervise, i.e. the gorund truth is available | compare a clustering against the ground truth using certain clustering quality measure

Answer 25

, i.e. the ground truth is unavalible evaluate the goodness of a clustering by considering how well the clusters are separated and how compact the clusters are

Chapter 10 Flashcards

(50 cards)