Quiz #2 Flashcards
Exam Prep
Which of these is a method employed by the k-means algorithm to mitigate the effects of the random initialization trap? A. Random initialization escape B. K-means++ C. Centroid placement D. K-medoids
B. K-means++
In Association rules, a collection of one or more items is known as \_\_\_\_\_\_\_\_\_\_\_\_\_\_. A. a set of items B. a ruleset C. a set of rules D. an itemset
D. an itemset
K-Means clustering is useful in creating non-spherical clusters.
True
False
False
The increased likelihood that a rule occurs in a dataset relative to its typical rate of occurrence is known as \_\_\_\_\_\_\_\_\_\_. A. Lift B. Count C. Support D. Confidence
A. Lift
As we discussed in class, the elbow method makes use of the Within Cluster Sum of Squares (WCSS) metric to suggest the appropriate value for “k”. If we keep increasing the value for “k”, what will happen to the value for WCSS?
A. The value for WCSS will tend towards 0.
B. The value for WCSS will tend towards 1.
C. The value for WCSS will eventually become negative.
D. The value for WCSS will grow infinitely.
A. The value for WCSS will tend towards 0.
TID Items Bought T1 bread, milk, beer T2 bread, diaper, beer, eggs T3 milk, diaper, beer, coke T4 bread, milk, diaper, beer T5 bread, milk, diaper, coke
What is the support of the itemset {beer, coke} in the dataset above?
A. 4
B. 0.4
C. 0.2
D. 1
C. 0.2
The Amelia package in R is useful for dealing with \_\_\_\_\_\_\_\_\_\_\_ data. A. imbalanced B. skewed C. missing D. aggregate
C. missing
Association rules imply causality in that they explain why item B is bought whenever item A is bought.
True
False
False
Good clustering will produce clusters with \_\_\_\_\_ inter-class similarity and \_\_\_\_\_\_ intra-class similarity. A. low, low B. low, high C. high, low D. high, high
B. low, high
The anti-monotone property of support states that the support of an itemset is \_\_\_\_\_\_\_\_\_ than that of its subsets. A. always more B. always less C. sometimes more D. sometimes less
B. always less
D. sometimes less
One of the strengths of association rules is that they are easy to understand.
True
False
True
K-Means clustering only works with numeric data.
True
False
True
Which of these is NOT a method used in choosing the appropriate value for "k"? A. Elbow Method B. A priori knowledge C. Gap statistic D. Ankle Method
D. Ankle Method
Which of these is a distance measure employed by k-means clustering? A. Euclidean distance B. Centroid distance C. Manhattan distance D. Cluster distance
A. Euclidean distance
A clustering method in which every object belongs to every cluster with a membership weight that goes between 0 (if it absolutely doesn’t belong to the cluster) and 1(if it absolutely belongs to the cluster) is known as \_\_\_\_\_\_\_ clustering. A. overlapping B. partitional C. hierarchical D. fuzzy
D. fuzzy