Chapter 31 Flashcards

Question 1

Q

What are supervised mining techniques

Answer

A

The inputs are given by end user/user

Question 2

Q

What are unsupervised mining techniques

Answer

A

The inputs are not given by end user/user. We do not how much classes and properties there and we are not guiding how to do data mining.

Question 3

Q

What is similarity / dissimilarity ratio

Answer

A

The match/mismatch ratio of the matrix sets the target evolutionary distance

Question 4

Q

What is time complexity of similarity matrix

Answer

A

n(square) x m

Question 5

Q

What are main types of data mining

Answer

A

1- Supervised

2- Unsupervised

Question 6

Q

What are types of supervised data mining

Answer

A

Bayesian modeling
Decision Tree
Neural network etc

Question 7

Q

What are 2 types of unsupervised data mining

Answer

A

1- One-way clustering

2- Two-way clustering

Question 8

Q

What is one-way clustering

Answer

A

When we cluster a data matrix, we use all attributes and do rows clustering. It gives global view of data matrix.

Question 9

Q

What is two-way clustering

Answer

A

We use columns and rows clustering in two-way in data matrix. It gives local view of data matrix

Question 10

Q

What is min-max “distances” in clustering

Answer

A

Records are grouped with similarity constraint. In clustering, the intra-distance should be maximum e.g. clustering of employees in company with similar salary. Young people cluster is far away with old people cluster.

Question 11

Q

How to identify association in records

Answer

A

Map the association in distance matrix. So we can quantify records with more similarity.

Question 12

Q

What is numeric and non-numeric attributes

Answer

A

Numeric attributes are with numeric values and non-numeric attributes are with non-numeric values.

Question 13

Q

Can graph be stored in matrix form

Answer

A

Yes. Matrix is a data structure that can store a graph.

Question 14

Q

What is binary matrix

Answer

A

The matrix that has values 0 and 1

Question 15

Q

What are 2 methods to find clusters in matrix

Answer

A

Graph portioning (Separate vertices which have more connectivity and less connectivity)
Click detection

Question 16

Q

What is classification

Answer

A

Classification is a data mining function that assigns items in a collection to target categories or classes.

Question 17

Q

How classification works

Answer

A

We take data set and convert it into 2 sets.
1. Training set
2. Test set
Training set is testify on test set and get 2 classes of it. So we can classify data.

Question 18

Q

Clustering vs cluster detection

Answer

A

First do clustering and then do cluster detection. (note: once we have clusters then we can know how much number of clusters exists in system)

Question 19

Q

What is K means cluster detection technique

Answer

A

K means clustering techniques use a mean point to categorize values in clusters. It is fast technique.

Question 20

Q

What is mean point in clustering

Answer

A

The point in a cluster which defines in which cluster the value falls

Question 21

Q

Does k means clustering supervised

Question 22

Q

Does k means clustering converse