Lecture2 Flashcards

1
Q

What properties does the following attribute possess? Nominal

A

distinctness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What properties does the following attribute possess? Ordinal

A

distinctness & order

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What properties does the following attribute possess? Interval

A

distinctness, order, & addition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What properties does the following attribute possess? Ratio

A

distinctness, order, addition, & multiplication

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What type of attributes do all distance based data mining techniques expect?

A

ratio type attributes (usually floating point numbers)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is record data?

A

Data that consists of a collection of records, each of which consists of a fixed set of attributes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a data matrix?

A

If data objects have the same fixed set of numeric attributes, then the data objects can be thought of as points in a multi-dimensional space, where each dimension represents a distinct attribute. Such data can be represented by an m by n matrix, where there are m rows, one for each object, and n columns, one for each attribute.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Describe how document data is represented in data mining.

A

Each document becomes a ‘term’ vector (row). Each term is a component (attribute) of the vector The value of each component is the number of times the corresponding term occurs in the document

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What type of data is the following?

A

Ordered data

(also spatio-temporal data)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What type of data is the following?

A

Record data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What type of data is the following?

A

Data matrix

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What type of data is the following?

A

Document data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What type of data is the following?

A

Transaction data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What type of data is the following?

A

Graph data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What type of data is the following?

A

Chemical data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What type of data is the following?

A

Ordered data

17
Q

What type of data is the following?

A

Ordered data

(genomic sequence data)

18
Q

What are the three types of data sets?

A

Graph

Ordered

Record

19
Q

What are the three types of record data sets?

A

Data matrix

Document data (sparse data matrix)

Transation data

20
Q

What are the two types of graph data sets?

A

Data with relationships among objects (WWW)

Data with Objects that are graphs (molecular compounds)

21
Q

What are 4 types of ordered data?

A

Spatial data

Temporal data

Sequential data

Seuqnce data (genetic sequence data)

22
Q

What is transaction data?

A

A special type of record data where

each record (transaction) involves a set of items

eg. grocery store.

transaction = products purchased by a customer

products purchased = items

23
Q

What is graph data (data with relationships among objects)?

A

The relationships among objects convey important information.

Data objects are mapped to nodes of the graph.

Relationships among objects are captured by the links.

eg. web pages

24
Q

What is chemical data (data with objects that are graphs)?

A

objects have structure that contain subobjects that have relationships

eg. molecules in chemistry

25
Q

What is ordered data?

A

attributes have relationships that involved order in time or space

26
Q

What is sequential (temporal) data?

A

An extension of record data, where each record has a time associated with it.

27
Q

. What is sequence data?

A

A type of ordered data, data set that consists of a sequence of individual entities.

e.g. genetic information

28
Q

What is spatial data?

A

Some objects have spatial attributes such as positions or areas.

.e.g. weather data

29
Q

What is spatial autocorrelation?

A

objecsts that are physically close tend to be similar

30
Q

What is temporal autocorrelation?

A

If two measurements are close in time, the the values of the measurements are often similar