Lecture2 Flashcards

1
Q

What properties does the following attribute possess? Nominal

A

distinctness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What properties does the following attribute possess? Ordinal

A

distinctness & order

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What properties does the following attribute possess? Interval

A

distinctness, order, & addition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What properties does the following attribute possess? Ratio

A

distinctness, order, addition, & multiplication

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What type of attributes do all distance based data mining techniques expect?

A

ratio type attributes (usually floating point numbers)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is record data?

A

Data that consists of a collection of records, each of which consists of a fixed set of attributes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a data matrix?

A

If data objects have the same fixed set of numeric attributes, then the data objects can be thought of as points in a multi-dimensional space, where each dimension represents a distinct attribute. Such data can be represented by an m by n matrix, where there are m rows, one for each object, and n columns, one for each attribute.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Describe how document data is represented in data mining.

A

Each document becomes a ‘term’ vector (row). Each term is a component (attribute) of the vector The value of each component is the number of times the corresponding term occurs in the document

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What type of data is the following?

A

Ordered data

(also spatio-temporal data)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What type of data is the following?

A

Record data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What type of data is the following?

A

Data matrix

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What type of data is the following?

A

Document data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What type of data is the following?

A

Transaction data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What type of data is the following?

A

Graph data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What type of data is the following?

A

Chemical data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What type of data is the following?

A

Ordered data

17
Q

What type of data is the following?

A

Ordered data

(genomic sequence data)

18
Q

What are the three types of data sets?

A

Graph

Ordered

Record

19
Q

What are the three types of record data sets?

A

Data matrix

Document data (sparse data matrix)

Transation data

20
Q

What are the two types of graph data sets?

A

Data with relationships among objects (WWW)

Data with Objects that are graphs (molecular compounds)

21
Q

What are 4 types of ordered data?

A

Spatial data

Temporal data

Sequential data

Seuqnce data (genetic sequence data)

22
Q

What is transaction data?

A

A special type of record data where

each record (transaction) involves a set of items

eg. grocery store.

transaction = products purchased by a customer

products purchased = items

23
Q

What is graph data (data with relationships among objects)?

A

The relationships among objects convey important information.

Data objects are mapped to nodes of the graph.

Relationships among objects are captured by the links.

eg. web pages

24
Q

What is chemical data (data with objects that are graphs)?

A

objects have structure that contain subobjects that have relationships

eg. molecules in chemistry

25
What is ordered data?
attributes have relationships that involved order in time or space
26
What is sequential (temporal) data?
An extension of record data, where each record has a time associated with it.
27
. What is sequence data?
A type of ordered data, data set that consists of a sequence of individual entities. e.g. genetic information
28
What is spatial data?
Some objects have spatial attributes such as positions or areas. .e.g. weather data
29
What is spatial autocorrelation?
objecsts that are physically close tend to be similar
30
What is temporal autocorrelation?
If two measurements are close in time, the the values of the measurements are often similar