Lecture2 Flashcards
What properties does the following attribute possess? Nominal
distinctness
What properties does the following attribute possess? Ordinal
distinctness & order
What properties does the following attribute possess? Interval
distinctness, order, & addition
What properties does the following attribute possess? Ratio
distinctness, order, addition, & multiplication
What type of attributes do all distance based data mining techniques expect?
ratio type attributes (usually floating point numbers)
What is record data?
Data that consists of a collection of records, each of which consists of a fixed set of attributes.
What is a data matrix?
If data objects have the same fixed set of numeric attributes, then the data objects can be thought of as points in a multi-dimensional space, where each dimension represents a distinct attribute. Such data can be represented by an m by n matrix, where there are m rows, one for each object, and n columns, one for each attribute.
Describe how document data is represented in data mining.
Each document becomes a ‘term’ vector (row). Each term is a component (attribute) of the vector The value of each component is the number of times the corresponding term occurs in the document
What type of data is the following?

Ordered data
(also spatio-temporal data)
What type of data is the following?

Record data
What type of data is the following?

Data matrix
What type of data is the following?

Document data
What type of data is the following?

Transaction data
What type of data is the following?

Graph data
What type of data is the following?

Chemical data
What type of data is the following?

Ordered data
What type of data is the following?

Ordered data
(genomic sequence data)
What are the three types of data sets?
Graph
Ordered
Record
What are the three types of record data sets?
Data matrix
Document data (sparse data matrix)
Transation data
What are the two types of graph data sets?
Data with relationships among objects (WWW)
Data with Objects that are graphs (molecular compounds)
What are 4 types of ordered data?
Spatial data
Temporal data
Sequential data
Seuqnce data (genetic sequence data)
What is transaction data?
A special type of record data where
each record (transaction) involves a set of items
eg. grocery store.
transaction = products purchased by a customer
products purchased = items
What is graph data (data with relationships among objects)?
The relationships among objects convey important information.
Data objects are mapped to nodes of the graph.
Relationships among objects are captured by the links.
eg. web pages
What is chemical data (data with objects that are graphs)?
objects have structure that contain subobjects that have relationships
eg. molecules in chemistry
What is ordered data?
attributes have relationships that involved order in time or space
What is sequential (temporal) data?
An extension of record data, where each record has a time associated with it.
. What is sequence data?
A type of ordered data, data set that consists of a sequence of individual entities.
e.g. genetic information
What is spatial data?
Some objects have spatial attributes such as positions or areas.
.e.g. weather data
What is spatial autocorrelation?
objecsts that are physically close tend to be similar
What is temporal autocorrelation?
If two measurements are close in time, the the values of the measurements are often similar