lec 1(done) Flashcards

1
Q

why data mining?

A

The explosive growth of data

We need automated analysis of massive data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Major sources of abundant data:

A

1-Business: e-commerce, transactions, stocks, product descriptions…
2-Science: Remote sensing, bioinformatics, scientific experiments, …
3-Society and everyone: news, social networks, digital cameras, YouTube ..

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Data Mining

A

Extraction of interesting (non-trivial, implicit, previously unknown and potentially useful) patterns or knowledge from huge amount of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

an example of data mining:

A

Group together similar documents returned by search engine according to their context (e.g. Amazon rainforest)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is Data view, knowledge view , method view and application view:

A

Data view:Kinds of data to be mined

Knowledge view (Data mining functions):
Kinds of knowledge or patterns to be discovered

Method view:
Kinds of techniques utilized

Application view:
Kinds of applications adapted

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Relational database system

A

Relational database system is a collection of tables with ER(entity-relationship) for modeling and SQL for querying.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is a Data warehouse?

A

Data warehouse is a repository of information collected from multiple sources, stored under a unified schema at a single site in order to facilitate management decision making.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Transactional database

A

A file where each record represents a transaction

such as a customer’s purchase: sales (transID, list of item IDs)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Other Kinds of Data (Advanced datasets)

A
Data streams and sensor data
Spatial data 
Time-series data, temporal data, sequence data
Graphs, social networks data
Object-relational databases
Multimedia database
Text databases
The World-Wide Web
How well did you know this?
1
Not at all
2
3
4
5
Perfectly