w12d2 - Olap Data mining Flashcards

1
Q

Main characteristics of an Olap application

A
  • Fast (Deliver to the user in about 5 seconds)
  • Analysis (copes with business logic, and relevant statistical analysis)
  • shared (system implements all security requirements for confidentiality)
  • Multidimentional (system must provide multidimentional views of the data, including full support for hierarchies an multiple hierarchies)
  • information (all needed data and derived information is available)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

OLAP is used to ________
of _______
and of _______

A

produce reports
what is (especially trends)
what might be (by extrapolating and forecasting)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

normalization and other database design techniques focus on _______
Whereas reports usually _______

A

designing single records

combine multiple records

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

A traditional report takes ______
that describes data in _______
The third dimension can be described by

A

a grid structure
up to 3 dimensions
The contents of cells at different {x,y} coordinates

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q
Dimensions provide \_\_\_\_\_\_\_\_\_
come from \_\_\_\_\_\_\_\_\_\_\_\_\_
may or may not have \_\_\_\_\_\_\_\_\_
may have \_\_\_\_\_\_\_\_\_
may share \_\_\_\_\_\_\_\_
A
Different ways of looking at a set of data
Different attributes of data
Particular ordering
Their own subdimensions
Data with other dimensions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Temporal dimension includes
Events occur at
Activities occur over

A

activities and events

  • a specific time only
  • a range of times (Starting and ending event)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Temporal dimension

A

Ordered linearly in terms of time from the start of an organization or activity up to the present and beyond to the future

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Temporal dimension includes a wide range of

A

granularities (often different records)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Customer dimension often composed of

A

a number of discrete customers without a required ordering principle

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Customer dimension can be ordered based on

A

name, number, other dimensions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Use of coding within customer-numbers may also provide additional dimensions such as

A

customer type dimension

vendor dimension

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

location dimension is actually _________
Location may be limited to ________
Location is usually ordered relative to _____

A

multidimentional
a grouping of addresses based on some characteristic
other locations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

financial dimension generally contains ______
And is ordered _______

Measures for financial dimension

A

numeric information
linearly

exact $ values, being within some range, some # of sales slips

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Sub-dimensions

A

involve choices between different types of measures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Different attributes are from different sub dimensions if

A

Neither attribute is an instance of the other attribute

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Granularities

A

Involve choices between different units for a given type of measure within a dimension

17
Q

Different attributes are of different granularities if ________
While normalization avoids ____________
It may not deal ______________

A
  • one attribute can be converted with or without loss of some exactness into the other attribute
  • multiple granularities of numeric values
  • other types of granularites
18
Q

Olaps start with data in a ________

And progressivley reduces it by _______

A

data warehouse
slicing(removing whole columns)
Dicing(removing records based on attribute values or ranges of attribute values)
Consolidating (combining data by using finer granularities)
Drilling-down (examining more detailed granularities)
Processing (adding or replacing attributes based on the results of performing various computations on the data)

19
Q

Olap should allow users to easily work with the data by

A

Extracting copies of the data warehouse
Identifying dimensions in the data
selecting and working with dimension, subdimensions, and granularities,
Slicing, dicing, consolidating, drilling-down and processing data
Save the results form multiple stages for further processing
producing reports

20
Q
Data mining goes beyond \_\_\_\_\_\_\_\_
To \_\_\_\_\_\_\_\_
It operates on \_\_\_\_\_\_\_\_\_\_
It looks for \_\_\_\_\_\_\_\_\_\_
And suggests
A

Answering user questions
Identifying questions that users should consider
Data warehouses with good metadata
Trends and correlations across dimensions,
Models and visualizations that can be explored by the user

21
Q

Data mining techniques

A

Predictive modelling
Database segmentation
Link Analysis
Deviation Detection

22
Q

Predictive modelling

A

Classification - identifying groups based on common properties
Value Prediction - Extrapolating trends based on historical data

23
Q

Database segmentation

A

clustering based on multiple properties

24
Q

Link analysis

A

establishing associations between linked reports

25
Q

Deviation detection

A

Identifying records that deviate from the norm

26
Q

To be successful with olap and data mining

A

We need more than just tools

We need to know some things about the data

27
Q

Actual database contains

A

user data
data records in tables
metadata linked to individual data records in other tables
Data types and schemas including data types, constraints, relationships, permissions

28
Q

User programs typically

A

only interact with user data

leave data schema interations to the dba and dbms

29
Q

olap and data mining programs need to interact with

A

the actual data warehouse that contains user data
- data records in tables
- metadata linked to individual data records in other tables
Meta information that describes the attributes of the database in terms of data definitions, dimensions, granularities, transformations and other allowable types of processing