Article - Multidimensional Database Technology Flashcards
When is a multidimensional data model better suitable than a relational data model?
- When the objective is to analyze data rather than to perform online transactions.
- Key factor in the interactive analysis of large amounts of data for decision-making purposes.
How is data categorized in multidimensional models?
- As facts with associated numerical measures
Fact: a purchase | Measure: purchase amount/price - As textual dimensions that characterize the facts
Fact: A puchase | Textual dimensions: Purchase time & location
What three application areas are there for multidimensional data models?
*Means: In which processes are they used?
- Data warehouses
Large repositories that integrate data from several sources in an enterprise for analysis - Online analytical processing (OLAP)
Finding overall trends by aggregating data search queries - Data mining applications
To discover knowledge by searching for unknown patterns and relationships in multidimensional databases
What is a pivot table?
A two-dimensional spreadsheet with associated subtotals and totals that supports viewing more complex data by nesting several dimensions on the x- or y-axis and displaying data on multiple pages.
Why are spreadsheets not adequate for managing and storing multidimensional data?
They tie data storage to tightly to the presentation - they do not seperate the structural information from the desired views of the information.
Why are relational databases and SQL databases not adequate for managing and storing multidimensional data?
- The use of aggregation functions can be difficult
- It is harder to combine all the dimensions you are interested in
What is a multidimensional database/data warehouse built up?
A collection of related cubes
Why can cubes easily manage the addition of new dimension values?
Because dimensions in a cube are first-class, built-in concepts with associated domains
How many dimensions can a cube have?
Unlimited, but current tools start having performance problems with more than 10 dimensions.
When do cubes become more sparse?
As there are more dimensions added to the cube and when the level of detail (granularity) becomes finer
How many dimensions can you view of a cube?
Generally, only two or three. But it can show up to 4 low-cardinality dimensions by nesting one dimension within another on the axis.
What is an important use of dimensions in a multidimensional database?
To use dimensions to provide as much context as possible for facts.
Why are data redundancies sometimes okay in multidimensional databases?
It is okay if it increases the data’s information value.
-> Since the data is derived from other sources, and not born into the multidimensional database, redundancy problems related to updates can be managed more readily
Where is in general redundancy in a multidimensional database?
In the dimensions, not the facts.
What is the relation of hierarchy and dimension?
Dimensions are used for selecting and aggregating data at the desired level of detail. These levels of details are part of the hierarhy (time -> year, month, day).
-> Sometimes define multiple hierarchies for a dimension (fiscal year and calender year)
How do you avoid duplicate definitions when defining multiple hierarchies for dimension?
The metadata of a cube defines the dimension hierarchy.
How can you avoid adding dimensions to a cube if you want to add extra information?
Add a property that you initially wanted as a dimension as a level under an existing dimension:
Dimension: product –> you add the level of package size instead of making package-size an dimension.
What kind of ordering is available in a multi-dimensional database?
- Typically no ordering
- Only ordering is by hierarchy
- Ordering can happen by cumalative information, such as total sales to date.