10/16 Class Flashcards

1
Q

OLAP

A

online analytical process
main goal: to support ad-hoc but complex queries
Places key performance indicators(measures) into context(dimensions):
measures are pre-aggregated
data retrieval is significantly faster

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

OLTP

A

online transaction process

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why OLAP

A

dimension modeling is a natural presentation of data for business analytics.
OLAP Technology is very fast
Most reports run within 1-3 seconds
Speed advantage substantial in highly aggregated reports such as multi-year trends
Without OLAP, the burden is on the developer to extract relevant data and build aggregations
Pre-calculated results
Produces consistent information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Roll up(drill-up)

A

summarize data by climbing up a concept hierarchy or by reducing dimensions
example:drill up from city to state
drill up by reducing the location dimension

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Drill down

A

analyze more detailed data by moving down a concept hierarchy or by adding dimension(s)
example:
drill down from city to dealer
drill down by adding the time dimension

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

slice

A

creates a slice form the cube by choosing a single value for one of the dimensions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

dice

A

creates sub cube from the cub by choosing a two or more values for one or more of the dimensions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

hadoop

A
big data warehouse
 volume
 velocity
 variety
hadoop - apache open source software for reliable, scalable, distributed computing
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

mapreduce

A

programming model invented at google in 2012
read in input and produce a key value pair
example:
key value

map workers: certain computers are used for this
reduce worker: certain computers only do this

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

exam review

A

understand the difference between bitmap and B+tree
ETL
policies for data warehouse maintenance is on maintenance slide, user driven policy vs warehouse driven policy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly