SQL theory Flashcards

1
Q

Database the same as excel?

A

DB not comparable to Excel, because databse tables are related to eachother.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

database?

A

A collection of related tables, deisgned, maintained and utilized by multiple users with software to update and query the data.

consists of: data, hardware, software and users

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

DBMS

A

Software that controls the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Relational database allows?

A
  1. Data to be grouped into tables

2. sets relationships between tables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is superkey

A

key that uniquely identifies each row

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a candidate key

A

superkey without unnecessary attributes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Why choose a seperate data warehouse?

A
  1. historic data (normal db do not store this)
  2. data consolidation (dq requires aggregation and summarization from multiple heterogenous sources
  3. data quality: different sources use inconsistent data represenations, codes and formats (that why ETL)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

A data ware house is: (4)

A
  1. subject oriented: not focused on daily operations but on analysis for decision makers
  2. integrated: constructed by integrating multiple, heterogeneous data sources
  3. time variant: time horizon for DW is significantly larger than a regular db
  4. non-volatile: operational updates do not occur in dw
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

DW development methods:

A
  1. data mart approach (bottom up: simple, smaller dw’s)
    (independent data marts, data mart bus architecture, canned dw, consistency achieved through conformed dimensions)
  2. enterprise dw (top-down: centralized, more data)
    (independent dw, hub and spoke data warehouse, federated dw)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

ETl loading methods

A
  1. integral load

2. incremental/delta load

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

ETL generations

A
  1. do it yourself
  2. code generators
  3. ETL engines
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

OLAP density

A

in dw typically 25%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

dashboard designed on three levels

A
  1. perception of elements in environment (what is happening around the company)
  2. comprehension of current situation (benchmarking)
  3. projection of future status
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is data?

A

Raw facts; internal or external; structured or unstructured

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is information

A

Organized data that has meaning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is knowledge?

A

processed data or information that is applicable to a business decision problem

17
Q

ETL includes:

A

extraction data from production database
Transformation; transforming the extracted data from its source form to its target form
(transfer, cleaning, integration, aggregation)
Load: putting the data in the data warehouse