Exam 1 Flashcards

1
Q

___ is an umbrella term that combines architectures, tools, databases, analytical tools, applications, and methodologies.

A

BI

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

______ is data that cannot be stored or processed easily using traditional tools/means.

A

Big data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

A(n) _____ is a major component of a BI system that holds source data.

A

Data Warehouse

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

A(n)_____ is a major component of a BI system that is often browser based and often presents a portal or dashboard.

A

User Interface

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

OLAP

A

Online Analytical Processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

In Chapter 1our authors defined three types of business analytics. Identify the three types and the two questions typically answered by each of the three.

A
Descriptive - 
     What happened?
     Why is it happening?
Predictive -
     What will happen?
     why will it happen?
Prescriptive -
     What should i do?
     why should i do it?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

_____ statistics is about drawing conclusions about population based on sample data.

A

Inferential

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

______ is used to charecterize the relationship between explanatory (input) and response (output) variables.

A

Regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

_____ refers to the use of visual representations to explore, make sense of, and communicate data.

A

Data visualization

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

the fundemental challenge of _____ design is to display all the required information on a single screen, clearly and without distraction, in manner that can be assimilated quickly

A

dashboard

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

KPI

A

Key Preformance Indicator

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

In chapter 2, our authors defined four main tasks in their summary of data preprocessing tasks and methods. First, briefly describe why data preprocessing is necessary and then identify the four main tasks.

A
real world data is dirty, misaligned, overly complex, and inaccurate.
     data:
          consolidation
          cleaning
          transforming
          reduction
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

common features of a data warehouse

A

single version of truth
checked for quality
transaction system of Independence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

common features of the semantic model

A

inventory of data elements
common business names
pre-defined joins
standardized calculations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

common features of a presentation layer

A

multiple tools available
choose based on purpose
all point at same data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

ETL

A

Extract Transform Load

17
Q

A(n) _____ the most commonly used and simplest style of dimensional modeling, contains a fact table surrounded by an connected to several dimension tables.

A

Star Schema

18
Q

Star Schema

A
  • most commonly used and the simplest style of dimensional modeling
  • Contains fact table surrounded by and connected to several dimension tables
19
Q

briefly describe some of the defining characteristics of Kimbell’s data warehouse development approach:

A

Dimensional model/star-schema
incremental
high development difficulty

20
Q

_____ is the nontrivial process of identifying valid, novel, potentially useful, and ultimatly understandable patterned in data stored in structured databases

A

Data mining

21
Q

CRISP DM

A

Cross Industry Standard Process for Data Mining

22
Q

identify the three types of patterns used in data mining

A

assosiation
prediction
segmentation
sequential

23
Q

identify the 6 steps in the CRISP DM process

A
  1. business understanding
  2. data understanding
  3. data preparation
  4. model building
  5. testing and evaluation
  6. deployment