Exam 2 - Chapter 4 Flashcards

1
Q

Guest speaker

A

2 questions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

THE ROLE OF THE ANALYST

- Transform data into ______

A

Transform data into Information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

THE ROLE OF THE ANALYST

- Master the ______ and _____ needed to provide insights from data

A

Master the technologies and tools needed to provide insights from data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

THE ROLE OF THE ANALYST

- _____ a deep understanding of the ________ _______ and _______

A

Maintain a deep understanding of the business environment and requirements

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

THE ROLE OF THE ANALYST

- _____ ______ with data engineers and provide them with _______ and ____ ___

A

Maintain relationships with data engineers and provide them with requirements and data needs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

THE ROLE OF THE ANALYST

- Be able to select the _______ _______ method based on ______ _____ and ______ ______

A

Be able to select the appropriate analytical method based on business objectives and data structures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

pyramid of the role of the analyst

A

top - the business - receiver of information and knowledge
middle - the analysts - providers of methodology
bottom - the data warehouse - provider of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Requirements for analysts can be summed up into 3 major competencies:

A
  1. Business Competencies
  2. Method Competencies
  3. Data Competencies
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Business Competencies

A

Business competencies

 Understand the business requirements and desired insights

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Method Competencies

A

Method Competencies

Understand and be able to apply the correct analytical tools and procedures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Data Competencies

A

Data Competencies

Understand the data you have, the data you need, and how to bridge the gap

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

SELECTING THE ANALYTICAL METHOD

Question 1-3

A

Question 1: Determine with the process owner whether the quantitative analytical competencies or the data manager and report developer competencies are required.
Question 2: Determine whether hypothesis‐driven analytics, or data‐driven analytics can be expected to render the best decision support.
Question 3: Determine whether the data‐driven method has the objective of examining the correlation between one given dependent variable and a large number of other variables, or whether the objective is to identify different kinds of structures in data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

EXAMPLES OF DATA – ORIENTED COMPETENCIES

A
Ad Hoc Reports
Manual Reports
Automated Reports
      On Demand
      Event Driven
Self-service Reporting
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

HYPOTHESIS TESTING (STATISTICAL METHODS) – EXAMPLE QUESTIONS

A

Comparing data to a benchmark
Comparing data to each other
Are there specific factors about the line that impact the average fill level of the bottles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

DATA MINING STEPS WITH TARGET VARIABLES

A

Step 1: Create the models
Step 2: Evaluate the models
Step 3: Use the models on new data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Classifies objects

A

Classifies objects into a set of pre-specified object classes (or categories) based on the values of relevant object attributes (features) and objects’ class labels

17
Q

BENEFITS OF CLASSIFICATION

A
Identifying the class by a single or a small number of data attributes (e.g., gender, age) is manageable by human decision makers, but not when the number of attributes or the number of instances is large.
Estimating/predicting the class or category of action recipient supports time and cost-effective decision making
18
Q

MOTIVATING BUSINESS QUESTIONS, COSTS AND BENEFITS

- How do we identify mobile phone service CUSTOMERS who are likely to churn (switch to another courier)?

A

MOTIVATING BUSINESS QUESTIONS, COSTS AND BENEFITS
Churn or not: classes of customers
Which customers: identified by customers’ attribute-value information – e.g., age, income, gender, services subscribed, service utilization, etc.

19
Q

MOTIVATING BUSINESS QUESTIONS, COSTS AND BENEFITS

- so what?

A

Potential actions – increase or decrease customer service for customers likely to churn
Costs – increased service cost or loss of loyal customers
Benefits - reduced churn rate or service cost

20
Q

DATA MINING WITH NO TARGET VARIABLES

A
Data reduction models
    Principal Component Analysis (PCA)
Clustering models
    K-Means
Market basket models
    Association Rule Mining
21
Q

An object (e.g., a customer) has a list of variables (e.g., attributes of a customer such as age, spending, gender etc.)

A

An object (e.g., a customer) has a list of variables (e.g., attributes of a customer such as age, spending, gender etc.)

22
Q

When measuring similarity between objects we measure similarity between _____ of objects.

A

When measuring similarity between objects we measure similarity between variables of objects.

23
Q

object

- We use distance function to ____ [1 sentence] _____ [2 sentence]

A

We use distance function to measure dissimilarity between variables. Thus the further the distance between any two objects the more dissimilar they are.

24
Q

A distance matrix :

A

A distance matrix can be created with objects as indexes and distance between objects as elements.

25
Q

WHY DO WE CARE?

A

Clusters are groups of similar entities
They make exhibit very different behavior
This may give us the ability to train and apply classification / regression models to separate clusters an increase overall accuracy