Question 1

Data Science

Accepted Answer

the exploration and quantitative analysis of all available structured and unstructured data to develop understanding, extract knowledge and formulate actionable results

Question 2

Business Intelligence

Accepted Answer

strategies and technologies used by enterprises for the data analysis of business information.

Question 3

CRISP-DM

Accepted Answer

provides useful input on ways to frame analytics problems and is popular approach for data mining. Six steps include: business understanding, data understanding, data preparation, modeling, evaluation and deployment.

Question 4

Framing a Decision

Accepted Answer

outline what decision is being considered, why it is important, what data is need, who will provide input. Business Understanding, Data Understanding and Data Preparation of CRISP-DM.

Question 5

Analyzing a Decision

Accepted Answer

what kind of analytical approach is needed, what. does it show, what does it mean. Modeling in CRISP-DM.

Question 6

Implementing a Decision

Accepted Answer

how do I make use of the decision, what can I expect, what else should be considered, how do I "sell" the result. Evaluation and Deployment in CRISP-DM.

Question 7

Data Modeling Blocks

Accepted Answer

1. Data, 2. Build Model, 3. Inter hidden variables, 4. Predict & Explore

Question 8

Interpretation Error and Inconsistencies

Accepted Answer

Taking the value in your data for granted and difference between data sources and company's standardized values.

Question 9

Cleansing Data

Accepted Answer

Interpretation and Inconsistencies. Data Entry Errors, Redundant Whitespace, Fixing Capital Letter Mismatching, Outliers, Dealing with Missing Values, Different Units of Measurement, Different Level of Aggregation, Deviation for a Cook Book, Impossible values and Sanity Checks.

Question 10

Integrating Data

Accepted Answer

Combining data from different data sources. Joining/Appending Data, Appending Tables, Using Views to Simulate Data Joins and Appends, Enriching Aggregated Measures.

Question 11

Transforming Data

Accepted Answer

making data into a certain shape for models. Reducing the number of variables, turning variables into dummy variables.

Question 12

Data Retrieval

Accepted Answer

data stored within the company, data outside organization and data quality checks.

Question 13

Data Preparation

Accepted Answer

fix problems in the data; create derived variables.

Question 14

Exploratory Data Analysis

Accepted Answer

the use of graphical techniques to gain an understanding of your data and the interactions between variables.

Question 15

Joining

Accepted Answer

enriching an observation from one table with information from another.

Week 2 Flashcards

(25 cards)