week 2 Flashcards

Question 1

Q

According to the reading, the output of a data mining exercise largely depends on:

Answer

A

The programming language used.

The data scientist.

The quality of the data.*

The scope of the project.

Question 2

Q

When data are missing in a systematic way, you can simply extrapolate the data or impute the missing data by filling in the average of the values around the missing data.

Question 3

Q

What is an example of a data reduction algorithm?

Answer

A

Prior Variable Analysis.

Cojoint Analysis.

A/B Testing.

Principal Component Analysis.*

Question 4

Q

After the data are appropriately processed, transformed, and stored, what is a good starting point for data mining?

Answer

A

Data Visualization.*

Machine learning.

Creating a relational database.

Non-parametric methods.

Question 5

Q

In-sample forecast is the process of formally evaluating the predictive capabilities of the models developed using observed data to see how effective the algorithms are in reproducing data.

Question 6

Q

Prior Variable Analysis and Principal Component Analysis are both examples of a data reduction algorithm.

Question 7

Q

After the data are appropriately processed, transformed, and stored, machine learning and non-parametric methods are a good starting point for data mining.

Question 8

Q

According to the reading, the output of a data mining exercise largely depends on the skills of the data scientist carrying out the exercise.

Question 9

Q

What should you do when data are missing in a systematic way?

Answer

A

Determine who was managing the database.

Determine the impact of missing data on the results and whether missing data can be excluded from the analysis.*

Determine the average of the values around the missing data.

Extrapolate the data.

Question 10

Q

When data are missing in a systematic way, you can simply extrapolate the data or impute the missing data by filling in the average of the values around the missing data.

week 2 Flashcards

(10 cards)