week 2 Flashcards

1
Q

According to the reading, the output of a data mining exercise largely depends on:

A

The programming language used.

The data scientist.

The quality of the data.*

The scope of the project.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

When data are missing in a systematic way, you can simply extrapolate the data or impute the missing data by filling in the average of the values around the missing data.

A

FALSE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is an example of a data reduction algorithm?

A

Prior Variable Analysis.

Cojoint Analysis.

A/B Testing.

Principal Component Analysis.*

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

After the data are appropriately processed, transformed, and stored, what is a good starting point for data mining?

A

Data Visualization.*

Machine learning.

Creating a relational database.

Non-parametric methods.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

In-sample forecast is the process of formally evaluating the predictive capabilities of the models developed using observed data to see how effective the algorithms are in reproducing data.

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Prior Variable Analysis and Principal Component Analysis are both examples of a data reduction algorithm.

A

FALSE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

After the data are appropriately processed, transformed, and stored, machine learning and non-parametric methods are a good starting point for data mining.

A

FALSE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

According to the reading, the output of a data mining exercise largely depends on the skills of the data scientist carrying out the exercise.

A

FALSE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What should you do when data are missing in a systematic way?

A

Determine who was managing the database.

Determine the impact of missing data on the results and whether missing data can be excluded from the analysis.*

Determine the average of the values around the missing data.

Extrapolate the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

When data are missing in a systematic way, you can simply extrapolate the data or impute the missing data by filling in the average of the values around the missing data.

A

FALSE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly