Data Science Flashcards

1
Q

regression imputation

A

has the opposite problem of mean imputation. A regression model is estimated to predict observed values of a variable based on other variables, and that model is then used to impute values in cases where the value of that variable is missing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

mean imputation

A

is a method in which the missing value on a certain variable is replaced by the mean of the available cases. This method maintains the sample size and is easy to use, but the variability in the data is reduced, so the standard deviations and the variance estimates tend to be underestimated.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Imputation

A

the assignment of a value to something by inference from the value of the products or processes to which it contributes.Exp using the MEAN to find each value for each column

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Extension To Imputation

A

In this approach, we impute the missing values, as before. And, additionally, for each column with missing entries in the original dataset, we add a new column that shows the location of the imputed entries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Multiple Imputation

A

is essentially an iterative form of stochastic imputation. However, instead of filling in a single value, the distribution of the observed data is used to estimate multiple values that reflect the uncertainty around the true value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly