Question 1

Before conducting a statistical analysis you need to check your data for eight things:

Accepted Answer

1. Accuracy of data entry, 2. Missing data, 3. Outliers, 4. Normality, 5. Linearity, homoscedasticity, and homogeneity of variance, 6. Independence, 7. Multicollinearity and singularity (MANOVA and multiple regression). 8. Other assumptions

Question 2

Missing data may be addressed through a range of approaches such as

Accepted Answer

list-wise deletion, mean substitution, expectation-maximization, multiple imputation

Question 3

As defined by Tabachnick and Fidell (2019, p. 63), an outlier is

Accepted Answer

"a case with such an extreme value on one variable (a univariate outlier) or such a strange combination of scores on two or more variables (multivariate outlier) that it distorts statistics"

Question 4

If not identified and processed, outliers can lead to

Accepted Answer

Both Type I and Type II errors.

Question 5

There are several ways that outliers can be addressed that include

Accepted Answer

- ignoring (non-influential) data points (univariate, multivariate), - deleting individual data points, if sample size can accommodate for this (univariate, multivariate), - running the analysis with and without the outlier/s to justify keeping the outlier/s (univariate, multivariate), - modification to reduce the bias of the data through winsorizing or trimming data (univariate), and - transforming data for large data sets (univariate, can be extremely complex for multivariate).

Question 6

Occasionally, new multivariate outliers may have been identified following deletions or original outliers. This happens because once you remove a single outlier, the data set becomes more consistent and new data points will become

Accepted Answer

extreme points

Question 7

Distributional information, such as skewness and kurtosis values, can provide indicators of

Accepted Answer

symmetry and peakedness of a variable's distribution

Question 8

Skewness relates to the

Accepted Answer

symmetry of the distribution

Question 9

Positive skew is depicted when most scores are clustered at the

Accepted Answer

lower end of the distribution,

Question 10

Kurtosis refers to the

Accepted Answer

peakedness of the distribution

Question 11

A positive skew is described as ________ and a negative skew is described as:

Accepted Answer

leptokurtic; platykurtic

Question 12

Screening the residuals for normality is common practice when conducting data analyses for

Accepted Answer

ungrouped data

Question 13

Linearity (straight-line relationships between variables) can be observed graphically through

Accepted Answer

bivariate scatterplots

Question 14

For ungrouped data, the assumption of homoscedasticity refers to

Accepted Answer

assumption in regression analysis that the residuals on the continuum of scores for the predictor variable are fairly consistent and as such have similar variances

Question 15

For grouped data, Homogeneity of variance is

Accepted Answer

the assumption that the variance of one variable is stable (i.e. relatively similar) at all levels of another variable

1.8 Preparing for analysis Flashcards

(25 cards)