Lecture 1, 2 and 3 Flashcards

Question 1

Q

What are the 6 reasons for conducting exploratory data analysis?

Answer

A

Checking for data entry errors
Obtaining a thorough descriptive analysis of your data
Examining patterns that are not otherwise obvious
Analysing and dealing with missing data
Checking for outliers
Checking assumptions

Question 2

Q

What are the two options for dealing with data entry errors?

Answer

A

Remove data

2. Make “educated guess” about what was intended

Question 3

Q

How to examine patterns that are not otherwise obvious?

Answer

A

Stem and leaf plots

* box and whisker plots

Question 4

Q

What does screening and cleaning involve?

Answer

A

computing new variables from existing ones
recording variables
dealing with missing data

Question 5

Q

How to check for data entry errors in categorical/nominal variables?

Answer

A

Frequencies command

Question 6

Q

How to check for data entry errors in continuous/ scale variables?

Answer

A

The outliers option in the explore command

Question 7

Q

What the normality assumption?

Answer

A

Assumed that your data comes from population that is normally distributed.

Question 8

Q

What does homogeneity of variance assume?

Answer

A

Assumed that, if your data is to be divided into groups, the level of variability in the groups will be approximately equal (e.g., not significantly different)

Question 9

Q

What are the four ways normality is tested?

Answer

A

Visual inspection of histograms and stem and leaf plots
Visual inspection of normality and detrended normality plots
Normality tests
Skewedness divided by SE skewness

Question 10

Q

What are two reasons to recode data?

Answer

A

Reducing numbers of groups

2. Reverse scoring