Lecture 1, 2 and 3 Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

What are the 6 reasons for conducting exploratory data analysis?

A
  1. Checking for data entry errors
  2. Obtaining a thorough descriptive analysis of your data
  3. Examining patterns that are not otherwise obvious
  4. Analysing and dealing with missing data
  5. Checking for outliers
  6. Checking assumptions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two options for dealing with data entry errors?

A
  1. Remove data

2. Make “educated guess” about what was intended

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How to examine patterns that are not otherwise obvious?

A
  • Stem and leaf plots

* box and whisker plots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does screening and cleaning involve?

A
  • computing new variables from existing ones
  • recording variables
  • dealing with missing data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How to check for data entry errors in categorical/nominal variables?

A

Frequencies command

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How to check for data entry errors in continuous/ scale variables?

A

The outliers option in the explore command

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What the normality assumption?

A

Assumed that your data comes from population that is normally distributed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does homogeneity of variance assume?

A

Assumed that, if your data is to be divided into groups, the level of variability in the groups will be approximately equal (e.g., not significantly different)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the four ways normality is tested?

A
  1. Visual inspection of histograms and stem and leaf plots
  2. Visual inspection of normality and detrended normality plots
  3. Normality tests
  4. Skewedness divided by SE skewness
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are two reasons to recode data?

A
  1. Reducing numbers of groups

2. Reverse scoring

How well did you know this?
1
Not at all
2
3
4
5
Perfectly