Word Doc Week 2 Flashcards
1
Q
What should I do before conducting any data analysis?
A
- “Screen & Clean” Data
- SPSS can not tell if data is ridiculous
- Only the researcher knows this so I need to make sure I screen the data for errors
2
Q
Six reasons for conducting exploratory data analysis
A
- Checking for data entry errors
- Obtaining a thorough descriptive analysis of your data.
- Examining patterns that are not otherwise obvious
- Analysing and dealing with missing data
- Checking for outliers
- Checking assumptions
3
Q
EXPLORE command in SPSS
A
- The explore command is SPSS covers all bases.
4
Q
Who is John Tukey
A
- A very practical statistician
- Language and techniques of EDA developed by him
5
Q
What does Screening and Cleaning involve
A
- Computing new variables from existing ones
- Recoding variables
- Dealing with missing data
6
Q
Checking Data Entry Errors
A
- Use the frequencies command to check for data entry errors in categorical/nominal variables
- Use the outliers option in the explore command to check for data entry erros in continuous/scale variables
7
Q
Options for Dealing with Data Entry Errors
A
- Remove data
- Make ‘educated guesses’ about what was intended
8
Q
Obtaining a thorough descriptive analysis of data
A
- The explore command provides more descriptive stats than any other procedure
- Multiple measures of central tendency
- Multiple measures of variability
- Quantitative measures of shape
- Confidence intervals
- Percentiles
9
Q
Examining Patterns that are not otherwise obvious
A
- Stem and Leaf Plots
- Box and Whisker Plots
10
Q
Analysing and dealing with missing data
A
Do you leave it out or do you substitute in the mean?
11
Q
Outliers
A
- Are they legitimate, or are they error?
- Do you keep them? Do you discard them?
- Balancing act, as with so much in data analysis
12
Q
Assumptions
A
- All parametric procedures (e.g., t-tests, ANOVA’s, correlation) operate under certain assumptions
- Main two:
- Normality
- Assumes that your data comes from population that is normally distributed
- Homogeneity of variance
- Assumes that, if your data is to be divided into groups, the level of variability in the groups will be approximately equal (i.e., not significantly different).
- Normality
13
Q
Normality is tested in four ways:
A
- Visual inspection of histograms and stem and leaf plots
- Visual inspection of normality and detrended normality plots
- Normality tests
- Skewness divided by SE skewness
14
Q
Subjective Normality Tests
A
- Visual inspection of histograms and stem and leaf plots
- Visual inspection of normality and detrended normality plots
- not influenced by sample size
15
Q
Objective Normality Tests
A
- Normality tests
- Skewness divided by SE skewness
- Objective but influenced by sample size
- With a large sample, even trivial deviations from normality will indicate a violation of the assumption