Methods of Representing Quantitative Data Flashcards
What different formats can you use when encoding data?
Stack data
Give nominal data categories
What to histograms show?
Distribution
What is positive skew, in terms of mode, median and mean.
Mode
What is a bimodal distribution?
2 modes
What are the origins of Exploratory Data Analysis?
1970s - John Tukey
What is EDA about?
Understanding, rather than confirming results
Idea that there’s no point employing complex techniques on poorly understood data
What are the features of EDA?
- Emphasizes importance of repeated examination of data
- Uncovers data structure and enhances pattern recognition
- Uses simple numerical summary (dispersion + centrality)
- Imaginative Graphical Display - clarify data variability
- Resistance (summaries not affected by anomalies)
- Robustness (not unduly effected by assumptions)
- More significance to extreme values - interesting
- More cautious approach
- Fits into mixed methods approaches
What are the uses of EDA?
Helps formulate hypotheses & evaluate models
Identify major features of data
Useful when data is inaccurate
Might rule out doing confirmatory analysis
What are the 5 steps involved in Aggregate Data Analysis?
1) Set context and Research Questions
2) Collect Data and Calculate Rates (turn nominal into ratio)
3) Data Processing (manipulate in minitab/excel)
4) Checking and describing data (obtain summary statistics)
5) Drawing Pictures (Boxplots/Histograms/Advanced features)