week 3 Flashcards
2 components of data visualization
1- how to read graph
2- what insights to generate
what is scatter plot
simple plot (x,y) to display values for typically 2 variables
geographical visulization graph is
vertices associated with locations (x/y axis associated with longitude and latitude)
Heat maps are
2 dimensionally represent your data
colour maps
summarize hundreds of rows of data like how polygons reps a state on usa map
advatnage visualize over summary stats
convey results not just one dimensional data
o Simple stats like mean etc. provide 1 type of info
how does ggplot work and whats it advantage
work by adding layers to plots, easier to code multiple functions
o Advantages: Better images visualizations and easier to code
3 elements of ggplot are at least
1- dataframe
2- aes mapping:how variables in data frame, how they need to be plotted, where you define x and y axis including colour etc
3- geom object: determine how values are shown graphically, scatter, line polygon
common issue/ important things with graphs
o Use labels consistently
o Colour should convey a message
o Make easy to read
o Keep alignment of labels consistent
o Have a sequence or order
o Too many label points= too hard to read
scale properly
make sure visual rep correct
pick graph with most insights