Chapter 10 Flashcards
Values of two variables, each plotted on its own axis to exhibit possible correlation
scatter plot
The number of observations in one particular category divided by the grand total of observations.
proportion
general direction something is going
trend
numerical data with an equal and definitive ration between each point and absolute”zero” is the point of origin
ratio data
observation taken on a variable at different points in time
time series data
brief summaries (or factoids) of a dataset that provide a representation of the dataset as a whole
descriptive statistics
what data visualization should you use for the trend of N.I over the past 5 years
bar/line chart
what data visualization should you use for emphasizing proportions
pie chart
what data visualization should you use for ratio data (data emphasizing zero)
bar/line chart
what data visualization should you use for numerical data
bar/line chart
what data visualization should you use for a companies sales versus sales returns
bar/line chart
what data visualization should you use for categorical data
pie chart
what type of data analytics is a scatter plot
diagnostic or predictive
what type of data analytics is time series data
predictive
what type of data is tables displaying sensitivity analysis
prescriptive
what type of data is pie charts
descriptive
what type of data is a breakeven graph
prescriptive
what type of data is a line and bar chart
descriptive
what type of data is a box plot
diagnostic
what type of data is a histogram
descriptive
Line charts should only be shown in ________ order to emphasize ________
chronological, trend
Pie charts lend themselves well to _________ data showing proportions.
categorical
____________ are observations on values taken on by a variable at different points in time.
time series data
_________ are brief summaries (or factoids) of a dataset that provide a representation of the dataset as a whole.
descriptive stats
________ show values of two variables each plotted on its own axis to exhibit potential ________
scatterplots, correlation
correlations, scatter plots, regression results, and time series analysis are generally associated with _________ analytics
predictive
diagnostic analytics includes which type of visualization
A. box plots
B. scatter plots
C. pivot charts
D. time series
box plots
Scatter plots are more meaningful if they include a _______.
trend line
An outlier showing on a box plot will typically appear ________ the maximum or minimum value within the expected range.
outside
Accounting figures and money are most likely to be shown as __________
ratio data
horizontal axis
typically used for labels
vertical axis
usually used for values (aggregated sum or average of what we are measuring)
scale
the scale by which the values are measured
tick marks
increments of the scale that are listed on the vertical axis
data series
every observation in a chart, represents raw data visually
data points
each data series is made up of individual data points
What is typically captured on the vertical axis in a data visualization?
values
What is typically captured on the horizontal axis in a data visualization?
Labels for categorical data
What is bar charts useful for
time series data
How must time series data be listed
chronological order
line charts are useful for
communicating trends
what are pie charts good for?
categorical data and showing proportions
pie charts are rarely used if there are more than how many categories
4
pie charts are bad for…
time series data
What charts are the best for showing data over time
line or bar charts
what are dashboards good for
showing multiple visualizations in one place
what are histograms
type of bar chart but instead of representing categories they represent bins or intervals
Does histograms give you categorical data?
no only numerical data
Difference between bar charts and histograms when it comes to data shown?
Bar charts show a variety of descriptive stats (avg, count, total) where histograms show just counts
what are histograms useful for
showing frequency of groups of data
why are outliers and anomalies important
show if something stand out and are part of diagnostic analytics
what are the two ways anomalies and outliers can be addressed
“eye ball” the data or construct a box plot