6.2: Distinguishing Among Chart Types Flashcards

1
Q

What are the four key questions one should consider when determining the best method for visualizing results?

A

What business question are you trying to answer?

Who is the target audience for the visualization?

Which type of data is being visualized, categorical or numerical?

What type of analysis have you performed, exploratory or confirmatory?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define exploratory business analytics and provide examples of activities involved in this type of analysis.

A

Exploratory business analytics involves initial descriptive and diagnostic analytical investigations to summarize and explain performance.

Activities in exploratory analytics include describing past performance, exploring summary performance statistics, identifying anomalies and outliers, checking assumptions, and detecting patterns using historic data.

It generates questions for further investigation and hypotheses for testing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is confirmatory business analytics, and what is its primary focus?

A

Confirmatory business analytics is conducted after the data analyst understands the relationships in the data. Its primary focus is to use statistics to judge the likelihood (probability) of a future event or outcome occurring.

This type of analysis includes predictive and prescriptive analytics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why is understanding the type of analysis performed (exploratory or confirmatory) crucial in choosing an appropriate data visualization method?

A

Understanding the type of analysis (exploratory or confirmatory) is crucial because it determines the focus and purpose of the visualization.

Exploratory analysis requires visualizations that aid in exploring patterns, anomalies, and summarizing past performance, whereas confirmatory analysis needs visualizations that support predictions and probabilities for future events or outcomes.

Choosing the right visualization method aligns with the specific analytical goals, ensuring effective communication of insights to the intended audience.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the three most frequently used types of charts for visualizing categorical data, and how do they represent the data?

A

The three most frequently used types of charts for visualizing categorical data are:

Pie charts: Circular graphs where each slice represents the category’s proportion of the whole dataset.

Bar charts: Data visualizations showing categorical data using vertical/horizontal bars with varying heights/lengths proportional to the values they represent.

Stacked bar charts: Bar charts with bars subdivided into different categories, representing proportions within each category.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why are bar charts often considered easier to interpret than pie charts for visualizing categorical data?

A

Bar charts are often considered easier to interpret than pie charts because our brains are better at comparing the height of columns (or lengths of horizontal bars) than comparing the sizes of pie slices, especially when proportions are relatively similar.

Bar charts provide a clearer visual representation of the differences in proportions between categories.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

In the context of categorical data visualization, how does adding percentages to a pie chart improve its interpretability?

A

Adding percentages to a pie chart helps improve interpretability by providing numerical context to the proportions represented by each slice.

It makes it easier for viewers to understand the exact proportions of each category relative to the whole dataset.

However, even with percentages, bar charts are often preferred as they allow for a quicker and more intuitive comparison of proportions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does a stacked bar chart represent, and how does it differ from a regular bar chart?

A

A stacked bar chart represents categorical data using bars that are subdivided into different categories.

Each bar is segmented into sections, each representing a specific category, and the total length of the bar represents the whole dataset.

Unlike a regular bar chart where each bar represents a single category, a stacked bar chart shows the composition of each category in relation to the total, allowing for a detailed comparison of proportions within categories.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the recommended rule of thumb regarding the number of categories for which pie charts are suitable, and why is this guideline important?

A

The recommended rule of thumb is to avoid using pie charts when there are more than five categories.

This guideline is important because as the number of categories increases, it becomes increasingly difficult to distinguish between the size of the slices, especially when there are many skinny slices of pie. Pie charts with numerous categories can be challenging to interpret accurately.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does a 100 percent stacked bar chart represent, and how does it differ from a regular stacked bar chart?

A

A 100 percent stacked bar chart represents the proportion of each category as a percentage of the whole dataset.

It differs from a regular stacked bar chart in that the total height of each bar in a 100 percent stacked bar chart always equals 100%, making it easier to compare the relative proportions of categories across different bars.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Provide examples of situations where stacked bar charts are particularly useful for data visualization.

A

Stacked bar charts are particularly useful when you need to:

Compare parts of the whole over time: For example, comparing product sales by type over multiple years (e.g., 2025 vs. 2026 sales).

Analyze data across different locations: For instance, comparing product sales at different branches or stores within a company.

Visualize proportions: Showing how different categories contribute to the whole, such as the distribution of product sales among various types of products.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a tree map, and in what situations is it useful for data visualization?

A

A tree map is a visualization technique that utilizes size and color to represent the proportional size of values using physical space.

Tree maps are useful when there are many categories that need to be visualized.

They are helpful in revealing patterns and identifying outliers.

However, they are not ideal for representing precise numbers or proportions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When is a symbol map useful, and what does it represent in the context of data visualization?

A

A symbol map is useful for expressing categorical data proportions across geographic areas such as states or countries.

It represents the relative proportions of categorical data in different geographic regions.

Symbols, like circles in the example, are used to visualize the data, with varying sizes indicating different levels of the data variable.

Symbol maps are effective for comparing relative proportions across different geographic regions but may lack precision for detailed analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How does a word cloud work, and in what scenario is it particularly beneficial for data analysis?

A

A word cloud is created based on the frequency of words mentioned in a text data set.

The higher the frequency of a word, the larger and bolder it appears in the word cloud.

Word clouds are useful for identifying commonly used words in text data, making them beneficial for analyzing open-ended responses in surveys or textual content.

They help quickly identify prevailing themes or sentiments within the text. Common English words can be filtered out to focus on significant terms.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a heat map, and how does it represent categorical data? Provide an example scenario where a heat map is advantageous.

A

A heat map is a graphical representation of categorical data that uses different colors or shades of the same color to indicate different values.

Darker colors represent lower values, while lighter colors represent higher values.

Heat maps are advantageous for comparing the intensity of a phenomenon across different categories.

For example, a heat map can display weekday unit sales by employee, where darker red indicates fewer units sold and darker blue indicates more units sold.

Heat maps allow for quick visual comparison of performance metrics, making them useful for data-driven decision-making.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the fundamental difference between continuous data and discrete data, and can you provide examples of each?

A

Continuous data are numerical data that can take on any numerical value, including decimals, indicating an infinite set of values between any two observations.

Examples include height, weight, and currency.

Discrete data, on the other hand, are numerical data limited to whole-number (integer) values, meaning there is a finite set of values between any two observations.

An example of discrete data is the number of products in inventory, where only whole numbers are feasible (e.g., 0 items, 1 item, 2 items, and so on).

17
Q

What are box-and-whisker plots and histograms used for in data visualization, and what information do they provide about a data set?

A

Box-and-whisker plots and histograms are used to visualize the distribution of a data set and identify outliers.

Box-and-whisker plots provide information about the spread and skewness of the data, including the median, quartiles, and potential outliers.

Histograms show the frequency distribution of a continuous data set, representing the data’s shape and central tendencies.

18
Q

How do line charts and scatterplots differ in terms of the types of data they represent, and what insights can be gained from each?

A

Line charts are suitable for showing data changes or trend lines over time, especially for visualizing continuous data.

They are useful for identifying trends and patterns.

Scatterplots, on the other hand, display two variables on separate axes and are useful for identifying correlations between variables and detecting trends.

Scatterplots help in understanding the relationship between two numerical variables and identifying potential outliers.

19
Q

What is the primary difference between filled (geographic) maps and symbol maps in data visualization?

A

Filled (geographic) maps use shading, tinting, or patterns to display how a value differs across geographic regions, such as states or countries.

The intensity of the shading or color represents the variation in the data. S

ymbol maps, on the other hand, utilize symbols like circles or markers to compare relative proportions of categorical data across different geographic areas.

Filled maps provide a visual representation of the data variation through color intensity, making them useful for comparing data across geographical regions.

20
Q

Figure: Summery of Chart Types

A
21
Q
A