Data Vusualizati8b Flashcards

1
Q

Best chart type to show changes over time

A

Line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Best chart type to show changes over time

A

Line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Nightingale coxcomb

A

Like an exploded pie chart. 1850d. Shows sizes of pie slices better. Shows multiple axis per section

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Data driven

A

Classic grapgs abd charts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Conceptual

A

Explain a process, abstract concept or idea. Like an org chart.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Column charts

A

use size to contrast and compare two or more values, using height or lengths to represent the specific values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Heat maps

A

Similar to bar charts, heatmaps also use color to compare categories in a data set. They are mainly used to show relationships between two variables and use a system of color-coding to represent different values. The following heatmap plots temperature changes for each city during the hottest and coldest months of the year.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Pie charts

A

circular graph that is divided into segments representing proportions corresponding to the quantity it represents, especially when dealing with parts of a whole.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Pie charts

A

circular graph that is divided into segments representing proportions corresponding to the quantity it represents, especially when dealing with parts of a whole.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Scatterplots

A

show relationships between different variables. Scatterplots are typically used for two variables for a set of data, although additional variables can be displayed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Does your data have only one numeric variable?

A

If you have data that has one, continuous, numerical variable, then a histogram or density plot are the best methods of plotting your categorical data. Depending on your type of data, a bar chart can even be appropriate in this case. For example, if you have data pertaining to the height of a group of students, you will want to use a histogram to visualize how many students there are in each height range:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Are there multiple datasets?

A

For cases dealing with more than one set of data, consider a line or pie chart for accurate representation of your data. A line chart will connect multiple data sets over a single, continuous line, showing how numbers have changed over time. A pie chart is good for dividing a whole into multiple categories or parts. An example of this is when you are measuring quarterly sales figures of your company. Below are examples of this data plotted on both a line and pie chart.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Are there multiple datasets?

A

For cases dealing with more than one set of data, consider a line or pie chart for accurate representation of your data. A line chart will connect multiple data sets over a single, continuous line, showing how numbers have changed over time. A pie chart is good for dividing a whole into multiple categories or parts. An example of this is when you are measuring quarterly sales figures of your company. Below are examples of this data plotted on both a line and pie chart.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Are you measuring changes over time?

A

A line chart is usually adequate for plotting trends over time. However, when the changes are larger, a bar chart is the better option. If, for example, you are measuring the number of visitors to NYC over the past 6 months, the data would look like this:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Do relationships between the data need to be shown?

A

When you have two variables for one set of data, it is important to point out how one affects the other. Variables that pair well together are best plotted on a scatterplot. However, if there are too many data points, the relationship between variables can be obscured so a heat map can be a better representation in that case. If you are measuring the population of people across all 50 states in the United States, your data points would consist of millions so you would use a heat map. If you are simply trying to show the relationship between the number of hours spent studying and its effects on grades, your data would look like this:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the 4 elements of successful visualization?

A

Information: reflects the conclusion you’ve drawn from the data, which you will communicate with visualization

Story: adds meaning to the data and makes it interesting

Goals: makes the data usable and useful

Visual form: creates both beauty and structure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

5 seconds best practice

A

Your audience should know what they are observing within five seconds of being shown a data visualization. Visuals should be clear and easy to follow.

In the five seconds after that, your audience should understand the conclusion your visualization is making—even if they aren’t familiar with your research.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Simplicity best practice

A

As long as it’s not misleading, you should visually represent only the data that your audience needs to understand your findings. Including irrelevant data may confuse, distract, or overwhelm your audience.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What are the 9 principles of design?

A

Balance
Emphasis
Movement
Pattern
Repetition
Proportion
Rhythm
Variety
Unity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Balance

A

The design of a data visualization is balanced when the key visual elements, like color and shape, are distributed evenly. This doesn’t mean that you need complete symmetry, but your visualization shouldn’t have one side distracting from the other. If your data visualization is balanced, this could mean that the lines used to create the graphics are similar in length on both sides, or that the space between objects is equal. For example, this column chart (also shown below) is balanced; even though the columns are different heights and the chart isn’t symmetrical, the colors, width, and spacing of the columns keep this data visualization balanced. The colors provide sufficient contrast to each other so that you can pay attention to both the motivation level and the energy level displayed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Emphasis

A

Your data visualization should have a focal point, so that your audience knows where to concentrate. In other words, your visualizations should emphasize the most important data so that users recognize it first. Using color and value is one effective way to make this happen. By using contrasting colors, you can make certain that graphic elements—and the data shown in those elements—stand out.

For example, you will notice a heat map data visualization below from The Pudding’s “Where Slang Comes From” article. This heat map uses colors and value intensity to emphasize the states where search interest is highest. You can visually identify the increase in the search over time from low interest to high interest. This way, you are able to quickly grasp the key idea being presented without knowing the specific data values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Movement

A

Movement can refer to the path the viewer’s eye travels as they look at a data visualization, or literal movement created by animations. Movement in data visualization should mimic the way people usually read. You can use lines and colors to pull the viewer’s attention across the page.

For example, notice how the average line in this combo chart (also shown below) draws your attention from left to right. Even though this example isn’t moving, it still uses the movement principle to guide viewers’ understanding of the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Pattern

A

You can use similar shapes and colors to create patterns in your data visualization. This can be useful in a lot of different ways. For example, you can use patterns to highlight similarities between different data sets, or break up a pattern with a unique shape, color, or line to create more emphasis.

In the example below, the different colored categories of this stacked column chart (also shown below) are a consistent pattern that makes it easier to compare book sales by genre in each column. Notice in the chart that the Fantasy & Sci Fi category (royal blue) is increasing over time even as the general category (green) is staying about the same.

24
Q

Repetition

A

Repeating chart types, shapes, or colors adds to the effectiveness of your visualization. Think about the book sales chart from the previous example: the repetition of the colors helps the audience understand that there are distinct sets of data. You may notice this repetition in all of the examples we have reviewed so far. Take some time to review each of the previous examples and notice the elements that are repeated to create a meaningful visual story.

25
Q

Proportion

A

Proportion is another way that you can demonstrate the importance of certain data. Using various colors and sizes helps demonstrate that you are calling attention to a specific visual over others. If you make one chart in a dashboard larger than the others, then you are calling attention to it. It is important to make sure that each chart accurately reflects and visualizes the relationship among the values in it. In this dashboard (also shown below), the slice sizes and colors of the pie chart compared to the data in the table help make the number of donuts eaten by each person the focal point.

26
Q

Rhythm

A

This refers to creating a sense of movement or flow in your visualization. Rhythm is closely tied to the movement principle. If your finished design doesn’t successfully create a flow, you might want to rearrange some of the elements to improve the rhythm.

27
Q

Variety

A

Your visualizations should have some variety in the chart types, lines, shapes, colors, and values you use. Variety keeps the audience engaged. But it is good to find balance since too much variety can confuse people. The variety you include should make your dashboards and other visualizations feel interesting and unified.

28
Q

Unity

A

The last principle is unity. This means that your final data visualization should be cohesive. If the visual is disjointed or not well organized, it will be confusing and overwhelming.

29
Q

5 phases of design thinking

A

Empathize: Thinking about the emotions and needs of the target audience for the data visualization

Define: Figuring out exactly what your audience needs from the data

Ideate: Generating ideas for data visualization

Prototype: Putting visualizations together for testing and feedback

Test: Showing prototype visualizations to people before stakeholders see them

30
Q

Empathize

A

Thinking about the emotions and needs of the target audience for the data visualization

31
Q

Define

A

Figuring out exactly what your audience needs from the data

32
Q

Ideate

A

Generating ideas for data visualization

33
Q

Prototype

A

Putting visualizations together for testing and feedback

34
Q

Test

A

Showing prototype visualizations to people before stakeholders see them

35
Q

Headline

A

A headline is a line of words printed in large letters at the top of a visualization to communicate what data is being presented. It is the attention grabber that makes your audience want to read more. Here are some examples:

Which Generation Controls the Senate?: This headline immediately generates curiosity. Refer to the subreddit post in the dataisbeautiful community, r/dataisbeautiful, on January 21, 2021.

Top 10 coffee producers: This headline immediately informs how many coffee producers are ranked. Read the full article: bbc.com/news/business-43742686.

36
Q

Subtitle

A

supports the headline by adding more context and description. Adding a subtitle will help the audience better understand the details associated with your chart. Typically, the text for subtitles has a smaller font size than the headline.

37
Q

Label

A

label in a visualization identifies data in relation to other data. Most commonly, labels in a chart identify what the x-axis and y-axis show. Always make sure you label your axes. We can add “Months (January - June 2020)” for the x-axis and “Average Monthly Rents ($)” for the y-axis in the average rents chart.

This is an unfinished stacked line chart, headline, subtitle, and newly added labels for the x and y axes.
Data can also be labeled directly in a chart instead of through a chart legend. This makes it easier for the audience to understand data points without having to look up symbols or interpret the color coding in a legend.

We can add direct labels in the average rents chart. The audience can then identify the data for Oceanside in yellow, the data for Carlsbad in green, and the data for Vista in blue.

38
Q

Annotations

A

briefly explains data or helps focus the audience on a particular aspect of the data in a visualization.

Suppose in the average rents chart that we want the audience to pay attention to the rents at their highs. Annotating the data points representing the highest average rents will help people focus on those values for each city.

39
Q

Gnatt charts

A

Gantt charts demonstrate the duration of events or activities on a timeline.

40
Q

Tableau: highlight charts

A

Charts with conditional formatting

41
Q

Box plots

A

Box plots, also known as box and whisker charts, illustrate the distribution of values along a chart axis. Refer to the steps to build a box plot.

42
Q

Bullet graphs

A

Bullet graphs compare a primary measure with another and can be used instead of dial gauge charts. Review the steps to build a bullet graph.

43
Q

Data ink ratio

A

The data-ink entails focusing on the part of the visual that is essential to understanding the point of the chart. Try to minimize non-data ink like boxes around legends or shadows to optimize the data-ink ratio.

44
Q

Data ink ratio

A

The data-ink entails focusing on the part of the visual that is essential to understanding the point of the chart. Try to minimize non-data ink like boxes around legends or shadows to optimize the data-ink ratio.

45
Q

Avoid cutting off the y axis

A

Changing the scale on the y-axis can make the differences between different groups in your data seem more dramatic, even if the difference is actually quite small.

46
Q

Avoid the misleading use of a dual x axis

A

Using a dual y-axis without clearly labeling it in your data visualization can create extremely misleading chart

47
Q

Avoid artificially limiting the scope of the x axis

A

If you only consider the part of the data that confirms your analysis, your visualizations will be misleading because they don’t take all of the data into account.

48
Q

Avoid misleading grouping

A

It is important to make sure that the way you are grouping data isn’t misleading or misrepresenting your data and disguising important trends and insights.

49
Q

Avoid totals not adding up

A

If you are using a part-to-whole visual like a pie chart to explain your data, the individual parts should add up to equal 100%. If they don’t, your data visualization will be misleading.

50
Q

Avoid hiding trends in cumulative charts

A

Creating a cumulative chart can disguise more insightful trends by making the scale of the visualization too large to track any changes over time.

51
Q

Story telling steps

A
  1. Engage the audience.
  2. Create Compelling visuals
  3. Tell the story in an interesting way
52
Q

Sorry telling with visuals

A

Provide context, show your work(allows them to do some analysis)
And present the conclusion

53
Q

Know thy audience

A

What role do they play
What are their expectations
What do they hope to get from the presentation
What is their stake in the product

54
Q

Mccandless

A

General to specific. Start with high level information and work your way into the deepest specifics

55
Q

Mccandless

A

General to specific