Unit 8 Flashcards

1
Q

To create an interactive visualization that allows a user to change the lag of one time series in order to compare it to a second time series, we create a calculated field such as “Lagged SOI” with the formula: Lookup(SUM([SOI]), [Lag (Months)]) and a parameter called “Lag (Months)”.

How do we add the control for Lag (Months)?

A

Right click Lag (Months) and select Show Parameter Control

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Given the data below on a single hospital, if you just plot the Deaths variable you get the graph below which clearly shows that the number of deaths is increasing each year. However, this graph does not reflect changes in the number of patient visits per year. What is an effective method in Tableau to normalize the data based on the number of patient visits?

A

Define a calculated field “Deaths per 1000” with the formula [Deaths]/[Patient Visits]*1000 and replace Deaths in the graph with “Deaths per 1000”.

Yes. If you do this for the data above you will see that the rate is 4% of patient visits per year. The death rate does not change at all through the years shown.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Select the statement that best describes the difference between temporal event analysis and time series analysis

A

Time series analyses deals best with point events collected at regular time intervals, whereas temporal event analysis can deal with point and interval events collected at irregular times

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Which of the following graphs represent multivariate time series analyses?

A

Lung Cancer Deaths Male, Female, and Total

?Profit and Sales
?Lung Cancer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Which of the graphs below show evidence of seasonality (a cycle that repeats every year)?

A

A - Cycle Plot Lung Cancer Deaths
C - Segmented Time Series by Year Superstore Sales

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Indicate whether each dataset is an example of time series data or temporal event data

A

A dataset containing one blood pressure and one weight measure every day for a month for 100 patients
Time series
A dataset containing patient logs of approximate start and end times at which they exercised, approximate meal time and calories consumed, and blood pressure taken approximately once in the morning and once in the evening. There are 14 to 60 days of data for each patient.
Temporal Event Analysis
# A dataset containing the time of ED Admission, time of ordering and administering antibiotics, time of ordering and obtaining a blood culture, time of reading and result of mean arterial pressure taken at varying intervals, and time of admission to the hospital. - Temporal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Which approach(es) would be effective for visually comparing the rate of change at which people in Ohio vs. Texas are being diagnosed with influenza like syndrome? Assume that we have data on number of diagnoses each week by state for August 2013 through February 2014. Select all that apply.

A

A line graph with one line for each state where each point plots the % difference from the previous months value

A line graph with one line for each state, where the y axis (showing number of diagnoses) uses a logarithmic scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Select all of the questions below that are amenable to temporal event analysis

A

What proportion of patients who are suspected of having sepsis receive the appropriate sequence of care in the first three hours and of those who do not receive that sequence is there a common pattern of deviation? Note: The required sequence is A. Measure lactate levels; B. Obtain blood cultures prior to antibiotics; and C. Administer broad spectrum antibiotics.

?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Suppose you have data showing the number of patients arriving to an ED by day for 5 years. You are interested in understanding whether ED arrivals vary by day of week (e.g., are there always more on Monday or Friday throughout the years?) as well as whether there is any trend over the years for each day (e.g., did visits on Monday go up or down over the years?). If you could use just one graph, choose the most effective:

A

A cycle plot showing days (Monday through Sunday) and for each day the years on the X axis, and number of arrivals on the Y axis. Thus the graph has 7 lines, one for each day, where each line has 5 points, one for each year of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Select the most accurate statement(s) regarding time series

A
  • For time series data collected at regular intervals with no missing values, line graphs are one of the most accurate and effective means of display
  • For time series collected at irregular times, it is best to use a dot plot

?For time series collected at regular intervals, but with some missing data, it is best to use a line graph with missing line segments to indicate missing data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Which of the following statements are true about the color coding used in Graph A? Note that for Graph A all Target bars are green and all Units Using Process bars are Red. Select all that apply.

A

Color is a redundant code in graph A, because the bars are also labeled with the name of the variable

Graph A does not effectively …

Graph A may be somewhat misleading because green typically means good and red bad, so at a glance a user might assume that all the green bars mean good performance and the red bars bad performance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

You have a number of units using a new quality improvement process for one year in three different hospitals. Each hospital has a different target number of units that were supposed to adopt the new process during the year. What is one way to normalize values in order to compare the hospitals’ relative performance toward meeting the goal?

A

Calculate the percentage of goal met by each hospital —- Yes, this normalizing all hospital performance metrics to a single comparable number.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

The most direct way to display deviations is by calculating and plotting (such as on a bar chart) the difference between the comparative measure and the performance measure. For example, we could calculate and plot the target number of units - actual number of units. However, when we do this, the underlying raw data is not visible on the display. Which of the following are effective ways to supply this data? Select all that apply.

A

?For interactive visualizations, provide a tooltip on each performance measure and target that shows the raw performance and comparative measures.

  • Use labels to overlay
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Select all of the problems below that apply to the specific Pareto chart above.

A

Visual clutter from many small values

Handling values that vary greatly in magnitude

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Based on the Pareto chart above, select the smallest number of diseases that account for over 60% of total insurer costs

A

Hypertension
Diabetes
Acid reflux

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Suppose that you have two time series: one measuring influenza-like syndrome (ILS) diagnoses per week and another measuring the number of tweets mentioning flu-like symptoms. You notice that that when the number of tweets increases, the number of diagnoses increases about a week later. Select the most accurate statement(s) regarding this data:

A

Number of tweets is a leading indicator of number of ILS diagnoses

To visualize the relationship we could plot both time series on a single graph with number of tweets plotted with a lag of -1 week (so that the values for Week 5 on the graph would actually be the number of tweets from Week 4, etc.)

17
Q

Of the options below, what is the most effective visualization strategy for detecting possible lagged covariation in the two time series shown on this graph?

A

Create a dashboard showing the graph above, plus a scatterplot of SOI vs. Recruitment. Add a linear regression line to the scatterplot and modify the above graph to allow the user to vary the lag of one of the variables (SOI or Recruitment) to observe the correlation at different lags.

—-Yes, this technique will allow the user to “shift” one of the time series lines left or right and observe whether this increases or decreases the correlation as shown in the scatterplot.

18
Q

Which of the following graphs represent univariate time series analyses?

A

Total Children in Mother-Only Households

19
Q

The aspect ratio of a time series graph can make it difficult to accurately perceive patterns. Following Tufte’s advice, which of the following graphs has the best aspect ratio?

A

? Time Series 3?

NOT TS2!!!

20
Q

Out of the following event sequence simplification methods, select the ones available in EventFlow

A

Search and replace
# Time windowing
# Removing an event from the display
# Event renaming

21
Q

You have a worksheet with four variables: Hospital, Units Using Process (the actual number of units converted to a new process), Target (the number of units that was set as a goal for converting to the new process), and Missed Target (True if a hospital missed–fell below–its target, otherwise False).
In Graph A above all Target bars are green and all Units Using Process bars are red. To create graph A, above, you must

A

drag Hospital to Columns

Measure Names to Columns

Measure Values to Rows

then color the graph by dragging Measure Names to Color

and finally filter Measure Names so that bars are shown only for Units Using Process and Target.

22
Q

Select the most accurate statement(s) regarding bullet graphs

A

Few recommends using a red dot to the left of the text label to indicate performance measures that did not meet their target.
To color code the qualitative performance regions, Few recommends using the lighter color intensity for favorable states and the darker color intensity for poor states.

23
Q

Which of the following questions best represents a deviation analysis?

A

How does patient waiting time for each clinic compare to the national average waiting time?
Right! You want to know how each clinic’s waiting times differ from the national average. The main information of interest here is the deviation—not the raw value.

24
Q

If we want to know which region has the lowest proportion of patients in an age range, what is the most effective graph? For example, consider the question: which region has the lowest proportion of patients in the 41-54 age range?

A

Grouped Bar Chart

25
Q

For which question would it be best to use a Pareto chart?

A

You know the percentage of patients who were readmitted and the reason for their readmission and you want to know the smallest number of readmission reasons that account for a total of 80% of readmissions

——This is exactly the kind of question a Pareto chart is designed to answer

26
Q

The percent difference of each point in a time series from the previous point.

A

Rate of Change

27
Q

Consecutive points that fall above 3 standard deviations of the mean.

A

Exceptions

28
Q

Repetitive patterns in a time series.

A

Cycles

29
Q

The overall direction of movement of the time series over the entire data set: rising, falling, or staying the same.

A

Trend

30
Q

The extent to which two time series are correlated, possibly at some non-zero lag.

A

Co-Variation

31
Q

The amount of random changes above and below the main trend throughout the time series.

A

Variability