Skill Belt Study Guide Flashcards

1
Q

What is Tableau’s ‘Golden Circle?

A

The Tableau WHY: Tableau amplifies human intellect. People are smart. People are creative. We enable people to unlock their ideas and contribute in ways they consider to be the highest use of their skills, intellect, and capabilities. When this happens, they improve their lives, their organizations, and the world. The Tableau HOW: We do this by making software that helps people. Regardless of technical skill, see and understand their data. This liberates people’s natural curiosity and creative energy. It enables them to have conversations with their data that were impossible before- leading to discover that collectively transform their organization. The Tableau WHAT: We make this software for people everywhere, at every level of the organization. This allows business and IT professionals to work together in beautiful symbiosis. This is transforming the way companies think.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How is Tableau breaking out of the Business Intelligence Box?

A

As Henry Ford said “If I had asked people what they wanted, they have said faster horses.” Tableau is different than traditional BI because we put humans at the center of the center of design instead of the database. This is a fundamental shift in thought in BI; Tableau doesn’t let itself be defined by pre-existing ideas in business intelligence like limiting who can access data and create and share insight from data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How does Tableau help its users find success?

A

Today we consider Tableau successful when the software fades into the background, and you are in flow. Discoveries happen naturally; you’re thinking about the problem, not the product in your hand. We strive to enable this data Zen for everyone, at every level of an organization. Now let’s be clear – we’re not trying to make everyone a data scientist. We love data scientists, but exploration and curiosity shouldn’t belong to a select few with “data” in their titles. What we’re trying to do is enable the data enthusiast in all of us. Because when people have access to facts and data and information, even people who claim they have no interest in numbers, suddenly they can see the world very differently.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why does Tableau stick to the 25% maintenance fee on its software?

A

Tableau invests a significantly higher percentage of top line revenue into Research & Development than pretty much all other major software vendors. This has a direct impact on our clients and why our products continue to provide significant value for our clients. Secondly, we continue to receive high accolades our customer service and support as noted by Gartner in the Magic Quadrant (http://www.tableausoftware.com/reports/gartner-quadrant-2014) with above average scores for customer experience. Lastly, because of our investment in R&D, again as noted by Gartner, we have the lowest migration complexity with a much higher percentage of clients moving to our new versions, and much faster than other technologies, reducing the overall cost of ownership. Our standard level of support and maintenance fees allows us to continue to innovate and be a partner for our customers in the long term.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Explain the services or process layout of a Tableau Server?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a highly available installation of Tableau Server?

A

Installing Tableau Server to make it highly available means adding redundant machine(s) with an inactive Tableau Server installed into a cluster with an active machine of Tableau Server. If the active Tableau Server fails, the inactive redundant machine(s) will become active. A highly available solution ensures an instance of Tableau Server can withstand a failure of a single machine. At least 3 machines are recommended to implement high availability for Tableau 9.0 Mariner.

Bonus tech information:

Tableau 9.0 uses Apache Zookeeper to create a single source of truth and decision making working with Cluster Controller. Cluster Controllers communicate to Zookeeper to tell it which machine is the active, primary Tableau Server and which is the inactive secondary Tableau Server. In previous versions PostgreSQL tracked the status of machine, but it could fail itself. Plus each machine effectively watched the other machines in the cluster for failure. This could cause confusion over which machine is primary and which is secondary. The combo of Zookeeper and Cluster Control solves both issues.

Additionally, Rsync has been replaced by a File Store process. The advantage of File Store is near automatic replication of folders onto all File Store nodes associated with the Data Engine. The issue of secondary vs primary doesn’t matter since all File Store nodes are essentially the same within seconds across the Tableau Server cluster. When failure does happen there’s already a copy of the data ready to for the Data Engine to use. Data Engine isn’t slowed down by File Store like it was with the comparatively sluggish Rsync.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

A basic Tableau Server is a ‘black box’ installation. What does this mean? What are some advantages and disadvantages?

A

A black box installation means that all the components to run a Tableau Server are included within a Tableau Server. Advantages are the installation of Tableau Server is really easy. Extra components like an external load balancer can be added later for performance, but are not necessary. Some disadvantages are some traditional BI customers want to know specifically what’s in a Server or customize their server. Tableau is transparent about the components and the degree of customization for a Tableau Server. Refer them to resources on Tableau.com.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What Run-As User should I use on a local vs. distributed environment?

A

On a single local installation on a computer for testing/demo use the default NT-Authority Run-As User.

On a distributed (like HA) local installation use a specific Run-As User account. A specified Run-As User account is needed for the cluster to connect to the same unique network account instead of different NT-Authority accounts of the different machines in the cluster.

In a more robust environment with Active Directory (AD), add an account into the AD specifically for Tableau Server. This new Tableau Server user can be given the correct permissions within AD to connect to protected data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the difference between scaling up a Tableau Server vs. scaling out a Tableau Server?

A

Scaling up means adding more hardware resources to a single machine. This might mean adding more hardware to support more process instances, storing or processing larger extracts, or handling more users.

Scaling out means adding Tableau worker machines distribute the workload. These worker machines can be configured to handle one specific process. A common example is adding a worker machine to handle extract refreshes. The point is all the hardware on that worker is now dedicated to processing those extracts instead of being in hardware resource contention with the other processes within Tableau Server.

To create a highly available Tableau Server that can handle all the requests made of its hardware, it’s pretty typical to scale up and scale out.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Single Sign On?

A

A user signs in to some other application which contains Tableau content. They should be able to see the Tableau Server content without having to log in again.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are my options for Single Sign On? Which should I use?

A

Active Directory + SSPI, SAML, Kerberos, Trusted Authentication

The rule of thumb on AD, SAML, Kerberos is that you should use them if you already have them installed for other purposes. It is almost never worth the effort to start from scratch with those. If you do not already have any of those than you should use Trusted Authentication, which is a feature specific to Tableau but which can integrate with any other authentication mechanism.
If you are going to use AD or SAML then that is system-wide and all of your users must have accounts with those systems.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the options for searching a workbook or view in Tableau Server?

A

In the search box at the top of a Tableau Server web page run a global search for views, workbooks, projects, data sources, and users. Run a search in the context of the view pane on left hand side of the server web page. In these search boxes the search can be refined to a specific owner, project, modification date, favorite, recently viewed or has an alert. Not every column is searchable, but there are additional fields.

Bonus tech information:

Tableau Server search uses Apache SOLR.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Tableau Server uses sites to create a multi-tenant server (separate spaces within a server for a collection of related users, workbooks and data that will not leak into another collection). As a user it’s possible to belong to multiple sites on one Tableau Server. As a user how do you know which site you’re in? What’s one way to switch from one site to another?

A

If a user has access to multiple sites upon signing into Tableau Server they will have to choose one specific site to enter from a pop up list. Once signed into a Tableau Server the top of the web page lists the site in orange letters.

In 9.0 change sites by clicking on the twisty arrow. A list of the other sites the user belongs to appears. Click on one site to navigate to its contents.

Bonus tech information:

When a user is added to a Tableau Server they must be given a site role by an administrator. These roles are unrelated to user licenses and include roles like server administrator, site administrator, publisher, interactor, viewer and unlicensed. http://onlinehelp.tableau.com/current/server/en-us/users_site_roles.htm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Is it possible to customize the name of a Tableau Server in the browser tab? Is it possible to alter the logo of a Tableau Server on the sign-page and in the server to be for another company?

A

Yes! These customization are possible. To change the logo or the name use the Tab Admin command prompt. http://onlinehelp.tableau.com/current/server/en-us/help.htm#customize_namelogo.htm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do I assign users to multiple groups?

A

In a Tableau Server site create group for users who will have the same permissions or perform the same function (permission vs role based groups). Once the group is created click on it, at the top of the page will be an orange plus button to add users.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How are permissions assigned to a group or user?

A

Permissions are assigned on a workbook or project level. To assign a permission, click on the permission link at the top that’s next to the workbook details, views, data sources and subscriptions. A highlight chart will show the permissions for groups and users. Assign permissions by clicking the link at the bottom of this chart to add a user or group rule.

Best practice bonus:

For the easiest server user management a best practice is to assign group permissions only.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the permission model for Tableau Server?

A

Tableau’s uses a cascading permission model. By answering a series of questions per user at the user and group level Tableau Server determines if a user is allowed, denied or inherits a specific capability (i.e. viewing a workbook, connecting to a data source, using web editing, or capability through a role).

If a permission is not specified it’s inherited. In the cascading series of questions if Tableau Server cannot determine if the permission is allowed or denied, it assumes the permission is inherited. At the end of the question list if the user inherits the permission, but is not allowed the permission specifically or by role, Tableau Server will equate inherited with deny.

In 9.0 Tableau makes seeing permissions much easier. At the top of any workbook there is a link to its permissions. Click on the link to see a highlight table of which groups and users are allowed, denied, or not specified to have a permission.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What are descriptions for the different roles in a Tableau Server?

A

Project Leader - Set permissions for all workbooks, view, and data sources in a project.

Data Source Connector - Connect to data sources in the Data Server in the site, server or project.

Data Source Editor - User or group can connect to data in a project. They can also publish, edit, download, delete, and schedule data refreshes.

Editor - Allows all permissions to a user or group for a workbook

Publisher - Allows a user or group to publish, but not view workbooks on a Server

Interactor - Allows a user or group all permission for a workbook except Web Saving, deleting, moving or setting permissions for a workbook.

Viewer - User or group can view a workbook. On the view page in server they can view and add comments, view summary data and export an image of the view. All other permissions are by inheritance.

None - User or group permissions are inherited from other permission grants.

Denied - Specific permission is denied for user or group.

A workbook is being published to Tableau Server. One of the data sources is a Tableau Data Extract. How do you schedule the Tableau Data Extract to refresh on a schedule?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

How can a server administrator keep tabs on what’s going on in a Tableau Server? (i.e, know top content, track statuses, and get data to allocate server resources better)

A

Pre-packaged in every installation of server are admin views to help server administrators monitor and analyze data from their own Tableau server. One way to think of it is BI-on-BI analytics.

For those familiar with our server product, you’ll know that Admin Views have existed for many versions. In the past, these views were designed to monitor a system by providing a quick report with a specific number of metrics. While maintaining all the metrics of the old Admin Views, the new version’s views also allow more freedom, going beyond monitoring. No longer are the Admin Views static screens; now you can interact and chain questions together to gain new insights

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

How can a user quickly know there location in a Tableau Server and also know what kind of content they are viewing?

A

Once a user is into a project, a breadcrumb trail shows at the top of the page. It will follow the trail of project- workbook-view. A user can click on a part of the breadcrumb trail to return to a higher level of content. Recognize Tableau Server content types from these icons:

21
Q

A customer notices that Tableau doesn’t support gauges, and they are wondering why. What views would you position for them to use instead, and why?

A

I would suggest they try a Bullet chart, or perhaps a Bar in Bar chart, depending on the question they are trying to answer. Generally, bullet charts can show just as much or more information in a much smaller graphic, while also allowing for much easier comparison to the peers.

Specific reasoning for the Bullet View, from Stephen Few’s Perceptual Edge:

‘Specifically, bullet graphs support the comparison of the featured measure to one or more related measures (for example, a target or the same measure at some point in the past, such as a year ago) and relate the featured measure to defined quantitative ranges that declare its qualitative state (for example, good, satisfactory, and poor). Its linear design not only gives it a small footprint, but also supports more efficient reading than radial meters.’

22
Q

Can 3D charts be created in Tableau?

A

3D charts provide very eye catching visuals however, they typically hinder or skew the analysis being presented. For example, when dealing with volume or surface area such as a pie chart, a 3D chart can make certain slices seem bigger than they actually are. Because of these problems and they face that they don’t improve the story of the data, pundits do not consider 3D charts to be a best practice visual.

23
Q

What’s an alternative to 3D charts? *add in comparison to a TWBX

A

Small multiple charts. These are very powerful visuals for comparison because they enable true multi-dimensional analysis. It is basically a cross-tab of charts or visuals. Small multiples are a big improvement over the common 3d bar chart. They allow us to see multiple dimensions simultaneously, without obscuring, skewing or hiding any of the data. Tableau’s direct interaction model makes this even better so users can reorganize the output effortlessly or drill down all the way to the raw data.

Excel 2010 – default 3d chart. Try to compare heights.

24
Q

Does Tableau have to import data to run queries?

A

No, this is a big differentiator between us and some of our competitors.
You have the option to use Tableau’s fast data engine or maintain a live connection to your database, all without any scripting. If your data is already fast, or changes very often, then a direct connection is probably best.

25
Q

Tableau color palettes usually default to 10 colors. Why is 10 the recommended maximum for unique colors for categorical data?

A

Have you ever sat in a meeting and had someone say “look at the blue bars” when there are, say, 3 bars that are varying shades of blue? Our color palette follows visual best practices by allowing for 10 distinct colors for categorical data, all of which can be referenced with a distinct name. The recommended number of categorical colors is usually around 8. Anything more, and the color loses its impact.

26
Q

Is Data Blending within a single database the same as using Multiple Tables in the connection dialog?

A

No, joins (in the multiple tables’ scenario) happen in the database with the raw data as part of one universal query. Blends happen in Tableau with the already summarized (aggregated) data. This means that Data Blending sends a separate query to each database. These queries aggregate the data to the same level of detail – regardless of the level of detail of the raw data. Then the results of the separate queries are blended together by Tableau.

27
Q

When should I use a Data Blend and when should I use a Join?

A

It depends. When trying to determine whether joining data tables or blending the data, consider the following: where the data is coming from, the number of data connections, and the number of records you have in the data. If you have more than one data source, we recommend using a blend. If you have two tables from the same data source, joining could improve performance but may contain too many records for a join to be practical.

28
Q

What is a cube? Does Tableau support them?

A

Cubes (OLAP database) store structured (hierarchies and defined relationships) and aggregated data. This creates a ‘multi-dimensional’ data storage that used provide a performance edge over most relational databases. Tableau does support OLAP databases. The experience between an OLAP and a relational database is mostly the same except the query language is MDX and the cube structure is preserved. *See Cube file on Silver Skill Belt page

29
Q

I’m a Mac user and I need to connect to my company’s SSAS/Essbase/Teradata OLAP cube but I don’t see that as a connection option. Why?

A

Use case: Native data connection options are different depending on the Tableau Desktop platform (Mac vs. PC). Test-ee should know:

The connections that ARE available on PC but NOT on Mac are determined by the database vendors. Specifically whether they have/have not written drivers for Mac.
Options for connecting from a Mac include running Windows inside a VM such as Parallels, using a dual boot system like Bootcamp, or … get a PC.

30
Q

What’s the difference between using the generic ODBC connection and Tableau’s named connections? Are there drawbacks / benefits that I should be aware of?

A

The connections listed in the ‘connect to data’ dialog window are optimized and tuned by Tableau to assure performance and robust functionality of the Tableau feature set.

ODBC connections will rely on the driver (likely provided by your database vendor) and may have limited support for the SQL standard. The result is that performance and functionality in Tableau may be degraded.

31
Q

I’ve connected to a Tableau Data Extract (TDE) and I can’t refresh it from the source database. Why?

A

Use case: Often a user will build an extract, then later connect to that from a 2nd workbook. They should understand the connection information for the data source of origin is contained either in the original workbook, saved data source (.TDS file), or in a Tableau Data Server connection. If you connect directly to a TDE there is no reference to source data, we see a TDE the same way we would an Excel file or CSV.

32
Q

When should ODBC be used and other data base connection types

A

Generic ODBC can be used for any datasource for which a named connector does not exist.

Many of our “named” datasources graduated from “generic” ODBC after testing with hundreds of reports and queries.

Database vendors may create TDC files to tweak the behavior of our query engine when querying their database. Sometimes this is sufficient to avoid queries that might otherwise be too difficult to execute. The guide to customizing TDC files is here:

http://kb.tableau.com/articles/knowledgebase/customizing-odbc-connections

Technical Support usually does not support a “non-named” datasource connection using “generic” ODBC.

33
Q

Who defines the measures and dimensions?

A

For relational – Tableau determines this automatically. By default, all numeric fields (that are not keys in the database) are measures, everything else is a dimension. The names of the fields are simply the names of the columns in the database. To change their placement, just drag and drop.

For Cubes – These are explicitly defined by the author of the cube.

34
Q

There are too many fields to weed through in the data window, it’s confusing to know what I need and to find the fields that I do need. Can Tableau help?

A

Use case: Someone is connected to a data source with LOTS of data fields (50-250+). Looking for a range of options to help make these data fields easier to find data fields.

Search tool
Create folders to group similar dimensions/measures
Hide fields you know won’t/shouldn’t be used

35
Q

Why can’t you create a forecast from a bar chart of Category and Sales?

A

For forecast, you have to include at least one Date and one Measure in the view.

Bonus Tech information:

A line graph using a date will not forecast if there is not enough data points in the view. One solution is to aggregate at a lower time segment. Like month instead of year.

36
Q

Is there a way to change the aggregation function used by Totals directly in the Analytics Pane?

A

No, to change the function, you still have to use the Analysis menu, choose Totals, then choose Total All Using.

37
Q

How Do I find all my customers that have bought a product in ‘Technology’ but not ‘Office Supplies’? *Practice in Silver Study workbook

A

Drag Customer Name to Filter, select the condition tab, and write: MAX(IF [Product Category]=”Technology” then 1 else 0 END)+MAX(IF [Product Category]=”Office Supplies” then 999 else 0 END)=1

38
Q

Are my calculations written back to the database?

A

No. Tableau is a very safe environment – it never changes your data. The calculations are computed on the fly based on the current values in the database

39
Q

Are calculations performed locally in Tableau?

A

Tableau pushes most calculations back to the database. All aggregations are performed on the database (which includes the data engine). Table Calculations, most reference lines, and some types of filters are performed locally.

40
Q

How do I find the number of days/weeks/months/seconds/minutes/ etc. between two different date dimensions in my data set (e.g. Hire Date and Term Date)? *Practice in Silver Study workbook

A

Use case: Business users often want to find out how long something or the other takes, whether it’s time from order to ship, ship to receive, number of seconds elapsed. Tableau makes finding this easy. The format is:

DATEDIFF(‘Year’,[Date1],[Date2])
If you’re looking specifically for the number of days simply, subtract the earlier date from the later date (e.g. [Term Date]-[Hire Date])

41
Q

How can I determine if my profit margin is greater than, less than or equal to my company’s pre-determined threshold? *Practice in Silver Study workbook

A

Looking for easiest path here, boolean calcs.

[Value 1]>NN
[Value 1]>[Parameter 1]
[Value 1]>[Value 2]

42
Q

How Can I categorize Profit margin into “Good Margin”, “High Margin”, “Poor Margin” and “Underwater”? *Practice in Silver Study workbook

A

Next level KPIs, solvable with IF or CASE statements. Bonus if they understand potential performance implications of lengthy string calculations:

IF [Margin]> .15 THEN “High Margin”

ELSEIF [Margin] >.05 THEN “Acceptable Margin”

ELSEIF [Margin] >=0 THEN “Poor Margin”

ELSE “Underwater”

END

43
Q

How do I determine if my customers from the first year I was in business are sticking with me? What % of my total current business do they make up? How do the other years compare? Testing their base level of understanding of LoD expressions, in this question the object is to group customers by their year Cohort. *Practice in Silver Study workbook

A

{FIXED [Customer ID]: MIN([Order Date])}

44
Q

Explain how LOD expressions differ from table calculations.

A

TC:Table calculations are generated from the query results.

WINDOW_AVG(SUM([Sales]))

LOD: LOD expressions are generated as part of the query to the underlying data source. They are expressed as a nested select so are dependent on DBMS performance:

SELECT T1.[State],SUM(T2.[Sales per County, State])

FROM [DB Table] T1 INNER JOIN

(SELECT [State], [County],

AVG([Sales]) AS [Sales per County, State]

FROM [DB Table] GROUP BY [State],[County]) T2

ON T1.[State] = T2.[State]

GROUP BY T1.[State]

TC: Table calculations can only produce results that are equal to or less granular than the viz LOD.

LOD: LOD expressions can produce results that are more granular than, less granular than or totally independent of the viz LOD.

TC: Dimensions that control the operation of a table calculation are separate from the calculation syntax and specified in the “compute using” or “running along” menus.

LOD: Dimensions that control the calculation for a LOD expression are embedded in the calculation syntax. For INCLUDE/EXCLUDE expressions these are relative to the viz LOD, for FIXED expressions these are absolute.

TC: Table calculations are always aggregated measures.

LOD: LOD expressions can be used as measures, aggregated measures or dimensions. The data type is determined by the formula. INCLUDE/EXCLUDE expressions are used as measures or aggregated measures. FIXED expressions are used as either dimensions, measures or aggregated measures.

TC: Table calculations cannot be used in other constructs such as bins, groups, etc.

LOD: LOD expressions can be used in other constructs.

TC: Filters on table calculations act as a hide – they do not remove records from the result set.

LOD: Filters on LOD expressions act as an exclude – they remove records from the result set. Depending on whether the expression is FIXED or INCLUDE/EXCLUDE it is evaluated at different stages of the visualisation pipeline.

45
Q

What is the order that filters are applied to the Tableau query pipeline? When are filters on LOD expressions applied?

A
46
Q

How do Story Points help show multiple insights from one view?

A

Story Points have the ability to create a delta from the original sheet or dashboard in the workbook. Drag a sheet or dashboard onto a story point. Filter or add an annotation to the Story Point. Go back to the original sheet or dashboard; the original will remain unchanged. Sheets, dashboards and story points as workbook components create a flexible interface. The story point can be used to focus in a data insight, while the original sheets and dashboards can still be used to run day to day analytics.

47
Q

Can questions be asked in story point mode on the fly?

A

Yes! Each story point is still interactive. Filter, zoom in, or annotate to answer other questions then reverse all of that or save it with the click of a button.

48
Q

Describe a couple of ways to add dynamic interactivity to a dashboard.

A

Use relevant filters. Often dashboards involve cascading filters. Hone into only the relevant options to select from in non-primary filters to make the options list shorter and easier to navigate. Example: In Superstore sales data there is a category field with 3 options (domain members). Imagine once one of those categories is selected you only want to see the corresponding sub-categories in a sub-category filter. Making the sub-category filter relevant to the category filter would be the solution.
Dynamic titles. Double click into a title. Input data fields to contextualize specific data selected.
Dynamic action filter, URL, highlight titles.