Gold Study Flashcards

1
Q

When I add many fields to my rows shelf, some rows start going vertical. As I add more, the labels start getting concatenated. Can I fix this?

A

You can adjust Tableau from taking these actions by setting default table options. (Analysis  Table Layout  Advanced, and increase the number of row and horizontal row labels.)ALSO: You can perform some of the following actions:• Resize the label rows (click-drag) to provide more room• Choose to “Rotate” the labels (right-click a label)• Edit the aliases of the label values to make them shorter.• Format the font for the labels to make them smaller• Un-Choose “Show Header” for some field labels• Choose “Hide Field Labels for Rows”• Don’t place so many rows onto your sheet – it’s not best-practice for visual analysis!!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How does unchecking ‘AnalysisAggregate Measures’ differ from converting a measure to a dimension?

A

A dimension returns all distinct values of the field. “Dis-aggregate” means that you want to return each row from the underlying database. Example: 1,1,1,1,2,3. When set to a dimension there would be 3 marks, as a disaggregate measure, 6 marks.This becomes very important when using reference lines, trend lines or other secondary calcs. An ‘average’ ref line on the dimension would yield a value of 2, but on the disaggregate data the value would be 1.5.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is an ‘Attribute’ (ATTR)? How does it differ from dimensions and measures?

A

An attribute is an aggregation that can be applied to Dimensions. This enables the dimensions to be removed from the level of detail, but still be displayed. The functionality is useful for excluding a dimension from a table calc or allowing dimensions from secondary data sources. Returns the value of the dimension if it only has a single value for all rows in the group, otherwise it displays an asterisk () character. Null values are ignored.An attribute is equivalent to the formula: IF MAX([Field])=MIN([Field]) THEN MAX([Field]) ELSE“” END

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Can I connect to a stored procedure?

A

Sometimes, but this is not often needed. Due to Tableau’s advanced calculation, filtering and security capabilities, most customers find that they can accomplish more with tableau connecting to raw data than by executing a stored procedure.The stored procedure or User defined function (UDF) must return a table. The reason for this is that all subsequent actions in Tableau will be sent to the database as queries in a ‘sub select’. For Example:Select [Region] From ([my stored procedure result]). Most databases don’t support this. When they do, this effectively means that the stored procedure is executed every time an action is taken in Tableau. This typically defeats the benefits of a stored procedure. A much better approach is to first execute the stored procedure into a table or file and then connect Tableau to the results.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Does Tableau have any API’s or scripting?

A

“What is it that you are trying to accomplish?” We have embedding and URL actions that solve a lot of these needs. Other items are often solved with existing functionality such as table calcs or actions that will help satisfy their requirements. We also have a Javascript API.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Your website mentions the VizQL language. How do I program in this or modify the VizQL?

A

Currently the VizQL is generated only by using the Tableau Desktop interface. It can be manipulated through the Tableau Server web interface by Interactors. It is not possible to see the VizQL or create it directly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How do I group thousands of items together?

A

Best chance is to use a calculated field with a programmatic condition in these “thousands of items” such that the cardinality is low (less than 100). For measures, rounding using a calc field is useful. For dimensions, rounding using or LEFT or RIGHT functions, etc…

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Can I save a filter for reuse? For example I manage 12 of the 5000 products we have.

A

Yes. Create the filter, and then choose “Create Set” from the filter’s context menu

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

If I have a calculated field that includes members from two blended data sources, will the results be included when I create an extract?

A

Since extracts are specific to each data source, the results from the calculated field will not be included in the data extract (i.e. it will not be optimized). However, the calculation will still work fine.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How do I move a sheet from one Tableau workbook to another?

A

Bookmarks. Save one from the source workbook. Open the destination book, choose that bookmark, save the file. (Note: you cannot bookmark a dashboard)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Can I move dashboards from one workbook to another?

A

No.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the polygon mark type and how do I use it?

A

Polygon mark type is advanced and used to describe geographic areas and other areas of measurement. They typically require coordinate data in the underlying database. Armed with the right set of coordinate data, polygon marks – in conjunction with the PATH shelf – can describe two-dimensional areas. Any time you have all of the coordinate data to describe ANY area (state, election regions, etc), you can accurately use the polygon mark type to show these boundaries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the two main types of Joins?

A

Joins require a condition to evaluate how a row in one table matches a row or set of rows in another table. The fields used in this condition are called ‘key’ fields.Inner join: a join of two or more tables that keeps only that rows where the key is common to both tablesOuter join: is further defined as left outer join or right outer join. These includes all rows from the table on the left (or right) side of the join condition regardless of whether a match was found on the right (or left) side.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

I created a join and most of my data is missing. How do I fix this?

A

An outer join will always include the records of the primary table (i.e. left outer join will always include data from the ‘left’ table, even if no match is found in the right table.) An example is a transaction header table which is related to a transaction line-item table. For dimensional analysis purposes, you would want all header rows to be returned, even if there was not always a specific line-item related to it.Select * from TransactionHeader LEFT OUTER JOIN TransactionLineItemsOn TransactionHeader.LineItemID = TransactionLineItems.LineItemID

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is Join Culling?

A

Because joins cost time and resources to process on the database server, we don’t really want to use every join that we declared in our data source at all times. Join Culling allows you to skip the join when the query does not require it and provides increased performance. To do this, you must have relationships setup in your database between the tables you’ll be joining in Tableau. Inner Joins will work best for Join Culling.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do I do Sparklines?

A

Create a line chart.Edit the axis to not include zero.Hide the axisMake the rows really tiny

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How do I create a chart the shows the budget compared to actual?

A

Try a bullet graph first. IF that doesn’t satisfy, then:Color Answer: Steps: add “budgeted” to ROW, add a date measure to column. Then, add second measure (e.g. “actual”) to Y-axis (i.e. the row axis where “budgeted” was already placed). Tableau intelligently determined that COLOR is the best-practice in this case.Side by Side Answer: Follow the steps for Color Answer, and drag Measure Names to the Column Shelf next to your Date. This allows different measures to appear side by side.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

How do I show the rank of my products?

A
  1. Create a calculated field: index()2. Add this to your sheet and change it to discrete.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Can I create a chart that shows the trailing 12 months revenue?

A

Yes, this is easy to do in Tableau. You can use a table calculation for this, as well, you can use the built-in totals feature. If you have “month” as a dimension you can turn on grand totals against this dimension.Answer one: use relative date filters to filter a single sales bar down to the last 12 months (don’t add date to the columns). Answer two: Use a Table calc to use a Moving Calculation for the sum of the last 12 Months. This one is even better as this is simple for Tableau, and hard for other products.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Can I create Control Charts?

A

Yes. Control charts are often used in SPC (Statistical Process Control) and six sigma (6σ) efforts or other quality control. It is basically a time series analysis with trends and reference lines. Tableau makes them very robust with one exception: We can exclude the outliers completely, but cannot exclude them from the reference line calculations while still displaying them on the chart.See the knowledge base for more details.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Can I do a Pareto Analysis?

A

Yes (this is the quick way, a more complete answer is in the knowledge base): Place a dimension on the column shelf. Sort it by descending.Place a measure on the rows shelf. Duplicate it so there are two measures.Set the 2nd one to be a Running Total with a secondary calculation of Percent of Total.Dual Axis these two measures.Set the first measure to be a Bar and the second measure to be a Line.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

I have a view with MY(Order Date) on Columns, SUM(Sales) on Rows, with three different colored lines – one for each Product Category. I want to animate the lines over time, and show the history so I can get a view like the following where the lines will appear for each month:

A

Drag Date to the Pages shelf and to columns. In order to get the lines to produce over time, the Mark type needs to be changed from Line to some other option, such as Shape, Circle, even text. Then check Show History on the Pages card, access the drop-down menu, Show History for All and select Trails.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Can I use dimensions from the secondary datasource to roll-up my primary data? How can I create a primary group from a secondary data source?

A

Yes. Kind of.An example is if I have sales and people in my primary and people and their territory assignments in the secondary. How do I roll-up my sales (primary) by territory (secondary)?Once your data blended view is complete you can right click on your field you wish to group from your secondary source and select “Create Primary Group.” This will populate your primary data source with a grouped field based on values shared in both sources.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Setup a view that displays only a list customers whom purchased from BOTH Furniture and Office Supplies in the past.

A

Since no row of record contains information on about a customer whom purchased on multiple product categories, this is a filter based on aggregated results – this suggests using the “conditions” tab in filter on [Customer Name]Use the formula: (MAX(IIF([Product Category]=”Furniture”,1,0))+MAX(IIF([Product Category]=”Office Supplies”,1,0)))=2How this works: • MAX(IIF([Product Category]=”Furniture”,1,0))Show me the value “1” if Customer ever had bought something from Furniture• MAX(IIF([Product Category]=”Office Supplies”,1,0))Show me the value “1” if Customer ever had bought something from Office Supplies• (MAX(IIF([Product Category]=”Furniture”,1,0))+MAX(IIF([Product Category]=”Office Supplies”,1,0)))=2Only show me if both of the above formulas returned “1” or another words 1+1=2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Can I use Tableau to discover which products were frequently purchased together?

A

Yes. This is commonly referred to as a Market Basket Analysis. First, identify the unique identifier (Customer name, ID number, Business, etc.) and the category in question(Product category, Business segment, Academic Course, etc.). Self join tables approach: Go to Edit Tables on the data connection and choose to Add a Table. Select the table with the data we will be needing and choose to Join the Table. Add two join clauses: Join the identifier to itself, and join the category in question to itself where the relationship is (not equal).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

How about the top 5 products that I sold this year?

A

Yes. Filter on current year, then add this filter to the “context”. Reason: context is created prior to filtering to the top 5.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

How about the top 5 in each region?

A

Yes. Create an index() calculation and add it to the sheet. Edit it to restart for each region.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What about how the top 10 products and regions are performing?

A

Yes. Make a set of product and region, place this new set on row, followed by product and region. Sort on the set. Then hide headers:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

How do I make a box plot?

A

Many customers prefer our box plots to the standard definition because we will show you all outliers and plot density throughout the range, not just the whiskers (though you can get that as well).See the knowledge base for examples.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Can I find the purchase history of customers that purchased this month?

A
  1. Manually filtering2. Dynamic sets from dimensions3. Manual sets from marks/members4. See also: cohort analysis.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Every day, I want to open Tableau and see how yesterday’s top 3 sales people have performed over the last 6 months.

A

Similar to above, use dynamic sets. Once created, this set can be a filter and be used against any date range. You need to peg it using “add to context”.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

How do I apply weights when working with my survey data?

A

If you have a weight field in your data set you can apply this weight with a simple calculated field. [Weight]*[Measure].

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

How do I find the average sale per order?

A

Calculated field. Given a dimension “Order” and a measure “line item sale amount”Sum([line item sale amount])/Countd(Order)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

Tableau only created equal sized bins, but I want mine to be 0-10, 10-50, 50-100, 100-1000, and greater than 1000.

A

Instead of using built in binning, use a calculated field with an IF statement. Something like:IF [Sales] > 10000 THEN “LARGE” ELSEIF [Sales] > 5000 THEN “MEDIUM” ELSE “SMALL” END

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

Your bins are great! But I have just a couple of things that are bigger than 1000, how do just create a bin that is >1000 and not have to define all the intermediary bins?

A
  1. IF [Sales] > 5000 THEN 5000 ELSE [Sales] END2. Then, bin this calculated field3. Place on shelf4. Edit alias for that last group (e.g. ‘>= 5000’)
36
Q

Is there any way that we can dynamically change the size of our bins?

A

Yes, Tableau makes this very easy to do by allowing you to create parameters directly in the bin menu.

37
Q

I have a date field that looks like 20070326. Tableau doesn’t recognize it as a number. How do I make it into a date field? Bonus, I have 200703?

A

Calculated field. Examples assume the source is of type “string” (otherwise, wrap str() around these):DATE(MID([DividendDate],5,2) + “/” + RIGHT([DividendDate], 2) + “/” + LEFT([DividendDate],4))Bonus question:DATE(MID([DividendDate],5,2) + “/01/” + LEFT([DividendDate],4))

38
Q

Do you have a Modulus function in excel it is Mod()?

A

Yes we do have a modulus function. Instead of MOD(), Tableau has a modulo operator, in calculated fields this is the “%” character, e.g.: [Dividend] % INT[Divisor]The modulo operator finds the remainder when dividing by integers: 5%2 = 1

39
Q

Can I use calculations from Excel?

A

Yes. Once connected to the excel sheet, calculated fields are represented inside of Tableau. A change to the calculation in excel will result in updated information in the Tableau worksheet. Note: you cannot use VB or other excel functions inside of Tableau, only the output of those functions/calculations.

40
Q

Can I use functions in my database that Tableau doesn’t have?

A

Yes. You can use the various Tableau “RAWSQL” functions to return results from functions in the underlying database. Care must be taken to present the results of these functions in meaningful ways (i.e. a db function that returns a list might need to be ‘counted’ in tableau) (stored procedures may also work, as long as they return a single value for each row – aka single-valued functions)

41
Q

I have financial data. How do I calculate the profit ratio?

A

Tricky question. Their table will look like this:Account AmountGross profit 100Expenses 1049Revenue 2302COGS 8239Etc.Best way is a couple of calculated fields, e.g Profit: IF [Account] = “Gross Profit” THEN [Amount] ELSE NULL ENDRevenue: IF [Account] = “Revenue” THEN [Amount] ELSE NULL ENDEtc, then, perform your math on these calc fields: sum(Profit)/sum(Revenue)

42
Q

How do I calculate year-over-year growth without using table calcs?

A

Hint: similar to the ‘financial data’ question above.

43
Q

How do I sort on a particular dimensions’ member (i.e. sort on 2008 of “order date”)

A

I.E Do this, but without single-click sort:

44
Q

Can Tableau calculate the result of compounding interest?

A

Yes. Tableau can certainly do this with some table calculation functions:PREVIOUS_VALUE(1)*(1+AVG([Discount]))

45
Q

Can Tableau do z-tests?

A

Yes, the calculation is: (Sum(x) – WINDOW_AVG(Sum(x))) / STDEVP(x) where x is a field.Z represents the number of standard deviations between x and its population average. As a general benchmark, any Z scores above 2 are considered statistically significant.

46
Q

The Tableau Server maintenance views do not fit my needs. Can I customize them?

A

Yes. See the admin guide or knowledge base for details.

47
Q

How can I embed a view without the toolbar?

A

Add to the end of the url: ?:embed=yes&?:toolbar=no

48
Q

Can I do a multiple regression?

A

No. But Tableau does iterative regressions with ANOVA very well.

49
Q

What does the Anova table tell me?

A

analysis of variance: 1. Sources of variation (AKA “Field”) 2. Degrees of freedom 3. Sums of squares 4. Mean squares 5. F-statistic 6. P-Valuefor further information, see also: the internet.

50
Q

What is a p-Value?

A

A p-value shows the probability that sample data will adequately represent the population from which it was drawn. The larger the value, the less likely that the sample data accurately represents the population. The max. value for a P-Value is 1.0, min is zero, and a value of .05 means that there’s a 5% chance that the model does not represent the data.

51
Q

Can I use a cumulative distribution function?

A

Most likely. It depends on exactly what they are asking, but start with a histogram and set the measure to be ‘running total’. If they like that, then set it to percent of total as well (secondary table calc).

52
Q

Can I find the correlation?

A

Yes – compares two sets of data against each other and contains two separate sets of measures.

53
Q

Can I do significance testing?

A

Significance is built-in to the ‘describe trend model’ command. A possible advanced approach is to calculate your significance manually, however this requires a detailed understanding of your dataset and what significance you are testing for.

54
Q

What is scalability?

A

The ability to support multiple simultaneous actions, instead of making a single action faster.

55
Q

What is availability?

A

The ability of a solution (system) to resist component failures. Increasing availability increases cost.

56
Q

What does ‘scaling up’ mean?

A

Adding more resources (CPU, RAM, etc.) to a single server.

57
Q

What does ‘scaling out’ mean?

A

Adding more resources (CPU, RAM, etc.) by adding more servers in a “cluster.”

58
Q

What does ‘multi-process’ mean?

A

Running multiple instances of a process or service on a single server or distributed across multiple servers. This increases throughput.

59
Q

What does ‘multi-threaded’ mean?

A

Within a process, the ability to perform multiple tasks simultaneously across multiple CPUs on one machine.

60
Q

What does ‘single point of failure’ mean?

A

Within a solution (system), a component that, if it fails, will cause the solution (system) to fail as a whole or be completely inaccessible.

61
Q

How do context filters work in 9.0?

A

In 9.0, context filters create sub-queries by default. Temp tables are only created in the case of generic ODBC data sources or customized data sources (when the database isn’t set up to use sub-queries, it reverts to using temp tables).

62
Q

What are the advantages of using a subquery-based approach over using temp tables?

A

It enables join culling, considers only columns required for the viz, and applies all query optimization rewrites.

63
Q

What is the Cache Server?

A

The new Server process for the external query cache. It’s a query cache distributed and shared across the server cluster. This in-memory cache speeds user experience across many scenarios. VizQL server, backgrounder, and data server (and API server and application server to a lesser extent) make cache requests to the cache server on behalf of users or jobs. The cache is single-threaded, so if you need better performance you should run additional instances of cache server.

64
Q

Which processes use the Cache Server?

A

VizQL, backgrounder, and data server. Install the cache server on every machine running one of these processes.

65
Q

What is the default cache capacity?

A

512 MB per cache server instance. The cache is single-threaded, so if you need better performance you should run additional instances of cache server.

66
Q

What is the Repository?

A

The repository is a database that stores server data. It’s based on PostgresSQL. The server data it stores includes information on Tableau Server users, groups and group assignments, projects, permissions, datasources, extract metadata, and refresh information

67
Q

Data Server

A

The component that provides central management of Tableau Data Extracts and database connections. Allows you to upload and share data extracts, preserve database connections, and reuse calculations and field metadata.

68
Q

Why buy Server when Reader is free? What problems does Server solve?

A

Server solves problems around automation (there doesn’t have to be a person sending updates every time the data updates), creates a single version of the truth (we don’t have tons of workbooks and excel spreadsheets out there, we don’t have all these people writing their own definitions), has security features (who are you? are you allowed to see what you want to see? by having a place where the data is centrally locked down, everything’s going through Tableau Server to the customer), and gives mobile access to views (Server is browser agnostic and mobile-aware. You publish once and you can view it anywhere through any devices you want. You don’t have to have a laptop to look at your reports)

69
Q

Why is it a bad idea to take Server apart and put gateway in the DMZ and the rest of the Server behind the firewall?

A

That creates security risks. Every Tableau component ahs its own port. When you put gateway in the DMZ, you’re opening 10+ ports to the outside world on your firewall. A better practice would be to have your Proxy in front of server and keep Server all together

70
Q

What are the benefits of having a black box installation?

A

Ease of use! We have a frequent upgrade schedule, and black box installations make it easy for customers to always have the latest version of Server by taking away repetitive tasks. Our mission is to help anybody see and understand data, and our black box installation helps us do that by making it easier for people with less IT skills be able to implement Tableau, because the installation of Tableau Server is really easy!

71
Q

What’s the difference between published data sources and embedded data sources?

A

Published data sources can be used in multiple workbooks. Embedded data sources are associated with one workbook and describe the attributes required for connecting to a data source, like server name, database name, and authentication credentials. If three workbooks are connected to the same database, then you’ll have three embedded data sources listed on the Data Sources page.

72
Q

What is a black box installation?

A

A black box installation means that all the components needed to run a Tableau Server are included within a Tableau Server. Extra components (like an external load balancer) can be added later for performance, but aren’t necessary.

73
Q

What are disadvantages of a black box installation?

A

Some customers want to know specifically what’s in a Server or want to customize their Server. We’re transparent about the components and the degree of customization for a Tableau Server.

74
Q

What is Data Server?

A

The Data Server servers as a proxy between requests for data and individual data sources.The component that provides central management of Tableau Data Extracts and database connections. Allows you to upload and share data extracts, preserve database connections, and reuse calculations and field metadata.

75
Q

What are the advantages of using Data Server?

A

Data Server allows for central management of Tableau Data Extracts and database connections. You can write complex calculations and make changes to the metadata once. You only have to install drivers once on one machine. You have one source of truth and prevent data creep, since multiple workbooks can be based on the same data source. You can set up data models once. Central permission management for sensitive data sources.

76
Q

Describe the Extract API in one sentence.

A

The Extract API is a programmatic way to generate Tableau Data Extracts without manual intervention in Tableau Desktop.

77
Q

Name 4 use cases for the Extract API.

A
  • Times when you want to pre-process or access and assemble data from other applications before working with it in Tableau
    • To connect to data sources that Tableau doesn’t have a native connector for
    • Useful for technology partners who’d like to have an “export to Tableau” button
    • Some ‘portal’ scenarios when there are workflows in which Extracts would be created on the fly instead of waiting for someone to set them up in Desktop
78
Q

What languages can you write Extract API scripts in?

A

C++, Java, and Python

79
Q

What can be done with the generated TDE?

A

Connect to it as a data source in Tableau Desktop or publish it up to Tableau Server as a published data source and go from there. In other words, the TDE can be used natively in Tableau Desktop or published to Tableau Server using the same API.

80
Q

Does the Extract API work in 32-bit or 64-bit?

A

Works in both

81
Q

What platforms is the Extract API available on?

A

It’s available to developers on Windows and Linux platforms.

82
Q

Explain what addressing and partitioning are.

A

These have to do with table calculations. Addressing indicates the direction we want our table calculation to go in, or what part of the table you are computing along. Partitioning indicates the buckets we want our calculations done on, in other words, how to group the calculation.

83
Q

Describe the Server’s process when it’s looking for something in the query cache.

A

External query cache: we save the results from previous queries for fast access by future users. Each process has an in-memory cache called the query cache. The server process first tries to look for what it needs in the query cache. If it doesn’t find it in the in-memory cache, tries to find what it needs in the Cache Server. If the result is in the Cache Server, it’s copied to the in-memory cache and returned. If it’s not in either place, the query is run on the database and the results are cached in a Cache Server as well as the in-memory cache of the process that needed it. Caches in each Cache Server are accessible by all server processes and nodes in the whole cluster.

84
Q

What’s concurrency in terms of Tableau Server?

A

For Server, concurrency is defined as the number of end users that are actively loading and interacting with visualizations at a specified response time and throughput goal. Server can typically support 10% concurrency. In our testing, we observed that Tableau Server 9.0 can support up to 927 totoal users on a 16-core single machine deployment, so 92.75 concurrent users. Scales linearly. 2 node 32 cores – 1380 total 138.04 concurrent. 3 node 48 cores 2809 total 280.93 concurrent

85
Q

What happens when you create an extract?

A

§ Tableau takes each column, creates two buffers, each 8mg a piece
§ Converts the row return data into columns
§ Takes each column, buffers it, sends it to disk
§ Then goes internal processing local to tableua to create column store, memory maps, sorting
§ When you see the update screen when creating an extract, you can see what’s going on
§ As it’s writing to the disk to create that extract, the longer that takes, the more latency you have in extract creation. That’s not usually the problem
§ Typically, problem is how long it takes for DB to spit the data out
§ When you get to higher quality databases, he’s seen 2 to 3 million rows a minute
§ Try to set the bar low w/tableau so they’re thrilled when Tableau beats it

86
Q

Why multi-node?

A
• Why multi-node?
		○ Availability
		○ Performance
		○ Scaling
		○ People in different time zones
87
Q

Which backup processes are always active?

A

• w/exception of primary every backup is up

○ Gateway is manual failover. The only passive machine in the whole cluster