Gold Study Guide Word Doc Flashcards
API Server
A single threaded process that handles REST API calls
Application Server
Handles the web application, supports browsing and searching.
App server handles authentication
This process is multi-threaded
Backgrounder
A single threaded process that executes server tasks, including extract refreshes, ‘Run Now’ tasks, and tasks initiated from tabcmd
Cache Server
Query cache shared across all Tableau processes in a cluster. Used by Backgrounder, Data Server, and VizQL
Cluster Controller
Responsible for monitoring various components, detecting failures, and executing failover when needed
Coordination Service
In distributed installations, responsible for ensuring there is a quorum for making decisions during failover
Data Engine
Loads data extracts into queries and memory
Data Server
Manages connections to Tableau Server data sources
File Store
Automatically replicates extracts across data engine nodes
Repository
Tableau Server database, stores workbook and user metadata.
Stores Application Data, File Data, TWBs and TDSs
Based on PostgresSQL
Meta-Data and Application Data can be queried and reported on in Tableau
Search & Browse
Handles fast search, filter, retrieval , and display of content metadata on the server. Queries meta-data from TWBs and TDSs. Based on Apache SolR
VizQL Server
Loads and renders views, computes and executes queries
Explain the following: Count, Sum, Average, Minimum, Maximum
Count - the number of values. Sum - the sum of values. Average- the average of values. Minimum - the minimum value. Maximum- max value
First Quartile, Median, Third Quartile
The values observed at the 25th, 50th, and 75th percentiles
Standard Deviation
Measures the amount of dispersion or variation in a set of values
What % of the data are within 1, 2, and 3 standard deviations?
1 - 68% of the data. 2 - 95% of the data. 3 - 99.7% of the data
Skewness
Measures the asymmetry of the distribution of values around the mean. Think of this as whether the curve is weighted to the left (positive skew) or the right (negative skew)
Excess kurtosis
Kurtosis is a descriptor of the shape of a curve. It’s tall and skinny (positive kurtosis) or flat and wide (negative kurtosis)
What are the fields in the “describe trend model” dialog?
Model formula Number of modeled observations Number of filtered observations Model degrees of freedom Residual degrees of freedom (DF) SSE (sum squared error) MSE (mean squared error) R-squared Standard error p-value (significance) Analysis of variance (ANOVA) table Individual trend lines table Terms Value P-value
Model formula
the formula for the full trend line model. The formula reflects whether you have specified to exclude factors from the model.
Number of modeled observations
the number of rows used in the view
Number of filtered observations
the number of observations excluded from the model
Model degrees of freedom
the number of parameters needed to completely specify the model. Linear, logarithmic, and exponential trends have model degrees of freedom of 2. Polynomial trends have model degrees of freedom of 1 plus the degree of the polynomial. For example a cubic trend has model degrees of freedom of 4, since we need parameters for the cubed, squared, linear and constant terms.
Residual degrees of freedom (DF)
for a fixed model, this value is defined as the number of observations minus the number of parameters estimated in the model.
SSE (sum squared error)
the difference between the observed value and the value predicted by the model. In the Analysis of Variance table, this column is actually the difference between the SSE of the simpler model in that particular row and the full model, which uses all the factors. This SSE also corresponds to the sum of the differences squared of the predicted values from the smaller model and the full model.
MSE (mean squared error)
the SSE quantity divided by its corresponding degrees of freedom. The difference between the observed value and the value estimated by the model, divided by its corresponding degrees of freedom
R-Squared
R-squared is a statistical measure of how close the data are to the fitted regression line. It is also known as the coefficient of determination, or the coefficient of multiple determination for multiple regression.
The definition of R-squared is fairly straight-forward; it is the percentage of the response variable variation that is explained by a linear model.
Standard error
The difference between the observed value and the value estimated by the model.
An estimate of the standard deviation (variability) of the “random errors” in the model formula.
p-value (significance)
the probability that an F random variable with the above degrees of freedom exceeds the observed F in this row of the Analysis of Variance table. A measure of statistical significance. If significant, there is a low
Analysis of Variance table
also known as the ANOVA table, lists information for each factor in the trend line model. The values are a comparison of the model without the factor in question to the entire model, which includes all factors.
Individual trend lines table
provides information about each trend line in the view. Looking at the list you can see which, if any, are the most statistically significant. This table also lists coefficient statistics for each trend line. A row describes each coefficient in each trend line model. For example, a linear model with an intercept requires two rows for each trend line. In the Line column, the p-value and the DF for each line span all the coefficient rows. The DF column under the shows the residual degrees of freedom available during the estimation of each line.
Terms
the name of the independent term.
Value
the estimated value of the coefficient for the independent term.
StdErr
A measure of the spread of the sampling distribution of the coefficient estimate. This error shrinks as the quality and quantity of the information used in the estimate grows.
T-value
the statistic used to test the null hypothesis that the true value of the coefficient is zero.
P-value
the probability of observing a t-value that large or larger in magnitude if the true value of the coefficient is zero. So, a p-value of .05 gives us 95% confidence that the true value is not zero.
Can I deploy a single Tableau Server distributed across the different server zones in my IT infrastructure?
No. You must deploy Tableau Server in the same zone, and subnet for that matter, whether the Tableau Server is a single node or multiple nodes.
I am not sure if Browser Rendering is having any effect on my clients. Is there a way to inhibit it?
Yes. Add to the end of the url: ?:render=false
How do I embed a view without the toolbar?
Add to the end of the url: ?:embed=yes&:toolbar=no
Can you configure Tableau Server to support non-english speaking users
Yes. Each user can set the Language and Locale of the Tableau Server interface. This will change all of the non-content language on Tableau Server
Explain how the Resulting Permissions panel works for a Workbook. What information does it convey?
First, to see the resulting permissions you must select a group of user in the Permission Rules section of the Permissions pop up
Then you will see the resulting permissions of all the users in that group. You will see:
1. If you select a group you will see the number of users in the group.
2. For each user in the group you will see their resulting permissions acquired due to them being in that group, or another group, or assigned to them explicitly.