High Availability Flashcards
What is the minimum supported node configuration for high availability?
three-node system
Can you have more than one Gateway process running on a single machine? On a cluster?
On a single machine no, and no point.
On a cluster yes, and they are all active.
What is the goal of HA within TS?
The goal is to minimize UNPLANNED downtime.
What does TS attempt to do in the event of a component failure?
Tries to automatically re-spawn it.
What components need to be redundant in order to achieve system HA?
Every single component.
Redundancy across nodes is possible for all processes except _________ service.
Why?
Licensing service
It can only run on the dedicated Primary node.
Gateway process can run on any node of the TS cluster.
True or false?
True
We can only run the Gateway process on one node at a time.
True or false?
False
Gateway process can run on any and all nodes of the TS cluster. They are all active.
What does Tableau recommend in order to make your Gateway process HA?
Have more than one node in a Tableau Server cluster and configure more than just one to run the Gateway process.
They recommend that we run an instance of the Gateway on each node.
What happens when the Gateway fails?
If no Gateway processes are running, the entire TS cluster will be unavailable. If other Gateway processes remain running, requests made to those working Gateways will be processed normally. However, any requests received by the failed Gateway will not be redirected and will continue to fail, despite the presence of other functioning Gateways.
How to make your system robust to Gateway failures?
Run multiple Gateways across the cluster and configure an external load balancer to route traffic accordingly.
How can you achieve HA with the Application Server?
Configure instances of the App Server on each node in the cluster.
What happens when an Application Server process fails?
Requests being handled by that instance of the App Server will fail, but subsequent requests will be routed to other running Application Servers.
If the node containing the failed App Server is still running, the failed process should automatically restart itself within seconds.
How many Coordination Service processes will be installed if I am running 4 nodes in a cluster? How about 3? 5?
3, 3, 5
What is a quorum?
Explain how this is important in the context of TS?
A quorum is just another way of saying an absolute majority.
A complete Tableau Server outage will occur if the number of running Coordination Service processes does not constitute a quorum, which is base on the total number of configured Coordination Services processes.
How many Coordination Service processes are installed on a three node cluster? How many node failures can be tolerated without crashing the entire system?
A cluster with either three or four computers is able to tolerate the loss of, at most, one node (one instance of the Coordination Service).
i.e.
Total CS processes: Three
Quorum equals: 2 (to have a majority)
Tolerates: 1 CS failure
Why do you need three nodes to achieve HA? Why note two?
A cluster with only two nodes cannot tolerate the loss of any a single Coordination Service process.
What happens when a Coordination Service process fails?
Nothing, as long as the number of remaining Coordination Service processes still constitutes a quorum.
If the number of still functioning Coordination Service processes number less than a quorum, the entire Tableau Server cluster becomes unavailable in order to protect the referential integrity of the underlying Postgres database.
How is the Cluster Controller process installed on a cluster?
One instance is installed on each node of the cluster. No explicit config is necessary.
What happens when a Cluster Controller process fails?
All other TS components on that same node will become unavailable and display as “unavailable” on the TS status page. Any Repository process running on that node will also by unavailable.
How do you protect against a Cluster Controller process failure?
Ensure that each unique server component has redundancy and is running on at least two different nodes in the cluster.
When a Cluster Controller restarts, it also restarts any _________ process configured on that node.
Repository
What happens if there is no fully-functioning Repository process running on the cluster?
The entire TS cluster will be unavailable.
How can you improve Repository availability?
Configure an additional “passive” Repository on a different node of the cluster.