Section 5.1 Flashcards
What does the Indexing Layer do?
Allows you to clean up data.
Allows you to refine data.
Allows you to store data.
What is Index clustering?
When multiple indexers are connected in order to replicate copies of the indexers buckets (data).
Where is data stored?
In indexes on the indexer that have buckets.
What is automatic failover?
Basically backing up data. If one indexer fails, the others will pickup the slack and maintain continuity.
High availability means…
Data is highly available for searching.
Index Clustering in summary means
Data is protected from sudden loss
More copies are available for users who are actively searching
Indexer activities will continue in the event an indexer goes down
Replication Factor determines
How many copies are maintained within an indexer cluster.
Deafult RF is 3
Maximum RF is determined by the number of indexers you have or nodes.
Search Factor determines
How many of these copies are immediately searchable.
Default SF is 2
In a clustering environment you need a minimum of ____ Indexers
3
Most important fact about a Search Factor (SF)
The Search Factor can never be more than the Replication Factor.
Explain RF & SF
RF factor tells us how many times we want the data to be copied over. Two of those copies are highly available and just incase something happens to the first copy. If both copies go down, the third copy is usually stored at an offsite location.
When does the Cluster Master come in?
The Cluster Master comes into play when we start copying our data (when the environment becomes clustered).
Cluster Master Manages what layer?
It manages the indexing layer.
What is the Cluster Master?
A centralized configuration Manager who’s job is to manage the indexer cluster.
Once the environment becomes clustered, the Deployment Server….
Only manages the forwarders.
What does a Cluster Master do?
Manages cluster activities (adding peers, distributing configurations, determines the number of copies to maintain).
Maintains memory of peers, their buckets, and configs
Tells search head where to request data.
What are Peers (Cluster Peer)?
Peers are Indexers
What do Peer Nodes do?
Peers receive and index incoming data typically from forwarders)
Replicate data to other peers
Respond to incoming searches by supplying search results
A clustered architecture is called ..
A distributed search
Clustering is Smart because it provides….
Data Availability
Data Fidelity
Data Resiliency
Disaster Recovery
Search Affinity
Multi-site clustering =
Storing copies of your data at a different site
Data fidelity =
The act of not losing data; reliability
Benefits of Clustering =
1.Data Availability & fast recovery
2.Easier overall administration
3.Scalability of indexing
4.No additional cost for data replication
Cons of clustering =
1.Increased storage requirements
2.Increased processing load
3.Requires additional Splunk instances
4.Indexers require the same OS and versions
When you enable a search head in cluster environment you must specify what?
Cluster settings (i.e. Master Node) and the port on which it receives data.
Transforms.conf=
specify transformations and lookups that can then be applied to any event
What is the filepath of the CM that sends apps to its peers ?
splunkhome/etc/master-apps
Where do bundles reside for cluster peer?
splunkhome/etc/slave-apps