Managing StorageGRID Objects Flashcards
What is the life cycle of an object?
Ingest (client application creates object)
Manage (ILM manages the object over time and protects from data loss)
Retrieve (Client application reads object from storage node)
Delete (Client deletes the object using removal command)
What is an ILM Policy?
A set of prioritised ILM rules that determines how the Grid manages the data over time.
This is applied when the object is first ingested.
Example rule: Place one copy in DC 1 disk, Place one copy in DC 2 disk, keep a copy on DC2 disk for one year before moving to cold storage in the cloud
What is the default ILM Policy Rule?
The default rule is to make one copy of the object at every site.
What is a storage pool
A logical grouping of storage nodes
What is storage grade?
Defines the The type of storage that is using in a storage pool
What is a site?
Is the data centre where storage pools are located.
What is a region?
A location where ILM rules can be placed based on the regions where S3 buckets are created.
Regions can be used to keep data close to a customer location if required to reduce latency, address any regulatory requriements and minimise cost (if available).
What is the Cloud Storage Pool?
This is a pool that allows you to move objects to an external storage (cloud) outside your Grid system.
What are ILM rules?
They define
The location of the data
The type of storage that is used
The type of loss protection that is applied to an object’s data
The number of copies made
The life space of the object (and changes to its location, Renton and loss protection over time).
What are the two mechanisms ILM rules provide to protect from object data loss?
Replication
Erasure coding
What is object ingest?
When the client creates the object and sends it to the grid system (though a put request) matches the object against the ILM rule.
The system will make a temporary copy else where if it is unable to initially complete the rules of the ILM
What is replication used for
To protect the object by making a copy at another site that can be updated by the GRID
What is Erasure coding used for?
Data loss, where an ILM rule create object parity items which are spread across the GRID to ensure protection (like raid).
There are factors to consider
Erasure code (data + parity).
Storage overhead (%)
Storage Node redundancy — the number of nodes that can be lost without impacting data.
Site redundancy — The number of storage grid sites that can be lost without impacting data retrieval.
What is the Object retrieve?
The client application retrieving data from the GRID system using a http get request.
Usually from the local storage node. The storage node returns the data via the gateway node.
What is object delete?
This is when a client sends a delete request Http Delete API to the grid system.
All copies of an object are removed from the grid system.
It does not happen immedately. The key is removed first, and later the system will purge the data and meta data.
What is the different between a storage pool and a cloud storage pool
A storage pool will reside within one or more sites.
Whereas a cloud storage pool will be one location (azure or AWS).
An object will be moved from the storage pool to the cloud but the meta data is stored in the Grid.
What is cross-grid replication?
This allows fro the use of 3-2 backup stragegy (keep 3 copies, two in the grid, and one in the cloud).
All data is replicated between the grids.
Replication can be singe or bi-directional.
What is object protection
Object Versioning (S3 bucketing)
Object Encryption (using data at rest)
Object lifecycle management (when objects are expired in certain S3 buckets)
Object Locking (this allows for client applications to set retention settings - i.e. for secure worm objects)
What are Platform Services
They can help implement a hybrid cloud strategy (for your environment).
E.G.
CloudMirror Replication (ingested objects immediately replicated to an external S3 bucket).
Event notification (send an SNS notification when specific object event types occur).
Search Integration (Send object metadata to an Elasticsearch cluster when an object is ingested or deleted or when its
metadata changes).
ASSESSMENT QUESTION - Which grid node thyme manages an object’s data and metadata storage, including protection?
Storage Node
ASSESSMENT: Which grid node type is responsible for sending out AutoSupport information to NetAPP Support?
Admin
ASSESSMENT: what is the minimum number of storage nodes that is supported at each site?
Three
ASSESSMENT: A StorageGRID SG1000 appliance functions as which two types of grid node?
Admin
Gateway
ASSESSMENT: Which management interface do you use to create a tenant?
Grid Manager interface