Amazon OpenSearch Flashcards
What is Amazon OpenSearch Service?
A fully managed service that makes it easy to deploy, operate, and scale OpenSearch clusters in the AWS Cloud. OpenSearch is a powerful open-source search and analytics engine.
What are some common use cases for OpenSearch?
- Log analytics
- Full-text search
- Security analytics
- Application monitoring
- Clickstream analytics
Name the core components of OpenSearch.
- Documents: The data you index and search, represented in JSON format.
- Types: (Deprecated in later versions) Used to define schemas for documents.
- Indices: Collections of documents and their associated types.
- Shards: Subdivisions of an index for distributing data across nodes.
How does OpenSearch provide redundancy and high availability?
Through replication. Each shard can have one or more replicas. If a shard fails, a replica takes over.
What are the different storage tiers available in Amazon OpenSearch Service?
- Hot storage: Fastest, best for frequently accessed data. Backed by instance storage or EBS volumes.
- UltraWarm storage: Cost-effective for less frequently accessed data, uses S3 with caching.
- Cold storage: Cheapest, for infrequent access and long-term retention, uses S3.
What is the purpose of Index State Management (ISM)?
Automates common index management tasks:
* Deleting old indices
* Changing replica count
* Moving indices to different storage tiers
* Taking snapshots
What are index rollups?
A way to summarize and reduce the size of older data by creating new, smaller indices with aggregated information.
What is cross-cluster replication used for?
Replicating indices across OpenSearch domains for:
* High availability in case of outages
* Geographic redundancy
* Lower latency for users in different regions
How can you improve the stability of your OpenSearch cluster?
- Use 3 dedicated master nodes to avoid split-brain situations.
- Monitor disk space usage closely to prevent running out of storage.
- Choose the appropriate number of shards based on data volume and desired shard size.
What are some security features offered by Amazon OpenSearch Service?
- Resource-based policies
- Identity-based policies (IAM integration)
- IP-based access control
- Encryption at rest and in transit
- VPC support
- Integration with Cognito for user authentication
What are some anti-patterns to avoid when using OpenSearch?
- Don’t use OpenSearch for OLTP workloads (it’s not a database).
- Avoid using it for ad-hoc data querying (tools like Athena are better suited).
- Remember that OpenSearch is optimized for search and analytics.
What is Amazon OpenSearch Serverless?
A serverless option for OpenSearch that provides:
* On-demand autoscaling
* Encryption with your own KMS keys
* Simplified data management with collections
What are the key benefits of using OpenSearch Serverless?
- Eliminates the need to manage infrastructure.
- Scales automatically based on workload demands.
- Offers pay-as-you-go pricing.
- Enhances security with automatic encryption.
What are OpenSearch Compute Units (OCUS)?
The unit of measure for capacity in OpenSearch Serverless. You can set limits on the OCUs used by your collections.