Amazon OpenSearch Flashcards

1
Q

What is Amazon OpenSearch Service?

A

A fully managed service that makes it easy to deploy, operate, and scale OpenSearch clusters in the AWS Cloud. OpenSearch is a powerful open-source search and analytics engine.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are some common use cases for OpenSearch?

A
  • Log analytics
  • Full-text search
  • Security analytics
  • Application monitoring
  • Clickstream analytics
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Name the core components of OpenSearch.

A
  • Documents: The data you index and search, represented in JSON format.
  • Types: (Deprecated in later versions) Used to define schemas for documents.
  • Indices: Collections of documents and their associated types.
  • Shards: Subdivisions of an index for distributing data across nodes.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How does OpenSearch provide redundancy and high availability?

A

Through replication. Each shard can have one or more replicas. If a shard fails, a replica takes over.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the different storage tiers available in Amazon OpenSearch Service?

A
  • Hot storage: Fastest, best for frequently accessed data. Backed by instance storage or EBS volumes.
  • UltraWarm storage: Cost-effective for less frequently accessed data, uses S3 with caching.
  • Cold storage: Cheapest, for infrequent access and long-term retention, uses S3.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the purpose of Index State Management (ISM)?

A

Automates common index management tasks:
* Deleting old indices
* Changing replica count
* Moving indices to different storage tiers
* Taking snapshots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are index rollups?

A

A way to summarize and reduce the size of older data by creating new, smaller indices with aggregated information.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is cross-cluster replication used for?

A

Replicating indices across OpenSearch domains for:
* High availability in case of outages
* Geographic redundancy
* Lower latency for users in different regions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How can you improve the stability of your OpenSearch cluster?

A
  • Use 3 dedicated master nodes to avoid split-brain situations.
  • Monitor disk space usage closely to prevent running out of storage.
  • Choose the appropriate number of shards based on data volume and desired shard size.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are some security features offered by Amazon OpenSearch Service?

A
  • Resource-based policies
  • Identity-based policies (IAM integration)
  • IP-based access control
  • Encryption at rest and in transit
  • VPC support
  • Integration with Cognito for user authentication
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are some anti-patterns to avoid when using OpenSearch?

A
  • Don’t use OpenSearch for OLTP workloads (it’s not a database).
  • Avoid using it for ad-hoc data querying (tools like Athena are better suited).
  • Remember that OpenSearch is optimized for search and analytics.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is Amazon OpenSearch Serverless?

A

A serverless option for OpenSearch that provides:
* On-demand autoscaling
* Encryption with your own KMS keys
* Simplified data management with collections

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the key benefits of using OpenSearch Serverless?

A
  • Eliminates the need to manage infrastructure.
  • Scales automatically based on workload demands.
  • Offers pay-as-you-go pricing.
  • Enhances security with automatic encryption.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are OpenSearch Compute Units (OCUS)?

A

The unit of measure for capacity in OpenSearch Serverless. You can set limits on the OCUs used by your collections.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly