Amazon Glacier | Amazon Glacier Select Flashcards
Can I obtain a real time list of my vaults?
Amazon Glacier Select
Amazon Glacier | Storage
Yes, you can list your vaults stored in Amazon Glacier using either the AWS Management Console or by calling the ListVaults API. As well as a list of vault names, you will also be able to see when the vault’s inventory was last updated and a summary of the vault’s contents at that time, as well as the vault’s creation date and creator.
What is Amazon Glacier Select?
Amazon Glacier Select
Amazon Glacier | Storage
Amazon Glacier Select is a feature that allows you to run queries on your data stored in Amazon Glacier, without the need to restore the entire object to a hotter tier like Amazon S3. With Amazon Glacier Select, you can now perform filtering and basic querying using a subset of SQL directly against your data in Amazon Glacier. You provide a SQL query and list of Amazon Glacier objects, and Amazon Glacier Select will run the query in-place and write the output results to a bucket you specify in Amazon S3.
Why should I use Amazon Glacier Select?
Amazon Glacier Select
Amazon Glacier | Storage
Amazon Glacier Select enables you to perform analysis on your data in Amazon Glacier without first staging it in a hotter storage tier like Amazon S3. This makes it cheaper, faster and easier to gather insights from your cold data in Amazon Glacier. This can unlock exciting business value for your archives, opening up multiple scenarios of using Amazon Glacier for Big Data, IoT, and custom analytics workloads.
How does the Amazon Glacier Select compare to legacy archival solutions?
Amazon Glacier Select
Amazon Glacier | Storage
Legacy archival solutions, like on-premises tape libraries, have highly restricted data retrieval throughput and rarely have idle compute capacity nearby. The problem is even worse if tapes have been sent to an off-site storage facility. Running any kind of analysis on these solutions can easily take anywhere from weeks to even months. In contrast, with Amazon Glacier Select it is easy to analyze your Amazon Glacier data in-place quickly and inexpensively at latencies you choose ranging from minutes to hours.
What are some scenarios in which I can use Amazon Glacier Select?
Amazon Glacier Select
Amazon Glacier | Storage
You can use Amazon Glacier Select when you need to perform pattern matching or custom analytics on your archived data stored in Glacier. Some customers occasionally face situations where they need to perform filtering on specific keys in response to an audit where they must respond in a few hours, such as a customer who might need to query all of their usage logs for the past year to respond to a billing dispute. Higher-level Big Data applications, like Amazon Athena, can also use the Amazon Glacier Select APIs to provide Amazon Glacier as an additional data source, so that customers can use their tools and languages against their Glacier data.
What kind of latencies can I expect when querying against Amazon Glacier?
Amazon Glacier Select
Amazon Glacier | Storage
Glacier provides three retrieval options - Expedited, Standard, and Bulk. All of these options provide different retrieval times and costs. Amazon Glacier Select works with each of these retrieval options, allowing you to choose the option best aligned to the speed at which you want your query to return results. For all but the largest archives (250MB+), data accessed using Expedited retrievals are typically made available within 1 – 5 minutes. Standard retrievals complete within 3 – 5 hours. Bulk retrievals complete within 5 – 12 hours. For more details on Glacier retrievals, refer to the FAQs on Glacier data retrievals.