Redshift Flashcards
What is Amazon Redshift?
Amazon Redshift is a fast, powerful, fully-managed, petabyte-scale data warehouse service in the cloud.
What, at a high level, is the financial benefit of using Redshift?
- Customers start at $0.25/hr with no commitments or upfront costs
- and scale to a petabyte or more for $1000/terrabyte/yr, less than one tenth of most other data warehousing solutions
What is the storage available when Amazon Redshift is configured for Single Node?
160GB
What is the setup for Multi-Node AWS Redshift?
- A Leader Node that manages clinet connections and receives queries
- Compute Nodes that store data and perform queries and computations
What is the maximum number of compute nodes you can have in Redshift?
128
What are two built-in ways that Amazon Redshift maximizes performance?
- Advanced Compression based around columns
- Massive Parallel Processing
What does MPP stand for?
Massive Parallel Processing
Are AWS Redshift Backups enabled by default?
Yes, with a 1-day retention period
What is the maximum retention period for AWS Redshift Backups?
35 days
What does AWS Reshift do to help ensure redundancy?
It always tries to maintain at least 3 copies of the data
- Original
- replica on the compute nodes
- a backup in S3
What does AWS Redshift do to help with disaster recovery of your data?
It can asynchronously replicate your snapshots to S3 in another region
What is the pricing model for AWS Redshift?
- Backups
- Data Transfer (w/in VPC Only, not outside)
- Compute Node Hours = total number of hours run across all compute nodes in the given billing period (you are NOT charged for leader node hours)
How does AWS Redshift account for security considerations?
- Communications with Redshift are encrypted in transit using SSL
- encrypted at rest using AES-256
- By default, Redshift takes care of Key Management
- but, you CAN manage your own keys through a hardware security module and KMS
What does HSM stand for?
Hardware Security Module
How does Redshift handle availability concerns?
- Currently, Redshift is only available in 1 AZ
- You can restore snapshots to new AZs in the event of an outage