Redshift Flashcards
Redshift
fast, powerful, fully managed peta-byte scale data warehouse in cloud..
Customer can start with .25c per hour with no commitments or upfront costs and scale up to petabyte or more for $1000 per terabyte per year, less than tenth of most other data warehousing solutions
Redshift configurations
can be configured as follows
Single node (160Gb)
Multi-node leader node (manages client connection and receives queries) Compute Nodes(store and perform queries and computations ). up to 128 compute nodes
Backups
Enabled by default with 1 day retention period
max retention is 35 days
Redshift always maintains three copies of your data (the original and the replica on the compute nodesand a backup in Amazon s3
redshift also asynchronously replciate snapshots in another region for disaster recovery
Redshift pricing
Compute Node Hours.. you will be charged for leader node hours
3 node compute node cluster running for 1 month = 2160 node hours 3X24X30
Back up
Data transfer(only within VPC, not outside it)
Security
ENCRYPTION in transit using SSL
at rest using AES-256 encryption
Key Management
Manage your own keys through HSM
AWS Key Management Service
Redshift AVailability
Currently available only in 1 AZ
You cna restrore snapshots in a different AZ