S3 Flashcards
Which service is a petabyte-scale data moving service (as a fleet) in or out of AWS with computing capabilities?
- Snowcone
- Snowball Edge
- Snowmobile
- Snowball Edge - 80TB usable, up to petabytes offline. Can cluster up to 15 nodes.
- Snowcone - 8TB usable, up to 24TB, online and offline. No clustering.
- Snowmobile - More than 100PB. Up to exabytes, offline. No clustering.
What are objects stored in Amazon S3?
-Buckets - Basically directories or folders.
A company with low budget needs to regularly transfer petabytes of data to the cloud and have access to local computing capacity. Which service can help with this task?
-Snowball Edge - Storage Optimized.
Which service is a decentralized blockchain on AWS?
-Managed Blockchain - Works with Ethereum and Hyperledger Fabric. Blockchain makes it possible to build apps where multiple parties can execute transactions without the need for a trusted, central authority.
What AWS service is always server less and has SQL capabilities?
-Athena - Serverless query service to perform analytics against S3 objects.
What relational database is proprietary technology from AWS and is cloud-optimized?
-Aurora - Cost 20% more than RDS, but is cloud optimized and 5 times faster than MySQL on RDS and 3 times faster than Postgres on RDS.
You want to create Hadoop clusters to analyze and process vast amounts of data. Which service do you use?
EMR - Elastic Map Reduce - Helps create hadoop clusters (Big Data) to analyze and process huge amounts of data.
- Cluster can be hundreds of EC2 instances.
- Can auto scale with Spot instances.
- Used for data processing, machine learning, web indexing and big data.
What is the name of a central repository to store structural and operational metadata for data assets in AWS Glue?
Glue Data Catalog
You would like to set up a NoSql database that can scale with no downtime and can handle millions of requests per second. Which ASW database service should you use?
DynamoDB - Scales to huge workloads, distrilbuted “serverless” database. Single digit latency. Low cost and auto-scaling.
Which AWS serverless service can use machine learning BI to create interactive dashboards and charts?
QuickSight - It’s basically a competitor to Tableau.
Name and exclusive DynamoDB feature that is an in-memory cache that can improve your performance up to 10x.
DynamoDB Accelerator - Also called DAX. Elasticache is a more general caching so it’s not exclusive to DynamoDB. You could use Elasticache with DynamoDB, but you’d have milisecond latency instead of micro-second latency.