Domain 3 Flashcards
Give an example of Object, Block and File storage
Object (S3), Block (EBS) and File storage (EFS)
Key words for block storage
DAS, SAN, EC2
True or False: EFS scales automatically
True
What is Multi Part
AWS multipart upload is a feature of Amazon S3 (Simple Storage Service) that allows you to upload a single object as a set of parts. Each part is a contiguous portion of the object’s data and is uploaded independently. Once all parts are uploaded, they are automatically reassembled into a single object.
Types of EBS volumes
General, Provisioned, Throughput, Cold
What is S3 accelerator
Amazon S3 Transfer Acceleration is a feature of Amazon S3 that enables fast, easy, and secure transfers of files over long distances between your client and an S3 bucket. It leverages Amazon CloudFront’s globally distributed edge locations to accelerate your uploads to S3. When data arrives at an edge location, it is routed to Amazon S3 over optimized network paths.
How does caching with cloud front work
Amazon CloudFront is a content delivery network (CDN) service that speeds up the delivery of your content to users around the world. It does this by caching copies of your content in strategically placed edge locations across the globe.
A cache check checks to see if the content already exists. If it does, it uses that and if not, it requests from the origin server
EC2 and Fargate summary
Fargate is serverless compute for containers
If you want more control over infrastructure, use EC2
How can metrics help with auto scaling?
In AWS, Auto Scaling is a service that helps you automatically adjust the number of compute resources, such as EC2 instances, to maintain application performance and optimize costs. Metrics are a critical component of Auto Scaling, as they provide the information needed to make decisions about scaling actions.
What can assist with auto scaling
Cloud watch metrics
Scaling policies
Custom metrics
Alarms
Predictive scaling
Monitoring and Analysis
True or False: With AWS databases, don’t need to worry about database management tasks such as server provisioning, patching, setup, configuration, backups, or recovery
True
Read Replicas vs. Multi AZ Deploy
Read Replicas
Improve performance by offloading read traffic from the primary database instance.
Facilitate scaling of read-intensive database workloads.
Enable data distribution for read-heavy applications
Multi AZ
Enhance availability and reliability by automatically replicating data across different Availability Zones (AZs).
Provide automated failover support for database instances.
Use Read Replicas if your primary goal is to scale read operations and distribute load among multiple instances, especially for read-heavy workloads.
Use Multi-AZ Deployments if your primary goal is to enhance database availability and reliability, with automated failover to a standby instance in the event of failure. Multi-AZ is crucial for production workloads where high availability and automatic recovery from failures are critical.
Aurora can span multiple regions
True
What is Redis?
Redis (Remote Dictionary Server) is an open-source, in-memory data structure store used as a database, cache, and message broker.
Mem Cache
In memory data store. Memcached is a high-performance, distributed memory caching system that is used to improve the speed and performance of dynamic web applications by alleviating database load. It stores key-value pairs in memory, providing fast data retrieval for frequently accessed information.
RDS Auto Scaling
Vertical
Horizontal
Aurora Auto Scaling
Elastic Load Balancing
How to build out a VPC
Plan your VPC
-IP Address Range
-Define Subnets
-Decide Availability Zones
Create VPC
-Create VPC
-Create Subnets
Configure Networking
-IG
-Route table
-NAT if needed
Security
-Security Groups
-ACLS
Additional Consideration
-VPC Peering
-Load Balancers
Launch Resources
What is Transit Gateway
AWS Transit Gateway is a network transit hub that you can use to interconnect your Amazon Virtual Private Clouds (VPCs) and on-premises networks. It acts as a central hub that simplifies network architecture, allowing you to manage and route traffic efficiently across multiple networks.
What is cloud hub
AWS CloudHub is a feature of AWS that facilitates the connection of multiple Amazon Virtual Private Clouds (VPCs) and on-premises networks over a secure, scalable, and highly available network infrastructure. It is particularly useful for organizations that have branch offices, remote locations, or multiple on-premises data centers that need to connect securely to AWS and to each other
Data Sync
AWS DataSync is a managed data transfer service that simplifies, automates, and accelerates moving data between on-premises storage systems and AWS storage services. It is designed to handle large-scale data transfers in a fast and reliable manner, supporting a variety of use cases such as data migration, data processing, and backup and recovery.
Describe Homogeneous
Move data to destination in same format or same storage engine as the source. All about speed of transfer
EMR
Athena
Amazon RDS
S3
Describe Heterogenous
Transformed as ingested to the destination storage. Meet the requirements for the correct format.
Redshift
Kinesis
Glue
RDS
S3
What is API gateway
AWS API Gateway is a fully managed service that allows developers to create, publish, maintain, monitor, and secure APIs at any scale. It serves as a “front door” for applications to access data, business logic, or functionality from your backend services such as AWS Lambda functions, Amazon EC2 instances, or other web services.
Cognito
Amazon Cognito is a service provided by AWS that enables you to add authentication, authorization, and user management to your web and mobile applications. It simplifies the process of handling user sign-up, sign-in, and access control, allowing developers to focus on building their applications without dealing with complex authentication code.
HSM
A Hardware Security Module (HSM) is a physical device that provides cryptographic functions to securely generate, store, and manage cryptographic keys. HSMs are designed to protect sensitive data and cryptographic keys from theft and misuse, making them critical components in security-sensitive environments.
Batch
AWS Batch is a fully managed service that enables developers, scientists, and engineers to efficiently run hundreds of thousands of batch computing jobs on AWS. AWS Batch automatically provisions the optimal quantity and type of compute resources based on the volume and specific resource requirements of the batch jobs submitted.
Step Functions
AWS Step Functions is a fully managed service that enables you to coordinate distributed applications and microservices using visual workflows. It is particularly useful for creating complex workflows that involve multiple AWS services and automated tasks, providing a way to define and execute tasks in a sequence, parallel, or conditionally based on logical rules.
App Sync
AWS AppSync is a fully managed service that simplifies application development by enabling developers to create scalable GraphQL APIs. It integrates seamlessly with AWS services, making it easy to build engaging real-time and offline mobile and web applications.
Graph QL
GraphQL is a query language for APIs and a runtime environment for executing those queries by using a type system you define for your data.
Lazy Loading
Lazy loading is a design pattern used to defer the loading of resources until they are actually needed. In the context of AWS, lazy loading can be implemented in various services to optimize performance, reduce costs, and improve user experience by minimizing the initial load time and resource usage.
Data Sync vs. Glue vs. DMS
“Data Sync” is primarily used for transferring large data sets between different storage locations like on-premise servers and S3 buckets, while “Glue” is a managed ETL service for complex data transformations and integration, and “DMS” is focused on database migration and replication, particularly for moving data between different database systems