Amazon Web Services - CP Flashcards
SQS
Simple Queue Services
S3
Simple Storage Service
EC2
Elastic Compute Cloud
CSP
Cloud Service Provider
X’tics of CSP
Multiple Cloud Services…
Which can be chained together to form cloud architectures
Accessible via Single Unified API
Utilized metered billing based on usage
Have rich monitoring built in (AWS CloudTrail)
Have an infrastructure as a Service (IaaS)
Offers automation via infrastructure as code (IaC)
Tier-1 CSPs
AWS, Microsoft Azure, Google Platform (GCP), Alibaba Cloud
Tier-2 (specialization)
IBM Cloud, Oracle Cloud, Rackspace (Openstack)
Tier-3 (VPS) offer IaaS
Vultr, Digital Ocean, Linode
Magic Quadrant (MQ)
A series of market research reports published by IT consulting firm Gartner that rely on proprietary qualitative data analysis methods to demonstrate market trends, such as direction, maturity and participants.
Cloud services for IaaS
Compute
Network
Storage
Databases
Example of 4 core cloud services
Compute - EC2 Virtual machines
Storage - EBS Virtual hard drives
Database - RDS SQL databases
Networking and content delivery - VPC Private Cloud Network
Dedicated
A physical server wholly utilized by a single customer
Guess your capacity
Overpay for an underutilized server
Can’t vertical scale, you need a manual migration
Replacing a server is very difficult
Limited by host’s OS
Results in conflicts in resource sharing
Guaranteed security, privacy and full utility of underlying resources
VMs
Multiple Virtual machines on one machine.
Hypervisor is software layer that lets you run the VMs.
Physical server shared by multiple customers.
Overpay for an underutilized Virtual Machine.
Limited by your Guest OS.
Multiple apps on a single VM can result in conflicts in resource sharing.
Easy to export or import images for migration.
Easy to vertical or horizontal scale.
Containers
VM running multiple containers.
Docker Deamon is the name of the software layer that runs multiple containers
Maximize available capacity, cost-effective
Containers share same underlying OS, thus more efficient than multiple VMs
Multiple apps run side-by-side w/o being limited to the same OS requirements and will not cause conflict during resource sharing.
Functions
Managed VMs running managed containers.
Serverless Compute
Upload a piece of code, choose the amount of memory and duration.
Responsible for only code and data.
Very cost-effective, only pay for time code is running, VMs only run when there is code to be executed
Cold Starts is a side-effect of this setup
Types of Cloud Computing
Software as a Service
For customers
Product that is run and managed by the service provider. eg. salesforce, Office365, gmail
SaaS
Types of Cloud Computing
Platform as a Service
For Developers
Focus on the deployment and management of your apps. (Heroku, google app engine)
PaaS
Don’t worry about, provisioning, confinguring or understanding the hardw
Infrastructure as a Service
For administrators
Basic building blocks for cloud IT. Provides access to networking features, computers and data storage space. Eg. Microsoft Azure, AWS, Oracle
IaaS
Don’t worry about IT staff, data centers and hardware.
Cloud Computing Deployment Models
Public Cloud
Everything (the workload or project) is build on the CSP. Eg. Dropbox, Startups, SaaS offerings
Cloud-Native or Cloud First
Cloud Computing Deployment Models
Private Cloud
Everything built on company’s data centers. Eg. public sector like government, hospitals, insurance companies.
On-Premise or Openstack
Cloud Computing Deployment Models
Hybrid
Using both On-Promise and a Cloud Service Provider
Eg. Banks, FinTech, Investment management, Large professional Service providers like Deloitte, Legacy on-premise.
Both public and private cloud
Cloud Computing Deployment Models
Cross-Cloud
Using multiple Cloud Providers. Eg. Anthos
Multi-cloud
What is computing Power?
Types of computing
The throughput measured at which a computer can complete computational task.
- General computing (xeon CPU Processor; EC2)
- GPU Computing (50x faster than traditional CPUs; AWS Inferentiare (Inf1))
- Quantum computing (100 million times faster; AWS Bracket via CalTech)
Benefits of the cloud
- Agility
- Pay-as-you go pricing
- Economy of scale
- Global Reach
- Security
- Reliability
- High availability
- Scalabilty
** Elasticity*
What is the AWS Global Infrastructure?
Name the resources
Gobally distributed hardware and datacenters physically networked together.
- 25 Launched Regions
- 81 Availability zones
- 108 Direct Connection Locations
- 275+ Points of presence
- 11 Local Zone
- 17 Wavelength Zones
Global Infrastrure - Regions
Factors to consider in choosing a region
Geographically distinct locations consisting of 3 Availability Zones (except US-West, 2AZ). New services are avaible first in US-East. All billing information appears in US-EAST-1 (North Virginia). AWS cost varies by region.
- Regulatory compliance of the region
- AWS cost of services in the region
- Availability of AWS services in the region
- Distance or latency to end-users
Availability Zone
Physical location made up of one or more more datacenters. Datacenter is a secured building that contains hundreds of thousands of computers.
- Datacenters within a region are close enough to provide low-latency (<10ms)
- Common practice to run workloads in at least 3 AZs to ensure services remain available in case of one or two datacenters fail. (High Availability)
- AZs are shown as a Region Code, followed by a letter identifier eg. US-East-1a
Each region generally contain 3 Availability Zones
Subnet is associated with an AZ
Never choose the AZ when launching resources. You choose the Subnet which is associated to the AZ.
The US-EAST-1 region has 6AZs (the most AZs of any region)
Structure of AWS Region
- Region has multiple AZs
- An AZ is made up of one or more datacenters
- All Azs in AWS Region are interconnected with high-bandwith, low-latency networking, over fully redundant, dedicated metro fiber providing high-throughput, low-latency networking between
- All traffic between AZs is encrypted
- AZs are within 100 km (60 miles) of each other.
Discuss Fault Tolerance with regard to AWS
Scope of fault domain
Fault domain is a section of a network that is vulnerable to damage if a critical device or system fails. The purpose of a fault domain is that if a failure occurs, it will not cascade outside that domain, limiting the damage possible. Fault domains can be nested inside fault domains.
Fault leve is a collection of fault domains.
- specific servers in a rack
- an entire rack in a datacenter
- an entire room in a dataceter
- an entire data center building.
AWS Region would be fault level
AZ would be a fault domain
Scope of fault domain is chosen/defined by Cloud Service Provider like AWS
Failure Zone (AZs)
Each Amazon region is designed to be completely isolated from the other Amazon Regions to achieve the greatest possible fault tolerance and stability.
AWS version of a Fault domain
1. AZ are physically separated within a typical metropolitan region and are located in lower risk flood plains.
2. Discrete uninterrupted power supply(UPS) and onsite backup generation facilities
3. Data centers located in different AZ are designed to be supplied by independent substations to reduce the risk of an event on the power grid impacting more than one AZ
4. AZ are all redundantly connected to multiple tier-1-transit providers.
Multi-AZ for High Availablitiy: If an application is partitioned across
Each AZ is designated as an independent failure zone
What is the backbone of AWS?
AWS Global Network
Represent the interconnections between AWS Global Infrastructure. Permits rapid mobility between datacenters.
1.Edge locations
2.Amazon CloudFront (CDN)
3.AWS Global accelerator, AWS S3 Transfer Acceleration
4.VPC Endpoints
Points of Presence (PoP) Resources
Is an intermediate location between an AWS Region and the end user, and this location could be a datacenter or collection of hardware owned by AWS or a trusted partner for content delivery or expediated upload.
Edge locations
Regional Edge Caches
Edge locations are datacenters that hold cached (copy) on the most popular files (eg. web pages, images and videos) so that the delivery of distances to the end users are reduced
Regional Edge Locations are datacenters that hold much larger caches of less-popular files to reduce a full round trip and also to reduce the cost of transfer fees.
Global Infrastructure - Point of Presence (PoP)
Tier 1 Network
Is a network that can reach every other network on the internet without purchasing IP transit or paying for peering.
AWS AWs are all redudantly connected to multiple tier-1 transit providers
Global Infrasture - PoP
AWS services that use PoPs for content delivery or expediated upload
- Amazon CloudFront is a Content Delivery Network (CDN) service
- Amazon S3 Tansfer Acceleration
- AWS Global Accelerator
Global Infrastructure - PoP
Amazon CloudFront
- You point your website to CloudFront so that it will route requests to nearest Edge Location cache
- Allows you to choose an origin(such as web-server or storage) that will be source of cached
- Caches the contents of what origin would return to various Edge Locations around the world.
Global Infrastructure - PoP
Amazon S3 Transfer Acceleration
Allows you to generate a special URL that can be used by end users to upload files to a nearby Edge Location.
Once a file is uploaded to an Edge Location, it can move much faster within the AWS Network to reach S3.
Global Infrastructure - PoP
AWS Global Accelerator
- Can find the optimal path from the end user to your web-servers
- Deployed within Edge Locations so you send user traffic to an Edge Location instead of directly to your web-application.
AWS Direct Connect
AWS Direct Connect
Is a private/dedicated connection between your datacenter, office, co-location and AWS.
Two very-fast network options:
1. Lower Bandwith 50MBps-500MBps
2. Higher Bandwith 1GBps or 10GBps
Helps reduce network costs and increase bandwith throughput(great for high traffic networks).
Provides a more consistent network experiece than a typical internet-based connection. (reliable and secure)
Co-located (aka carrier-hotel) is a data center where equipment, space, and bandwidth are available for rental to retail customers.
Direct Connect Locations
Trusted partnered datacenters that you can establish a dedicated high speed, low-latency connection from your on-premise to AWS.
Global Infrastructure - Local Zones
Local Zones
Are datacenters located very close to a densely populated area to provide single-digit millisecond low latency performance (eg. 7ms) for that area.
- LA, California was first Local Zone to be deployed as a logisticla extension of US-West Region identified by us-west-2-lax-1a
- Limited AWS services such as EC2 instance types, EBS, Amanzon FSx, Application Load Balancer, Amazon VPC
Purpose of local zone is the support highly-demanding applications sensitive to latencies: media & entertainment, electronic design automation, ad-Tech, machine learning
Global Infrastructure - Wavelength Zones
AWS Wavelenth Zones
Allow for edge-computing on 5G Networks. Applications will have ultra-low latency being as close as possible to the users
You create a subnet tied to a Wavelength Zone and then you can launch Virtual Machines (VMs) to the edge of the targeted 5G Networks.
AWS has partnered with various Telecom companies to utilitze their 5G networks.
Global Infrastructure - Data Residency
Data Residency
Is the physical or geographic location of where an organization or cloud resources reside.
Compliance Boundaries is a regulatory compliance (legal requirement) by a government that describes where data and cloud resources are allowed to reside.
Data sovereignty is the jurisdictional control or legal authority that can be asserted over data because it’s physical location is within jurisdiction boundaries.
- AWS Config (Policy as Code service)
- IAM Policies (Service Control Policy)
- AWS Outposts(physical rack of servers)
Global Infrastructure - AWS For Government
Public sector
Include public goods and governmental services
- Military
- law enforcement
- infrastructure
- public transit
- public education
- health care
- the government itself
AWS achieves this by meeting regulatory compliance programs along with specific governance and security controls.
AWS has special regions for US regulation called GovCloud
Gloud Infrastructure - GovCloud(US)
AWS GovCloud Regions
A CSP generally will offer an isolated region to run FedRAMP workloads.
Federal Risk and Authorization Management Program (FedRAMP) is a US government-wide program that provides a standardized approach to security assessment, authorization, and continuous monitoring for cloud products and services.
- Allow customers to host sensitive Controlled Unclassified Information and other types of regulated workloads.
- Only operated by employees who are US citizens, on US soil.
- Only accessible to US entities and root account holders who pass screening process.
Customers can architect secure cloud solutions that comply with: FedRAMP High baseline, DOJ’s Criminal Justice Information Systems (CJIS)Security Policy, US international traffic in Arm Regulations (ITAR), Export Administration Regulation (EAR), Department of Defense (DoD) Cloud Computing Security Requirements Guide.
Global Infrastructure - AWS in China
AWS China
Completely isolated intentionally from AWS Global due to regulatory and compliance requirement in Mainland China
Has its own domain at amazonaws.cn
- Chinese Business Licence (ICP license) is required to operate in AWS China Region
- Not all services are available in China eg. Route53
- Consists of two region namely Ningxia CN-NorthWest-1 operated by NSWCF and Beijing CN-North-1 operated by SINNET
- AWS label is banned in China. Thus, it is not shown on the chinese website.
Global Infrastructure - Sustainability
AWS Cloud Sustainability Goals
Amazon co-founded the Climate Pledge to achieve Net-Zero Carbon Emissions by 2040 across all of Amazon’s business (this includes AWS)
- Powered by100% Renewable Energy by 2025.
- Cloud Efficiency: AWS’ infrastructure is 3.6 times more energy efficient than the median of US enterprise data centers surveyed.
- Water stewardship: Direct evaporative technology/ non-potable (recycled) water for cooling, on-site water treatment to remove scale-forming minerals and reuse water for more cycles, water efficiency metrics to determine and monitor optimal water use for each AWS Region.
AWS purchases and retires environmental attributes to cover the non-renewable energy for AWS Global Infastructure:
- Renewable Energy Credits (RECs)
- Guarantees of Origin (GOs)
Global Infrastructure - AWS Ground Station
AWS Ground Station
Fully managed service that lets you control satelite communications, process data, and scale your operations without having to worry about building or managing your own ground station.
Use cases for Ground station:
* Weather forecasting
* Surface imaging
* Communications
* Video broadcasts
To use Ground Station:
- You schedule a contact (select satelite, start and end time, and the ground location)
- Use the AWS Ground Station EC2 AMI to launch EC2 instances that will uplink and downlink data during the contact or receive downlinked data in an Amazon s3 bucket.
Global Infrastructure - AWS Outposts
AWS Outposts
(rack of servers running AWS Infrastructure on your physical location)
Fully managed service that offers the same AWS infrastructure, AWS services, APIs, and tools to virtually any datacenter, co-location space, or on-premises facility for a truly consistent hyprid experience.
- Server Rack: Frame design to hold and organize IT equipment.
- Rack Heights: U stands for “rack units” or “U spaces” which equals to 1.75 inches. Industry standard rack size is 48U (7 Foot Rack). AWS highest full-size rack cage is 42U.
Global Infrastructure: AWS Outposts
AWS Outposts comes in 3 form factors
- 42U: Delivered by AWS to preferred physical site fully assembles. Installed by AWS. Rack needs to be plugged into power and network.
- 1U: Server that is placed into rack. Suitable for 19” wide x 24” deep cabinets. AWS Graviton2 (up to 64 vCPUs), 128 GiB memory, 4 TB of local NVMe storage
- 2U: Server that is placed into rack. Suitable for 19” wide x 36” deep cabinets, Intel processor (up to 128 vCPUs), 256 GiB memory
Solution Architect vs Cloud Architect
- Solutions architect is a role in a technical organization that architects a technical solution using multiple systems via research, documentation and experimentation
- Cloud architect is a solutions architect focused on architecting technical solutions using cloud services.
Terms for cloud architect
- Availablility - ability to ensure a service remains available eg. Highly Available (HA)
- Scalability - ability to grow rapidly or unimpeded
- Elasticity - ability to shrink and grow to meet the demand
- Fault Tolerance - ability to prevent a failure
- Disaster Recovery - ability to recover from a failure eg. Highly durable
Solutions Architect is guided by the following prompts:
- How secure is this solution? (security)
- How much is this going to cost? (cost)
High Availability
Elastic Load Balancer in high availability
High availability is the sbility for service to remain available by ensuring there is no single point of failure and/or ensure a certain level of perfomance
A load balancer allows you to evenly distribute traffic to multiple servers in one or more datacenter. If a datacenter or server becomes unavailable(unhealthy) the load balancer will route the traffic to only available datacenters with servers.
Running your workload across multiple Availability Zones ensures that if 1 or 2 AZs become unavailable your service/ applications remains available.
Types of scalability
High Scalability is ability to increase capacity based on the increasing demand of traffic, memory and computing power.
- Vertical Scaling: Scaling Up. Upgrade to a bigger server
- Horizontal Scaling: Scaling out. Add more servers of the same size
High Elasticity
Horizontal Scaling
High elasticity is ability to automatically increase or decrease capacity based on current demand of traffic, memory and computing power.
- Scaling out - Add more servers of the same size
- Scaling in -Removing underutilized servers of the same size
-Auto Scaling Groups (ASG) is an AWS feature that will automatically add or remove servers based on the scaling rules you define based on metrics.
Vertical scaling is generally hard for traditionsl architecture so you’ll usually only see horizontal scaling described with elasticity.
Highly Fault Tolerant
Fail-overs
Highly Faulat Tolerant: Ability for service to ensure there is no single point of failure preventing the chance of failure.
- Fail-overs is when you have a plan to shift traffic to a redundant system in case the primary system fails.
- A common example is having a copy (secondary) of your database where all ongoing changes are synced. The secondary systen is not in-use until a fail occurs and it becomes the primary database.
- RDS Multi-AZ is when you run a duplicate standby database in another Availability Zone in case your primary database fails.
High Durability
Disaster Recovery Concerns
High durability is ability to recover from a disaster and to prevent the loss of data. Solutions that recover from a disaster is known as Disaster Recovery (DR)
- Do you have a backup?
- How fast can you restore that backup?
- Does your backup still work?
- How do you ensure current live data is not corrupt?
CloudEndure Disaster Recovery continuously replicates your machines into a low-cost staging area in your target AWS account and preferred Region enabling fast and reliable recovery in case of IT data center failures.
Business Continuity Plan
A business continuity plan (BCP) is a document that outlines how a business will continue operating during an unplanned disruption in services
- Recovery Point Objective (RPO): How much data are you willing to lose?
- Recovery Time Objective: How much time are you willing to go down?
RPO: The maximum acceptable amount of data loss after an unplanned data-loss incident, experessed as an amount time
RTO: The maximum amount of downtime your business can tolerate without incurring a significant loss.
Disaster Recovery Options
Mutiple options for recovery that trade cost vs time to recover
- Backup & Restore
- Pilot Light
- Warm Standby
- Multi-site Active/active
Distract Recovery Options
Backup & Restore
- RPO/RTO: Hours
- Backup data is restored to new infrastructure.
-Lower priority use cases
-Restore data after event
-Deploy resources after event
- Cost $$
Disaster Recovery Options:
Pilot Light
- RPO/RTO: 10 mins
- Data is replicated to another region with the minimal services running
-Less stringent RTO & RPO
-Core Services
-Start & and scale resources after event
-Cost $$
Disaster Recovery Options
Warm Standby
- RPO/RTO : Minutes
- Scaled down copy of your infrastructure running ready to scale up
-Business Critical Services
-Scale resources after event
- Expensive
Disaster Recovery Options
Multi-site Active/active
- RPO/RTO: Real-time
- Scaled up copy of your infrastructure in another region.
- Zero downtime
- Net zero loss
- Mission critical services
- Expensive
AWS Application Progrmming Interface (API)
API
- An API is software that allows two applications/services to talk to each other.
- Most common type of API is via HTTP/S requests.
- AWS API is an HTTP API and you can interact by sending HTTPS requests, using an application interacting with APIs like Postman
- Each AWS service has its own Service Endpoint which you send requests
- To authorize you will need to generate a signed request. You make a separate request with your AWS credentials and get back a token.
- You need to also provide ACTION and accompanying parameters as the payroad.
AWS Management Console
Web-based unified console to build, manage, and monitor everythig from simple web apps to complex cloud deployments. Clickops refers to performing all system operations via clicks.
Service Console
AWS Services have customized console that can be accessed by searching the service name.
Some AWS Service Console acts as an umbrella console containing many AWS services:
* VPC console
* EC2 console
* Systems Manager console
* SageMaker console
* CloudWatch console
AWS Account ID
-Found by dropping down the current user in the global navigation
-Composed of 12 digits
-Used for login by non-root users, cross-account roles, support cases
-Can also be found under IAM console
-Used when creating policies for users and cross-acount roles
PowerShell
-A task automation and configuration management framework. Consits of command-line shell and a scripting language.
-Unlike most shells, which accept and return text, PowerShell is built on top of the .NET Common Language Runtime (CLR), and accepts and returns .NET objects.
-AWS Tools for PowerShell lets you interact with the AWS API via PowerShell Cmdlets
-Cmdlets is a special type of command in PowerShell in the form of capitialized verb-and-noun e.g. New-s3Bucket
Amazon Resource Names (ARNs)
-Uniquely identify AWS resources
-Require to specify a resource unambigously across all of AWS
-Format is arn:partition:service:region:account-id:resource-type/resource-id