Snowball, Storage Gateway, FSx Flashcards
Snowball
a huge box and that allows you to basically physically transport data in and out of AWS.
an alternative to moving data over the network
offline transfer and storage
a service that provides secure, rugged devices, so you can bring AWS computing and storage capabilities to your edge environments, and transfer data into and out of AWS. Those rugged devices are commonly referred to as AWS Snowball or AWS Snowball Edge devices.
Snowball what for?
if you need to run computing in rugged, austere, mobile, or disconnected (or intermittently connected) environments.
Also for large-scale data transfers and migrations when bandwidth is not available for use of a high-speed online transfer service, such as AWS DataSync.
it’s quite useful if you want to pre-process the data while the thing is moving.
Snowball you are going to pay
per data transfer job.
Snowball Edge
an edge computing and data transfer device provided by the AWS Snowball service. It has on-board storage and compute power that provides select AWS services for use in edge locations.
How does Snowball Edge work?
You request one or more devices in the AWS Management Console .
The buckets, data, Amazon EC2 AMIs, and Lambda functions you select are automatically configured, encrypted, and preinstalled on your devices before they are shipped to you.
Once a device arrives, you connect it to your local network and set the IP address either manually or automatically with DHCP.
Then use the Snowball Edge client software, job manifest, and unlock code to verify the integrity of the Snowball Edge device or cluster, and unlock it for use. The manifest and unlock code are uniquely generated and crypto-logically bound to your account and the Snowball Edge shipped to you, and cannot be used with any other devices. Data copied to Snowball Edge is automatically encrypted and stored in the buckets you specify.
All logistics and shipping is done by Amazon, so when copying is complete and the device is ready to be returned, the E Ink shipping label will automatically update the return address, ensuring that the Snowball Edge device is delivered to the correct AWS facility. Once the device ships, you can receive tracking status via messages sent by Amazon Simple Notification Service (Amazon SNS), generated texts and emails, or directly from the console.
Snowmobile
a truck to transfer exabytes of data
one exabyte equal 1000 petabytes, equals 1 million terabytes,
Snowball and Glacier
Snowball can not import data into Amazon Glacier directly. Ypu have to use Amazon S3 first, and then you’re going to use an S3 lifecycle policy to transition that data directly and immediately into glacier.
Hybrid Cloud
part of your infrastructure will be on the cloud on AWS
part of your infrastructure will also be on-premise.
This can be due to many reasons, maybe you have a long cloud migration, or security requirements or compliance requirements
idea behind Storage Gateway
S3, for example, is a proprietary storage technology. It’s not like NFS, which is standardized. So how do we expose the S3 data when we are with on-premise servers or on-premise computers?
Storage Gateway is going to give us access to S3
through a gateway which will expose standard API’s.
how the storage works today in AWS in the Cloud-Native way
we have
- Block Storage which is EBS or EC2 Instance Store,
that’s basically our volumes. - Then we have file storage. That’s when we dealt with EFS and we’re storing files on a network file system.
- Then we have object when we were storing files and objects directly on S3 and Glacier.
Storage Gateway will bring a bridge to these solutions.
three types of Storage Gateway
- File Gateway
- Volume Gateway
- Tape Gateway
Storage Gateway use cases
where we wanna maybe bring the on-premise data into S3 or bridge it, is to do disaster recovery, back up
and restore, or maybe tiered storage.
when you have S3 buckets and you want them to be accessible using maybe the NFS (Network File System) protocol or the SMB protocol
File Gateway
allows us to view files on our local file system on-premise, but it will be backed by S3, Glacier
it stands between Application Server and S3/S3IA/Glacier and talks to the Server using NFS, but to S3/S3IA/Glacier over HTTPS
we have to setup a File Gateway on-premise.
So from our applications perspective, it seems like we’re talking to a local network file system, but the File Gateway actually does some magic behind the scenes and talks to S3 or Glacier.
Volume Gateway
when you want to have Block storage using iSCSI protocol backed by S3
we have to setup a Volume Gateway on-premise.
it stands between Application Server and S3 bucket with EBS snapshots and talks to the Server using iSCSI, but to S3 over HTTPS
the idea is that the EBS snapshots will be made from time to time and they will be in S3. This will help us restore on-premise volumes if we wanted to.
our application server is going to mount a volume from the Volume Gateway and for on-premise, it will look like it’s just the local volume, but the Volume Gateway will store this as Amazon EBS snapshots backed by S3.
Tape Gateway
Some companies still have processes to use physical tapes.
For this, you build a VTL, or Virtual Tape Library and it will be backed by Amazon S3 and Glacier. Tape Gateway is for a backup reason
Backup software will be connecting directly using iSCSI to the Tape Gateway will create a Virtual Tape library stored in S3 or Glacier. So it will talk to S3 or Glacier over HTTPS