Storage Extras Flashcards

1
Q

What is the snow family?

What are some of the challanges that the snow family helps overcome?

What is the rule of thumb when it comes to using a snow device?

Where does the data go when it is uploaded to AWS?

A

Highly Secure, portable devices to collect and process data at the edge and migrate data into and out of aws. They’re offline devices that allow us to perform data migrations

Time it takes to upload/download
Limited connectivity/Connection stability
Limited bandwidth
Shared bandwidth (Can't maximize line)
High network cost

If it will take more than a week to transfer your data.

To s3 buckets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is snowball edge?

What are the two flavors of snowball edge?

What are the usecases for snowball edge?

A

Physical data transport solution to move TBs or PBs of data in or out of AWS.

Snowball Edge Storage Optimized - 80TB HDD for block volume and S3 compatible object storage

Snowball Edge Compute Optimized - 42 TB of HDD capacity for block volume and S3 compatible object storage

Large data cloud migrations, data center decommissions, disaster recovery up to 10PB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is AWS snowcone?

How much storage do you get with snowcone?

When should you use snowcone?

What are the two ways that you can get data to sync from your snowcone to AWS?

A

Small portable computing anywhere, rugged and secure. Built to withstand harsh environments.

8TB storage

Use snowcone where snowball does not fit and up to 24 TB

You can ship it to AWS or you can connect it to a network and use aws datasync.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is AWS snowmobile?

What is the storage capacity of AWS snowmobile?

What is a good use case for this?

A

It is a physical truck that comes to pick your data up. The truck is temperature controlled, has GPS and 24/7 security.

100PB

When you need to transfer more than 10PB of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is edge computing.

What are the show family devices that can be used for edge computing?

What are the usecases of edge computing?

A

Edge computing allows you to process data being created at an edge location that has limited/no internet access or limited/no access to computing power.

Snowball and snowcone

Preprocessing data
Machine learning at the edge
Transcoding media streams

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the specs on a snowcone edge

Snowball edge compute optimized?

Snowball edge storage optimized.

What type of compute runs on these boxes?

How can you get discounted pricing of these devices?

A

2 CPUs 4gb ram. Wired or wireless access. USB-C power using a cord or an optional battery

52 vCPUs 208 GB of Ram, optional GPU (useful for video processing) and 42TB of storage

up to 40 vCPUs, 80GB ram. Object storage clustering available

EC2 instaces and lambda functions using AWS IoT Greengrass

By using longterm deployment options of 1-3 years.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is AWS opshub?

A

You can use ops hub to
Unlock and configure single or clustered devices
Transfer files
Launch and manage instances running on snow family devices
Monitor device metrics (storage capacity, active instances etc)
Launch compatible AWS services on your devices

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How can you migrate data from from the snow family to glacier?

A

By creating a lifecycle policy for your s3 bucket to move the data from the s3 bucket to glacier.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What problem is FSx for Windows solving?

So what is FSx?

A

The elastic file servic (the infinitely scalable netowrk atached drive) is built on top of a posix system which means it’s only available for Linux. This service solves that problem by making a “EFS” that is mountable by windows.

FSx is a fully managed windows file system share drive that supports SMB protocal and Windows NTFS? Thin “shared storage for windows”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Tell me about the directory service for FSx.

How does FSx Scale?

How can it be accessed?

Hoes does HA and Backups work for FSx?

A

It utilizes Micorsoft Active Directory integration, ACLs and user Quotas

It’s built on SSD to scale up to 10s of GB/s, millions of IOPS, 100s PB of data

From the cloud and from on prem infrastructure.

FSx can be configured to be Multi-AZ and the data is backed up daily to S3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Amazon FSx for Luster?

What are some of the use-cases for FSx for Luster?

How does scaling work for FSx for Luster?

How does FSx have a seamless integration with S3?

How can it be accessed?

A

Its a high performance computing cluster for linux that has a file system that is shared with high IOPS, high throughput, low latency and integration with S3 as a backend.

High performance computing, machine learning
Ex: Video processing, financial modeling, eletronic design automation

Scales up to 100s GBs, millions of IOPS, sub-ms latencies

It uses S3 as a filesystem

From the cloud and from on prem infrastructure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the two file systems for FSx?

What is a scratch file system?

What is a persistent file system?

A

Scratch and Persistent.

Temporary Storage
Data is not replicated (Doesn’t persist if server fails
High burst (6x faster, 200mbps per TiB)
Used for short-term processing, optimize costs

Long term storage
Data replicated within same AZ
Replace failed files within minutes
Used for long-term processing, sensitive data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the AWS Transfer family?

How are you charged for the transfer family?

How does authentication work for the transfer family?

What is the use-case for the transfer family?

A

The AWS Transfer family gives you a way to add data to an S3 bucket or EFS though FTP (within vpc), SFTP, or FTPS. It is scalable, reliable, and highly available (multi-az)

You pay per provisioned endpoint per hours + data transfer in GB

You can store and manage users credentials witin the service, Integrate with an existting authentication system such as MSAD, LDAP, Okta, Amazon cognito, custom

When yo u just want to upload using ftp/sftp . Sharing files, public data sets, CRM ERP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is AWS Storage Gateway?

What are the use-cases of AWS Storage Gateway?

What are the 3 types of storage gateway?

A

Storage gateway bridges your on-prem data and the storage in the cloud in S3.

Disaster recovery, backup and restore, tiered storage. Gives you a way to expand your NFS on-prem by leveraging amazon s3

Files gateway
Volume gateway
Tape gateway

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How is your data stored using the file gateway?

What S3 types are supported?

How is access provided between the file gateway and buckets?

How does caching work for a file gateway?

How can you authenticate users who want to access files via the file gateway?

A

Configured S3 buckets are exposed via NFS and SMB protocol

Standard S3, S3 IA, S3 One Zone IA

IAM Roles.

The most recently used data is cached in the file gateway

Active directory authentication

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the purpose of a volume gateway?

Explain how backup works with a volume gateway?

What are the two types of volumes?

A

To backup your volumes of your on-prem servers.

The server will communicate with the file gateway over iSCSI protocol to send volume data to S3. Then nightly snapshots can be taken. In this case the S3 buckets are actually backed by EBS snapshots

Cached volumes - to give you low latency access to the most recent data

Stored volumes - The entire dataset is on-premise and there is a scheduled backup to amazon s3

17
Q

What is a tape gateway?

A

Some companies use tape backups. The AWS Tape gateway allow these companies to run the same tape backup process, however the tape data is then uploaded to AWS s3/glacier via the ISCSI protocol. This process works with the leading vendors in the space

18
Q

What do you do if your system does not have the virtualization software needed to run the file gateways?

A

You an order a physical storage gateway from aws that supports all three versions of the gateway, file, volume and tape