FSx for Lustre Flashcards
What is the latency it provides
sub millisecond
How much throughput
100s of GBps
How much IOPS
up to millions
What operating system does it support
POSIX which means that it is part of the underlying standard of UNIX systems like Linux and FreeBSD
What deployment options does it support? What are the differences between them?
Scratch and Persistent.
Scratch file systems are ideal for temporary storage and shorter-term processing of data. Data is not replicated and does not persist if a file server fails.
Persistent file systems are ideal for longer-term storage and throughput-focused workloads. In persistent file systems, data is replicated, and file servers are replaced if they fail.
If you want to minimize storage costs for a shorter-term workload (hours/days) AND is OK re-running their job in the case of a file server failure then Scratch is extremely popular and cost effective option.
What hardware based storage options exist and what are the differences between them?
SSD storage options – For low-latency, IOPS-intensive workloads that typically have small, random file operations, choose one of the SSD storage options.
HDD storage options – For throughput-intensive workloads that typically have large, sequential file operations, choose one of the HDD storage options.
How fast does FSx perform its metadata operations, and how is it able to be so fast?
You can optionally provision a read-only SSD cache that is sized to 20 percent of your HDD storage capacity. This provides sub-millisecond latencies and higher IOPS for frequently accessed files. Both SSD-based and HDD-based file systems are provisioned with SSD-based metadata servers. As a result, all metadata operations, which represent the majority of file system operations, are delivered with sub-millisecond latencies.
What AWS compute options are compatible with it?
EC2, EKS, ECS. The Lustre client is included with Amazon Linux 2 and Amazon Linux. For RHEL, CentOS, and Ubuntu, an AWS Lustre client repository provides clients that are compatible with these operating systems.
What encryption does FSx support?
Encryption at rest and in transit. Amazon FSx automatically encrypts file system data at rest using keys managed in AWS Key Management Service (AWS KMS). Data in transit is also automatically encrypted on file systems in certain AWS Regions when accessed from supported Amazon EC2 instances. Encryption of data at rest is automatically enabled when you create an Amazon FSx for Lustre file system, regardless of the deployment type you use.
Explain how an object from S3 is loaded into FSx
When you create your FSx for Lustre file system, you link it to your S3 data repository. At this point, the objects in your S3 bucket are listed as files and directories on your FSx file system. Amazon FSx then automatically copies the file contents from S3 to your Lustre file system when a file is accessed for the first time on the Amazon FSx file system. After your compute workload runs, or at any time, you can use a data repository task to export changes back to S3.
What happens when a scratch file system fails
Files stored on other servers are still accessible. If clients try to access data that is on the unavailable server or disk, clients experience an immediate I/O error.
What is the availability/durability of a scratch file system
Around 99%. Because larger file systems have more file servers and more disks, the probabilities of failure are increased.
Where is data replicated for persistent file systems
data is automatically replicated within the same Availability Zone in which the file system is located
What happens when a persistent file system fails
it’s replaced automatically within minutes of failure. During that time, client requests for data on that server transparently retry and eventually succeed after the file server is replaced.
What is the persistent 1 deployment type
well-suited for use cases that require longer-term storage, and have throughput-focused workloads that aren’t latency-sensitive.
What is the persistent 2 deployment type
best-suited for use cases that require longer-term storage, and have latency-sensitive workloads that require the highest levels of IOPS and throughput. This option is NOT available in GovCloud.
Is data from S3 automatically imported into my FSx file system
You must turn on automatic import or do it manually each time. When you turn on automatic import for a data repository association, your file system automatically imports file metadata as files are created, modified, and/or deleted in the S3 data repository. Alternatively, you can import metadata for new or changed files and directories using an import data repository task.
Can any S3 object be imported
No. FSx for Lustre imports only S3 objects that have POSIX-compliant object keys.
When we say FSx stores metadata, what does that mean
FSx for Lustre stores POSIX metadata, including ownership, permissions, and timestamps for files, directories, and symbolic links, in S3 objects
Can I create custom metadata
No. FSx for Lustre doesn’t retain any user-defined custom metadata on S3 objects.
Can I create a link between FSx and S3 after I create my file system
Yes. You can create the link when creating the file system or at any time after the file system has been created.