FSx for Lustre Flashcards

Question

Can I link an S3 bucket that has encryption enabled

Answer 1

FSx for Lustre supports Amazon S3 buckets that use server-side encryption with S3-managed keys (SSE-S3), and with AWS KMS keys stored in AWS Key Management Service (SSE-KMS)

Answer 2

Automatic export supports cross-Region configurations. The Amazon FSx file system and the linked S3 bucket can be located in the same AWS Region or in different AWS Regions. Automatic import does not support cross-Region configurations. Both the Amazon FSx file system and the linked S3 bucket must be located in the same AWS Region. Both automatic export and automatic import support cross-Account configurations. The Amazon FSx file system and the linked S3 bucket can belong to the same AWS account or to different AWS accounts.

Answer 3

Depends on file system capacity, deployment type, number of storage disks, and availability zone. Typically around 10-20 minutes.

Answer 4

New, Changed, Deleted, Any Combination, No Policy. The first 3 are recommended

Answer 5

It overwrites it, even if file is write locked (assuming import policy is set)

Answer 6

CloudWatch Logs

Answer 7

No. Import data repository tasks don't synchronize deletes in your S3 bucket with your FSx for Lustre file system. If you want to fully synchronize S3 with your file system (including deletes), you must re-create your file system.

Answer 8

Yes, you can preload. If you request the preloading of multiple files simultaneously, Amazon FSx loads your files from your Amazon S3 data repository in parallel

Answer 9

Data repository tasks optimize data and metadata transfers between your FSx for Lustre file system and a data repository on S3. One way that they do this is by tracking changes between your Amazon FSx file system and its linked data repository. They also do this by using parallel transfer techniques to transfer data at speeds up to hundreds of GB/s. There are two types, import and export

Answer 10

Each file server employs a fast, in-memory cache to enhance performance for the most frequently accessed data. When a client accesses data that's stored in the in-memory or SSD cache, the file server doesn't need to read it from disk, which reduces latency and increases the total amount of throughput you can drive.

Answer 11

When you read data that is stored on the file server's in-memory or SSD cache, file system performance is determined by the network throughput. When you write data to your file system, or when you read data that isn't stored on the in-memory cache, file system performance is determined by the lower of the network throughput and disk throughput.

Answer 12

The throughput that an FSx for Lustre file system supports is proportional to its storage capacity. Regardless of file system size, Amazon FSx for Lustre provides consistent, sub-millisecond latencies for file operations.

Answer 13

All file data in Lustre is stored on storage volumes called object storage targets (OSTs). All file metadata (including file names, timestamps, permissions, and more) is stored on storage volumes called metadata targets (MDTs). Amazon FSx for Lustre file systems are composed of a single MDT and multiple OSTs. Each OST is approximately 1 to 2 TiB in size, depending on the file system's deployment type. Amazon FSx for Lustre spreads your file data across the OSTs that make up your file system to balance storage capacity with throughput and IOPS load.

Answer 14

You can stripe at the file level across OST's

Answer 15

Striped layout matters most for large files, especially for use cases where files are routinely hundreds of megabytes or more in size.

Answer 16

200 MB/s per TiB of storage

Answer 17

No. In an encrypted file system, data and metadata are automatically encrypted before being written to the file system. Similarly, as data and metadata are read, they are automatically decrypted before being presented to the application.

Answer 18

The main difference between "Persistent 1" and "Persistent 2" lies in their performance and storage capacity. "Persistent 1" is designed for workloads that require moderate throughput and storage capacity. It provides up to 200 MB/s of throughput per TiB of storage and can scale up to 64 TiB per file system. In contrast, "Persistent 2" is designed for workloads that require higher throughput and storage capacity. It provides up to 1 GB/s of throughput per TiB of storage and can scale up to 120 TiB per file system. "Persistent 2" file systems also offer enhanced durability and data protection features compared to "Persistent 1." In summary, the main difference between "Persistent 1" and "Persistent 2" for FSx for Lustre is their performance and storage capacity. "Persistent 2" provides higher throughput and storage capacity than "Persistent 1" and has additional durability and data protection features.

Answer 19

The keys used to encrypt scratch file systems at-rest are unique per file system and destroyed after the file system is deleted. These keys are managed by AWS.

Answer 20

988 1018-1023

Answer 21

Through an ENI. You access your Amazon FSx file system through its DNS name, which maps to the file system's network interface. Only resources within the associated VPC, or a peered VPC, can access your file system's network interface.

Answer 22

These can be increased, but the default is 100

Answer 23

You can use AWS Backups. Backups created using the AWS Backup console have the same level of file system consistency and performance, and the same number of restore options, as backups created through the Amazon FSx console

Answer 24

FSx Lustre is considered network traffic, not EBS, so you get 25 Gbps for shared storage.

Answer 25

One, but there are DR strategies

Answer 26

Yes, if data compression is already enabled on FSX. But it will not automatically compress existing uncompressed data within FSX

Answer 27

FSx for Lustre file system backups are block-based, incremental backups, whether they are generated using the automatic daily backup or the user-initiated backup feature. This means that when you take a backup, Amazon FSx compares the data on your file system to your previous backup at the block level. The initial backup of a brand new file system with very little data takes minutes to complete. The initial backup of a brand new file system taken after loading TBs of data takes hours to complete. A second backup taken of the file system with TBs of data with minimal changes to the block-level data (relatively few creates/modifications) takes seconds to complete. A third backup of the same file system after a large amount of data has been added and modified takes hours to complete.

Answer 28

Many small files. You are doing two hops, one to the metadata servers, and one to the object servers.