Storage | Amazon Elastic File System (EFS) Flashcards
What is Amazon Elastic File System?
General
Amazon Elastic File System (EFS) | Storage
Amazon EFS is a fully-managed service that makes it easy to set up and scale file storage in the Amazon cloud. With a few clicks in the AWS Management Console, you can create file systems that are accessible to Amazon EC2 instances via a file system interface (using standard operating system file I/O APIs) and that support full file system access semantics (such as strong consistency and file locking).
Amazon EFS file systems can automatically scale from gigabytes to petabytes of data without needing to provision storage. Tens, hundreds, or even thousands of Amazon EC2 instances can access an Amazon EFS file system at the same time, and Amazon EFS provides consistent performance to each Amazon EC2 instance. Amazon EFS is designed to be highly durable and highly available. With Amazon EFS, there is no minimum fee or setup costs, and you pay only for the storage you use.
What use cases is Amazon EFS intended for?
General
Amazon Elastic File System (EFS) | Storage
Amazon EFS is designed to provide performance for a broad spectrum of workloads and applications, including Big Data and analytics, media processing workflows, content management, web serving, and home directories.
When should I use Amazon EFS vs. Amazon Simple Storage Service (S3) vs. Amazon Elastic Block Store (EBS)?
General
Amazon Elastic File System (EFS) | Storage
Amazon Web Services (AWS) offers cloud storage services to support a wide range of storage workloads.
Amazon EFS is a file storage service for use with Amazon EC2. Amazon EFS provides a file system interface, file system access semantics (such as strong consistency and file locking), and concurrently-accessible storage for up to thousands of Amazon EC2 instances.
Amazon EBS is a block level storage service for use with Amazon EC2. Amazon EBS can deliver performance for workloads that require the lowest-latency access to data from a single EC2 instance.
Amazon S3 is an object storage service. Amazon S3 makes data available through an Internet API that can be accessed anywhere.
What regions is Amazon EFS currently available in?
General
Amazon Elastic File System (EFS) | Storage
Please refer to Regional Products and Services for details of Amazon EFS service availability by region.
How do I get started using Amazon EFS?
General
Amazon Elastic File System (EFS) | Storage
To use Amazon EFS, you must have an Amazon Web Services account. If you do not already have an AWS account, you can create one by clicking the “Try the Free Tier” button on the Amazon EFS detail page.
Once you have created an AWS account, please refer to the Amazon EFS Getting Started guide to begin using Amazon EFS. You can create a file system via the AWS Management Console, the AWS Command Line Interface (AWS CLI), and Amazon EFS API (and various language-specific SDKs).
How do I access a file system from an Amazon EC2 instance?
General
Amazon Elastic File System (EFS) | Storage
To access your file system, you mount the file system on an Amazon EC2 Linux-based instance using the standard Linux mount command and the file system’s DNS name. Once you’ve mounted, you can work with the files and directories in your file system just like you would with a local file system.
Amazon EFS uses the NFSv4.1 protocol. For a step-by-step example of how to access a file system from an Amazon EC2 instance, please see the Amazon EFS Getting Started guide.
What Amazon EC2 instance types and AMIs work with Amazon EFS?
General
Amazon Elastic File System (EFS) | Storage
Amazon EFS is compatible with all Linux-based AMIs for Amazon EC2. You can mix and match the instance types connected to a single file system. For a step-by-step example of how to access a file system from an Amazon EC2 instance, please see the Amazon EFS Getting Started guide.
How do I manage a file system?
General
Amazon Elastic File System (EFS) | Storage
Amazon EFS is a fully-managed service, so all of the file storage infrastructure is managed for you. When you use Amazon EFS, you avoid the complexity of deploying and maintaining complex file system infrastructure. An Amazon EFS file system grows and shrinks automatically as you add and remove files, so you do not need to manage storage procurement or provisioning.
You can administer a file system via the AWS Management Console, the AWS command-line interface (CLI), or the Amazon EFS API (and various language-specific SDKs). The Console, API, and SDK provide the ability to create and delete file systems, configure how file systems are accessed, create and edit file system tags, and display detailed information about file systems.
How do I load data into a file system?
Data Protection and Availability
Amazon Elastic File System (EFS) | Storage
There are a number of methods for loading existing file system data into Amazon EFS, whether your existing file system data is located in AWS or in your on-premises servers.
Amazon EFS file systems can be mounted on an Amazon EC2 instance, so any data that is accessible to an Amazon EC2 instance can also be read and written to Amazon EFS. To load file data that is not currently stored in AWS, you can use EFS File Sync to copy data directly to Amazon EFS.
For on-premises file systems, EFS File Sync provides a fast and simple way to securely sync existing file systems into Amazon EFS. EFS File Sync works over any network connection, including with AWS Direct Connect. AWS Direct Connect provides a high bandwidth and lower latency dedicated network connection, over which you can mount your EFS filesystems. You can also use standard Linux copy tools to move data files to Amazon EFS.
For more information about accessing a file system from an on-premises server, please see the On-premises Access section of this FAQ.
For more information about moving data to the Amazon cloud, please see the Cloud Data Migration page.
How is Amazon EFS designed to provide high durability and availability?
Data Protection and Availability
Amazon Elastic File System (EFS) | Storage
Every file system object (i.e. directory, file, and link) is redundantly stored across multiple Availability Zones. In addition, a file system can be accessed concurrently from all Availability Zones in the region where it is located, which means that you can architect your application to failover from one AZ to other AZs in the region in order to ensure the highest level of application availability. Mount targets themselves are designed to be highly available.
How do I back up a file system?
Data Protection and Availability
Amazon Elastic File System (EFS) | Storage
Amazon EFS is designed to be highly durable. Using the EFS-to-EFS Backup solution, you can schedule automatic incremental backups of your Amazon EFS file system. For more information, please see the Amazon EFS Walkthrough: Backup Solutions for Amazon EFS File Systems.
How do I access my file system from outside my VPC?
Scale and Performance
Amazon Elastic File System (EFS) | Storage
Amazon EC2 instances within your VPC can access your file system directly, and Amazon EC2 Classic instances outside your VPC can mount a file system via ClassicLink. On-premises servers can mount your file systems via an AWS Direct Connect connection to your VPC.
How much data can I store?
Scale and Performance
Amazon Elastic File System (EFS) | Storage
Amazon EFS file systems can store petabytes of data. Amazon EFS file systems are elastic, and automatically grow and shrink as you add and remove files. You do not provision file system size or specify a size up front and you pay only for the storage you use.
How many Amazon EC2 instances can connect to a file system?
Scale and Performance
Amazon Elastic File System (EFS) | Storage
Amazon EFS supports one to thousands of Amazon EC2 instances connecting to a file system concurrently.
How many file systems can I create?
Scale and Performance
Amazon Elastic File System (EFS) | Storage
By default, you can create up to 10 file systems per AWS account per region. You can request to increase your file system limit by visiting AWS Service Limits.
How does Amazon EFS performance compare to that of other storage solutions?
Scale and Performance
Amazon Elastic File System (EFS) | Storage
Amazon EFS file systems are distributed across an unconstrained number of storage servers, enabling file systems to grow elastically to petabyte-scale and allowing massively parallel access from Amazon EC2 instances to your data. Amazon EFS’s distributed design avoids the bottlenecks and constraints inherent to traditional file servers.
This distributed data storage design means that multi-threaded applications and applications that concurrently access data from multiple Amazon EC2 instances can drive substantial levels of aggregate throughput and IOPS. Big Data and analytics workloads, media processing workflows, content management and web serving are examples of these applications.
The table below compares high-level performance and storage characteristics for Amazon’s file and block cloud storage offerings.
Amazon EFS
Per-operation latency: Low, consistent
Throughput scale: Multiple GBs per second
Amazon EBS
PIOPS
Per-operation latency: Low, consistent
Throughput scale: Single GB per second
Amazon EFS’s distributed nature enables high levels of availability, durability, and scalability. This distributed architecture results in a small latency overhead for each file operation. Due to this per-operation latency, overall throughput generally increases as the average I/O size increases, since the overhead is amortized over a larger amount of data. Amazon EFS’s support for highly parallelized workloads (i.e. with consistent operations from multiple threads and multiple EC2 instances) enables high levels of aggregate throughput and IOPS.