Blob Storage Flashcards
Objective
Store massive amounts of unstructured data
Types of resources
Blob storage offers three types of resources:
1. The storage account.
2. A container in the storage account.
3. A blob in a container.
Storage account
A storage account provides an unique namespaces for
data.
A container
A container organizes a set of blobs, similar to a directory in a
file system.
A storage account can include an unlimited number of
containers, and a container can store an unlimited number of
blobs.
Types of blobs
Block blobs store text and binary data, up to about 190 TB.
Append blobs are made up of blocks optimized for append
operations. This can be used for what?
Page blobs store random access files up to 8 TB in size. Page
blobs store virtual hard drive (VHD) files and serve as disks for
Azure virtual machines.
Performance tiers and when to use them
Premium
* Optimized for high transaction rates and single-digit consistent
storage latency
Interactive workloads, analytics, AI/ML, data transformation
Standard
* Optimized for high capacity and high throughput
Media contents, backup, bulk data processing
Access tiers
Hot access tier
* Optimized for frequent access of objects. Storage costs are
higher.
New storage accounts are created in the hot tier by default.
Cool access tier
* Optimized for storing large amounts of data that is infrequently
accessed and stored for at least 30 days. Accessing data may be
more expensive than accessing data in the hot tier.
Archive tier
* Optimized for data that can tolerate several hours of retrieval
latency and will remain in the Archive tier for at least 180 days.
Uses of redundancy
Redundancy can be used for:
1. Providing high availability / fault tolerance;
2. Providing faster access to data (due to additional copies
and location of copies).
Locally-redundant storage
Tolerate machine failures in a
region.
Data is replicated synchronously
three times within the primary
region.
Zone-redundant storage
Tolerates data center failures in
a given region.
Data is replicated synchronously
across three Azure availability
zones in the primary region.
Geo-redundant storage
Tolerates complete region failures – with RA-GRS, it is possible to
read from other region – potential better latency.
Data is replicated synchronously three times in the primary region,
then replicated asynchronously to the secondary region.
Geo-zone-redundant storage
- Tolerates both data center failures and complete region failures –
with RA-GZRS, it is possible to read from other region – potential
better latency. - Data is replicated synchronously across three Azure availability zones
in the primary region, then replicated asynchronously to the
secondary region.
Inside a DC, data is replicated in different racks. Why?
In case there is a failure in a rack.
Soft delete
Blob soft delete protects an individual blob (or container)
from accidental deletes or overwrites.
* Maintains the deleted data for a specified period of time,
during which it is possible to “undelete” the blob.
Versioning
- Allow to maintain multiple version for a blob.