Data Storage Flashcards
What are GCP’s 6 main data storage services?
- Cloud Storage
- Cloud SQL
- Cloud Spanner
- Firestore
- Bigtable
- BigQuery
What is Cloud Storage?
Cloud Storage is a storage service for objects.
What is Cloud Storage’s storage capacity?
Unlimited
What is a Cloud Storage object and how large can it be?
An object is an immutable unit of data that can be in any file format. Every object has 2 components: object data and object metadata. Objects are very similar to Avro objects. An object can be up to 5 TB in size.
What is the identifier for a Cloud Storage object?
An object’s sole identifier is the combination of its unique name (which is generated by the owner) and its generation number which is created by Cloud Storage.
How are Cloud Storage objects normally accessed?
Through HTTP GET requests
What is object composition in the context of using Cloud Storage?
Object composition is the process of creating small objects out of chunks of data in parrallel, combining them together to create a single object, and then deleting the small temporary objects.
Are Cloud Storage operations strongly consistent?
Yes
What is object versioning?
Cloud versioning is a Cloud Storage feature that allows a replaced or deleted object to be retained as a noncurrent version by enabling object versioning. This allows you to keep a historical record of object changes and to also restore an object to an older state.
Describe the IAM roles inheritance sequence between projects, buckets, and objects.
Generally speaking, Cloud Storage buckets inherit IAM roles from projects, and Cloud Storage objects inherit IAM roles from buckets. Please keep in mind, there are ways to set IAM Roles at the object level. Look at the docs for details.
What is a Cloud Storage bucket?
A bucket is a container for objects. All buckets are associated with a project.
Are Bucket names required to be globally unique?
Yes. Consider using UUIDs.
Bucket and object names appear in URLs, do not use sensitive information to create names.
What are the 4 different Cloud Storage classes?
- Standard
- Nearline
- Coldline
- Archive
Standard does not have a minimum storage duration and has the lowest operation fees.
Nearline has a 30 day minimum storage duration and has operation fees.
Coldline has a 90 day minimunm storage duration and has operation fees.
Archive has a 365 day minimum storage duration and has the most expensive operation fees.
Wile IAM can be used to manage access to buckets and objects by inheritance, it is not enough if you need granular control over individual objects. If you need to control access to individual objects, you can use object ACLs or Signed URLs (can be shared with anyone). ACLs are specific to Cloud Storage and not any other service ACLs are a legacy feature. Google Recommends using IAM unless there is a dire need to implement ACLs.
What is a bucket-lock?
A bucket-lock is a feature that allows you to lock-in the data retention policy, permanently preventing the policy from being reduced or remove for its duration. You can always, however, increase the retention policy after a bucket-lock.