Storage in the Cloud (advanced) Flashcards
A set of general and more detailed questions around cloud storage options.
List Google’s core storage options?
Cloud Storage, SQL, Spanner, Datastore, Firestore, BigTable
What type of storage is Cloud storage?
Object storage (not file or block storage)
Name Google’s object storage service?
Cloud Storage
How do you reference objects stored in Cloud Storage?
By a URL
Why does Cloud Storage interact well with Web technologies?
Because the unique object references can be defined as URLs.
What is Cloud Storage?
A fully managed, scalable, high durability and high availability object storage service.
Which storage service would you use for a Website?
Cloud Storage
Which storage service would you use for archival and/or disaster recovery?
Cloud Storage
Which storage service would you use for distributing large data objects to your end users via direct download?
Cloud Storage
How are object organised in Cloud Storage?
In Buckets
Can you re-use the same bucket name in a different project?
No, by definition the bucket name is globally unique.
What are the 3 attributes you need to specify when creating a bucket?
- A Globally Unique Name, 2. A Geographic location, 3. A default storage class
Can you edit objects stored in Cloud Storage?
No: objects are immutable
How does Google secure data transit for Cloud Storage?
HTTPS - data is encrypted on the server side before it is written to disk
How does Google apply modifications to a given object when version control is disabled?
It overwrites the definition of the original object.
What life cycle management policies can you define on Cloud Storage?
Delete objects based on age, creation date or number of versions (keep x most recent).
When enabling Cloud Storage version control, what should you do?
Define a life cycle management policy to avoid bloat.
At what level is version control defined in Cloud Storage?
Bucket level (not object).
What does ACL stand for?
Access Control List
What storage classes can you choose from for Cloud Storage?
Multi‑regional, Regional, Nearline and Coldline.
Which storage classes would you choose from for backup and archival?
Nearline or Coldine
Which storage classes would you choose from for high-performance access?
Multi-regional or Regional
Order storage classes in descending order of storage price?
Multi-regional, Regional, Nearline, Coldine
Order storage classes in descending order of retreival price?
Coldline, Nearline, Regional, Multi-regional
What is the intended access frequency of nearline storage?
Once a month or less.
What is the intended access frequency of coldine storage?
Less than once a year.
What storage class would you use for data archival?
Coldine
What storage class would you use for disaster recovery?
Coldine
What storage class would you use for backups (you might need to access within 30 days)?
Nearline
What is the minimum number of days storage you are committing to when you opt for coldline storage?
90 days
What unit of measure is used to charge for Cloud Storage?
Gb of stored data / Month
Do egress and data transfer charges apply when accessing Cloud Storage?
Yes
How do you transfer small objects into Cloud Storage?
Online using gsutil command line or drag-and-drop in GCP Console.
How do you transfer large volumes (terabytes - TB) of data to Cloud Storage?
(Online) Storage Transfer Service, (Offline) Transfer Appliance.
What type of source points can the Transfer Storage Service schedule and manage batch transfers into Cloud Storage?
From: another cloud provider, a different Cloud Storage region, or an HTTPS endpoint.
What is the Transfer Appliance?
A rackable server with high capacity storage that you lease from Google.
How do you use Google’s data Transfer Appliance?
Connect it to your network, load it with data, and then ship it to an upload facility where the data is uploaded to Cloud Storage.
What is the maximum capacity of a given transfer Appliance?
Up to a petabyte (PB) of data
What type of DB is Cloud Bigtable?
No SQL
Which NoSQL DB options can you choose from?
Cloud Bigtable, Firestore, Datastore, Memorystore and Firebase Realtime Database
Which DB technology open source API does Cloud Bigtable use?
HBase - which is the native database for the Apache Hadoop project
What simplifies portability between HBase and Bigtable?
They both use the same open source API.
Which 3 types of application APIs coud you use to read/write data to Cloud Bigtable?
A data service layer like managed VMs, the HBase REST Server, a Java server using the HBase client.
Which 3 stream processing frameworks can you use to read/write data to Cloud Bigtable?
Cloud Dataflow streaming, Spark streaming or Storm
Which 3 batch processing frameworks can you use to read/write data to Cloud Bigtable?
Cloud Dataflow, Spark or Hadoop MapReduce
What 3 different ways do you have to read/write to Cloud Bigtable?
Realtime, streaming or batch processes.
What does RDBMS stand-for?
Relational Database Management System
Which GCP services allow you to set up a relational database (as opposed to a NoSQL DB)?
Cloud SQL, Cloud Spanner and BigQuery
Which DB engines can you select from in Cloud SQL?
MySQL, PostgreSQL and SQLServer
What is Google Cloud SQL?
A managed service for relational databases
Which cloud storage options support transactions?
Cloud SQL, Spanner, Datastore, Firestore and Bigtable (but only single row)
Does BigQuery support transactions?
No
Which cloud storage options would you use for blob storage?
Cloud Storage
Which cloud services support complex SQL queries?
Cloud SQL, Cloud Spanner and BigQuery
Which cloud storage options support Petabytes+ of data capacity?
Bigtable, Cloud Datastore, Firestore and BigQuery
What is the max unit size for a Cloud Datastore object?
1 MB / unit
What is the max unit size for a Bigtable object?
~10 MB / cell, ~100 MB / row
What is the max unit size for a Cloud Storage object?
5 TB / Object
What detemines the max unit size for a Cloud SQL object?
The DB engine you’ve selected (MySQL or PostgreSQL)
What is the max unit size for a Cloud Spanner row?
10,240 MiB (Mebibyte = 2^20 bytes) / row
What is the max unit size for a BigQuery row?
10 MB / row
Excluding Cloud SQL options, list cloud storage options in ascending order of row size they support?
BigQuery (10 MB), Bigtable (100 MB), Cloud Spanner (10,240 MiB - mebibytes)