GCP Storage Flashcards
Cloud Storage
Simple. Buckets (like directory) - must be globally unique /Object (file). Reliable. Located in region or multi-region (store close to customers). Can do object versioning. Storage classes: Standard, Nearline (30 days), Coldline (90 days), Archive (365 days).
Cloud BigTable
Big Data. High throughput, low latency. Can store Petabytes) Ideal for single-keyed data. Can scale up and down in seconds. Can stream data into it or batched in.
Cloud Firestore
Good for mobile and web apps & games (integrates well with Firebase). NoSQL - not for relational data. Automatic scaling. APIs for several languages. Replicated across several data centers. SQL-like capabilities.
Cloud Spanner
GLobal SQL. Horizontally scalable. Instance created in 3 zones (in one region or across multiple regions). Can be delay as updates replicate.
Cloud SQL
Fully managed DB Service (backups, replication) (MySQL, Postgres, MSSQL) (standard DB, just managed so you don’t have to)
BigQuery
SQL database. Enterprise data warehouse. Serverless. GB to PB. Real-time insights from streaming data. Built-in ML.
A customer has been a long time Apache Hadoop user and has many tools and an ecosystem built around that, which service should be easiest to adopt in Google Cloud? *
A) Cloud Dataproc
B) BigQuery
C) Cloud Dataflow
D) Cloud Storage
Answer: A - Cloud Dataproc
Dataproc is a fully managed and highly scalable service for running Apache Spark, Apache Flink, Presto, and 30+ open source tools and frameworks. Use Dataproc for data lake modernization, ETL, and secure data science, at planet scale, fully integrated with Google Cloud, at a fraction of the cost.
What command is used to show a list of Cloud Storage buckets?
A) gcloud show buckets
B) gcloud ls
C) gsutil ls
D) gsutil show buckets
Answer: C - gsutil ls
gsutil is a python app that is installed with the Cloud SDK (but can be installed separately as well) and works with both Google Cloud Storage but also with Amazon S3. For historical reasons, it wasn’t incorporated into the gcloud command and continued to exist independently as gsutil.
ls - List providers, buckets, or objects
I need a SQL database with strong consistency for my global application. Which is the correct choice?
A) Choose Cloud SQL
B) Choose Big Query
C) Choose Cloud Spanner
D) Choose Firestore
Answer: C - Cloud Spanner
Cloud Spanner - Fully managed relational database with unlimited scale, strong consistency, and up to 99.999% availability. Cloud Spanner offers transactional consistency at global scale, schemas, SQL (ANSI 2011 with extensions), and automatic, synchronous replication for high availability.
A customer is looking for a fully-managed cost-effective analytics data warehouse solution that lets you run analytics over vast amounts of data in near real time. Which would you recommend?
A) Cloud Storage
B) Looker
C) BigQuery
D) Data Studio
Answer: C - BigQuery
BigQuery - Serverless, highly scalable, and cost-effective multi-cloud data warehouse designed for business agility.
What are the available storage classes for Cloud Storage?
A) Standard, Nearline, Coldline, Archive
B) Active, Hybrid, Archive, Long Term
C) Standard, Regional, Distributed
D) Standard, Nearline, Archive, Tape
Answer: - A - Standard, Nearline, Coldline, Archive
Remember acronym CANS
Transferring data from S3 to GCS I can use…
A) Cloud Tiering Appliance
B) Cloud Shuffler
C) Cloud Storage Transfer Service
D) Cloud Data Mover
Answer: C - Cloud Storage Transfer Service
While you can use gsutil to transfer from another cloud storage provider to Cloud Storage, Storage Transfer Service is recommended for this use case. You can also use gsutil to transfer between Cloud Storage buckets. This can be helpful for moving data between bucket locations.