Road to Google Cloud Architect Certification Flashcards
Building for Builders LLC manufactures equipment used in residential and commercial
building. Each of its 500,000 pieces of equipment in use around the globe has IoT devices
collecting data about the state of equipment. The IoT data is streamed from each device
every 10 seconds. On average, 10 KB of data is sent in each message. The data will be used
for predictive maintenance and product development. The company would like to use a
managed database in Google Cloud. What would you recommend?
A. Apache Cassandra
B. Cloud Bigtable
C. BigQuery
D. CloudSQL
B. Option B is correct. Bigtable is the best option for streaming IoT data, since it supports
low-latency writes and is designed to scale to support petabytes of data. Option A is incorrect because Apache Cassandra is not a managed database in GCP. Option C is incorrect
because BigQuery is an analytics database. While it is a good option for analyzing the data,
Bigtable is a better option for ingesting the data. Option D is incorrect. CloudSQL is a
managed relational database. The use case does not require a relational database, and Bigtable’s scalability is a better fit with the requirements.
You have developed a web application that is becoming widely used. The frontend runs in
Google App Engine and scales automatically. The backend runs on Compute Engine in a
managed instance group. You have set the maximum number of instances in the backend
managed instance group to five. You do not want to increase the maximum size of the managed instance group or change the VM instance type, but there are times the frontend sends
more data than the backend can keep up with and data is lost. What can you do to prevent
the loss of data?
A. Use an unmanaged instance group B. Store ingested data in Cloud Storage C. Have the frontend write data to a Cloud Pub/Sub topic, and have the backend read from that topic D. Store ingested data in BigQuery
C. The correct answer is C. A Cloud Pub/Sub topic would decouple the frontend and
backend, provide a managed and scalable message queue, and store ingested data until the
backend can process it. Option A is incorrect. Switching to an unmanaged instance group
will mean that the instance group cannot autoscale. Option B is incorrect. You could store
ingested data in Cloud Storage, but it would not be as performant as the Cloud Pub/Sub
solution. Option D is incorrect because BigQuery is an analytics database and not designed
for this use case.
You are setting up a cloud project and want to assign members of your team different permissions. What GCP service would you use to do that?
A. Cloud Identity
B. Identity and Access Management (IAM)
C. Cloud Authorizations
D. LDAP
B. The correct answer is B. IAM is used to manage roles and permissions. Option A is
incorrect. Cloud Identity is a service for creating and managing identities. Option C is
incorrect. There is no GCP service with that name at this time. Option D is incorrect.
LDAP is not a GCP service.
You would like to run a custom container in a managed Google Cloud Service. What are
your two options?
A. App Engine Standard and Kubernetes Engine
B. App Engine Flexible and Kubernetes Engine
C. Compute Engine and Kubernetes Engine
D. Cloud Functions and App Engine Flexible
B. The correct answer is B. You can run custom containers in App Engine Flexible and
Kubernetes Engine. Option A is incorrect because App Engine Standard does not support
custom containers. Option C is incorrect because Compute Engine is not a managed service. Option D is incorrect because Cloud Functions does not support custom containers.
PhotosForYouToday prints photographs and ships them to customers. The frontend application uploads photos to Cloud Storage. Currently, the backend runs a cron job that checks
Cloud Storage buckets every 10 minutes for new photos. The product manager would like
to process the photos as soon as they are uploaded. What would you use to cause processing to start when a photo file is saved to Cloud Storage?
A Cloud Function
B. An App Engine Flexible application
C. A Kubernetes pod
D. A cron job that checks the bucket more frequently
A. The correct answer is A. A Cloud Function can respond to a create file event in Cloud
Storage and start processing when the file is created. Option B is incorrect because an
App Engine Flexible application cannot respond to a Cloud Storage write event. Option
C is incorrect. Kubernetes pods are the smallest compute unit in Kubernetes and are not
designed to respond to Cloud Storage events. Option D is incorrect because it does not
guarantee that photos will be processed as soon as they are created.
The chief financial officer of your company believes that you are spending too much money
to run an on-premises data warehouse and wants to migrate to a managed cloud solution.
What GCP service would you recommend for implementing a new data warehouse in GCP?
A. Compute Engine
B. BigQuery
C. Cloud Dataproc
D. Cloud Bigtable
B. The correct answer is B. BigQuery is a managed analytics database designed to support
data warehouses and similar use cases. Option A is incorrect. Compute Engine is not a
managed service. Option C is incorrect. Cloud Dataproc is a managed Hadoop and Spark
service. Option D is incorrect. Bigtable is a NoSQL database well suited for large-volume,
low-latency writes and limited ranges of queries. It is not suitable for the kind of ad hoc
querying commonly done with data warehouses.
A government regulation requires you to keep certain financial data for seven years. You
are not likely to ever retrieve the data, and you are only keeping it to be in compliance.
There are approximately 500 GB of financial data for each year that you are required to
save. What is the most cost-effective way to store this data?
A. Cloud Storage multiregional storage
B. Cloud Storage Nearline storage
C. Cloud Storage Coldline storage
D. Cloud Storage persistent disk storage
C. The correct answer is C. Cloud Storage Coldline is the lowest-cost option, and it is
designed for data that is accessed less than once per year. Option A and Option B are incorrect because they cost more than Coldline storage. Option D is incorrect because there is no
such service.
Global Games Enterprises Inc. is expanding from North America to Europe. Some of the
games offered by the company collect personal information. With what additional regulation will the company need to comply when it expands into the European market?
A. HIPAA
B. PCI-DS
C. GDPR
D. SOX
C. The correct answer is C. The GDPR is a European Union directive protecting the personal information of EU citizens. Option A is incorrect. HIPAA is a U.S. healthcare regulation. Option B is incorrect. PCI-DS is a payment card data security regulation; if Global
Games Enterprises Inc. is accepting payment cards in North America, it is already subject
to that regulation. Option D is a U.S. regulation on some publicly traded companies; the
company may be subject to that regulation already, and expanding to Europe will not
change its status.
Your team is developing a Tier 1 application for your company. The application will depend
on a PostgreSQL database. Team members do not have much experience with PostgreSQL
and want to implement the database in a way that minimizes their administrative responsibilities for the database. What managed service would you recommend?
A. Cloud SQL
B. Cloud Dataproc
C. Cloud Bigtable
D. Cloud PostgreSQL
A. The correct answer is A. Cloud SQL is a managed database service that supports PostgreSQL. Option B is incorrect. Cloud Dataproc is a managed Hadoop and Spark service.
Option C is incorrect. Cloud Bigtable is a NoSQL database. Option D is incorrect. There is
no service called Cloud PostgreSQL in GCP at this time
What is a service-level indicator?
A. A metric collected to indicate how well a service-level objective is being met
B. A type of log
C. A type of notification sent to a sysadmin when an alert is triggered
D. A visualization displayed when a VM instance is down
A. The correct answer is A. A service-level indicator is a metric used to measure how well
a service is meeting its objectives. Options B and C are incorrect. It is not a type of log or a
type of notification. Option D is incorrect. A service-level indicator is not a visualization,
although the same metrics may be used to drive the display of a visualization.
Developers at MakeYouFashionable have adopted agile development methodologies. Which
tool might they use to support CI/CD?
A. Google Docs
B. Jenkins
C. Apache Cassandra
D. Clojure
B. The correct answer is B. Jenkins is a popular CI/CD tool. Option A is incorrect. Google
Docs is a collaboration tool for creating and sharing documents. Option C is incorrect.
Cassandra is a NoSQL database. Option D is incorrect. Clojure is a Lisp-like programming
language that runs on the Java virtual machine (JVM).
You have a backlog of audio files that need to be processed using a custom application.
The files are stored in Cloud Storage. If the files were processed continuously on three
n1-standard-4 instances, the job could complete in two days. You have 30 days to deliver
the processed files, after which they will be sent to a client and deleted from your systems.
You would like to minimize the cost of processing. What might you do to help keep costs
down?
A. Store the files in coldline storage
B. Store the processed files in multiregional storage
C. Store the processed files in Cloud CDN
D. Use preemptible VMs
D. The correct answer is D. Use preemptible VMs, which cost significantly less than
standard VMs. Option A is incorrect. Coldline storage is not appropriate for files that are
actively used. Option B is incorrect. Storing files in multiregional storage will cost more
than regional storage, and there is no indication from the requirements that they should be
stored multiregionally. Option C is incorrect. There is no indication that the processed files
need to be distributed to a global user base.
You have joined a startup selling supplies to visual artists. One element of the company’s
strategy is to foster a social network of artists and art buyers. The company will provide
e-commerce services for artists and earn revenue by charging a fee for each transaction.
You have been asked to collect more detailed business requirements. What might you
expect as an additional business requirement?
A. The ability to ingest streaming data
B. A recommendation system to match buyers to artists
C. Compliance with SOX regulations
D. Natural language processing of large volumes of text
B. The correct answer is B. This is an e-commerce site matching sellers and buyers, so a
system that recommends artists to buyers can help increase sales. Option A is incorrect.
There is no indication of any need for streaming data. Option C is incorrect. This is a
startup, and it is not likely subject to SOX regulations. Option D is incorrect. There is no
indication of a need to process large volumes of text
You work for a manufacturer of specialty die cast parts for the aerospace industry. The
company has built a reputation as the leader in high-quality, specialty die cast parts, but
recently the number of parts returned for poor quality is increasing. Detailed data about the
manufacturing process is collected throughout every stage of manufacturing. To date, the
data has been collected and stored but not analyzed. There are a total of 20 TB of data. The
company has a team of analysts familiar with spreadsheets and SQL. What service might
you recommend for conducting preliminary analysis of the data?
A. Compute Engine
B. Kubernetes Engine
C. BigQuery
D. Cloud Functions
C. The correct answer is C. BigQuery is an analytics database that supports SQL. Options
A and B are incorrect because, although they could be used to run analytics applications,
such as Apache Hadoop or Apache Spark, it would require more administrative overhead.
Also, the team members working on this are analysts, but there is no indication that they
have the skills or desire to manage analytics platforms. Option D is incorrect. Cloud Functions is for running short programs in response to events in GCP.
A client of yours wants to run an application in a highly secure environment. They want to
use instances that will only run boot components verified by digital signatures. What would
you recommend they use in Google Cloud?
A. Preemptible VMs
B. Managed instance groups
C. Cloud Functions
D. Shielded VMs
The correct answer is D.
Shielded VMs include secure boot, which only runs digitally verified boot components. Option A is incorrect. Preemptible VMs are interruptible
instances, but they cost less than standard VMs. Option B is incorrect. Managed instance
groups are sets of identical VMs that are managed as a single entity. Option C is incorrect.
Cloud Functions is a PaaS for running programs in response to events in GCP.
You have installed the Google Cloud SDK. You would now like to work on transferring
files to Cloud Storage. What command-line utility would you use?
A. bq
B. gsutil
C. cbt
D. gcloud
B. The correct answer is B. gsutil is the command-line utility for working with Cloud
Storage. Option A is incorrect. bq is the command-line utility for working with BigQuery.
Option C is incorrect. cbt is the command-line utility for working with Cloud Bigtable.
Option D is incorrect. gcloud is used to work with most GCP services but not Cloud
Storage.
Kubernetes pods sometimes need access to persistent storage. Pods are ephemeral—they
may shut down for reasons not in control of the application running in the pod. What
mechanism does Kubernetes use to decouple pods from persistent storage?
A. PersistentVolumes
B. Deployments
C. ReplicaSets
D. Ingress
A. The correct answer is A. PersistentVolumes is Kubernetes’ way of representing storage
allocated or provisioned for use by a pod. Option B is incorrect. Deployments are a type
of controller consisting of pods running the same version of an application. Option C
is incorrect. A ReplicaSet is a controller that manages the number of pods running in a
deployment. Option D is incorrect. An Ingress is an object that controls external access to
services running in a Kubernetes cluster.
An application that you support has been missing service-level objectives, especially around
database query response times. You have reviewed monitoring data and determined that
a large number of database read operations is putting unexpected load on the system. The
database uses MySQL, and it is running in Compute Engine. You have tuned SQL queries,
and the performance is still not meeting objectives. Of the following options, which would
you try next?
A. Migrate to a NoSQL database.
B. Move the database to Cloud SQL.
C. Use Cloud Memorystore to cache data read from the database to reduce the number of
reads on the database.
D. Move some of the data out of the database to Cloud Storage.
C. The correct answer is C. Use Cloud Memorystore to reduce the number of reads against
the database. Option A is incorrect. The application is designed to work with a relational
database, and there is no indication that a NoSQL database is a better option overall.
Option B is incorrect. Simply moving the database to a managed service will not change the
number of read operations, which is the cause of the poor performance. Option D is incorrect. Moving data to Cloud Storage will not reduce the number of reads
You are running a complicated stream processing operation using Apache Beam. You want
to start using a managed service. What GCP service would you use?
A. Cloud Dataprep
B. Cloud Dataproc
C. Cloud Dataflow
D. Cloud Identity
C. The correct answer is C. Cloud Dataflow is an implementation of the Apache Beam
stream processing framework. Cloud Dataflow is a fully managed service. Option A is
incorrect. Cloud Dataprep is used to prepare data for analysis. Option B is incorrect. Cloud
Dataproc is a managed Hadoop and Spark service. Option D is incorrect. Cloud Identity is
an authentication service.
Your team has had a number of incidents in which Tier 1 and Tier 2 services were down for
more than 1 hour. After conducting a few retrospective analyses of the incidents, you have
determined that you could identify the causes of incidents faster if you had a centralized log
repository. What GCP service could you use for this?
A. Stackdriver Logging
B. Cloud Logging
C. Cloud SQL
D. Cloud Bigtable
A. The correct answer is A. Stackdriver Logging is a centralized logging service. Option
B is incorrect. There is no such service at this time. Option C and Option D are incorrect
because those are databases and not specifically designed to support the logging of the use
case described.
A Global 2000 company has hired you as a consultant to help architect a new logistics system. The system will track the location of parts as they are shipped between company facilities in Europe, Africa, South America, and Australia. Anytime a user queries the database,
they must receive accurate and up-to-date information; specifically, the database must support strong consistency. Users from any facility may query the database using SQL. What
GCP service would you recommend?
A. Cloud SQL
B. BigQuery
C. Cloud Spanner
D. Cloud Dataflow
C. The correct answer is C. Cloud Spanner is a globally scalable, strongly consistent relational database that can be queried using SQL. Option A is incorrect because it will not
scale to the global scale as Cloud Spanner will. Option B is incorrect. The requirements
describe an application that will likely have frequent updates and transactions. BigQuery
is designed for analytics and data warehousing. Option D is incorrect. Cloud Dataflow is a
stream and batch processing service.
A database architect for a game developer has determined that a NoSQL document
database is the best option for storing players’ possessions. What GCP service would you
recommend?
A. Cloud Datastore
B. Cloud Storage
C. Cloud Dataproc
D. Cloud Bigtable
A. The correct answer is A. Cloud Datastore is a managed document NoSQL database
in GCP. Option B is incorrect. Cloud Storage is an object storage system, not a document
NoSQL database. Option C is incorrect. Cloud Dataproc is a managed Hadoop and Spark
service. Option D is incorrect. Cloud Bigtable is a wide-column NoSQL database, not a
document database.
A major news agency is seeing increasing readership across the globe. The CTO is concerned that long page-load times will decrease readership. What might the news agency try
to reduce the page-load time of readers around the globe?
A. Regional Cloud Storage
B. Cloud CDN
C. Fewer firewall rules
D. Virtual private network
B. The correct answer is B. Cloud CDN is GCP’s content delivery network, which distributes static content globally. Option A is incorrect. Reading from regional storage can still
have long latencies for readers outside of the region. Option C is incorrect. Firewall rules do
not impact latency in any discernible way. Option D is incorrect. VPNs are used to link onpremises networks to Google Cloud
What networking mechanism allows different VPC networks to communicate using private
IP address space, as defined in RFC 1918?
A. ReplicaSets
B. Custom subnets
C. VPC network peering
D. Firewall rules
C. The correct answer is C. VPC peering allows different VPCs to communicate using
private networks. Option A is incorrect. ReplicaSets are used in Kubernetes; they are not
related to VPCs. Option B is incorrect. Custom subnets define network address ranges for
regions. Option D is incorrect. Firewall rules control the flow of network traffic.