DB and Storage Flashcards
What is an example of unstructured data that organizations can capture from social media?
A. Post comments
B. Tagging
C. Profile picture
D. Location
A. Post comments
Reviews, Descriptions are free form text with no structure. Suitable option for such data is CloudBigTable
Photograph, Files should be stored on CloudStorage
Regular table type structured data should go to CloudSQL or CloudSpanner
An organization wants a cost-effective relational database.
Which Google Cloud service should the organization use?
A. Cloud Storage
B. BigQuery
C. Cloud SQL
D. Dataflow
C. Cloud SQL
GCP has two options for Relational DB. Cloud SQL and Cloud Spanner
CloudSQL can run MySQL, MS SQL type of DBs. But it can go only upto 10 TB.
If you have a DB running OnPrem you can migrate it to CloudSQL
Another option is CloudSpanner which gives you much better performance and unlimited storage (>10TB)
Your organization is developing an application that will capture a large amount of data from millions of different sensor devices spread all around the world. Your organization needs a database that is suitable for worldwide, high-speed data storage of a large amount of unstructured data.
Which Google Cloud product should your organization choose?
A. Firestore
B. Cloud Data Fusion
C. Cloud SQL
D. Cloud Bigtable
D. Cloud Bigtable
The need is for Unstructured data so CloudSQL or CloudSpanner can not be an option.
The Q is asking for Sensor/IoT data and
Bigtable is special for stream data IOT.
Your organization needs to analyze data in order to gather insights into its daily operations. You only want to pay for the data you store and the queries you perform. Which Google Cloud product should your organization choose for its data analytics warehouse?
A. Cloud SQL
B. Dataproc
C. Cloud Spanner
D. BigQuery
D. BigQuery
BigQuery is used for
- Data Analysis for structured and unstructured data
- As a Data Warehouse
-As a Data Lake
You can also use Looker to make visualization on the BigQuery data
Your organization is developing an application that will manage payments and online bank accounts located around the world. The most critical requirement for your database is that each transaction is handled consistently. Your organization anticipates almost unlimited growth in the amount of data stored.
Which Google Cloud product should your organization choose?
A. Cloud SQL
B. Cloud Storage
C. Firestore
D. Cloud Spanner
D. Cloud Spanner
The Q is aksing to store Bank Account data which is kind of structured data and needs a SQL, Relational DB.
So the options are CloudSQL and CloudSpanner.
CloudSQL has a limitation of 10TB and can not give you unlimited storage.
Hence the answer should be CloudSpanner
Your organization wants an economical solution to store data such as files, graphical images, and videos and to access and share them securely.
Which Google Cloud product or service should your organization use?
A. Cloud Storage
B. Cloud SQL
C. Cloud Spanner
D. BigQuery
A. Cloud Storage
The need is to store pdf, word, images, video etc. The best option for such storage is CloudStorage. This is like google drive.
How would a global organization benefit from managing their data with Cloud Spanner?
A. Cloud Spanner is optimized for cold storage
B. Cloud Spanner replicates data across regions in real time
C. Cloud Spanner is optimized to ingest unstructured data
D. Cloud Spanner visualizes and analyzes data in real time
B. Cloud Spanner replicates data across regions in real time
A: Cold storage is for CloudStorage
C: This is Cloud BigTable
D: This is Looker
our organization wants to migrate its data management solutions to Google Cloud because it needs to dynamically scale up or down and to run transactional
SQL queries against historical data at scale. Which Google Cloud product or service should your organization use?
A. BigQuery
B. Cloud Bigtable
C. Pub/Sub
D. Cloud Spanner
D. Cloud Spanner
We are talking about TRANSACTIONAL SQL queries which means a relational DB like CloudSQL or CloudSpanner
CloudSQL is not elastic and will not dynamically scale up/down. It will run fix number of servers.
BigQuery is for Analysis DB like Data Warehouse or Data Lake. It will also not provide transactional DB.
BigTable is for NoSQL, Unstructured DB
Pub/Sub is a messaging service and not a DB
Your organization wants to predict the behavior of visitors to its public website. To do that, you have decided to build a machine learning model. Your team has database-related skills but only basic machine learning skills, and would like to use those database skills.
Which Google Cloud product or feature should your organization choose?
A. BigQuery ML
B. LookML
C. TensorFlow
D. Cloud SQL
A. BigQuery ML
The imp point is that your team know only DB but not ML.
BigQuery ML lets you create and execute machine learning models in BigQuery using standard SQL queries.
There is no such readymade ML facility for CloudSQL.
Your team would need ML knowledge to use TenserFlow
Which Google Cloud service or feature lets you build machine learning models using Standard SQL and data in a data warehouse?
A. BigQuery ML
B. TensorFlow
C. AutoML Tables
D. Cloud Bigtable ML
D. BigQuery
BigQuery ML lets you create and execute machine learning models in BigQuery using standard SQL queries.
There is no such readymade ML facility for CloudSQL.
Your team would need ML knowledge to use TenserFlow
A global organization is developing an application to manage payments and online bank accounts in multiple regions. Each transaction must be handled consistently in their database, and they anticipate almost unlimited growth in the amount of data stored.
Which Google Cloud product should the organization choose?
A. Cloud SQL
B. Cloud Spanner
C. Cloud Storage
D. BigQuery
B. Cloud Spanner
The keywords are Bank Account kind of data and unlimited size of DB.
So you need a Relational DB means CloudSQL or CloudSpanner
CloudSQL has a 10 TB limit so it can’t be used as you need unlimited storage
An organization wants to use BigQuery data analytics to understand their website performance, but wants to move only some data into the cloud.
Which environment should the organization use?
A. Private cloud
B. On-premises
C. Multi-cloud
D. Hybrid cloud
D. Hybrid cloud
The Q says that you need to move only partial data from OnPrem to Cloud.
It means that the data will remain at OnPrem as well which would need a service that can make a connection between OnPrem + Cloud
Hybrid Cloud allows to establish a “connection” between cloud and on-premises.
Multi-clouds always include more than one public cloud service. It can also include Private Clouds as well.
An organization has created an ecommerce website. What data on this website would be considered structured data?
A) Product ratings score
B) Product reviews
C) Product photographs
D) Product descriptions
A) Product ratings score
Reviews, Descriptions are free form text with no structure. Suitable option for such data is CloudBigTable
Photograph, Files should be stored on CloudStorage
What is the difference between Standard and Coldline storage?
A. Coldline storage is for data for which a slow transfer rate is acceptable.
B. Standard and Coldline storage have different durability guarantees.
C. Standard and Coldline storage use different APIs.
D. Coldline storage is for infrequently accessed data.
D. Coldline storage is for infrequently accessed data.
If you need to store data that is accessed very frequently, in that case Standard storage will be cheaper.
The 4 categories are:
Standard (charged for min 30 days)
NearLine (charged for min 60 days)
ColdLine (charged for min 90 days)
Archive (This wont be accessible online. Think of this as data on tape kept in a safe box)
The cost of storage decreases from Standard to Archive.
Your company’s development team is building an application that will be deployed on Cloud Run. You are designing a CI/CD pipeline so that any new version of the application can be deployed in the fewest number of steps possible using the CI/CD pipeline you are designing. You need to select a storage location for the images of the application after the CI part of your pipeline has built them.
What should you do?
A. Create a Compute Engine image containing the application
B. Store the images in Container Registry
C. Store the images in Cloud Storage
D. Create a Compute Engine disk containing the application
C. Store the images in Cloud Storage
CloudStorage is to store Documents, Images, PDF etc
The 4 categories are:
Standard (charged for min 30 days)
NearLine (charged for min 60 days)
ColdLine (charged for min 90 days)
Archive (This wont be accessible online. Think of this as data on tape kept in a safe box)
The cost of storage decreases from Standard to Archive.