DB and Storage Flashcards
What is an example of unstructured data that organizations can capture from social media?
A. Post comments
B. Tagging
C. Profile picture
D. Location
A. Post comments
Reviews, Descriptions are free form text with no structure. Suitable option for such data is CloudBigTable
Photograph, Files should be stored on CloudStorage
Regular table type structured data should go to CloudSQL or CloudSpanner
An organization wants a cost-effective relational database.
Which Google Cloud service should the organization use?
A. Cloud Storage
B. BigQuery
C. Cloud SQL
D. Dataflow
C. Cloud SQL
GCP has two options for Relational DB. Cloud SQL and Cloud Spanner
CloudSQL can run MySQL, MS SQL type of DBs. But it can go only upto 10 TB.
If you have a DB running OnPrem you can migrate it to CloudSQL
Another option is CloudSpanner which gives you much better performance and unlimited storage (>10TB)
Your organization is developing an application that will capture a large amount of data from millions of different sensor devices spread all around the world. Your organization needs a database that is suitable for worldwide, high-speed data storage of a large amount of unstructured data.
Which Google Cloud product should your organization choose?
A. Firestore
B. Cloud Data Fusion
C. Cloud SQL
D. Cloud Bigtable
D. Cloud Bigtable
The need is for Unstructured data so CloudSQL or CloudSpanner can not be an option.
The Q is asking for Sensor/IoT data and
Bigtable is special for stream data IOT.
Your organization needs to analyze data in order to gather insights into its daily operations. You only want to pay for the data you store and the queries you perform. Which Google Cloud product should your organization choose for its data analytics warehouse?
A. Cloud SQL
B. Dataproc
C. Cloud Spanner
D. BigQuery
D. BigQuery
BigQuery is used for
- Data Analysis for structured and unstructured data
- As a Data Warehouse
-As a Data Lake
You can also use Looker to make visualization on the BigQuery data
Your organization is developing an application that will manage payments and online bank accounts located around the world. The most critical requirement for your database is that each transaction is handled consistently. Your organization anticipates almost unlimited growth in the amount of data stored.
Which Google Cloud product should your organization choose?
A. Cloud SQL
B. Cloud Storage
C. Firestore
D. Cloud Spanner
D. Cloud Spanner
The Q is aksing to store Bank Account data which is kind of structured data and needs a SQL, Relational DB.
So the options are CloudSQL and CloudSpanner.
CloudSQL has a limitation of 10TB and can not give you unlimited storage.
Hence the answer should be CloudSpanner
Your organization wants an economical solution to store data such as files, graphical images, and videos and to access and share them securely.
Which Google Cloud product or service should your organization use?
A. Cloud Storage
B. Cloud SQL
C. Cloud Spanner
D. BigQuery
A. Cloud Storage
The need is to store pdf, word, images, video etc. The best option for such storage is CloudStorage. This is like google drive.
How would a global organization benefit from managing their data with Cloud Spanner?
A. Cloud Spanner is optimized for cold storage
B. Cloud Spanner replicates data across regions in real time
C. Cloud Spanner is optimized to ingest unstructured data
D. Cloud Spanner visualizes and analyzes data in real time
B. Cloud Spanner replicates data across regions in real time
A: Cold storage is for CloudStorage
C: This is Cloud BigTable
D: This is Looker
our organization wants to migrate its data management solutions to Google Cloud because it needs to dynamically scale up or down and to run transactional
SQL queries against historical data at scale. Which Google Cloud product or service should your organization use?
A. BigQuery
B. Cloud Bigtable
C. Pub/Sub
D. Cloud Spanner
D. Cloud Spanner
We are talking about TRANSACTIONAL SQL queries which means a relational DB like CloudSQL or CloudSpanner
CloudSQL is not elastic and will not dynamically scale up/down. It will run fix number of servers.
BigQuery is for Analysis DB like Data Warehouse or Data Lake. It will also not provide transactional DB.
BigTable is for NoSQL, Unstructured DB
Pub/Sub is a messaging service and not a DB
Your organization wants to predict the behavior of visitors to its public website. To do that, you have decided to build a machine learning model. Your team has database-related skills but only basic machine learning skills, and would like to use those database skills.
Which Google Cloud product or feature should your organization choose?
A. BigQuery ML
B. LookML
C. TensorFlow
D. Cloud SQL
A. BigQuery ML
The imp point is that your team know only DB but not ML.
BigQuery ML lets you create and execute machine learning models in BigQuery using standard SQL queries.
There is no such readymade ML facility for CloudSQL.
Your team would need ML knowledge to use TenserFlow
Which Google Cloud service or feature lets you build machine learning models using Standard SQL and data in a data warehouse?
A. BigQuery ML
B. TensorFlow
C. AutoML Tables
D. Cloud Bigtable ML
D. BigQuery
BigQuery ML lets you create and execute machine learning models in BigQuery using standard SQL queries.
There is no such readymade ML facility for CloudSQL.
Your team would need ML knowledge to use TenserFlow
A global organization is developing an application to manage payments and online bank accounts in multiple regions. Each transaction must be handled consistently in their database, and they anticipate almost unlimited growth in the amount of data stored.
Which Google Cloud product should the organization choose?
A. Cloud SQL
B. Cloud Spanner
C. Cloud Storage
D. BigQuery
B. Cloud Spanner
The keywords are Bank Account kind of data and unlimited size of DB.
So you need a Relational DB means CloudSQL or CloudSpanner
CloudSQL has a 10 TB limit so it can’t be used as you need unlimited storage
An organization wants to use BigQuery data analytics to understand their website performance, but wants to move only some data into the cloud.
Which environment should the organization use?
A. Private cloud
B. On-premises
C. Multi-cloud
D. Hybrid cloud
D. Hybrid cloud
The Q says that you need to move only partial data from OnPrem to Cloud.
It means that the data will remain at OnPrem as well which would need a service that can make a connection between OnPrem + Cloud
Hybrid Cloud allows to establish a “connection” between cloud and on-premises.
Multi-clouds always include more than one public cloud service. It can also include Private Clouds as well.
An organization has created an ecommerce website. What data on this website would be considered structured data?
A) Product ratings score
B) Product reviews
C) Product photographs
D) Product descriptions
A) Product ratings score
Reviews, Descriptions are free form text with no structure. Suitable option for such data is CloudBigTable
Photograph, Files should be stored on CloudStorage
What is the difference between Standard and Coldline storage?
A. Coldline storage is for data for which a slow transfer rate is acceptable.
B. Standard and Coldline storage have different durability guarantees.
C. Standard and Coldline storage use different APIs.
D. Coldline storage is for infrequently accessed data.
D. Coldline storage is for infrequently accessed data.
If you need to store data that is accessed very frequently, in that case Standard storage will be cheaper.
The 4 categories are:
Standard (charged for min 30 days)
NearLine (charged for min 60 days)
ColdLine (charged for min 90 days)
Archive (This wont be accessible online. Think of this as data on tape kept in a safe box)
The cost of storage decreases from Standard to Archive.
Your company’s development team is building an application that will be deployed on Cloud Run. You are designing a CI/CD pipeline so that any new version of the application can be deployed in the fewest number of steps possible using the CI/CD pipeline you are designing. You need to select a storage location for the images of the application after the CI part of your pipeline has built them.
What should you do?
A. Create a Compute Engine image containing the application
B. Store the images in Container Registry
C. Store the images in Cloud Storage
D. Create a Compute Engine disk containing the application
C. Store the images in Cloud Storage
CloudStorage is to store Documents, Images, PDF etc
The 4 categories are:
Standard (charged for min 30 days)
NearLine (charged for min 60 days)
ColdLine (charged for min 90 days)
Archive (This wont be accessible online. Think of this as data on tape kept in a safe box)
The cost of storage decreases from Standard to Archive.
Your organization is migrating to Google Cloud. As part of that effort, it needs to move terabytes of data from on-premises file servers to Cloud Storage. Your organization wants the migration process to be automated and to be managed by Google. Your organization has an existing Dedicated Interconnect connection that it wants to use. Which Google Cloud product or feature should your organization use?
A. Storage Transfer Service
B. Migrate for Anthos
C. BigQuery Data Transfer Service
D. Transfer Appliance
D. Transfer Appliance
If you have data that is more than 10 TB or it will take more than a week to transfer, then you can use Transfer Appliance. This is a like a very huge USB drive where you can upload your data. Then google will upload it in your cloud.
If you have less than 10 TB or you think data can transferred in few hours or days, then go with Storage Transfer Service which is like like doing FTP to cloud.
If you need to transfer data between clouds, then also Storage Transfer will be a better option.
Your organization is migrating to Google Cloud. As part of that effort, it needs to move terabytes of data from on-premises file servers to Cloud Storage. Your organization has calculated that with the current internet speed, it will take them 3 weeks to transfer the data but they want to transfer it within a week. Which Google Cloud product or feature should your organization use?
A. Storage Transfer Service
B. Migrate for Anthos
C. BigQuery Data Transfer Service
D. Transfer Appliance
D. Transfer Appliance
f you have data that is more than 10 TB or it will take more than a week to transfer, then you can use Transfer Appliance. This is a like a very huge USB drive where you can upload your data. Then google will upload it in your cloud.
If you have less than 10 TB or you think data can transferred in few hours or days, then go with Storage Transfer Service which is like like doing FTP to cloud.
If you need to transfer data between clouds, then also Storage Transfer will be a better option.
Your organization is migrating to Google Cloud. As part of that effort, it needs to move terabytes of data from amazon S3 which is a cloud storage to Google Cloud Storage. Which Google Cloud product or feature should your organization use?
A. Storage Transfer Service
B. Migrate for Anthos
C. BigQuery Data Transfer Service
D. Transfer Appliance
A. Storage Transfer Service
If you have data that is more than 10 TB or it will take more than a week to transfer, then you can use Transfer Appliance. This is a like a very huge USB drive where you can upload your data. Then google will upload it in your cloud.
If you have less than 10 TB or you think data can transferred in few hours or days, then go with Storage Transfer Service which is like like doing FTP to cloud.
If you need to transfer data between clouds, then also Storage Transfer will be a better option.
Your team is publishing research results and needs to make large amounts of data available to other researchers within the professional community and the public at minimum cost.
How should you host the data?
A. Use a Cloud Storage bucket and enable “Requester Pays”
B. Use a Cloud Storage bucket and provide Signed URLs for the data files.
C. Use a Cloud Storage bucket and set up a Cloud Interconnect connection to allow access to the data.
D. Host the data on-premises, and set up a Cloud Interconnect connection to allow access to the data.
A. Use a Cloud Storage bucket and enable “Requester Pays”
The Q asks whether you will pay for the network cost for the data people will down or not.
Cloud Storage has an option where you can make the people who are downloading your data to pay for it. This is called “Requester Pay” option.
How would an organization benefit from using Looker?
A. Optimal identity and access management
B. Leading serverless warehousing technology
C. Robust data roll-back accuracy
D. Advanced business intelligence and analytics
D. Advanced business intelligence and analytics
Looker is a BI application. You can use it to make reports, graphs on data from BigQuery or any other Database
An organization wants to transform multiple types of structured and unstructured data in the cloud from various sources. The data must be readily accessible for analysis and insights.
Which cloud data storage system should the organization use?
A. Relational database
B. Private data center
C. Data field
D. Data warehouse
D. Data warehouse
Data Warehouse or Data Lake can take data from various sources (DB, File, Storages etc) and analyse them.
You can use Structured as well non-structured Data.
BigQuery is the service you can use for this. You can also use Looker to create graphs, reports on this data.
An organization recently launched a virtual customer support agent, generating vast amounts of text and speech data.
Why should they use a cloud data warehouse to interpret this data?
A. To natively visualize both types of data using a dashboard in real time
B. To ingest and analyze structured and unstructured data at scale, in real time
C. To secure data transmission between cloud and on-premises environments
D. To transform data from structured to unstructured
B. To ingest and analyze structured and unstructured data at scale, in real time
A is Looker, a BI solution
An organization needs to store structured, semi-structured, and unstructured data in its raw, native format in the same repository.
Which cloud data management solution should the organization use?
A. Data field
B. Data lake
C. Database
D. Data warehouse
B. Data lake
You have many terabytes of data residing within BigQuery. Your organization has asked you to analyze the data using machine learning.
Which choice would be ideal for this scenario?
Dataproc
DataFusion
BigQuery ML
Vertex AI
BigQuery ML