Big Data and Machine Learning on Google Cloud Flashcards
Describe Google Cloud infrastructure
Google Cloud infrastructure in terms of three layers:
1) networking and security
2) compute and storage
3) big data and machine learning products
What compute GCP services are available?
- Compute Engine
- Google Kubernetes Engine
- App Engine
- Cloud Functions
- Cloud Run
What is Compute Engine?
- Compute Engine is an IaaS offering, or infrastructure as a service, which provides compute, storage, and network resources virtually that are similar to physical data centers.
- You use the virtual compute and storage resources the same as you manage them locally.
- Compute Engine provides maximum flexibility for those who prefer to manage server instances themselves.
What is Google Kubernetes Engine?
- Google Kubernetes Engine, or GKE GKE runs containerized applications in a cloud environment, as opposed to on an individual virtual machine, like Compute Engine.
- A container represents code packaged up with all its dependencies.
What is App Engine?
- App Engine, a fully managed PaaS offering, or platform as a service.
- PaaS offerings bind code to libraries that provide access to the infrastructure application needs. - This allows more resources to be focused on application logic.
What are Cloud Functions?
- Cloud Functions execute code in response to events, like when a new file is uploaded to Cloud Storage.
- It’s a completely serverless execution environment, which means you don’t need to install any software locally to run the code and you are free from provision and managing servers.
- Cloud Functions is often referred to as functions as a service.
What is Cloud Run?
- It is a fully managed compute platform that enables you to run request or event-driven stateless workloads without having to worry about servers.
- It abstracts away all infrastructure management so you can focus on writing code.
- It automatically scales up and down from zero, so you never have to worry about scale configuration.
- Cloud Run charges you only for the resources you use so you never pay for over provisioned resources.
What is Cloud Run?
- It is a fully managed compute platform that enables you to run request or event-driven stateless workloads without having to worry about servers.
- It abstracts away all infrastructure management so you can focus on writing code.
- It automatically scales up and down from zero, so you never have to worry about scale configuration.
- Cloud Run charges you only for the resources you use so you never pay for over provisioned resources.
What is TPU?
TPUs are Google’s custom-developed application-specific integrated circuits (ASICs) used to accelerate machine learning workloads.
How TPU is different from CPU or GPU?
- TPUs act as domain-specific hardware, as opposed to general-purpose hardware with CPUs and GPUs.
-This allows for higher efficiency by tailoring architecture to meet the computation needs in a domain, such as the matrix multiplication in machine learning
- With TPUs, the computing speed increases more than 200 times.
What are the major differences between cloud computing and desktop computing?
On GCP, for proper scaling capabilities compute and storage are decoupled.
So processing limitations aren’t attached to storage disks.
What fully managed database and storage Services are offered?
- cloud storage
- cloud bigtable
- cloud SQL
- cloud spanner
- firestore
- bigquery
Where is it better to store unstructured data?
cloud storage
What are cloud storage’s primary storage classes?
- standard storage (is considered best for frequently accessed or hot data; it’s also great for data that is stored for only brief periods of time)
- nearline storage (is best for storing infrequently accessed data like reading or modifying data once per month or less on average)
- cold Line storage (is also a low-cost option for storing infrequently accessed data that is meant for reading or modifying data at most once every 90 days)
- archive storage (is the lowest cost option used for data archiving online backup and disaster recovery - once a year)
Describe the road map for storing structural data.