8_Cloud Datalab Flashcards
1
Q
Cloud Datalab Overview
- Interactive tool for exploring and visualizing data:
- Notebook format
- Great for data engineering, machine learning
- Built on Jupyter
- Visual analysis of data in BigQuery, ML Engine, Compute Engine, Cloud Storage and Stackdriver
- Supports Python, SQL and Javascript
- Runs on GCE instance, dedicated VPC and Cloud Source Repository
- Cost: free - only pay for GCE resources Datalab runs on and other Google Cloud services you interact with
A
2
Q
How it works
- Manages Datalab instance lifecycle
- Create Datalab VMs in seconds
- Notebooks stored in GCR (Google Cloud Repository) on persistent disk
- Storage can persist after the instance is destroyed
A
3
Q
Sharing notebook data
- GCE access based on GCE IAM roles
- Must have Compute instance Admin and Service Account Actor roles
- Notebook access per user only
- Sharing data performed via shared Cloud Source Repository
- Sharing at the project level
A