8_Cloud Datalab Flashcards

1
Q

Cloud Datalab Overview

  • Interactive tool for exploring and visualizing data:
    • Notebook format
    • Great for data engineering, machine learning
  • Built on Jupyter
  • Visual analysis of data in BigQuery, ML Engine, Compute Engine, Cloud Storage and Stackdriver
  • Supports Python, SQL and Javascript
  • Runs on GCE instance, dedicated VPC and Cloud Source Repository
  • Cost: free - only pay for GCE resources Datalab runs on and other Google Cloud services you interact with
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How it works

  • Manages Datalab instance lifecycle
  • Create Datalab VMs in seconds
  • Notebooks stored in GCR (Google Cloud Repository) on persistent disk
  • Storage can persist after the instance is destroyed
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Sharing notebook data

  • GCE access based on GCE IAM roles
  • Must have Compute instance Admin and Service Account Actor roles
  • Notebook access per user only
  • Sharing data performed via shared Cloud Source Repository
  • Sharing at the project level
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly