Getting Started with Azure Databricks Flashcards
IaaS
Infrastructure as a Service: Hardware hosted in the cloud, managed by the subscriber
PaaS
Platform as a Service: Application platforms that require minimal hardware configurations.
SaaS
Software as a Service: Instant access to software such as Office 365
Azure Account
Identifies who you are.
Azure Subscription
Identifies resources that have been provisioned to you for billing purposes
Resource Group
A container(logical grouping) that stores resources you create.
What is a workspace?
A workspace can refer to three different things. Azure Databricks service, Databricks web interface, Databricks File storage.
Virtual Machine
Physical Machines divided into multiple simulated machines.
Containerization
Manage many application instances automatically to support scaling.
What is a cluster?
Provide compute resources needed to run Spark applications.
What are libraries?
A collection of functions that extend the feature of a language
What are Notebooks?
Databricks programs which can include code and rich annotations.
What are Jobs?
Schedules of when to automatically run Databricks notebooks
Cluster Mode Standard
A Standard cluster can only be used by one user.
Cluster Mode High Concurrency
Create clusters shared amongst multiple users. Does not support Scala