Getting Started with Azure Databricks Flashcards

1
Q

IaaS

A

Infrastructure as a Service: Hardware hosted in the cloud, managed by the subscriber

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

PaaS

A

Platform as a Service: Application platforms that require minimal hardware configurations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

SaaS

A

Software as a Service: Instant access to software such as Office 365

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Azure Account

A

Identifies who you are.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Azure Subscription

A

Identifies resources that have been provisioned to you for billing purposes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Resource Group

A

A container(logical grouping) that stores resources you create.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a workspace?

A

A workspace can refer to three different things. Azure Databricks service, Databricks web interface, Databricks File storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Virtual Machine

A

Physical Machines divided into multiple simulated machines.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Containerization

A

Manage many application instances automatically to support scaling.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a cluster?

A

Provide compute resources needed to run Spark applications.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are libraries?

A

A collection of functions that extend the feature of a language

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are Notebooks?

A

Databricks programs which can include code and rich annotations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are Jobs?

A

Schedules of when to automatically run Databricks notebooks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Cluster Mode Standard

A

A Standard cluster can only be used by one user.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Cluster Mode High Concurrency

A

Create clusters shared amongst multiple users. Does not support Scala

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Pool

A

Speeds up cluster creation by using existing available Azure Virtual Machines.

17
Q

Databricks Runtime Version

A

Determines what programming languages will be supported.

18
Q

Enable Auto Scaling

A

Automatically dynamically size the cluster based on the estimated number of nodes required to support the submitted job.

19
Q

Terminate after x minutes of inactivity

A

Determines how long a cluster will remain online before the cluster is automatically shutdown to save on compute resources.

20
Q

Worker type

A

This is set based on the needs of a given workload. GPU based hardware is available for intensive task.

21
Q

Driver Type

A

This is the hardware configuration for the driver.