Snowflake Flashcards

1
Q

What are the key features of Snowflake?

A

Pure SaaS
Relational
Semi-Structured
Elastic
Highly Available
Durable
Cost-efficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does Pure SaaS mean in Snowflake?

A

No maintaining machines, Database Administering or installing / upgrading software

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does Relational mean in Snowflake?

A

It supports SQL and ACID transactions allowing for switch from standard relational DB like PostgreSQL without big adjustments

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does Semi-structured mean in Snowflake?

A
  1. There are built in function to flatten, traverse and nest the semi-structured data
  2. The Schemas can be automatically discovered thanks to automatic discovery allowing to make the operations on the data almost as quickly as on relational data without any user effort.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does Elastic mean in Snowflake?

A

Simple shared-nothing structure, where storage and compute are decoupled. Allowing for seamless and independent scaling.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does highly available mean in Snowflake?

A

It tolerates node, cluster and even full data center failure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does Durable mean in Snowflake?

A

Extra safeguards like cloning, undrop and cross-region backups are in place.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does Cost-efficient mean in Snowflake?

A

Pay only for what u use. Additionally compressed data and efficient compute allow for even bigger savings.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the drawbacks of Snowflake design?

A

Heterogeneous Workload
Membership Changes
Online Upgrade

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why is Heterogeneous Workload a drawback in snowflake?

A

Snowflake is configured best for high I/O band-width light compute. Which can be very slow for complex queries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why is Membership change a drawback in Snowflake?

A

When moving data or changing the number of nodes there is very expensive operation of reshuffling and as the same nodes are in charge of it as to process any incoming requests it may affect performance and elasticity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why is Online Upgrade a drawback in Snowflake?

A

Similar as with Membership Change, however it cannot be mitigated by replication as eventually every node will need to receive update and as in theory it is possible to upgrade one node after another in Praxis it’s damn hard as everything is coupled and homogeneous.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How does the compute and storage look like?

A

Compute is a proprietary shared-nothing engine.
Storage is typically provided by Amazon S3 Bucket but could be any blob storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a big thing about the compute engine in Snowflake?

A

Local data storage - every node has a hot SSD local cache (for temp data only) that after loading with the parts of data it is responsible for can exceed the performance of a pure shared-nothing architecture.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly