General Architecture Flashcards

1
Q

Snowflake’s architecture is a hybrid of what two architectures

A

Traditional shared disk and share nothing database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does MPP stand for

A

Massively parallel processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How does snowflake process queries

A

With virtual warehouses. Each virtual warehouse uses mpp compute clusters where each node in the cluster stores a portion of the entire dataset locally

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the three key layers of Snowflake’s architecture

A
  • Database Storage
  • Query Processing
  • Cloud Services
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How is data stored in cloud storage

A

A central data repo where data is internal optimized, compressed, and columnar format. Data is accessible by all compute nodes on the platform.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Are data objects stored by Snowflake visible by the customer

A

The data objects stored by Snowflake are not directly visible nor accessible by customers; they are only accessible through SQL query operations run using Snowflake

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Each warehouse consists of what

A

It is a MPP compute cluster, composed of multiple compute nodes allocated by Snowflake from a cloud provider.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What services are managed in the cloud services layer

A

AuthN
Infrastructure mgmt
Metadata mgmt
query parsing and optimizing
Access Control

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What ways can you connect to Snowflake

A

Web based user interface

Command Line Interface

ODBC and JDBC drivers that can be used by other applications

Native Connectors like python and spark

Third Party connectors that can connect apps like ETL tools and BI Tools

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Supported cloud platforms

A

Azure, AWS, GCP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Differences in unit costs for credits and data storage are calculated by what on each cloud platform

A

Region

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Snowflake supports loading data from files staged in what location, regardless of the platform of the snowflake account

A

Internal snowflake stages
Amazon s3
Google Cloud Storage
Microsoft Azure blob storage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Snowflake supports unloading data from files staged in what location, regardless of the platform of the snowflake account

A

Internal snowflake stages
Amazon s3
Google Cloud Storage
Microsoft Azure blob storage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What type of loading does snowflake support

A

Bulk data loading and continuous data loading

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

T/F Each virtual warehouse is independent and has no impact on other virtual warehouses

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

You have create a table in snowflake as below

CREATE TABLE EMPLOYEE(EMPLOYEE_NAME STRING, SALARY NUMBER);

When you do a DESCRIBE TABLE EMPLOYEE, what will you see as the data type of EMPLOYEE_NAME?

A

VARCHAR(16777216)