General Architecture Flashcards
Snowflake’s architecture is a hybrid of what two architectures
Traditional shared disk and share nothing database
What does MPP stand for
Massively parallel processing
How does snowflake process queries
With virtual warehouses. Each virtual warehouse uses mpp compute clusters where each node in the cluster stores a portion of the entire dataset locally
What are the three key layers of Snowflake’s architecture
- Database Storage
- Query Processing
- Cloud Services
How is data stored in cloud storage
A central data repo where data is internal optimized, compressed, and columnar format. Data is accessible by all compute nodes on the platform.
Are data objects stored by Snowflake visible by the customer
The data objects stored by Snowflake are not directly visible nor accessible by customers; they are only accessible through SQL query operations run using Snowflake
Each warehouse consists of what
It is a MPP compute cluster, composed of multiple compute nodes allocated by Snowflake from a cloud provider.
What services are managed in the cloud services layer
AuthN
Infrastructure mgmt
Metadata mgmt
query parsing and optimizing
Access Control
What ways can you connect to Snowflake
Web based user interface
Command Line Interface
ODBC and JDBC drivers that can be used by other applications
Native Connectors like python and spark
Third Party connectors that can connect apps like ETL tools and BI Tools
Supported cloud platforms
Azure, AWS, GCP
Differences in unit costs for credits and data storage are calculated by what on each cloud platform
Region
Snowflake supports loading data from files staged in what location, regardless of the platform of the snowflake account
Internal snowflake stages
Amazon s3
Google Cloud Storage
Microsoft Azure blob storage
Snowflake supports unloading data from files staged in what location, regardless of the platform of the snowflake account
Internal snowflake stages
Amazon s3
Google Cloud Storage
Microsoft Azure blob storage
What type of loading does snowflake support
Bulk data loading and continuous data loading
T/F Each virtual warehouse is independent and has no impact on other virtual warehouses
True