2025 - General Knowledge Flashcards
What does the Snowflake architecture separate?
Storage and compute
What is a key benefit of Snowflake’s architecture?
Users can scale up or down as needed and pay for only the resources they use
What does Snowflake use to process queries?
Massive parallel processing (MPP) compute clusters
What functionality does Snowflake support for data sharing?
Real-time secure data sharing
What does MPP stand for?
Massive parallel processing
In MPP, what do multiple processors work on?
Different sections of the same user program
What does the leader node in MPP maintain?
Metadata regarding compute nodes
True or False: In Snowflake, users pay for storage and computation together.
False
Fill in the blank: Snowflake enables users to use and pay for storage and _______ independently.
Computation
What is the purpose of the leader node in MPP?
Communicates with compute nodes for the execution of different parts of the query
Which cloud platforms can run Snowflake?
AWS, Azure, GCP
What are the editions of Snowflake available?
Standard, Enterprise, Business Critical, and VPS.
What is the WebUI called?
Snowsight
What is the storage layer in Snowflake?
The lowest layer where data is physically stored (cloud storage)
Snowflake organizes data as compressed micro-partitions, containing between 50 MB and 500 MB of uncompressed data.
How does Snowflake organize data?
Data is organized in a columnar fashion and stored in compressed micro-partitions
Each micro-partition contains metadata about all rows stored.
What is the cost of storing data in Snowflake for capacity storage?
Approximately $23/TB per month
The cost for on-demand storage is about $40/TB per month.
What is the compute layer responsible for in Snowflake?
This layer is where queries are executed
It uses virtual warehouses to process queries.
What are virtual warehouses in Snowflake?
MPP compute clusters with compute nodes allocated by Snowflake
They act as independent compute clusters with CPU, memory, and temporary storage.
What flexibility does Snowflake provide with warehouses?
Warehouses can be started, stopped, or resized at any time
This flexibility accommodates customers’ computing needs based on operations.
What is the billing model used by Snowflake for running a warehouse?
Per-second billing (consumption-based)
Running a warehouse consumes credit, which increases as warehouse size increases.
What feature does Snowflake offer to limit idle time for warehouses?
Auto-suspend and auto-resume
This feature helps manage costs by limiting idle time.
What are multicluster warehouses in Snowflake?
Additional clusters allocated to improve user performance/concurrence of queries
They help in managing concurrent query execution.
What is the role of the cloud services layer in Snowflake?
Manages end-to-end workflow including authentication, access control, and query optimization
It runs on a Snowflake-managed virtual warehouse.
What are the three layers that make up Snowflake Architecture?
Storage, Compute, Cloud Services