snowflake cert pro Flashcards

Question

Which system functions can you view/monitor the clustering metadata for a table?

Answer 1

- SYSTEM$CLUSTERING_DEPTH - SYSTEM$CLUSTERING_INFORMATION (included clustering depth)

Answer 2

- Monitoring the clustering "health" of a large table, particularly over time as DML is performed on the table. - Determining whether a large table would benefit from defining a clustering key.

Answer 3

The clustering depth for a populated tables measures the average depth (1 or greater) of the overlapping micro-partitions for specified columns in a table. The smaller the average depth, the better clustered the table is with regards to the specified columns.

Answer 4

- The total number of micro-partitions that comprise the table. - The number of micro-partitions containing values that overlap with each other. - The depth of the overlapping micro-partitions.

Answer 5

Refers to the process of simplifying or modifying a query to reduce unnecessary computations, eliminate redundant operations, or narrow down the scope of data being retrieved.

Answer 6

- In contrast to traditional static partitioning, Snowflake micro-partitions are derived automatically; no maintenance needed by users. - Small (50 MB - 500 MB) - enable fast DML commands. - Micro-partitions can overlap in their range of values, which combined with their small size, help prevent skew. - Columns are stored independently. This enables efficient scanning; only the columns referenced by a query are scanned. - Columns are also compressed individually within micro-partitions.

Answer 7

Tables are transparently partitioned using the order of the data as it is inserted/loaded.

Answer 8

- The range of values for each of the columns in the micro-partitions. - The number of distinct values. - Additional properties used for both optimization and efficient query processing.

Answer 9

All data in Snowflake tables is automatically divided into micro-partitions. Each contains between 50 MB to 500 MB of uncompressed data. Groups of rows in tables are mapped to individual micro-partitions organized in a columnar fashion.

Answer 10

- SELECT - INSERT - CREATED TABLE AS SELECT - COPY INTO

By default, what does a multi-cluster warehouse consist of?

A single cluster.

Which are the services managed by the cloud services layer?

- Authentication - Infra Management - Metadata Management - Query Parsing and Optimization - Access Control

What is the default warehouse for notebooks?

A dedicated Snowflake-managed warehouse with the name SYSTEM$STREAMLIT_NOTEBOOK_WH is automatically provisioned in each account for running Notebook. This warehouse is owned and managed by Snowflake under the SYSTEM role. You cannot DROP or ALTER the warehouse.

Which are the scaling policies available when a multi-cluster warehouse is running in Auto-scale mode?

- Standard (default) - Economy

What is the Standard scaling policy?

Prevent/minimized queueing by favoring starting additional clusters over conserving credits.

Can Snowflake be run on private cloud infra (on-premise or hosted)?

FALSE

What determines the number of queries a warehouse can concurrently process?

The size and complexity of each query.

A multi-cluster warehouse is defined by specifying which properties for size?

- Maximum number of clusters, greater than 1 (up to 10). - Minimum number of clusters, equal to or less than max (up to 10).

Which properties can be set when defining a multi-cluster warehouse?

- Specifying a warehouse size. - Resizing a warehouse at any time. - Auto-suspending a running warehouse due to inactivity; This does not apply to individual clusters, but rather the entire multi-cluster warehouse. - Auto-resuming a suspended warehouse when new queries are submitted.

What are the modes a multi-cluster can run in?

- Maximized - Auto-scale

How is data reorganized when loaded into Snowflake?

- Compressed. - Columnar format.

What are the object-level parameters that Snowflake provided to help control query processing and concurrency?

- STATEMENT_QUEUED_TIMEOUT_IN_SECONDS - STATEMENT_TIMEOUT_IN_SECONDS

NOTE - If queries are queueing more than desired, another warehouse can be created and queries can be manually redirected to the new warehouse. In additional resizing a warehouse can enable limited scaling for query concurrency and queueing; However, warehouse resizing is primarily intended for query performance.

What are the properties of a warehouse running in Auto-scale mode?

This mode is enabled by specifying different values for max and min.

What is the Economy scaling policy?

Conserves credits by favoring keeping running clusters fully-loaded rather than starting additional clusters, which may result in queries being queued and taking longer to complete.

How can you identify and warehouses that might benefit from Query Acceleration Service?

You can query QUERY_ACCELERATION_ELIGIBLE view. You can use the SYSTEM$ESTIMATE_QUERY_ACCELERATION functions to assess whether a specific query is eligible for acceleration.

What is Snowflakes's architecture a combo of?

A hybrid of traditional shared-disk and shared-nothing architecture.

How is multi-cluster billing calculated?

- Warehouse size. - The number of clusters that run in that period.

What is available to control the usage of credits in Auto-scale mode?

Snowflake provided a property SCALING_POLICY that determines the scaling policy to use when automatically starting or shutting down additional clusters.

The amount of compute resources in each cluster is determined by what?

Warehouse size.

Does increasing the size of a warehouse always improve data loading performance?

Increasing the size of a warehouse does not always improve data loading performance. Data loading performance is influenced more by the number of files being loaded (and the size of each file) than the size of the warehouse.

NOTE - To enable fully automated scaling for concurrency, Snowflake recommends multi-cluster warehouses.

What is a virtual warehouse?

A cluster of computer resources.

What are three key layers in Snowflake?

- Cloud Services (Service) - Query Processing (Compute) - Database Storage (Storage)

What are the two types of virtual warehouses?

- Standard - Snowpark-optimized

NOTE - Unless you are bulk loading a large number of files concurrently (i.e. hundred or thousands of files), a smaller warehouse (S, M, L) is generally sufficient.

Can warehouses be started at any time?

TRUE

What does Snowflake use for persisted data?

A central data repository for persisted data that is accessible from all compute nodes. But, similar to shared-nothing architecture, Snowflake processes queries using MPP (massively parallel processing) compute cluster where each node in the cluster stores a portion of the entire data set locally.

What are the warehouse size options?

- x-small - small - medium - large - x-large - 2x-large - 3x-large - 4x-large - 5x-large - 6x-large

What are the keys to using warehouses effectively and efficiently?

- Experiment with different types of queries and different warehouse sizes to determine the combo that best meets your query needs and workload. - Don't focus on warehouse size, Snowflake utilizes per-second billing, so you can run larger warehouses and simply suspend them when not in use.

Which SQL command can be used to create a multi-cluster warehouse?

Execute a CREATE WAREHOUSE command with: - MAX_CLUSTER_COUNT - MIN_CLUSTER_COUNT

Which command can be used to view warehouses including multi-cluster?

SHOW WAREHOUSES

How are credits charged for warehouses?

- Warehouse size. - Number of clusters. - The length of time the compute resources in each cluster run.

How does warehouse caching impact queries?

Each warehouse, when running, maintains a cache of table data accessed as queries are processed by the warehouse. The larger the warehouse the larger the cache. The cache is dropped when the warehouse is suspended, which might result in slower initial performance. Tradeoff between suspending warehouse and speed.

How do you select an initial warehouse size?

- For data loading, the warehouse size should match the number of files being loaded and the amount of data of each file. - For queries in small-scale testing environments, smaller warehouse sizes may be sufficient. - For queries in larger-scale production environments, larger warehouses sizes may be most cost effective.

What are the effects of resizing a suspended warehouse?

Resizing a suspended warehouse does not provision any new compute resources for the warehouse.

Can Snowflake be packaged and installed?

FALSE

What is the SNOWFLAKE database?

Snowflake provides a system-defined, read-only shared database named SNOWFLAKE that contains metadata and historical usage data about the objects in your organization and accounts.

When can a warehouse be resized?

They can be resized at any time, even while running, to accommodate the need for more or less compute resources, based on the type of operations being performed by the warehouse.

What is the Snowflake data lifecycle?

Organizing Data (DDL) CREATE/ALTER DATABASE (DDL) CREATE/ALTER SCHEMA (DDL) CREATE/ALTER TABLE Storing Data (DML) INSERT/INTO TABLE Querying Data (DDL) SELECT FROM

Working with Data (DML) UPDATE

(DML) MERGE INTO

(DML) DELETE FROM

(DDL) CREATE TABLE/SCHEMA/DATABASE ... CLONE Removing Data (DDL) TRUNCATE TABLE (DDL) DROP TABLE (DDL) DROP SCHEMA (DDL) DROP DATABASE

What are the four sections in Snowsight?

- Navigation menu - Search - Quick actions - Recently viewed

What would be the credit consumption if a 3x-Large multi-cluster warehouse runs 1 cluster for one full hour and then runs 2 clusters for the next full hour?

The total number of credits would be (192 = 64 + 128)

What are the properties of a warehouse running in Maximized mode?

This mode is enabled by specifying the same value for both max and min of clusters. This mode is effective for statically controlling the available compute resources, particularly if you have large number of concurrent user sessions and/or queries and the numbers don't fluctuate significantly.

What is the cloud services layer in Snowflake?

A collection of services that coordinate activities across Snowflake.

What does it mean that Snowflake is a truly self-managed service?

- There is no hardware (virtual or physical) to select, install, configure, or manage. - There is virtually no software to install, configure, or manage. - Ongoing maintenance, management, upgrades and tuning are handled by Snowflake.

Can Snowflake client (SnowQL, JDBC driver, ODBC driver, python connectors, etc) have a default warehouse?

TRUE

How does auto-suspend and auto-resume apply to multi-cluster warehouses?

- Auto-suspend only occurs when the minimum number of cluster is running and there is no activity for the specified period of time. - Auto-resume only applies when the entire warehouse is suspended (i.e. no clusters are running).

Can the size of a warehouse impact query processing?

The size of a warehouse can impact the amount of time required to execute queries.

Can a default warehouse be set per user?

TRUE

What does Snowflake use to process queries?

Virtual warehouses.

Can a warehouse be automatically suspended and resumed?

TRUE

Which are the ways to connect to Snowflake?

- Web-base UI. - Command line clients (SnowSQL). - ODBC and JDBC. - Native connectors.

When a session is initiated in Snowflake, does the session have a default warehouse?

FALSE

What are the schemas in the SNOWFLAKE database?

- ACCOUNT_USAGE - ALERT - CORE - DATA_PRIVACY - DATA_SHARING_USAGE - INFORMATION_SCHEMA - LOCAL - ML - MONITORING - NOTIFICATION - ORGANIZATION_USAGE - READER_ACCOUNT_USAGE - TELEMETRY

What type of billing does Snowflake use?

Per-second billing.

How is query load calculated?

Query load is calculated by dividing the execution time (in seconds) of all queries in an interval by the total time (in seconds) for the interval.

What are the configuration option for Snowpark-optimized warehouse?

Memory (up to). CPU Architecture. Min warehouse size required 16 GB. Default or x86 XS 256 GB Default or x86 M 1 TB Default or x86 L

When should you use Snowpark-optimized warehouses?

Recommended for running code, and workloads that have large memory requirements or dependencies on a specific CPU architecture. Example workloads include ML training use cases.

What does Snowpark-optimized warehouses let you configure?

The available memory resources and CPU architecture on a single-node instance.

How can you define a Snowpark-optimized warehouse in SQL?

CREATE OR REPLACE WAREHOUSE snowpark_opt_wh WITH WAREHOUSE_SIZE = 'LARGE' WAREHOUSE_TYPE = 'SNOWPARK_OPTIMIZED' RESOURCE CONSTRAINT = 'MEMORY_16X_X86' // optional

Which view can you use to evaluate warehouse performance?

QUERY_HISTORY

What is the Query Acceleration Service?

QAS can accelerate parts of the query workload in a warehouse. When it is enabled, it can improve overall warehouse performance by reducing the impact of outlier queries, which are queries that use more resources than the typical query. It does this by offloading portions of the query processing work to shared compute resources provided by the service.

How can you enable Query Acceleration Service for a warehouse?

CREATE WAREHOUSE my_wh WITH ENABLE_QUERY_ACCELERATION = true;

What are the common reasons that queries are ineligible for Query Acceleration Service?

- There are not enough partitions in the scan. - Even if a query has a filter, the filter may not be selective enough. Alternatively, if the query has an aggregation with GROUP_BY, the cardinality of the GROUP BY might be too high for eligibility. - The query includes a LIMIT clause but does not have a ORDER BY clause. - The query includes functions that return nondeterministic results (RANDOM).

Which queries are eligible for the Query Acceleration Service?

In general, queries are eligible because they have a portion of the query plan that can be ran in parallel using QAS computer resources - Large scans with an aggregation or selective filter. - Large scans that insert many new rows.

A warehouse provides the required resources to perform what operations?

- Execute SQL SELECT. - Perform DML operations, such as: - Updating rows - Loading data - Unloading data

NOTE - In most cases, no tasks are required to enable Automatic Clustering for a table. You simply define a clustering key for the table. However, the rule does not apply to tables created by cloning (CREATE TABLE ... CLONE ...) from a source table that has clustering keys. The new table starts with Automatic Clustering suspended.

Which privileges do you need on the schema and database to add clustering to a table?

- USAGE - OWNERSHIP

How can you determine if Automatic Clustering is enabled for a table?

- SHOW TABLES command. - TABLES view (in the Snowflake Information Schema). - TABLES view (in the Account Usage shared database).

How can you SUSPEND/RESUME reclustering?

ALTER TABLE SUSPEND/RESUME RECLUSTER;

What can Automatic Clustering cost be broken down to?

- Compute costs - Storage costs

Does Automatic Clustering require you to provide a virtual warehouse?

FALSE

Which function can you use to help estimate the compute cost of enabling Automatic Clustering for a table and maintaining the table in a well-clustered state?

SYSTEM$ESTIMATE_AUTOMATIC_CLUSTERING_COSTS

What is the AUTOMATIC_CLUSTERING_HISTORY function used for?

This table function is used for querying the Automatic Clustering history for given tables within a specified date range. The information returned by the function includes the credits consumed, bytes updated, and rows updated each time a table is reclustered.

What is a stage?

A stage specifies where data files are stored so that the data in the files can be loaded into a table.

Which types of internal stages are supported?

- User - Table - Named

Is each user and table in Snowflake automatically allocated an internal stage?

TRUE

What steps are required in the data loading process that involve file staging information?

1. You must specify an internal stage in the PUT command when uploading files. 2. You must specify the same stage in the COPY INTO

command when loading data into a table from the stage files.

What is a user stage?

Each user has a stage allocated to them by default for storing files. This stage is convenient option if your files will only be accessed by a single user, but need to be copied into multiple tables.

What are the characteristics and limitations of user stages?

- User stages are referenced by using "@~" - Unlike named stages, user stages cannot be altered or dropped. - User stages do not support setting file format options. Instead, you must specify file format and copy options as part of the COPY INTO

command.

When are user stages not appropriate?

- Multiple users require access to the files. - The current user does not have INSERT privileges on the tables the data will be loaded into.

What is a table stage?

By default, each table has a Snowflake stage allocated to it for storing files.

When would you use a table stage?

You might use a table stage if you only need to copy files into a single table, but want to make the files accessible to multiple users.

What are the characteristics and limitations of table stages?

- A table stage has the same name as the table. For example, a table named mytable has a stage referenced as @%mytable. - A table stage is an implicit stage tied to a table object. It's not a separate database object. As a result, a table stage has no grantable privileges of its own. A table stage is also not appropriate if you need to copy file data into multiple tables. - To stage files on a table sage, list the files, query the files, or drop the, you must be the table owner (have the role with OWNERSHIP privilege on the table). - Unlike a named stage, you can't alter or drop the table stage. -Table stages don't support transforming data while loading it.

What is a named stage?

Named stages are database objects that provide the greatest degree of flexibility for data loading: - Users with the appropriate privileges on the stage on the stage can load data into any table. - Because the stage is a database object, the security/access rules that apply to all objects apply. The privileges to use a stage can be granted or revoked from roles. In addition, ownership of the stage can be transferred to another role.

How can you define a stage using SQL?

CREATE STAGE my_stage ENCRYPTION = (TYPE = 'SNOWFLAKE_SSE')

How do you load data into a user stage?

Uploads a file named data.csv in the /data directory on your local machine to your user stage and prefixes the file with a folder named staged Linux or macOS: PUT file:///data/data.csv @~/staged; Windows: PUT file://C:\data\data.csv @~/staged;

How do you load data into a table stage?

Uploads a file named data.csv in the /data directory on your local machine to the stage for a table named mytable Linux or macOS: PUT file:///data/data.csv @%mytable; Windows: PUT file://C:\data\data.csv @%mytable;

How do you load data into a named stage?

Uploads a file named data.csv in the /data directory on your local machine to a named internal stage called my_stage Linux or macOS: PUT file:///data/data.csv @my_stage; Windows: PUT file://C:\data\data.csv @my_stage;

How can you view the files that have been uploaded to a user stage using SQL?

LIST @~;

How can you view the files that have been uploaded to a table stage using SQL?

LIST @%mytable; // table name is mytable

How can you view the files that have been uploaded to a named stage using SQL?

LIST @my_stage; // stage is named my_stage

Which SQL command is used to load staged data into a target table?

COPY INTO

How can you validate data loads into a target table?

To validate data in an uploaded file, execute COPY INTO

in validation mode using the VALIDATION_MODE parameter. The VALIDATION_MODE parameter returns any errors that it encounters in a file.

What does the ON_ERROR copy option in COPY INTO

do?

Indicates what action to perform if errors are encountered in a file during loading.

What metadata is maintained for each file uploaded into a stage?

- File name - File size - LAST MODIFIED

Which command can be used to view the status of data files than have been staged?

LIST

Which function can be used to validate the data files you've loaded and retrieve any errors encountered during the load?

VALIDATE

Which view can be used to retrieve the history of data loaded into tables using the COPY INTO command?

LOAD_HISTORY in the Information schema

How can staged files be deleted?

- Files that were loaded successfully can be deleted from the stage during a load by specifying the PURGE copy option in the COPY INTO

command. - After the load completes, use the REMOVE command to remove the files in the stage.

Does removing files from a stage improve load performance?

TRUE - Removing files ensures they aren't inadvertently loaded again. It also improves load performance, because it reduces the number of files that COPY commands must scan to verify whether existing files in a stage were loaded already.

Can you copy files from one stage to another?

TRUE - you use the COPY FILES command and specify a source stage and a target stage

What the difference between internal and external stages?

Internal stages are managed by Snowflake, while external stages are managed by your business entity.

Which cloud providers can host external stages?

- AWS - GCP - Azure

What are the types of external stages?

Only named external stages are available.

What does bulk loading rely on?

On user-provided virtual warehouses, which are specified in the COPY statement. Users are required to size the warehouse appropriately to accommodate expected loads.

What is Snowpipe?

A feature that is designed to load small volumes of data (i.e. micro-batches) and incrementally make them available for analysis.

Which compute resources does Snowpipe use?

Snowpipe uses compute resources provided by Snowflake (i.e. a serverless compute model). These resources are automatically resized and scaled up or down as required, and are charged and itemized using per-second billing. Data ingestion is charged based upon the actual workloads.

What is the Snowpipe Streaming API?

The Snowpipe Streaming API writes rows of data directly to Snowflake tables without the requirement of staging files. This architecture results in lower load latencies with corresponding lower costs for loading any volume of data, which makes it a powerful tool for handling near real-time data stream.

What can you use to load data from Kafka topics?

Snowflake Connector for Kafka

Which SQL functions can be used to automatically detect the schema in a set of stage semi-structured data files?

- INFER_SCHEMA - GENERATE_COLUMN_DESCRIPTION

What semi-structured formats can be used with the COPY INTO

command?

- Apache Parquet - Apache Avro - ORC - JSON - CSV - XML (preview)

How can you create a table or external table using a derived schema from INFER_SCHEMA?

CREATE TABLE ... USING TEMPLATE CREATE EXTERNAL TABLE ... USING TEMPLATE

Can you query data in cloud storage without loading it into Snowflake tables?

TRUE

How can you query data in cloud storage without loading it into Snowflake tables?

External tables.

Which file formats can be used with the COPY INTO

command?

- Delimited files - Semi-structured - Unstructured

Does Snowflake support loading data from tar (tape archive) files?

FALSE

Can you stage uncompressed and already-compressed files?

TRUE

Are files stored on internal stages for data loading and unloading automatically encrypted?

TRUE - encrypted using AES-256

How is encryption handled when loading already-encrypted files into a stage?

The key used to encrypt the files must be provided to Snowflake.

Which keys are supported for unencrypted files?

128-bit and 256-bit

How can you optimize the number of parallel data loads?

Aim to generate data files with a compressed size of approximately 100 to 250 MB (or larger). Aggregate smaller files to minimize the processing overhead for each file.

Can the number of load operations that run in parallel exceed the number of data files loaded?

FALSE

Is loading very large files (e.g. 100 GB or larger) recommended?

FALSE

If you must load a large file, which copy option should you use?

ON_ERROR - Aborting or skipping a file due to a small number of errors could result in delays and wasted credits. In addition, if a data loading operation continues beyond the maximum allowed duration of 24 hours, it could be aborted without any portion of the file being committed.

Which column types can you not load an object larger than 16 MB?

- VARCHAR - BINARY - VARIANT - OBJECT - ARRAY - GEOGRAPHY - GEOMETRY

How can you load a large JSON file into separate rows?

Enable the STRIP_OUTER_ARRAY file format option for the COPY INTO

command to remove the outer array structure and load the records into separate table rows.

How can you load a large JSON object from a Parquet file?

Use FLATTEN CREATE OR REPLACE TABLE mytable AS SELECT t1.$1:ID AS id, t1.$1:CustomerDetails:RegistrationDate::VARCHAR AS RegistrationDate, t1.$1:CustomerDetails:FirstName::VARCHAR AS First_Name, t1.$1:CustomerDetails:LastName::VARCHAR AS as Last_Name, t2.value AS Event FROM @json t1, TABLE(FLATTEN(INPUT => $1:CustomerDetails:Events)) t2;

How should you prepare delimited text files?

- Use the ENCODING file format option to specify the character set. - Fields that contain delimit character should be enclosed in quotes (single or double). - If the data contains single or double quotes, then those must be escaped. - Fields that contain carriage returns should also be enclosed in quotes. - The number of columns in each row should be consistent.

Which elements are not extracted into a column?

- Elements that contain even a single "null" value are not extracted into a column. - Elements that contain multiple data types.

Should you dedicate separate warehouses to load and query operations?

TRUE - Loading large data sets can affect query performance. It is recommended to allocate separate warehouses for loading and querying operations to optimize performance for both tasks.

Which options are supported when loading data files from a stage when using the COPY command?

- By path (internal stages) / prefix (s3 bucket). - Specify a list of specific files to load. - Using pattern matching to identify specific files.

Which service should you use if your workload consists of highly concurrent COPY statements loading data into the same table?

Snowpipe.

When does load metadata expire?

After 64 days.

How can you load data whose metadata has expired?

To load files whose metadata has expired, set the LOAD_UNCERTAIN_FILES copy option to true. The copy option references load metadata, if available, to avoid data duplication, but also attempts to load files with expired load metadata. Alternatively, set the FORCE option to load all files, ignoring load metadata if it exists. Note that this option reloads files, potentially duplicating data in a table.

In a VARIANT column, how are NULL values stored?

Stored as a string containing the word “null,” not the SQL NULL value.

When is it recommended to remove loaded files?

When data from staged files is loaded successfully, consider removing the staged files to ensure the data is not inadvertently loaded again (duplicated). Do not remove the staged files until the data has been loaded successfully. To check if the data has been loaded successfully, use the COPY_HISTORY command. Check the STATUS column to determine if the data from the file has been loaded. Note that if the status is Load in progress, removing the staged file can result in partial loads and data loss. Cleaning up also improves load performance, because it reduces the number of files that COPY commands must scan to verify whether existing files in a stage were loaded already.

How can staged files can be deleted from a Snowflake stage (user stage, table stage, or named stage)?

- Files that were loaded successfully can be deleted from the stage during a load by specifying the PURGE copy option in the COPY INTO

command. - After the load completes, use the REMOVE command to remove the files in the stage.

Will Snowflake automatically determine the file and codec compression method for your data files?

TRUE

Which view can be used to review the data loading activity that has occurred over the last 365 days for all tables?

The COPY_HISTORY view in the ACCOUNT_USAGE schema of the SNOWFLAKE database.

What is the INFORMATION_SCHEMA?

Each database created in your account automatically includes a built-in, read-only schema named INFORMATION_SCHEMA. The schema contains the following objects: - Views for all the objects contained in the database, as well as views for account-level objects (i.e. non-database objects such as roles, warehouses, and databases) - Table functions for historical and usage data across your account.

What is a schema?

A logical grouping of database objects (tables, views, etc.). Each schema belongs to a single database.

What is the default table type?

Permanent.

What are the available table types?

- Permanent - Temporary - Transient - External - Dynamic - Hybrid - Iceberg

What is a temporary table?

A table used for storing non-permanent, transitory data (e.g. ETL data, session-specific data).

Do temporary tables only exist within the session in which they were created?

TRUE - Temporary tables only exist within the session in which they were created and persist only for the remainder of the session.

Are temporary tables visible to other users and session?

FALSE

What happens to the data in a temporary table when the session ends?

Once the session ends, data stored in the table is purged completely from the system and, therefore, is not recoverable, either by the user who created the table or Snowflake.

Can you create temporary and non-temporary tables with the same name within the same schema?

TRUE - the temporary table takes precedence in the session over any other table with the same name in the same schema. This can lead to potential conflicts and unexpected behavior, particularly when performing DDL on both temporary and non-temporary tables.

How can you define a temporary table using SQL?

CREATE TEMPORARY TABLE mytemptable (id NUMBER, creation_date DATE);

What is a transient table?

A table that persists data until explicitly dropped and are available to all users with the appropriate privileges.

What is the key difference between a permanent and transient tables?

Transient tables do not have a Fail-safe period. As a result, transient tables are specifically designed for transitory data that needs to be maintained beyond each session (in contrast to temporary tables), but does not need the same level of data protection and recovery provided by permanent tables.

How is data stored when a transient table is created from a clone of a permanent table?

Snowflake creates a zero-copy clone. This means when the transient table is created, it utilizes no data storage because it shares all of the existing micro-partitions of the original permanent table. When rows are added, deleted, or updated in the clone, it results in new micro-partitions that belong exclusively to the clone (in this case, the transient table).

Can you create a transient database and schema?

TRUE - All tables created in a transient schema, as well as all schemas created in a transient database, are transient by definition.

What is an external table?

An external table is a Snowflake feature that allows you to query data stored in an external stage as if the data were inside a table in Snowflake. The external stage is not part of Snowflake, so Snowflake does not store or manage the stage.

Are external tables read-only?

TRUE - You cannot perform data manipulation language (DML) operations on them.

Can you use external tables for query and join operations?

TRUE

Can you create views against external tables?

TRUE

How can you improve the query performance of an external table?

Use a materialized view based on an external table.

Which columns do all external tables include?

- VALUE - METADATA$FILENAME - METADATA$FILE_ROW_NUMBER

Can you partition an external table?

TRUE - automatically and manually

What is a dynamic table?

Dynamic tables in Snowflake simplify data engineering by automating data transformations. Instead of manually managing tasks and schedules, you define the end result with dynamic tables, and Snowflake handles the pipeline. These tables reflect query results directly, eliminating the need for separate target tables or custom code. The content is updated automatically through scheduled refreshes, and cannot be modified via DML operations.

When are dynamic tables best used?

- You don’t want to write code to track data dependencies and manage data refresh. - You don’t need, or want to avoid, the complexity of streams and tasks. - You do need to materialize the results of a query of multiple base tables. - You need to build multiple tables to transform data via an ETL pipeline. - You don’t need fine-grained refresh schedule control and you just want to specify the target data freshness for your pipelines. - You don’t need to use unsupported dynamic query constructs such as stored procedures, non-deterministic functions not listed in Supported non-deterministic functions in full refresh, or external functions, or need to use sources for dynamic tables that are external tables, streams, or materialized views.

What is a hybrid table?

A hybrid table is a Snowflake table type that is optimized for hybrid transactional and operational workloads that require low latency and high throughput on small random point reads and writes. You can use a hybrid table along with other Snowflake tables and features to power Unistore workloads that bring transactional and analytical data together in a single platform.

Which type of queries benefit most from hybrid tables?

- High concurrency random point reads versus large range reads. - High concurrency random writes versus large sequential writes (for example, bulk loading). - Retrieval of a small number of entire records (for example, customer object) versus narrow projections with analytical functions (for example, aggregations or GROUP BY queries).

Can you clone a hybrid table?

FALSE - although cloning is not supported for hybrid tables, you can clone databases and schemas that contain hybrid tables by using the IGNORE HYBRID TABLES parameter in the CREATE

Answer 11

A single cluster.

Answer 12

- Authentication - Infra Management - Metadata Management - Query Parsing and Optimization - Access Control

Answer 13

A dedicated Snowflake-managed warehouse with the name SYSTEM$STREAMLIT_NOTEBOOK_WH is automatically provisioned in each account for running Notebook. This warehouse is owned and managed by Snowflake under the SYSTEM role. You cannot DROP or ALTER the warehouse.

Answer 14

- Standard (default) - Economy

Answer 15

Prevent/minimized queueing by favoring starting additional clusters over conserving credits.

Answer 16

The size and complexity of each query.

Answer 17

- Maximum number of clusters, greater than 1 (up to 10). - Minimum number of clusters, equal to or less than max (up to 10).

Answer 18

- Specifying a warehouse size. - Resizing a warehouse at any time. - Auto-suspending a running warehouse due to inactivity; This does not apply to individual clusters, but rather the entire multi-cluster warehouse. - Auto-resuming a suspended warehouse when new queries are submitted.

Answer 19

- Maximized - Auto-scale

Answer 20

- Compressed. - Columnar format.

Answer 21

- STATEMENT_QUEUED_TIMEOUT_IN_SECONDS - STATEMENT_TIMEOUT_IN_SECONDS

Answer 22

This mode is enabled by specifying different values for max and min.

Answer 23

Conserves credits by favoring keeping running clusters fully-loaded rather than starting additional clusters, which may result in queries being queued and taking longer to complete.

Answer 24

You can query QUERY_ACCELERATION_ELIGIBLE view. You can use the SYSTEM$ESTIMATE_QUERY_ACCELERATION functions to assess whether a specific query is eligible for acceleration.

Answer 25

A hybrid of traditional shared-disk and shared-nothing architecture.

Answer 26

- Warehouse size. - The number of clusters that run in that period.

Answer 27

Snowflake provided a property SCALING_POLICY that determines the scaling policy to use when automatically starting or shutting down additional clusters.

Answer 28

Warehouse size.

Answer 29

Increasing the size of a warehouse does not always improve data loading performance. Data loading performance is influenced more by the number of files being loaded (and the size of each file) than the size of the warehouse.

Answer 30

A cluster of computer resources.

Answer 31

- Cloud Services (Service) - Query Processing (Compute) - Database Storage (Storage)

Answer 32

- Standard - Snowpark-optimized

Answer 33

A central data repository for persisted data that is accessible from all compute nodes. But, similar to shared-nothing architecture, Snowflake processes queries using MPP (massively parallel processing) compute cluster where each node in the cluster stores a portion of the entire data set locally.

Answer 34

- x-small - small - medium - large - x-large - 2x-large - 3x-large - 4x-large - 5x-large - 6x-large

Answer 35

- Experiment with different types of queries and different warehouse sizes to determine the combo that best meets your query needs and workload. - Don't focus on warehouse size, Snowflake utilizes per-second billing, so you can run larger warehouses and simply suspend them when not in use.

Answer 36

Execute a CREATE WAREHOUSE command with: - MAX_CLUSTER_COUNT - MIN_CLUSTER_COUNT

Answer 37

SHOW WAREHOUSES

Answer 38

- Warehouse size. - Number of clusters. - The length of time the compute resources in each cluster run.

Answer 39

Each warehouse, when running, maintains a cache of table data accessed as queries are processed by the warehouse. The larger the warehouse the larger the cache. The cache is dropped when the warehouse is suspended, which might result in slower initial performance. Tradeoff between suspending warehouse and speed.

Answer 40

- For data loading, the warehouse size should match the number of files being loaded and the amount of data of each file. - For queries in small-scale testing environments, smaller warehouse sizes may be sufficient. - For queries in larger-scale production environments, larger warehouses sizes may be most cost effective.

Answer 41

Resizing a suspended warehouse does not provision any new compute resources for the warehouse.

Answer 42

Snowflake provides a system-defined, read-only shared database named SNOWFLAKE that contains metadata and historical usage data about the objects in your organization and accounts.

Answer 43

They can be resized at any time, even while running, to accommodate the need for more or less compute resources, based on the type of operations being performed by the warehouse.

Answer 44

Organizing Data (DDL) CREATE/ALTER DATABASE (DDL) CREATE/ALTER SCHEMA (DDL) CREATE/ALTER TABLE Storing Data (DML) INSERT/INTO TABLE Querying Data (DDL) SELECT FROM

Answer 45

- Navigation menu - Search - Quick actions - Recently viewed

Answer 46

The total number of credits would be (192 = 64 + 128)

Answer 47

This mode is enabled by specifying the same value for both max and min of clusters. This mode is effective for statically controlling the available compute resources, particularly if you have large number of concurrent user sessions and/or queries and the numbers don't fluctuate significantly.

Answer 48

A collection of services that coordinate activities across Snowflake.

Answer 49

- There is no hardware (virtual or physical) to select, install, configure, or manage. - There is virtually no software to install, configure, or manage. - Ongoing maintenance, management, upgrades and tuning are handled by Snowflake.

Answer 50

- Auto-suspend only occurs when the minimum number of cluster is running and there is no activity for the specified period of time. - Auto-resume only applies when the entire warehouse is suspended (i.e. no clusters are running).

Answer 51

The size of a warehouse can impact the amount of time required to execute queries.

Answer 52

Virtual warehouses.

Answer 53

- Web-base UI. - Command line clients (SnowSQL). - ODBC and JDBC. - Native connectors.

Answer 54

- ACCOUNT_USAGE - ALERT - CORE - DATA_PRIVACY - DATA_SHARING_USAGE - INFORMATION_SCHEMA - LOCAL - ML - MONITORING - NOTIFICATION - ORGANIZATION_USAGE - READER_ACCOUNT_USAGE - TELEMETRY

Answer 55

Per-second billing.

Answer 56

Query load is calculated by dividing the execution time (in seconds) of all queries in an interval by the total time (in seconds) for the interval.

Answer 57

Memory (up to). CPU Architecture. Min warehouse size required 16 GB. Default or x86 XS 256 GB Default or x86 M 1 TB Default or x86 L

Answer 58

Recommended for running code, and workloads that have large memory requirements or dependencies on a specific CPU architecture. Example workloads include ML training use cases.

Answer 59

The available memory resources and CPU architecture on a single-node instance.

Answer 60

CREATE OR REPLACE WAREHOUSE snowpark_opt_wh WITH WAREHOUSE_SIZE = 'LARGE' WAREHOUSE_TYPE = 'SNOWPARK_OPTIMIZED' RESOURCE CONSTRAINT = 'MEMORY_16X_X86' // optional

Answer 61

QUERY_HISTORY

Answer 62

QAS can accelerate parts of the query workload in a warehouse. When it is enabled, it can improve overall warehouse performance by reducing the impact of outlier queries, which are queries that use more resources than the typical query. It does this by offloading portions of the query processing work to shared compute resources provided by the service.

Answer 63

CREATE WAREHOUSE my_wh WITH ENABLE_QUERY_ACCELERATION = true;

Answer 64

- There are not enough partitions in the scan. - Even if a query has a filter, the filter may not be selective enough. Alternatively, if the query has an aggregation with GROUP_BY, the cardinality of the GROUP BY might be too high for eligibility. - The query includes a LIMIT clause but does not have a ORDER BY clause. - The query includes functions that return nondeterministic results (RANDOM).

Answer 65

In general, queries are eligible because they have a portion of the query plan that can be ran in parallel using QAS computer resources - Large scans with an aggregation or selective filter. - Large scans that insert many new rows.

Answer 66

- Execute SQL SELECT. - Perform DML operations, such as: - Updating rows - Loading data - Unloading data

Answer 67

- USAGE - OWNERSHIP

Answer 68

- SHOW TABLES command. - TABLES view (in the Snowflake Information Schema). - TABLES view (in the Account Usage shared database).

Answer 69

ALTER TABLE SUSPEND/RESUME RECLUSTER;

Answer 70

- Compute costs - Storage costs

Answer 71

SYSTEM$ESTIMATE_AUTOMATIC_CLUSTERING_COSTS

Answer 72

This table function is used for querying the Automatic Clustering history for given tables within a specified date range. The information returned by the function includes the credits consumed, bytes updated, and rows updated each time a table is reclustered.

Answer 73

A stage specifies where data files are stored so that the data in the files can be loaded into a table.

Answer 74

- User - Table - Named

Answer 75

1. You must specify an internal stage in the PUT command when uploading files. 2. You must specify the same stage in the COPY INTO

Answer 76

Each user has a stage allocated to them by default for storing files. This stage is convenient option if your files will only be accessed by a single user, but need to be copied into multiple tables.

Answer 77

- User stages are referenced by using "@~" - Unlike named stages, user stages cannot be altered or dropped. - User stages do not support setting file format options. Instead, you must specify file format and copy options as part of the COPY INTO

Answer 78

- Multiple users require access to the files. - The current user does not have INSERT privileges on the tables the data will be loaded into.

Answer 79

By default, each table has a Snowflake stage allocated to it for storing files.

Answer 80

You might use a table stage if you only need to copy files into a single table, but want to make the files accessible to multiple users.

Answer 81

- A table stage has the same name as the table. For example, a table named mytable has a stage referenced as @%mytable. - A table stage is an implicit stage tied to a table object. It's not a separate database object. As a result, a table stage has no grantable privileges of its own. A table stage is also not appropriate if you need to copy file data into multiple tables. - To stage files on a table sage, list the files, query the files, or drop the, you must be the table owner (have the role with OWNERSHIP privilege on the table). - Unlike a named stage, you can't alter or drop the table stage. -Table stages don't support transforming data while loading it.

Answer 82

Named stages are database objects that provide the greatest degree of flexibility for data loading: - Users with the appropriate privileges on the stage on the stage can load data into any table. - Because the stage is a database object, the security/access rules that apply to all objects apply. The privileges to use a stage can be granted or revoked from roles. In addition, ownership of the stage can be transferred to another role.

Answer 83

CREATE STAGE my_stage ENCRYPTION = (TYPE = 'SNOWFLAKE_SSE')

Answer 84

Uploads a file named data.csv in the /data directory on your local machine to your user stage and prefixes the file with a folder named staged Linux or macOS: PUT file:///data/data.csv @~/staged; Windows: PUT file://C:\data\data.csv @~/staged;

Answer 85

Uploads a file named data.csv in the /data directory on your local machine to the stage for a table named mytable Linux or macOS: PUT file:///data/data.csv @%mytable; Windows: PUT file://C:\data\data.csv @%mytable;

Answer 86

Uploads a file named data.csv in the /data directory on your local machine to a named internal stage called my_stage Linux or macOS: PUT file:///data/data.csv @my_stage; Windows: PUT file://C:\data\data.csv @my_stage;

Answer 87

LIST @%mytable; // table name is mytable

Answer 88

LIST @my_stage; // stage is named my_stage

Answer 89

To validate data in an uploaded file, execute COPY INTO

Answer 90

Indicates what action to perform if errors are encountered in a file during loading.

Answer 91

- File name - File size - LAST MODIFIED

Answer 92

LOAD_HISTORY in the Information schema

Answer 93

- Files that were loaded successfully can be deleted from the stage during a load by specifying the PURGE copy option in the COPY INTO

Answer 94

TRUE - Removing files ensures they aren't inadvertently loaded again. It also improves load performance, because it reduces the number of files that COPY commands must scan to verify whether existing files in a stage were loaded already.

Answer 95

TRUE - you use the COPY FILES command and specify a source stage and a target stage

Answer 96

Internal stages are managed by Snowflake, while external stages are managed by your business entity.

Answer 97

- AWS - GCP - Azure

Answer 98

Only named external stages are available.

Answer 99

On user-provided virtual warehouses, which are specified in the COPY statement. Users are required to size the warehouse appropriately to accommodate expected loads.

Answer 100

A feature that is designed to load small volumes of data (i.e. micro-batches) and incrementally make them available for analysis.

Answer 101

Snowpipe uses compute resources provided by Snowflake (i.e. a serverless compute model). These resources are automatically resized and scaled up or down as required, and are charged and itemized using per-second billing. Data ingestion is charged based upon the actual workloads.

Answer 102

The Snowpipe Streaming API writes rows of data directly to Snowflake tables without the requirement of staging files. This architecture results in lower load latencies with corresponding lower costs for loading any volume of data, which makes it a powerful tool for handling near real-time data stream.

Answer 103

Snowflake Connector for Kafka

Answer 104

- INFER_SCHEMA - GENERATE_COLUMN_DESCRIPTION

Answer 105

- Apache Parquet - Apache Avro - ORC - JSON - CSV - XML (preview)

Answer 106

CREATE TABLE ... USING TEMPLATE CREATE EXTERNAL TABLE ... USING TEMPLATE

Answer 107

External tables.

Answer 108

- Delimited files - Semi-structured - Unstructured

Answer 109

TRUE - encrypted using AES-256

Answer 110

The key used to encrypt the files must be provided to Snowflake.

Answer 111

128-bit and 256-bit

Answer 112

Aim to generate data files with a compressed size of approximately 100 to 250 MB (or larger). Aggregate smaller files to minimize the processing overhead for each file.

Answer 113

ON_ERROR - Aborting or skipping a file due to a small number of errors could result in delays and wasted credits. In addition, if a data loading operation continues beyond the maximum allowed duration of 24 hours, it could be aborted without any portion of the file being committed.

Answer 114

- VARCHAR - BINARY - VARIANT - OBJECT - ARRAY - GEOGRAPHY - GEOMETRY

Answer 115

Enable the STRIP_OUTER_ARRAY file format option for the COPY INTO

Answer 116

Use FLATTEN CREATE OR REPLACE TABLE mytable AS SELECT t1.$1:ID AS id, t1.$1:CustomerDetails:RegistrationDate::VARCHAR AS RegistrationDate, t1.$1:CustomerDetails:FirstName::VARCHAR AS First_Name, t1.$1:CustomerDetails:LastName::VARCHAR AS as Last_Name, t2.value AS Event FROM @json t1, TABLE(FLATTEN(INPUT => $1:CustomerDetails:Events)) t2;

Answer 117

- Use the ENCODING file format option to specify the character set. - Fields that contain delimit character should be enclosed in quotes (single or double). - If the data contains single or double quotes, then those must be escaped. - Fields that contain carriage returns should also be enclosed in quotes. - The number of columns in each row should be consistent.

Answer 118

- Elements that contain even a single "null" value are not extracted into a column. - Elements that contain multiple data types.

Answer 119

TRUE - Loading large data sets can affect query performance. It is recommended to allocate separate warehouses for loading and querying operations to optimize performance for both tasks.

Answer 120

- By path (internal stages) / prefix (s3 bucket). - Specify a list of specific files to load. - Using pattern matching to identify specific files.

Answer 121

After 64 days.

Answer 122

To load files whose metadata has expired, set the LOAD_UNCERTAIN_FILES copy option to true. The copy option references load metadata, if available, to avoid data duplication, but also attempts to load files with expired load metadata. Alternatively, set the FORCE option to load all files, ignoring load metadata if it exists. Note that this option reloads files, potentially duplicating data in a table.

Answer 123

Stored as a string containing the word “null,” not the SQL NULL value.

Answer 124

When data from staged files is loaded successfully, consider removing the staged files to ensure the data is not inadvertently loaded again (duplicated). Do not remove the staged files until the data has been loaded successfully. To check if the data has been loaded successfully, use the COPY_HISTORY command. Check the STATUS column to determine if the data from the file has been loaded. Note that if the status is Load in progress, removing the staged file can result in partial loads and data loss. Cleaning up also improves load performance, because it reduces the number of files that COPY commands must scan to verify whether existing files in a stage were loaded already.

Answer 125

- Files that were loaded successfully can be deleted from the stage during a load by specifying the PURGE copy option in the COPY INTO

Answer 126

The COPY_HISTORY view in the ACCOUNT_USAGE schema of the SNOWFLAKE database.

Answer 127

Each database created in your account automatically includes a built-in, read-only schema named INFORMATION_SCHEMA. The schema contains the following objects: - Views for all the objects contained in the database, as well as views for account-level objects (i.e. non-database objects such as roles, warehouses, and databases) - Table functions for historical and usage data across your account.

Answer 128

A logical grouping of database objects (tables, views, etc.). Each schema belongs to a single database.

Answer 129

Permanent.

Answer 130

- Permanent - Temporary - Transient - External - Dynamic - Hybrid - Iceberg

Answer 131

A table used for storing non-permanent, transitory data (e.g. ETL data, session-specific data).

Answer 132

TRUE - Temporary tables only exist within the session in which they were created and persist only for the remainder of the session.

Answer 133

Once the session ends, data stored in the table is purged completely from the system and, therefore, is not recoverable, either by the user who created the table or Snowflake.

Answer 134

TRUE - the temporary table takes precedence in the session over any other table with the same name in the same schema. This can lead to potential conflicts and unexpected behavior, particularly when performing DDL on both temporary and non-temporary tables.

Answer 135

CREATE TEMPORARY TABLE mytemptable (id NUMBER, creation_date DATE);

Answer 136

A table that persists data until explicitly dropped and are available to all users with the appropriate privileges.

Answer 137

Transient tables do not have a Fail-safe period. As a result, transient tables are specifically designed for transitory data that needs to be maintained beyond each session (in contrast to temporary tables), but does not need the same level of data protection and recovery provided by permanent tables.

Answer 138

Snowflake creates a zero-copy clone. This means when the transient table is created, it utilizes no data storage because it shares all of the existing micro-partitions of the original permanent table. When rows are added, deleted, or updated in the clone, it results in new micro-partitions that belong exclusively to the clone (in this case, the transient table).

Answer 139

TRUE - All tables created in a transient schema, as well as all schemas created in a transient database, are transient by definition.

Answer 140

An external table is a Snowflake feature that allows you to query data stored in an external stage as if the data were inside a table in Snowflake. The external stage is not part of Snowflake, so Snowflake does not store or manage the stage.

Answer 141

TRUE - You cannot perform data manipulation language (DML) operations on them.

Answer 142

Use a materialized view based on an external table.

Answer 143

- VALUE - METADATA$FILENAME - METADATA$FILE_ROW_NUMBER

Answer 144

TRUE - automatically and manually

Answer 145

Dynamic tables in Snowflake simplify data engineering by automating data transformations. Instead of manually managing tasks and schedules, you define the end result with dynamic tables, and Snowflake handles the pipeline. These tables reflect query results directly, eliminating the need for separate target tables or custom code. The content is updated automatically through scheduled refreshes, and cannot be modified via DML operations.

Answer 146

- You don’t want to write code to track data dependencies and manage data refresh. - You don’t need, or want to avoid, the complexity of streams and tasks. - You do need to materialize the results of a query of multiple base tables. - You need to build multiple tables to transform data via an ETL pipeline. - You don’t need fine-grained refresh schedule control and you just want to specify the target data freshness for your pipelines. - You don’t need to use unsupported dynamic query constructs such as stored procedures, non-deterministic functions not listed in Supported non-deterministic functions in full refresh, or external functions, or need to use sources for dynamic tables that are external tables, streams, or materialized views.

Answer 147

A hybrid table is a Snowflake table type that is optimized for hybrid transactional and operational workloads that require low latency and high throughput on small random point reads and writes. You can use a hybrid table along with other Snowflake tables and features to power Unistore workloads that bring transactional and analytical data together in a single platform.

Answer 148

- High concurrency random point reads versus large range reads. - High concurrency random writes versus large sequential writes (for example, bulk loading). - Retrieval of a small number of entire records (for example, customer object) versus narrow projections with analytical functions (for example, aggregations or GROUP BY queries).

Answer 149

FALSE - although cloning is not supported for hybrid tables, you can clone databases and schemas that contain hybrid tables by using the IGNORE HYBRID TABLES parameter in the CREATE

Answer 150

Apache Iceberg tables for Snowflake combine the performance and query semantics of typical Snowflake tables with external cloud storage that you manage. They are ideal for existing data lakes that you cannot, or choose not to, store in Snowflake.

Answer 151

- ACID (atomicity, consistency, isolation, durability) transactions - Schema evolution - Hidden partitioning - Table snapshots

Answer 152

In an external cloud storage location (Amazon S3, Google Cloud Storage, or Azure Storage).

Answer 153

An external volume is a named, account-level Snowflake object that you use to connect Snowflake to your external cloud storage for Iceberg tables. An external volume stores an identity and access management (IAM) entity for your storage location.

Answer 154

- Non-materialized - Materialized

Answer 155

A materialized view is a pre-computed data set derived from a query specification (the SELECT in the view definition) and stored for later use. Because the data is pre-computed, querying a materialized view is faster than executing a query against the base table of the view. This performance difference can be significant when a query is run frequently or is sufficiently complex.

Answer 156

TRUE - this allows faster access, but requires storage space and active maintenance, both of which incur additional costs.

Answer 157

A view that refers to itself. Only works on Non-materialized views.

Answer 158

FALSE - the definition for a view cannot be updated (i.e. you cannot use ALTER VIEW or ALTER MATERIALIZED VIEW to change the definition of a view). To change a view definition, you must recreate the view with the new definition.

Answer 159

FALSE - for example, if you drop a column in a table, the views on that table might become invalid.

Answer 160

- non-materialized - materialized

Answer 161

Views should be defined as secure when they are specifically designated for data privacy (i.e. to limit access to sensitive data that should not be exposed to all users of the underlying table(s)).

Answer 162

The definition of a secure view is only exposed to authorized users (i.e. users who have been granted the role that owns the view).

Answer 163

The IS_SECURE column in the Information Schema and Account Usage views identifies whether a view is secure.

Answer 164

Use the SHOW MATERIALIZED VIEWS command.

Answer 165

- Do not expose the sequence-generated column as part of the view. - Use randomized identifiers (e.g. generated by UUID_STRING) instead of sequence-generated values. - Programmatically obfuscate the identifiers.

Answer 166

It is best to materialize data per user/role instead of exposing views on the base data to users

Answer 167

CURRENT_ACCOUNT

Answer 168

- Query results contain a small number of rows and/or columns relative to the base table (the table on which the view is defined). - Query results contain results that require significant processing. - The query is on an external table, which might have slower performance compared to querying native database tables or Apache Iceberg™ tables. - The view’s base table does not change frequently.

Answer 169

- The results of the view change often. - The results are not used often (relative to the rate at which the results change). - The query is not resource intensive so it is not costly to re-run it.

Answer 170

- The query results from the view don’t change often. - The results of the view are used often. - The query consumes a lot of resources.

snowflake cert pro Flashcards

(226 cards)