D3.2 - DEFINE BULK AS COMPARED TO CONTINUOUS DATA LOADING METHODS Flashcards
1
Q
COPY
A
- This option enables loading batches of data from files already available in cloud storage, or copying (i.e. staging) data files from a local to an internal (i.e. snowflake) cloud storage location before loading the data into tables using the COPY command
- Bulk loading relies on user-provided virtual warehouses, which are specified in the COPY statement. Users are required to size the warehouse appropriately to accommodate expected loads.
2
Q
SNOWPIPE
A
- Option is best designed to load small volumes of data (i.e. micro-batches) and incrementally make them available for analysis. Snowpipe loads data within minutes after files are added to a stage and submitted for ingestion. This ensures users have the largest results, as soon as the raw data is available.
- Snowpipe uses compute resources provided by Snowflake. These snowflake-provided resources are automatically resized and scaled up or down as required, and are charged and itemized using per-second billing. Data ingestion is charged based upon the actual workloads.