snowflake Trial Edition Flashcards

1
Q

What are the main pushbuttons in Snowflake main screen

A

Projects
Data
Data products
AI & ML
Monitoring
Admin

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the options under projects

A

Worksheets
Streamlit
Dashboards
App packages

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is VPS

A

Virtual private Snowflake

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the main cloud hosting snowflake

A

AWS, Azure and GCP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Each snowflake account is hosted in a single region

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the unique layers of Snowflake

A

Storage layer
Compute layer
Cloud service layer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is share disc architecture

A

one data base with several services

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is share nothing architecture

A

several dbs in paralel with many services

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the characteristics of Data Storage layer

A

Underlying cloud layer
Virtually infinite storage
Compressed & Encrypted
Cloud Data Redundancy
pay only for stored data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the characteristics of Compute & Processing Layer

A

Query engine or virtual warehouse
Underlying virtual machines
Scale up and down as needed
different size to serve different loads

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the characteristics of Cloud Service Layer (Brain of snowflake)

A

Authentication and Authorisation
User and session management
Query compilation, Optimization & Data Caching
virtual Warehouse Management
Metadata management

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Can each layer be scaled independently

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does snowflake do to the loaded data

A

It compresses and store data in columnar form.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What can we do in the compute layer

A

Select queries
join queries
data loading

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is VWH

A

Virtual warehouse and before any query is executed they need to be provisioned.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Which layer takes care of the authentication 7 authorisation

A

Cloud service layer for all incoming request from:
Web UI
SnowSQL
JDBC
ODBC

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

what is another name of Query in snowflake

A

Work Load

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What attributes makes Snowflake a true SAAS Solution

A

No hardware to purchase or configured
No maintenance upgrades or patches
Transparent releases don’t required user intervention.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Which instalment options are available

A

AWS, Azure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Three terms are used to described the same compute layer

A

Compute Layer
Virtual Warehouse Layer
Query processing layer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Two terms are used to described the same Cloud Services Layer

A

Cloud Services Layer
Virtual Warehouse Layer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Two of these terms are used to described the same data layer

A

Data layer
Storage layer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Which tasks are performed by the Cloud Services Layer

A

Metadata management
User authentication
Metadata storage
Data security

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is DML

A

Data Manipulation language

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is DDL

A

Data Definition Language

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What are the method types for loading data

A

Bulk loading and continuous loading

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

what the stages types

A

External and Internal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

produce a syntax of an inner join

A

select A.C_CUSTOMER_SK, A.C_CUSTOMER_ID, A.C_CURRENT_CDEMO_SK, B.CA_STREET_NUMBER, B.CA_STREET_NAME from SNOWFLAKE_SAMPLE_DATA.TPCDS_SF100TCL_OLD.CUSTOMER as A
inner join SNOWFLAKE_SAMPLE_DATA.TPCDS_SF100TCL_OLD.CUSTOMER_ADDRESS as B
on A.C_CUSTOMER_SK = B.CA_ADDRESS_SK;

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

what did you get as soon as Database is created in snowflake

A

Information Schema and Public

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What are the options of the submenu for creating a table

A

Standard,
As select
External
From file

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What is the qualification when creating tables

A

DB.Schema.Table name

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What is the acceptable syntax for naming surrogate key

A

_PK, _FK

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What are the benefits of surrogate keys

A

performance improvement
handle dummy values
integrate multiple systems
easier admin and update
useful when there are no natural keys available

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What is type 0

A

Retain original

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What is SCD type 1

A

The values are overwritten and no history is maintain.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

What is SCD type 2

A

We maintain a complete history of changes. Every time, there is a change a new row will be added to the table. A new column is added to the table to indicate witch record is active. This column(s) could be a flag (true or false) or version number (0,1,2, etc), or time columns to indicate from/to.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

What is SCD type 3

A

A partial history is maintained and not a complete history. an extra row is added to indicate the previous state.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What is SCD type 4

A

The dim table changes frequently. There is a need for creating a new table or redesign existing model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

What is SCD type 6

A

This is a combination of 1,2,3. Along with the addition of a new row, we also update the latest value in all the rows.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

What are the stages types

A

External and Internal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

Where is the data flowing to Snowflake to form External Stage

A

S3, External cloud Provider, Google Cloud, Microsoft Azure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

What is the copy syntax

A

copy into <table name>
from externalStage
files = (‘file_name1’,’file_name2’)
file_format = <file_format_name>
copyoptions</file_format_name>

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

Where can I find Snowflake documentation

A

docs.snowflake.com

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

What is the syntax for creating a schema

A

create or replace schema qualified address + name

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

What is the syntax for creating a file format

A

create or replace file format qualified address + name

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

How to determine a file definition

A

by using desc file format qualified address + name

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

What is the syntax for changing a file properties

A

alter file format qualified address + name
for example SKIP_HEADER = 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

Alter cannot be used for changing properties that are not inherently to its native format.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

What is the syntax for setting up csv file properties

A

create or replace file format qualified address + name
TYPE = CSV,
FIELD_DELIMITER = “,”
SKIP_HEADER = 1;

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

What is the syntax for loading data into a table

A

copy into DB.Schema.Table name
from @qualified name
file_format = (format_name = qualified name + csv file format)
files = (‘file name.csv’)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

What does the truncate command does

A

Delete the content of a table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
52
Q

How to get rid of double quotes in Json file - column

A

:: String

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
53
Q

What command is used to select one row from a Json file

A

raw_file:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
54
Q

How to use raw_file for extracting a Json object

A

raw_file object name.first property, raw_file object name.second property,

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
55
Q

What is the syntax for creating a role

A

create role <data_scientist>
grant usage on <warehouse> to role <data_scientist></data_scientist></warehouse></data_scientist>

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
56
Q

How can you delete an user

A

drop user <me></me>

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
57
Q

How long does the cash storage last

A

24 hrs or an soon the data is changed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
58
Q

Is clustering for all tables

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
59
Q

What are the most suitable field for clustering

A

Date or those fields used in the where clause or in joins

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
60
Q

What is the syntax for creating clusters in tables

A

create table name ….. cluster by (column1, column2 …column) or expression <>

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
61
Q

What is the syntax for changing clusters in tables

A

alter < table name> cluster by (expression 1, … expression n)
alter <table name> drop clustering key

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
62
Q

What is the process for building Snowpipe from S3

A

1.Create Stage
2. Test copy command
3. Create pipe
4. S3 notification

63
Q

What is Snowpipe

A

It enables loading once a new data is storing in a third party source bucket.

64
Q

What happens when auto_infest = True

A

Data will be push from source system bucket automatically to Snowflake.

65
Q

What is the syntax for creating a pipe

A

create or replace pipe <schema><pipe>
auto_ingest=true
as
copy into our_first
from @manage_DB_external</pipe></schema>

66
Q

Mentioned the time travel type

A

Standard
Enterprise
Business Critical
Virtual private

67
Q

What is the retention period - default

A

Data can be recovered up to 1 day only

68
Q

Who can perform the fail safe process

A

Restoring only by Snowflake support

69
Q

How many methods are there to recover time travel data

70
Q

Can undrop work schemas and tables

71
Q

What are the table types

A

Permanent
Transient
Temporary

72
Q

what does the field kind contain

A

Table type

73
Q

Can the Transient schema pass on the same properties to created tables under this schema

74
Q

is cloned object independent from original

75
Q

What is the syntax for creating a clone object

A

create table <name>
clone <source></source>
before (timestamp => (timestamp))</name>

76
Q

Is data sharing involve the recipient with the right to update the data in the original table

77
Q

What is the syntax for sharing data on a database

A

grant usage on database <data-s> to share orders_share;</data-s>

78
Q

what objects can you data share

A

Database
Schema
Table

79
Q

What is behind_by

A

it is the time difference between a query and an materialised view.

80
Q

What does masking policy do

A

hire some field content to unauthorised users

81
Q

What is a classic console

A

This is the old UI

82
Q

Where can you get more information of how to do things in Snowflake

A

docs.snowflake.com

83
Q

Where can you get more information about courses

A

learn.snowflake.com

84
Q

What is identity all about

A

identifying the person

85
Q

What is authentication

A

What the user can do

86
Q

what is the role with more power

A

accountadmin

87
Q

What is the role given for all account trial users as default

88
Q

In which direction the inpersonation flows

89
Q

If you had been awarded SYSADMIN directly, you would not be able to impersonate ACCOUNTADMIN.

90
Q

What is DAC

A

Discrecionary Access Control: you created you owned it

91
Q

After creating a database, what are the default schemas provided

A

Information schema and public

92
Q

What are databases created for

A

Databases are used to group datasets (tables) together.

93
Q

Can an information schema be dropped

94
Q

What is the content of information schemas

A

The INFORMATION_SCHEMA schema holds a collection of views.

95
Q

How many menus are included in worksheet

A

Role
Warehouse
Database
Schema

96
Q

what does Running a SHOW DATABASES command do

A

It is just like being at the first level of an Object Picker (but with more details, and the ability to cut and paste the info into a spreadsheet).

97
Q

Tsai points out that every worksheet
has four configuration settings that
are saved with the worksheet. How are they known as

A

Context settings, 2 provide shot cuts for the data storage location. The other are for the user role and warehouse.

98
Q

What is the warehouse

A

The machine that crunches data (computer power).

99
Q

What is the idle time allows for the machine to turn of wait for any code running

A

10 minutes

100
Q

Can Select statement run without role or datawarehouse

101
Q

Is the user role and warehouse mandatory

102
Q

What can you do with Worksheets

A

Write code
Run code
Save code
Share

103
Q

Up to what cluster can a warehouse stretch

104
Q

Team is also known as

105
Q

Team members are also known as

106
Q

Warehouses can be manually scaled UP or DOWN. They can be set up so that they automatically scale OUT. What did the video call the opposite of SCALING OUT?

A

Snapping back.

107
Q

What is the meaning of cluster

A

Group of servers

108
Q

XS not scaling how many servers does it have

109
Q

M not scaling how many servers does it have

110
Q

Is the number of servers different based on XS, S, M, etc

111
Q

A XS warehouse when it is scale out would it has more than 1 cluster

112
Q

A M warehouse when it is scale out would it has more than 1 cluster

113
Q

do clusters hold more than 1 server

114
Q

Has any worksheet include a default role

115
Q

So what is a stage?

A

A stage or staging area
is a place we put things temporarily
with the intention to later move
them to a more stable, longer
term location.

116
Q

What are the Stages type

A

Internal and External (Cloud storage services)

117
Q

What are the requirments for creating an external AWS Cloud services

A

Storage location - S3 bucket
Cloud storage access
AWS IAM user and policy and
In snowflake define stage object with reference to the AWS S3 bucket

118
Q

does snowflake care about capitalization

119
Q

snowflake will type table names in uppercase if the name is not in double quotes

120
Q

What the tools used for loading data into AWS S3 buckets

A

Amazon transfer for AFP
Amazon CLI
Web browser interface
Python
Java
REST API

121
Q

Is there any load wizard for loading data from S3 to Snowflake

122
Q

To run the COPY INTO command, you
need to have three database objects
defined.

A

Table
Stage
File format

123
Q

snowflake tools allow you to import 5 popular semi structure data formats like

A

Json
xml
Parquet
Avro
Orc

124
Q

A Snowflake tool used for ingesting semi structure data format is known as

A

Snowflake variant data type

125
Q

When loading semi structure data, what is the data type of the column holding the upload data

126
Q

What are the data type trend

A

Unstructured
Quasi structured
Semi structured
Structured

127
Q

What are some examples of Unstructured

A

Images, PDF, videos

128
Q

What are some examples of Quasi structured

A

ClikStream

129
Q

What are some examples of Semi structured

A

XML, JSON, Parquet, AVRO, ORC

130
Q

What are some examples of Structured

A

RDBMS AND CSV files

131
Q

What the flatten command do

A

It flatten for example list arrays in a Json file. This is similar to the Russian nested dolls.

132
Q

Usually how are the twit entities called

133
Q

What are the layers of Snowflake architecture

A

Storage
Query Processing
cloud Services

134
Q

What is the query processing

A

It is the muscle of the system and performs massive parallel system

135
Q

Which layer is the brain of the system

A

Cloud Services: access control, security, optimizer, metadata, manage the infrastructure.

136
Q

What is Data Warehouse

A

DB that is used for reporting and data analysis.

137
Q

VALIDATION_MODE = RETURN_ERRORS; What is this

A

When loading the file will be checked for errors.

138
Q

VALIDATION_MODE = RETURN_5_ROWS ; What is this

A

When loading the file; the first 5 lines will be checked for errors.

139
Q

SIZE_LIMIT = 2000; What is this

A

The system will load up to 20000 mb

140
Q

RETURN_FAILED_ONLY = TRUE; What is this

A

The system will indicate the files containing errors.

141
Q

TRUNCATE COLUMNS = TRUE; What is this

A

Strings are automatically truncated to the target column length

142
Q

TRUNCATE COLUMNS = FALSE; What is this

A

Copy pro9duces an error if a loaded string exceed the target column length

143
Q

FORCE=TRUE; What is this

A

Indicates tot he system to load the files(s) regardless of whether the file has been loaded previously and have not change since loaded previously. It has the potential to duplicate the data. Default is FALSE.

144
Q

What is Loading History

A

Enables you to retrieve the history of data loaded into tables using the COPY INTO <Table> command

145
Q

We have 3 files in our external stage with 20MB (~20 000 000 bytes) each.

How many files will be loaded if we set SIZE_LIMIT = 30 000 000 ?

146
Q

how do you change from variant to string

A

SELECT RAW_FILE:first_name::string as first_name FROM OUR_FIRST_DB.PUBLIC.JSON_RAW;

147
Q

How to create a table by using select statement

148
Q

How to format a date column - parquet data to readable date

A

DATE($1:date::int ) as Date

149
Q

What is this statement METADATA$FILENAME AS FILENAME do in an sql query

A

Indicate the file name loaded to a stage or table

150
Q

What is this statement TO_TIMESTAMP_NTZ(CURRENT_TIMESTAMP) AS LOAD_DATE,

152
Q

What measures can be taken to improve performance

A
  1. Create virtual warehouses
  2. Scale up during season of high demand
  3. Scale out to unknow patters of workload
  4. Maximise cached usage
  5. Cluster keys - large tables
153
Q

What is a cluster

A

Subset of rows t olocate the data in micro partitions. For large tables improves the table scan.