DP-203 Flashcards

Question

Sharding pattern: What is replicate?

Answer 1

A replicated table provides the fastest query performance for small tables.

Answer 2

A round-robin table is the most straightforward table to create and deliver fast performance when used as a staging table for loads.

Answer 3

Replicated

Answer 4

Round-robin (Default)

Answer 5

``` Tumbling Hopping Sliding Session Snapshot ```

Answer 6

Used to segment a data stream into DISTINCT time segments. I.e count of tweets per timezone every 10 seconds

Answer 7

Hop forward in time by a fixed window, like a tumbling window that can overlap. I.e Every 5 seconds give me the count of tweets over the last 10 seconds.

Answer 8

Output events only for points in time when the content of the window changes. I.e Give me the count of tweets for all topics which are tweeted more than 10 times in the last 10 seconds.

Answer 9

Group events that arrive at similar times, filtering out periods of time where there is no data. I.e Count of tweets that occur within 5 minutes of each other.

Answer 10

Groups events that have the same timestamp. I.e Give me the count of tweets with the same topic type that occurres at exactly the same time.

Answer 11

You set up a dynamic data masking policy in the Azure portal by selecting the Dynamic Data Masking blade under Security in your SQL Database configuration pane. This feature cannot be set using portal for SQL Managed Instance.

Answer 12

* Use XXXX or fewer Xs if the size of the field is less than 4 characters for string data types (nchar, ntext, nvarchar). * Use a zero value for numeric data types (bigint, bit, decimal, int, money, numeric, smallint, smallmoney, tinyint, float, real). * Use 01-01-1900 for date/time data types (date, datetime2, datetime, datetimeoffset, smalldatetime, time). * For SQL variant, the default value of the current type is used. * For XML the document is used. * Use an empty value for special data types (timestamp table, hierarchyid, GUID, binary, image, varbinary spatial types).

Answer 13

Masking method, which exposes the last four digits of the designated fields and adds a constant string as a prefix in the form of a credit card. XXXX-XXXX-XXXX-1234

Answer 14

Masking method, which exposes the first letter and replaces the domain with XXX.com using a constant string prefix in the form of an email address. aXX@XXXX.com

Answer 15

Masking method, which generates a random number according to the selected boundaries and actual data types. If the designated boundaries are equal, then the masking function is a constant number.

Answer 16

Masking method, which exposes the first and last characters and adds a custom padding string in the middle. If the original string is shorter than the exposed prefix and suffix, only the padding string is used. prefix[padding]suffix

Answer 17

Always Encrypted is a feature designed to protect sensitive data, such as credit card numbers or national identification numbers (for example, U.S. social security numbers), stored in Azure SQL Database or SQL Server databases.

Answer 18

Deterministic encryption always generates the same encrypted value for any given plain text value. Using deterministic encryption allows point lookups, equality joins, grouping and indexing on encrypted columns. However, it may also allow unauthorized users to guess information about encrypted values by examining patterns in the encrypted column, especially if there's a small set of possible encrypted values, such as True/False, or North/South/East/West region. Deterministic encryption must use a column collation with a binary2 sort order for character columns.

Answer 19

Randomized encryption uses a method that encrypts data in a less predictable manner. Randomized encryption is more secure, but prevents searching, grouping, indexing, and joining on encrypted columns.

Answer 20

SSMS + PS 1) Provisioning column master keys, column encryption keys and encrypted column encryption keys with their corresponding column master keys. 2) Creating key metadata in the database. 3) Creating new tables with encrypted columns 4) Encrypting existing data in selected database columns

Answer 21

Transparent Data Encryption (TDE) encrypts SQL Server, Azure SQL Database, and Azure Synapse Analytics data files. This encryption is known as encrypting data at rest.

Answer 22

Create a master key. Create or obtain a certificate protected by the master key. Create a database encryption key and protect it by using the certificate. Set the database to use encryption.

Answer 23

For scenarios that require low latency.

Answer 24

On the Advanced tab, set the Hierarchical Namespace to Enabled. If you want to enable the best performance for analytical workloads in Data Lake Storage Gen2, then on the Advanced tab of the Storage Account creation set the Hierarchical Namespace to Enabled.

Answer 25

In Synapse, you do not have foreign keys and unique value constraints like you do in SQL Server. Since these rules are not enforced at the database layer, the jobs used to load data have more responsibility to maintain data integrity.

Answer 26

DISTRIBUTION = REPLICATE | DISTRIBUTION = HASH([SalesOrderNumber])

Answer 27

DISTRIBUTION = REPLICATE. Replicate will result in a copy of the table on each compute node, which performs well with joins to the distributed fact table.

Answer 28

DISTRIBUTION = HASH([SalesOrderNumber]). Hash distribution provides good read performance for a large table by distributing records across compute nodes based on the hash key.

Answer 29

All dimensions in a star schema join directly to the fact table (denormalized) while some dimension tables in a snowflake schema are normalized. A star schema is highly denormalized so that the fact table joins directly to dimension; a snowflake schema normalizes some dimensions into multiple tables such as DimProduct, DimProductSubcategory, and DimProductCategory.

DP-203 Flashcards

(55 cards)