DP-200 - Monitor and optimise data solution Flashcards

Question

What is bounded staleness?

Answer 1

the data is consistent beyond the user-defined time or operations threshold. The performance of bounded staleness is better than the strong consistency however the availability is still low due to inherent lag for the replication. This level is used for apps that don't need to fetch data in real-time, however still in the order, it was written.

Answer 2

session consistency provides strong consistency for the session, ensuring the data stays up to date for any active read-write session. The availability of the data is relatively high with lower latency and higher throughput than the bounded staleness. The possible candidate for this kind of model could be a typical e-commerce application, social media app, and other similar services with persistent user connection.

Answer 3

The consistent prefix model is similar to bounded staleness except, the operational or time lag guarantee. The replicas guarantee the consistency and order of the writes however the data is not always current. This model ensures that the user never sees an out-of-order write. For example, if data is written in the order A, B, and C, the user may either see A, A,B or A,B,C, but never out-of-order entry like A,C or B,A,C. This model provides high availability and very low latency which is best for certain applications that can afford the lag and still function as expected.

Answer 4

This model offers high availability and low latency along with the highest throughput of all. This model suits the application that does not require any ordering guarantee. The best usage of this type of model would be the count of retweets, likes, non-threaded comments where the count is more important than any other information.

Answer 5

Azure Self-hosted Azure-SSIS

Answer 6

You use a self-hosted integration runtime when you: - Copy data between cloud and on-premises stores - Copy data between on-premises stores - Execute activities using on-premises stores and services

Answer 7

Execute SSIS Packages through Azure Data Factory

Answer 8

You use an Azure integration runtime when you: - Copy data between cloud stores - Transform data between cloud stores using data flows - Execute activities using cloud stores and services

Answer 9

``` Tumbling window Hopping window Sliding window Session window Snapshot window ```

Answer 10

Tumbling window functions are used to segment a data stream into distinct time segments and perform a function against them, such as the example below. The key differentiators of a Tumbling window are that they repeat, do not overlap, and an event cannot belong to more than one tumbling window.

Answer 11

Hopping window functions hop forward in time by a fixed period. It may be easy to think of them as Tumbling windows that can overlap and be emitted more often than the window size. Events can belong to more than one Hopping window result set. To make a Hopping window the same as a Tumbling window, specify the hop size to be the same as the window size.

Answer 12

Sliding windows, unlike Tumbling or Hopping windows, output events only for points in time when the content of the window actually changes. In other words, when an event enters or exits the window. So, every window has at least one event. Similar to Hopping windows, events can belong to more than one sliding window.

Answer 13

Session window functions group events that arrive at similar times, filtering out periods of time where there is no data. It has three main parameters: timeout, maximum duration, and partitioning key (optional). A session window begins when the first event occurs. If another event occurs within the specified timeout from the last ingested event, then the window extends to include the new event. Otherwise if no events occur within the timeout, then the window is closed at the timeout. If events keep occurring within the specified timeout, the session window will keep extending until maximum duration is reached. The maximum duration checking intervals are set to be the same size as the specified max duration. For example, if the max duration is 10, then the checks on if the window exceed maximum duration will happen at t = 0, 10, 20, 30, etc. When a partition key is provided, the events are grouped together by the key and session window is applied to each group independently. This partitioning is useful for cases where you need different session windows for different users or devices.

Answer 14

Snapshot windows groups events that have the same timestamp. Unlike other windowing types, which require a specific window function (such as SessionWindow(), you can apply a snapshot window by adding System.Timestamp() to the GROUP BY clause.

Answer 15

Conditional access policies - this can also help with: - -Blocking sign ins - -Blocking or granting access from specific locations - -Blocking risky sign-in behaviours - - Requiring organisation managed devices for specific applications

Answer 16

Transparent data encryption

Answer 17

Default value, which displays the default value for that data type instead.

Answer 18

Credit card value, which only shows the last four digits of the number, converting all other numbers to lower case x’s.

Answer 19

which hides the domain name and all but the first character of the email account name.

Answer 20

which specifies a random number between a range of values. For example, on the credit card expiry month and year, you could select random months from 1 to 12 and set the year range from 2018 to 3000.

Answer 21

which allows you to set the number of characters exposed from the start of the data, the number of characters exposed from the end of the data, and the characters to repeat for the remainder of the data.

Answer 22

No, but this can be enabled by adding specific SQL users to an exclusion list.

Answer 23

Azure SQL Database auditing

Answer 24

Append blobs in a designated Azure Blob storage account

Answer 25

Provides capabilities built into Azure SQL Database for discovering, classifying, labeling & protecting the sensitive data in your databases. It can be used to provide visibility into your database classification state, and to track the access to sensitive data within the database and beyond its borders.

Answer 26

Is an easy to configure service that can discover, track, and help you remediate potential database vulnerabilities. It provides visibility into your security state, and includes actionable steps to resolve security issues, and enhance your database fortifications.

Answer 27

Detects anomalous activities indicating unusual and potentially harmful attempts to access or exploit your database. It continuously monitors your database for suspicious activities, and provides immediate security alerts on potential vulnerabilities, SQL injection attacks, and anomalous database access patterns. Advanced Threat Protection alerts provide details of the suspicious activity and recommend action on how to investigate and mitigate the threat.

Answer 28

SQL injection reports where SQL injection attacks have occurred. SQL injection vulnerability reports where the possibility of a SQL injection is likely. Anomalous client login looks at logins that are irregular and could be cause for concern, such as a potential attacker gaining access.

Answer 29

Sends the threats to the service administrators.

Answer 30

A server-level virtual network rule will allow you to allow connectivity from specific Azure VNet subnets, and will block access from the internet. This is the most efficient manner to secure this configuration.

Answer 31

laura@contoso.com When database administrator accounts access data that have a mask applied, the mask is removed, and the original data is visible.

Answer 32

Database files, log files, and backup files Transparent Data Encryption encrypts all database, log, and backup files. When new Azure SQL databases are created, Transparent Data Encryption will be enabled by default.

Answer 33

Yes | Azure SQL Database enforces encryption (SSL/TLS) at all times for all connections

Answer 34

always generates the same encrypted value for any given plain text value. Using deterministic encryption allows point lookups, equality joins, grouping and indexing on encrypted columns. However, it may also allow unauthorized users to guess information about encrypted values by examining patterns in the encrypted column, especially if there's a small set of possible encrypted values, such as True/False, or

Answer 35

uses a method that encrypts data in a less predictable manner. Randomized encryption is more secure, but prevents searching, grouping, indexing, and joining on encrypted columns.

Answer 36

Managed Identity Authentication

Answer 37

10GB storage

Answer 38

- Wide range of values and access patterns that are evenly spread across logical partitions. - That spreads the workload evenly and over time. Good candidates are properties that appear frequently as a filter.

Answer 39

Hash distributed

Answer 40

Replicated for smaller tables, if tables are too large to store on each compute node, use hash.

Answer 41

Round robin.

Answer 42

SQL Data warehouse as this is for the consumption of this particular resource.

Answer 43

High concurrency - They can be shared by multiple users but only support python, SQL and R.

Answer 44

One and only supports Python, SQL and Scala.

Answer 45

'%', followed by the language. You will not need to prefix the cell with anything if you are using your primary language.

Answer 46

- Create a database scoped credential | - Create an external data source using 'abfs' as the file location

Answer 47

Storage Account: Save your diagnostic logs to a storage account for auditing or manual inspection. You can use the diagnostic settings to specify the retention time in days. Event Hub: Stream the logs to Azure Event Hubs. The logs become input to a partner service/custom analytics solution like Power BI. Log Analytics: Analyze the logs with Log Analytics. The Data Factory integration with Azure Monitor is useful in the following scenarios:

Answer 48

(Lookup table) is a finite data set that is static or slowly changing in nature, used to perform a lookup.

Answer 49

Azure blob storage and Azure SQL Datawarehouse

Answer 50

Export to BACPAC using SSMS, save to storage account. Export to a BACPAC file using Powershell and save locally Export to BACPAC using SqlPackage utility.

Answer 51

Create a Virtual Private Network Connection from the on-prem network to Azure. Create new ADF resource Create self-hosted integration runtime

Answer 52

Watermark delay

Answer 53

Use Azure SQL Database managed instance

Answer 54

Azure SQL database single database

Answer 55

Used to perform block creation, deletion and replication upon instruction from the NameNode

Answer 56

Executes file system namespace operations like opening, closing and renaming files and directories. Also determines mapping of blocks to datanodes

Answer 57

If 'Delete data' is not specified then they are retained 'indefinitely'

DP-200 - Monitor and optimise data solution Flashcards

(84 cards)