examtopics Flashcards
what occured in the past
No, Yes, Yes
c: data that is fully processed before being loaded to the target data store
c: latency is expected
a: cognitive
Diagnostic, Predictive, Cognitive
Descriptive Analytics tells you what happened in the past.
Diagnostic Analytics helps you understand why something happened in the past.
Predictive Analytics predicts what is most likely to happen in the future.
Drescriptive Analytics recommends actions you can take to affect those outcomes.
Customer is a root object
Address is a nested object
Social media is a nested array
star schema
dimension table
a: distibutes processing across compute nodes
a: A clustered index
c: transactional writes
Yes
Yes
Yes
Extract: The CRM system
Load: DWH
Transform: DWH
descriptive
a: Treemap
b: Key influencer
c: Scatter
You need to create an Azure Storage account.
Data in the account must replicate outside the Azure region automatically.
Which two types of replication can you use for the storage account? Each correct answer presents a complete solution.
NOTE: Each correct selection is worth one point.
A. zone-redundant storage (ZRS)
B. read-access geo-redundant storage (RA-GRS)
C. locally-redundant storage (LRS)
D. geo-redundant storage (GRS)
B. read-access geo-redundant storage (RA-GRS)
D. geo-redundant storage (GRS)
Yes
No
No
Which statement is an example of Data Manipulation Language (DML)?
A. REVOKE
B. DISABLE
C. INSERT
D. GRANT
c: Insert
SELECT, INSERT, UPDATE, DELETE are DML
You have a SQL query that combines customer data and order data. The query includes calculated columns.
You need to persist the SQL query so that other users can use the query.
What should you create?
A. an index
B. a view
C. a scalar function
D. a table
b: a view
simple lookups
Image files = Azure Blob storage
Relationship between employees = Azure Cosmos DB Gremlin API Key/value pairs = Azure Table Storage
Resource Group -> Storage Account -> Container -> Folders -> Files
What are two characteristics of real-time data processing? Each correct answer presents a complete solution.
NOTE: Each correct selection is worth one point.
A. Data is processed periodically
B. Low latency is expected
C. High latency is acceptable
D. Data is processed as it is created
B. Low latency is expected
D. Data is processed as it is created
Dataset
Linkedservice
Pipeline