Design & Implement Data Storage Flashcards
What is a Temporal table?
In Azure SQL DB, it allows you to track and analyze the full history of changes to the data, without custom coding.
What is PolyBase?
A SQL Server feature that allows you to join data with external data using T-SQL.
What is Azure Monitor?
A centralized monitoring service for all Azure resources.
What is Cosmos DB?
Fully managed NoSQL DB for app development. Easy to be globally distributed.
What is Azure Stream Analytics?
As opposed to batch processing, for real-time analytics; fast-moving streams of data for reports and triggering alerts. Can lookup against Reference Data.
What are the 5 types of data collected in Azure Monitor?
Application Guest OS Azure Resource Azure Subscription Azure Tenant
What are the 5 modes for Cosmos DB?
Core (SQL) API Cassandra API Gremlin API Table API MongoDB API
What is Azure Synapse Analytics?
A single pane of glass for EDW & Big Data analytics.
What is Azure HDInsight?
Helps to ingest, process and analyze big data. Support batch, data warehousing, IoT and data science.
Hadoop: Includes Hive, HBase, Spark, Kafka.
HBase
Storm
Kafka
What is Apache Spark & Spark Pool?
Spark: Parallel processing framework that support in-memory processing for fast big data analytic apps.
Spark Instances: Created when you connect to a Pool.
Spark Pool: A set of metadata that defines compute resource req’s and associated behavior characteristics when a Spark instance is instantiated.
What is Azure Databricks?
Data & AI service for data engineering.
Databricks SQL
Databricks Data Science & Engineering
Databricks Machine Learning
What is a Lakehouse?
The merging of data warehouse & data lake architectures.
What should you use to provision throughput for a Cosmos DB Container?
A logical key partition.
What are the 5 consistency levels in Cosmos DB from strongest consistency to weakest?
Strong Bounded Staleness Session Consistent Prefix Eventual
What are the 5 consistency levels in Cosmos DB, FROM highest availability, lowest latency and highest throughput
TO lowest availability, highest latency and lowest throughput?
Eventual, Consistent Prefix, Session, Bounded Staleness Strong