db-900 core data concepts Flashcards

Question

What proves that a transaction was durable?

Answer 1

Once the transaction is committed, it persists. If the database it turned on and off, the change remains.

Answer 2

It is used to process large volumes of file based data

Answer 3

Online Analytical Processing model

Answer 4

From operational data to the data lake, warehouse, or lakehouse

Answer 5

A database optimized for analytics queries (read operations)

Answer 6

Create Retrieve Update Delete

Answer 7

combines the flexible and scalable storage with relational querying semantics

Answer 8

Relational Data will contain duplicate data across rows.

Answer 9

Database Administrator Data Engineer Data Analyst

Answer 10

They are responsible for the design, implementation, and maintenance of databases. They do things like update the databases and manage permissions, and they are responsible for the performance and reliability of the databases.

Answer 11

They are responsible for building data workloads for databases and file stores that take transactional data and make them available for analytics. They own the management and monitoring of data pipelines to ensure that data loads perform as expected.

Answer 12

They investigate and transform data into reports and visualizations to provide insights for valuable business questions

Answer 13

A group of relational database solutions built on the SQL Server engine

Answer 14

Azure is a collection of cloud based IT solutions

Answer 15

Fully managed Platform-as-a-service product, which provides the least flexible configuration

Answer 16

A hosted instance of SQL Server which provides automated maintenance, allowing more configuration flexibility than SQL DB

Answer 17

A virtual machine with SQL Server installed, allowing for the maximum amount of configuration, also the most aount of responsibility for the DBA

Answer 18

MySQL MariaDB Postgres

Answer 19

A global scale non-relational (noSQL) database which supports storing documents as JSONs, key-value pairs, column family tables, and graphs. Sometimes DBAs have to manage this, but usually the software engineers do. Often Data Engineers will need to extract data from here for a data lakehouse

Answer 20

A cloud service that allows you to store data in BLOB containers, file shares, and tables

Answer 21

They would likely use it as a data lake

Answer 22

A service to define and schedule data pipelines to transfer and transform data. It can be integrated with other Azure products

Answer 23

They would use it to build ETL pipelines that take operational data and populate data warehouses for analytics solutions

Answer 24

Comprehensive PaaS for analytics

Answer 25

Pipelines SQL Apache Spark Synapse Analytics Data Explorer

Answer 26

Same technology as Azure Data Factory

Answer 27

a highly scalable SQL database engine, optimized for data warehouse workloads (read queries)

Answer 28

An open source distributed data processing system that allows for the integration of APIs using python, sql, java, and scala

Answer 29

Uses the Kusto Query Language to provide extremely fast analytics processing optimized for realtime telemetry and log data

Answer 30

They will use it to build comprehensive data analytics solutions for ingest pipelines, lake storage, and warehouse storage

Answer 31

They can use sql and spark through interactive notebooks and integrate with Azure Machine Learning and Power BI to create models

Answer 32

An Azure integrated version of a popular platform which combines Apache Spark and SQL database semantics for large scale analytics

Answer 33

They can use the native notebook support to provide browser friendly data analysis

Answer 34

They'll use it to create analytical data stores

Answer 35

This provides Azure hosted clusters for apache technologies

Answer 36

Write map-reduce jobs in Java or Apache Hive to process large volumes of data

Answer 37

Query NoSQL data at a large scale with this

Answer 38

a message broker for data stream processing

Answer 39

They can use this to support big data processing jobs that use multiple Apache technologies

Answer 40

Captures a stream of data, applies queries/transformations to it, writes the results for analytics or further processing

Answer 41

They can use this to write ETL pipelines for analytical data stores

Answer 42

query log and telemetry data fast with this standalone version of the Synapse product

Answer 43

They can easily analyze timestamped log data

Answer 44

Enterprise solution for governance and discoverability, helping people find the data they need

Answer 45

SaaS lakehouse platform that includes: ETL lakehouse analytics warehouse analytics data science and machine learning realtime analytics data visualization data governance and management

Answer 46

They will enforce data governance and ensure integrity of data

Answer 47

What are the three ways you can categorize data?

Answer 48

What is tabular data?

Answer 49

What makes data 'structured'?

Answer 50

What makes data 'semi–structured'?

Answer 51

What is an example of a format that is useful for 'semi structured' data?

Answer 52

What are some examples of 'unstructured' data?

Answer 53

What are the two broad categories of data stores?

Answer 54

What are some common ways to store files?

Answer 55

What is XML?

Answer 56

what is replacing XML?

Answer 57

What is the best format for storing large objects like videos, audio, and images?

Answer 58

How is file storage different from a database?

Answer 59

What is NoSQL?

Answer 60

What are the 4 common types of non–relational databases?

Answer 61

In a key–value database, what format does the value have to be in?

Answer 62

In a document database, what format does the value have to be in?

Answer 63

What are the two types of data processing?

Answer 64

What is OLTP?

Answer 65

What does OLTP track?

Answer 66

What does a transaction ensure?

Answer 67

What does ACID stand for?

Answer 68

What is atomicity?

Answer 69

How do you know a transaction is consistent?

Answer 70

How do you know your transactions are 'isolated'?

Answer 71

What proves that a transaction was durable?

Answer 72

What is a data lake?

Answer 73

What is an olap model?

Answer 74

ETL takes the data from where to where?

Answer 75

What is a data warehouse?

Answer 76

What does CRUD stand for?

Answer 77

What is a lakehouse

Answer 78

What kind of denormalization takes place when oltp data is transferred to a lakehouse?

Answer 79

What are the 3 main roles in Data?

Answer 80

What does a database administrator do?

Answer 81

What does a data engineer do?

Answer 82

What does a data analyst do?

Answer 83

What is Azure SQL?

Answer 84

What is Azure?

Answer 85

What is Azure SQL Database?

Answer 86

What is Azure SQL Managed Instance?

Answer 87

What is Azure SQL VM?

Answer 88

What Azure products are offered for open-source relational databases?

Answer 89

What is Azure Cosmos?

Answer 90

What is Azure Storage?

Answer 91

What would a data engineer do with Azure Storage?

Answer 92

What is Azure Data Factory?

Answer 93

What would a data engineer do with Azure Data Factory?

Answer 94

What is Azure Synapse Analytics?

Answer 95

What does Synapse Analytics include?

Answer 96

What is Synapse Analytics pipelines?

Answer 97

What is SQL?

Answer 98

What is Apache Spark?

Answer 99

What is Synapse Analytics Data Explorer?

Answer 100

what can data engineers use Azure Synapse Analytics for?

Answer 101

what can data analysts use Azure Synapse Analytics for?

Answer 102

What is Azure Databricks?

Answer 103

what can data analysts use Azure Databricks for?

Answer 104

what can data engineers use Azure Databricks for?

Answer 105

What is Azure HDInsight?

Answer 106

What is Apache Hadoop?

Answer 107

What is Apache HBase?

Answer 108

What is Apache Kafka?

Answer 109

Data engineers can use Azure HDInsight for what?

Answer 110

What is Azure Stream Analytics?

Answer 111

What can data engineers do with Azure Stream Analytics?

Answer 112

What is Azure Data Explorer?

Answer 113

Data analysts can use Azure Data Explorer for what?

Answer 114

What is Microsoft Purview?

Answer 115

What is Microsoft Fabric?

Answer 116

Data engineers can use Microsoft Purview for what?

db-900 core data concepts Flashcards

db-900 azure data fundamentals (140 cards)