Roles and services Flashcards

1
Q

Database Administrator

A

responsible for the design, implementation, maintenance, and operational aspects of on-premises and cloud-based database systems.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Data Engineer

A

A data engineer collaborates with stakeholders to design and implement data-related workloads, including data ingestion pipelines, cleansing and transformation activities, and data stores for analytical workloads.

They use a wide range of data platform technologies, including relational and non-relational databases, file stores, and data streams.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Data Analyst

A

A data analyst enables businesses to maximize the value of their data assets.

They’re responsible for exploring data to identify trends and relationships, designing and building analytical models, and enabling advanced analytics capabilities through reports and visualizations.

A data analyst processes raw data into relevant insights based on identified business requirements to deliver relevant insights.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Azure SQL Database

A

a fully managed platform-as-a-service (PaaS) database hosted in Azure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Azure SQL Managed Instance

A

–a hosted instance of SQL Server with automated maintenance, which allows more flexible configuration than Azure SQL DB but with more administrative responsibility for the owner.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Azure SQL VM

A

a virtual machine with an installation of SQL Server, allowing maximum configurability with full management responsibility.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Azure Database for MySQL

A

a simple-to-use open-source database management system that is commonly used in Linux, Apache, MySQL, and PHP (LAMP) stack apps.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Azure Database for MariaDB

A

a newer database management system, created by the original developers of MySQL. The database engine has since been rewritten and optimized to improve performance. MariaDB offers compatibility with Oracle Database (another popular commercial database management system).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Azure Database for PostgreSQL

A

a hybrid relational-object database. You can store data in relational tables, but a PostgreSQL database also enables you to store custom data types, with their own non-relational properties.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Azure Cosmos DB

A

is a global-scale non-relational (NoSQL) database system that supports multiple application programming interfaces (APIs), enabling you to store and manage data as JSON documents, key-value pairs, column-families, and graphs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Azure Storage

A

Blob containers - scalable, cost-effective storage for binary files.
File shares - network file shares such as you typically find in corporate networks.
Tables - key-value storage for applications that need to read and write data values quickly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Azure Data Factory

A

Azure service that enables you to define and schedule data pipelines (build extract, transform, and load (ETL) solutions that populate analytical data stores)

You can integrate your pipelines with other Azure services

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Azure Synapse Analytics

A

Azure Synapse Analytics is a comprehensive, unified data analytics solution that provides a single service interface for multiple analytical capabilities, including:

Pipelines - based on the same technology as Azure Data Factory.

SQL - a highly scalable SQL database engine, optimized for data warehouse workloads.

Apache Spark - an open-source distributed data processing system that supports multiple programming languages and APIs, including Java, Scala, Python, and SQL.

Azure Synapse Data Explorer - a high-performance data analytics solution that is optimized for real-time querying of log and telemetry data using Kusto Query Language (KQL).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Azure Databricks

A

is an Azure-integrated version of the popular Databricks platform, which combines the Apache Spark data processing platform with SQL database semantics and an integrated management interface to enable large-scale data analytics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Azure HDInsight

A

is an Azure service that provides Azure-hosted clusters for popular Apache open-source big data processing technologies, including:

Apache Spark
Apache Hadoop
Apache HBase
Apache Kafka
Apache Storm
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Azure Stream Analytics

A

is a real-time stream processing engine that captures a stream of data from an input, applies a query to extract and manipulate data from the input stream, and writes the results to an output for analysis or further processing.

17
Q

Azure Data Explorer

A

Data analysts can use Azure Data Explorer to query and analyze data that includes a timestamp attribute, such as is typically found in log files and Internet-of-things (IoT) telemetry data.

18
Q

Microsoft Purview

A

provides a solution for enterprise-wide data governance and discoverability. You can use Microsoft Purview to create a map of your data and track data lineage across multiple data sources and systems, enabling you to find trustworthy data for analysis and reporting.

19
Q

Microsoft Power BI

A

is a platform for analytical data modeling and reporting that data analysts can use to create and share interactive data visualizations. Power BI reports can be created by using the Power BI Desktop application, and then published and delivered through web-based reports and apps in the Power BI service, as well as in the Power BI mobile app.