Core Data Related Azure Services Flashcards

1
Q

what is Azure Storage Account

A

Is Secure and scalable cloud-based storage solution provided by Microsoft.

It acts as a central container to manage different types of data, such as documents, images, videos, and backups. It enables easy access to data from anywhere due to unique namespace for storage account and accessible by HTTP/S, sharing with controlled permissions, and integration with other Azure services.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is Azure Blob storage

A

Azure Blob Storage is cloud-based Storage service offered by Microsoft Azure that allows you to store large amount of unstructured data like images, videos, document, and text files, in a scalable and cost-effective manner.

“Blob” Stands for “Binary Large Objects”, refers to any type of data that doesn’t fit neatly into a structured database format.

Azure Blob Storage offers Hot, Cool, Cold and Archive tiers and as given sequence storage cost is decreased, and increased for access cost.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Azure Table ?

A

Azure Table Storage is a cloud-based service for storing large amounts of structured NoSQL data i.e. structured non-relational data

it is Schema-less

It Stores data in key/Value store format, making it highly flexible and adaptable to the changing needs of applications

working: Data is stored in tables as entities (like rows), where each entity has properties (like columns). You can quickly query the data using the key-value pairs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is azure file

A

Azure File Storage is a feature that allows you to create file shares in the cloud by handling the infrastructure, maintenance, and updates & can be accessed via industry-standard protocols like SMB (Server Message Block, accessible from windows, Linux, Mac OS client) and NFS (Network File System, linux client).

SMB Azure file shares can be cached on Windows servers with Azure File Sync for fast access near where the data is being used.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is azure file sync

A

Azure File Sync is a service that allows you to extend Azure File Shares to on-premises environments. It creates a bridge between your on-premises Windows Server and Azure File Share, enabling hybrid cloud scenarios.

Azure File Sync works with Azure File Share as the backend storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is azure storage explorer?

A

Azure Storage Explorer is a standalone, user-friendly, desktop app that allows you to easily manage and explore your Azure Storage resources from your desktop visually.

You can upload, download, and manage Azure Storage blobs, files, queues, tables, and Data Lake Storage entities and move between accounts.

It uses Microsoft Entra ID (formerly Azure Active Directory) for secure access.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is Azure Synapse Analytics?

A

Azure Synapse Analytics is a cloud-based data ware-housing & data analytics platform allowing you to store, manage analyze large volumes of structured and unstructured data from various sources (Data warehouse, Data lake, or external database) in a single, unified environment for Querying Data, using either SQL queries for structured data or Apache Spark for more complex big data manipulations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is Azure Synapse Studio?

A

Is Web-Based user interface designed for interaction with Azure Synapse Analytics, where you can manage your data pipelines, Write and execute queries, visualize data , Monitor your resources.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is Cosmo DB

A

Azure cosmos DB is multi-model DB service (Doc, Key-value, graph, and etc.) and it deals with NoSQL DB and can Host various NoSQL engine (NoSQL, MongoDB, Cassandra, and Gremlin).
Azure cosmos DB is famously known for its global Distribution as it replicates data in multiple region, hence providing high availability, performance and scalability and low latency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Azure data lake store (Gen2)

A

Azure Data Lake Storage Gen2 is a scalable and secure cloud-based solution designed for big data analytics.

It combines the capabilities of Azure Blob Storage with hierarchical file system features, making it ideal for storing and analyzing massive amounts of structured and unstructured data.

or can be said as these….

A centralized data repository for big data Blob storage designed for vast amount of data. Essentially it is the place where the data resides.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Azure data analytics

A

In general, data analytics refers to the process of examining and analyzing raw data to extract meaningful insights, patterns, and trends from large volume of data (struc. & unstruc.) . This process often involves various tools and techniques, like statistical analysis, machine learning, and visualization, to make informed decisions or solve problems.

In the context of Azure, data analytics is about using Azure’s cloud services (like Synapse, Data Lake, ML models) to perform these processes efficiently at scale.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is Azure SQL Server on Azure VM?

A

Azure SQL for VM refers to the option to run a full SQL Server instance on an Azure VM, allowing you to install and manage your own SQL Server software within a cloud-based VM, giving you complete control over the OS and database engine compared to other Azure SQL offerings.
Is used for lift and shift migration strategy to mirror your on-premises environment to azure.
Or to have more control on IaaS.
SQL Server on Azure Virtual Machines performs up to 22% faster and is up to 23% less expensive than AWS EC2.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Azure SQL managed instance ?

A

Azure SQL Managed Instance is a scalable cloud database service that’s always running on the latest stable version of the Microsoft SQL Server database engine and attached OS with 99.99% built-in high availability, offering close to 100% feature compatibility (Similarity) with on-premises SQL Server Instance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is Azure SQL database? (PaaS)

A

Azure SQL Database is fully managed Relational Database as a Service (DBaaS), which is running on the latest stable version of the Microsoft SQL Server Database Engine. Azure SQL Database allows you to store and manage large amount of data and also offering features like schema management, rich query capabilities, and support for structured data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Azure Database for <open></open>

A

managed relational database on azure
e.g: MySQL, Postgre SQL, maria DB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Azure Cache for REDIS

A

Azure Cache for Redis is a fully managed, in-memory data (Redis stores data in the computer’s memory (RAM), which is much faster than accessing data from a traditional hard disk or database.) store service provided by Microsoft Azure. Essentially, it’s a way to significantly speed up your applications by storing frequently accessed data in a fast, readily available cache (meaning it stores copies of frequently used data).

17
Q

Azure databrick

A

Azure Databricks is a cloud-based data analytics platform that makes it easy to process and analyze massive amounts of data using Apache Spark. Think of it as a powerful, collaborative workspace for data scientists, engineers, and analysts to work with big data.
Apache spark provides very fast ELT jobs, as well as ML and streaming.

18
Q

Microsoft Power BI

A

A Business intelligence tool used to create dashboard and interactive reports to empower business decision

19
Q

what is azure HDInsight ?

A

Azure HDInsight is a cloud-based service for businesses that allows you to use popular open-source big data tools. It supports popular open-source frameworks like Apache Spark, Apache Hive, LLAP, Apache Kafka, Hadoop, and more, allowing you to use them within your Azure environment.

20
Q

what is Azure Data Studio

A

An IDE that look very much like VS CODE, but designed around data related tasks. Cross platform similar to SSIS but broader

21
Q

what is Azure Data Factory

A

Azure Data Factory (ADF) allows you to easily move and transform large volumes of data from various sources and also allows you to create data-driven workflows for orchestrating data movement and transformation at scale.
helps you build pipelines to extract data from various sources, transform it as needed, and load it into target systems for analysis. 2
ELT and ETL

22
Q

what is ELT

A

This traditional method first extracts data from various sources, then transforms it into a usable format, and finally loads it into a target system like a data warehouse. Essentially, the “T” happens before the “L”.

23
Q

what is ETL?

A

This newer approach extracts data and immediately loads it into the target system, such as a data lake or cloud data warehouse. Transformations occur within that system afterward. So, the “L” happens before the “T”.

24
Q

what is SQL Server Integration Service ?

A

SQL Server Integration Services (SSIS) is a platform for building enterprise-level data integration and data transformation solutions. It’s a component of the Microsoft SQL Server database software that allows users to extract, transform, and load (ETL) data from various sources into a single destination.
Essentially, SSIS automates the process of moving and transforming data, making it easier to manage and utilize information from diverse sources.

25
Q

what is Azure Synapse Studio

A

Is Web-Based user interface for Azure Synapse Analytics, where you can manage your data pipelines, Write and execute queries, visualize data , Monitor your resources.

26
Q

what is data

A

unit of information

27
Q

what is database ?

A

A collection of organized data stored in a system (SQL, NoSQL). Example: SQL Server, MySQL.

28
Q

What is Dataset?

A

A collection of data, often used for AI & ML (Machine Learning). Example: Excel dataset, CSV file.

29
Q

what is data warehouse?

A

A large centralized database storing data from multiple sources for analysis. Example: Azure Synapse Analytics.

30
Q

What is data lake?

A

A storage repository that holds raw data (structured, semi-structured, unstructured) for future analysis. Example: Azure Data Lake Storage.

31
Q

What is Big Data?

A

Extremely large datasets processed using Hadoop, Spark (used in Cloud computing).

32
Q

what is Data Source

A

is place where the data originates from. it can be Data lake, warehouse, data store, database, from API

33
Q

what is Datastore?

A

A Datastore can be repository of persistently storing and managing collections of unstructured or Semi-structured. But generally a data store indicates working unstructured or semi-structured.

34
Q

what is database ?

A

A database is a data-store that stores semi-structured and Structured data. database is more complex data stores because it requires using formal design and modelling technique.
Relational: tabular (rows and cloumn)
Non-relational: semi structured may not be tabular.

35
Q

What is Data warehouse ?

A

A data warehouse is a centralized system designed for storing and analyzing large volumes of structured data.companies will have TB & millions of rows of data.

36
Q

What is Data mart ?

A

A data mart is a subset of a data warehouse. It is designed to serve a specific department or business function, such as sales, finance, or marketing. A Data mart will store under 100 GB and has a single business focus.

37
Q

what is Data lake?

A

A data lake is a large storage repository that holds raw data (big data), unstructured, semi-structured, and structured data from multiple sources.

38
Q

what is data lakehouse?

A

A Data Lakehouse is a data management architecture that combines the best elements of data lakes and data warehouses. It aims to provide the flexibility and scalability of a data lake with the data management and performance capabilities of a data warehouse