End-to-End Analytics (Fabric) Flashcards
What is OneLake?
Fabric’s lake-centric architecture that provides a single, integrated environment for data professionals and the business to collaborate on data projects.
What is OneCopy?
A key component of OneLake that allows you to read data from a single copy, without moving or duplicating data.
How does OneLake facilitate collaboration?
By eliminating the need to move and copy data between different systems and teams.
What is the analogy used to describe OneLake?
Think of it like OneDrive for data.
What does OneLake combine?
Storage locations across different regions and clouds into a single logical lake.
What features does Fabric’s data warehousing, data engineering, integration, intelligence, and Power BI use?
They all use OneLake as their native store without needing any extra configuration.
On what is OneLake built?
Azure Data Lake Storage (ADLS).
What formats can data be stored in OneLake?
Delta, Parquet, CSV, JSON, and more.
What is the benefit of storing data in OneLake?
Data stored in OneLake is directly accessible by all compute engines without needing to be moved or copied.
What format do analytical engines in Fabric write for tabular data?
Delta-parquet format.
What is a key feature of OneLake regarding shortcuts?
Shortcuts are embedded references within OneLake that point to other files or storage locations.
What does the Synapse Data Engineering experience focus on?
Data engineering with a Spark platform for data transformation at scale.
What is the purpose of the Synapse Data Warehouse experience?
Data warehousing with industry-leading SQL performance and scale to support data use.
What does the Synapse Data Science experience utilize?
Azure Machine Learning and Spark for model training and execution tracking in a scalable environment.
What is the focus of Synapse Real-Time Intelligence?
Real-time intelligence to query and analyze large volumes of data in real-time.
What does the Data Factory experience combine?
Power Query with the scale of Azure Data Factory to move and transform data.
What is the role of Power BI in Fabric?
Business intelligence for translating data to decisions through interactive reports.
What is the function of workspaces in Microsoft Fabric?
They serve as logical containers that help you organize and manage your data, reports, and other assets.
How do workspaces support collaboration?
By providing a clear separation of resources, making it easier to control access and maintain security.
What can be customized within workspaces?
How to separate and control access to the items.
What features do workspaces support for managing compute resources?
Settings to manage compute resources and integrate with Git for version control.
What does Git integration in workspaces allow?
To track changes, collaborate on code, and maintain a history of your work.
What is the significance of data lineage and impact analysis in workspaces?
They provide a comprehensive view of data flow and dependencies, enhancing transparency and decision-making.
How is security managed in OneLake?
Data is secured and governed in one place, allowing users to easily find and access the data they need.
What can be managed in the Fabric admin center?
Groups and permissions, configure data sources and gateways, and monitor usage and performance.
What can you access in the Fabric admin center for automation?
Fabric admin APIs and SDKs.
What does Microsoft Fabric transform in the analytics development process?
Unifies tools into a SaaS platform
What flexibility does Microsoft Fabric provide for different roles?
Allows performing necessary skills without duplicating efforts
What can data engineers do with large amounts of data in OneLake?
Ingest, transform, and load data
How are data loading patterns simplified in Microsoft Fabric?
Using pipelines and easily configurable architectures
What architecture is mentioned that can be configured using workspaces?
Medallion architecture
What advantage do data analysts gain with Microsoft Fabric?
Greater context and streamlined processes
What is the purpose of Data Factory in Microsoft Fabric?
Transform data upstream
What mode allows data analysts to connect with data more directly?
DirectLake mode
How do data scientists use Microsoft Fabric?
Integrate native data science techniques and use Power BI for insights
What role do analytics engineers play in Microsoft Fabric?
Bridge gap between data engineering and data analysis
What responsibilities do analytics engineers have?
Curate data store assets, ensure data quality, enable self-service analytics
Who can discover curated data through the OneLake hub?
Low-to-no-code users and citizen developers
What can low-to-no-code users do with curated data?
Further process and analyze it without dependency on data engineers
True or False: Microsoft Fabric allows data engineers to duplicate data efforts.
False
What must be done before exploring Microsoft Fabric capabilities?
Microsoft Fabric must be enabled for your organization.
Who might you need to work with to enable Microsoft Fabric?
You might need to work with your IT department.
What are the permissions required to enable Microsoft Fabric?
Admin privileges are required to enable Fabric.
How can an admin enable Microsoft Fabric?
Admins can access the Admin center from the Settings menu in Power BI and enable Fabric in the Tenant settings.
Can admins make Fabric available to specific groups?
Yes, admins can make Fabric available to the entire organization or specific groups of users.
What is a workspace in Microsoft Fabric?
Workspaces are collaborative environments for creating and managing items like lakehouses, warehouses, and reports.
Where is all data stored in Microsoft Fabric?
All data is stored in OneLake.
What can you configure in the Workspace settings?
You can configure the license type, OneDrive access, Azure Data Lake Gen2 Storage connection, Git integration, and Spark workload settings.
What roles are available for granting access to a workspace?
There are four available roles that have access to all items in a workspace.
What is the OneLake data hub?
The OneLake data Hub helps users find and access various data sources within their organization.
What can users do with the OneLake data hub?
Users can explore and connect to data sources, ensuring they have the right data for their needs.
What should users consider when using OneLake Hub?
Users can narrow results by domains, filter by workspaces, explore default groups, and filter by keyword or item type.
How do you create items in Fabric?
You can create items using the Create menu in the upper left corner of the Power BI service.
What are Fabric workloads?
Fabric workloads refer to the different capabilities included in Fabric.
How can you switch between Fabric workloads?
You can switch between workloads using the workload switcher in the bottom left corner of the navigation pane.
What makes Fabric unique compared to other Microsoft data offerings?
Fabric brings together capabilities from various services in a single, SaaS integrated experience without needing access to Azure resources.