Choose a Power BI model framework Flashcards
What is a Power BI data model?
A query-able data resource optimized for analytics, used in Power BI for creating reports and dashboards.
What is a Power BI dataset?
A Power BI artifact published to a workspace in the Power BI service, used as a source of data for visualizations.
What are the three phases of an analytic query?
Filter, group, and summarize.
What is a tabular model in Power BI?
A model comprising tables of columns, relationships, hierarchies, and calculations.
What is the star schema design?
A modeling approach that classifies tables as either dimension or fact, optimizing for analytics and ease of use.
What are the table storage modes in Power BI?
Import, DirectQuery, and Dual.
What determines the model framework in Power BI?
The table storage mode settings, leading to import, DirectQuery, or composite models.
What is an import model in Power BI?
A model with tables set to Import storage mode, including calculated tables defined with DAX formulas.
What are the benefits of import models?
They support all Power BI data source types, DAX and Power Query functionality, calculated tables, and deliver the best query performance due to in-memory storage optimization.
What are the size limitations of import models in Power BI?
Shared capacity models have a 1-GB limit per dataset, while dedicated capacity models can exceed 10-GB with the Large dataset storage format setting enabled.
What strategies can reduce the size of data in import models?
Removing unnecessary columns and rows, summarizing data, optimizing column data types, and using DirectQuery for specific tables.
How does data refresh work in import models?
Data must be periodically refreshed, with up to eight times per day in shared capacity and up to 48 times per day in dedicated capacity, depending on data velocity and urgency.
What is incremental refresh in Power BI?
A feature that automates time-period partition creation and management, updating only necessary partitions to reduce the burden on source systems and improve refresh performance.
What is a DirectQuery model in Power BI?
A model where tables have their storage mode set to DirectQuery, allowing real-time query results by querying the underlying data source directly.
When should you use a DirectQuery model?
When dealing with large or fast-changing data sources, enforcing source row-level security (RLS), adhering to data sovereignty restrictions, or creating specialized datasets.
What are the benefits of DirectQuery models?
They support large data volumes, enable near real-time data updates, enforce source RLS, and comply with data sovereignty restrictions.
What are the limitations of DirectQuery models?
Not all data sources are supported, limited Power Query transformations, potential slow query performance, and possible impact on source system performance.
How can you boost the performance of DirectQuery models?
Optimize the source database with indexes and materialized views, and use user-defined aggregation tables to handle higher-grain analytic queries.
What is a composite model in Power BI?
A model that combines more than one source group, typically including both Import and DirectQuery storage modes.
What are the benefits of using a composite model in Power BI?
Design flexibility, performance boosts by caching data, and the ability to extend models with new calculated columns and tables.
What is a limitation of composite models regarding data refresh?
Imported data may become out of sync with DirectQuery data, requiring periodic refreshes to maintain consistency.
How can DirectQuery model performance be improved using composite models?
By adding import aggregation tables and using dual storage mode for dimension tables.
What is a dual storage mode table?
A table that can use both import and DirectQuery storage modes, with Power BI selecting the most efficient mode at query time.
How does the real-time data option work with incremental refresh in composite models?
It creates a hybrid table with import partitions for older data and a DirectQuery partition for current data, enabling real-time updates.