Fabric Flashcards
If you have a local file and you want to upload it to lakehouse, what do you do?
local file upload with lakehouse explorer
You need to recommend which Fabric architecture to create and the process for integrating the accounting data into Fabric. The solution must minimize administrative effort and costs.
Developers must be able to build reports from the lakehouse or warehouse based on the sales data. The developers must also be able to do ad-hoc analysis of the legacy data at the end of each year.
Because the legacy accounting data is large and infrequently accessed, there is no need to move the data into a Fabric workspace. Use a shortcut. Shortcuts enable the querying of remote data without having to move the data. Ingest the Sales Data in a Fabric lakehouse.
What does a shortcut do/when to use it?
Shortcuts enable the querying of remote data without having to move the data.
When to use the Copy Data Activity?
The Copy data activity provides the best performance when copying data from large datasets.
What’s a Type 1 Slowly changing dimension (SCD)?
Type 1 SCD: Overwrites existing data, doesn’t keep history.
What’s a Type 2 Slowly changing dimension (SCD)?
Type 2 SCD: Adds new records for changes, keeps full history for a given natural key.
From Table X to table Y = what join?
Left outer join x to y
What’s a Fabric Tenant?
Tenant is the most fundamental part of the structure of Fabric. A tenant is a single instance of Fabric for an organization and is aligned with a Microsoft Entra ID. One Tenant per organization.
Should semantic models be in separate workspaces?
They can be in the same workspace, but for good data governance, they probably should be in separate workspaces.
When do you use WHERE and when do you use HAVING?
You use WHERE in order to filter rows before any groupings are made. HAVING filters after the GROUP BY clause is applied. That’s the context clue. “Groups where total” = HAVING
If you see TREATAS and USERELATIONSHIP, what are your context clues? What do they do?
Use TREATAS if there’s no pre-existing relationship. It creates relationships on the fly. USERELATIONSHIP relies on an existing defined relationship.
Power BI capacity size is what?
Set by the admin
What does RELATED mean in DAX and how is it used?
The RELATED function retrieves the value from one side of a relationship. The RELATED function acts similar to a VLOOKUP. It get a related value from a secondary table to go into the first, provided there is an active relationship.
TABLE_FACT = NEW_COLUMN = RELATED(TABLE_DIM[DESIRED_COLUMN])
What does RELATEDTABLE do in DAX?
The RELATEDTABLE function retrieves a table of rows, given an established and active relationship. Useful in aggregations.
One entity in a table can correspond to many in another. One product can have multiple categories.
One to many relationship
each customer has one account or one badge number
one to one relationship
products can below to multiple categories and categories can contain multiple products
many to many relationship
When to use performance analyzer?
measuring report performance
When to use DAX Studio?
optimize DAX queries
When to use Tabular Editor?
model structure optimization
You need to compare the sales amount and ensure that the statement displays the value from the previous year in the PreviousSalesAmount column. What function do you use?
The LAG() function accesses the data from a previous row in the same result set by using a given physical offset.
While a DataFrame can be saved to the Files section of a lakehouse, it will not be considered a table.
What’s an anti-join?
It means you’re excluding data.
Languages and examples
DAX for PBI, T-SQL for SSMS, Pyspark for Notebooks, M for Gen2 and PowerQuery.
Query Folding?
Query Folding is a technique used in data processing and analytics to optimize query performance by combining or “folding” multiple steps in a given query into a single operation.
You are planning a Fabric analytics solution.
You need to recommend a licensing strategy to support 10 Microsoft Power BI report authors and 600 report consumers.
The solution must use Dataflow Gen2 for data ingestion and minimize costs.
Which Fabric license type should you recommend?
Starting with the F64 license, report consumers can use a free per-user license. PPU is incorrect, because you cannot create non-Power BI items (in this case Dataflow Gen2) with PPU.
XMLA endpoint
Just a way to connect. Like an API.
What does OPTIMIZE do?
OPTIMIZE command; it merges all changes into bigger, consolidated parquet files.