Manage Dataset - Module 1,2,3,4 Flashcards
What is Workspace
Containers for dashboards, reports, apps, workbooks, datasets, and dataflows in Power BI. There are two types of workspaces: My workspace and workspaces. My workspace is the personal workspace for any Power BI customer to work with your own content. Workspaces are used to collaborate and share content with colleagues.
Difference between Workspace and Apps
Workspace acts like a developer environment, and the Power BI app serves as an end-user environment.
What is App
An app is a published, read-only window into your data for mass distribution and viewing. When ready to share apps with your users, you can publish the app. This process requires a Power BI Pro license. Consuming and viewing an app also requires a Pro license, or the workspace must be hosted in a Premium capacity.
How to create a workspace
Go to Power BI Service -> Select Workspace from left navigation -> Select Create Workspace button -> Enter workspace name and Description (Can also upload image). In the Advanced drop-down menu, you can create a Contact list of users who will receive notifications if issues with the workspace occur. By default, these users are the workspace admins
Roles in Workspace
Admin, Member, Contributor, Viewer
Difference between Admin, member and Contributor
Admin and Member have same access just that Member cannot remove people. Contributor can only publish reports to workspace. Create, edit and delete reports in workspace. Can sedule refreshes.
Usage metric reports
Usage metric reports are available for Power BI Pro users and can only be accessed by users with the role types of Admin, Member, or Contributor. To view usage metric reports, go to the workspace. Find the report or dashboard that you want to see usage metrics for. For example, if you want to see the usage metrics report for Sales Data, select the ellipsis (…), and then select View usage metrics report from the drop-down menu.
development life cycle.
The development process is iterative; it typically requires building an initial solution, testing the solution in a different environment, returning to make necessary revisions, and eventually releasing a final product.
deployment pipeline feature
manages content in dashboards, reports, and datasets between different environments in the development life cycle. With this feature, you can develop and test Power BI content in one centralized location and streamline the process before deploying the final content to your users. This Power BI Premium feature requires you to be a Capacity admin.
Development Enviornment
Development - The location in which dashboard developers or data modelers can build new content with other developers. This stage is first in the deployment pipeline. Test - Where a small group of users and user acceptance testers can see and review new reports, provide feedback, and test the reports with larger datasets for bugs and data inconsistencies before it goes into production. Production - Where an expansive user audience can use tested reports that are reliable and accurate. This stage is the final one of the deployment pipeline.
Configuration of deployment pipelines
https://learn.microsoft.com/en-us/training/modules/create-manage-workspaces-power-bi/4-development-lifecycle-strategy
Lineage view
Data lineage refers to the path that data takes from the data source to the destination. The Lineage view is only accessible to Admin, Contributor, and Member roles. Additionally, it requires a Power BI Pro license and is only available for app workspaces. Simplifies the troubleshooting process because you can see the path that the data takes from source to destination and determine pain points and bottlenecks. Allows you to manage your workspaces and observe the impact of a single change in one dataset to reports and dashboards. Saves time by simplifying your task of identifying reports and dashboards that haven’t been refreshed.
Impact analysis
On the Impact analysis window, you can see how many workspaces, reports, and dashboards that this dataset is a part of and how many views that this dataset has gathered
Sensitivity labels
Sensitivity labels specify which data can be exported. These labels are configured externally to Power BI, and Power BI allows you to quickly use them in your reports and dashboards. These labels allow you to define and protect content, even outside of Power BI. Datasets, dataflows, reports, and dashboards can use this mechanism, and all users in your corporation can use this feature unless exceptions have been defined. You can now apply them to the data: None, Personal, General, Confidential, and Highly confidential.
Two types of on-premises gateways
- Organization - Allows multiple users to connect to multiple on-premises data sources and is suitable for complex scenarios. 2. Personal - Allows one user to connect to data sources. This type of gateway can be used only with Power BI and it can’t be shared with other users, so it is suitable in situations where you’re the only one in your organization who creates reports. You will install the gateway on your local computer, which needs to stay online for the gateway to work.
How to schedule refresh
https://learn.microsoft.com/en-us/training/modules/manage-datasets-power-bi/5-dataset-refresh
Explain interaction of Cloud and on premises using gateway
The cloud service creates a query and the encrypted credentials for the on-premises data source. The query and credentials are sent to the gateway queue for processing. The gateway cloud service analyzes the query and pushes the request to Microsoft Azure Service Bus. Service Bus sends the pending requests to the gateway. The gateway gets the query, decrypts the credentials, and then connects to one or more data sources with those credentials. The gateway sends the query to the data source to be run. The results are sent from the data source back to the gateway and then to the cloud service. The service then uses the results.
How to configure Incremental refresh and what is important note to take care of
The Incremental refresh feature in Power BI is a popular feature because it allows you to refresh large datasets quickly and as often as you need, without having to reload historical data each time.
Incremental refresh should only be used on data sources and queries that support query folding. If query folding isn’t supported, incremental refresh could lead to a bad user experience because, while it will still issue the queries for the relevant partitions, it will pull all data, potentially multiple times. - https://learn.microsoft.com/en-us/training/modules/manage-datasets-power-bi/6-incremental-refresh
Two ways to endorse your datasets
Promotion - Promote your datasets when they’re ready for broad usage. Certification - Request certification for a promoted dataset. Certification can be a highly selective process, so only the truly reliable and authoritative datasets are used across the organization.
Query Caching
Query Caching is a local caching feature that maintains results on a user and report basis. This service is only available to users with Power BI Premium or Power BI Embedded. Improvement of the performance of reports, dashboards, and dashboard tiles by reducing loading time and increasing query speed. It respects bookmarks and default filters, so even if you enable query caching, any bookmarks that you have created still exist. The default option is that query caching is turned off; however, you can also select Off, which turns off query caching for the specific dataset in question. If you select On, query caching will be turned on for this specific dataset only.
Important information of Query caching
If many datasets have query caching enabled, and a refresh occurs, a reduction in performance might occur because a large number of queries are being processed at once. Switching from On to Off will clear all previously saved query results. When turning off query caching (either through the default or the Off option), a small delay will occur in query loading because the report queries are running against the dataset and it does not have saved queries to fall back on.
Data alerts
data alerts is a simple process to complete for a dashboard in Power BI. Data alerts can be used to notify you or a user that a specific data point is above, below, or at a specific threshold that you can set. These alerts are features that are only available on Power BI service and they are available on report elements such as KPI visuals, gauges, and cards.
Q&A feature
The Q&A feature is a tool within Power BI Desktop that allows you to ask natural-language questions about the data. Q&A visual consists of three main elements: Question Box, Pre-Populated suggestion tiles and Pin Visual
Quick Insights
Quick insights feature in Power BI uses machine learning algorithms to go over your entire dataset and produce insights (results) for you quickly. This feature is a great way to build dashboards when you don’t know where to start. This feature is available in the Power BI service only. Also, this feature doesn’t work with Direct Query; it only works with data that is imported to Power BI. The Quick Insights page contains up to 32 separate insight cards, and each card has a chart or graph plus a short description.
Stream in Power BI
Streaming data can come from a variety of sources, including from social media, factory sensors, service-usage metrics, and other sources that contain a constant stream of data points.
What is RLS and it’s type
Row-level security (RLS) with Power BI can be used to restrict data access for given users. Filters restrict data access at the row level, and you can define filters within roles. There are two type - Static and Dynamic
Steps for Static RLs
Create a report (normal steps) -> Create RLS roles in PBI Desktop using Dax, Test the roles in View -> Deploy the report to PBI Service -> Add members to roles -> Test the role in Service
Steps for Dynamic RLs
You can configure row-level security exactly the way you configured it previously, with only a single change. Instead of creating four roles, you only need to create one role. The DAX filter for that role using userprincipalname() function. The userprincipalname() function will compare the email address from the Employees table with the email that the user entered when signing in to Power BI service