Power BI - Prepare the Data (Get Data From different data sources) Flashcards

1
Q

How do you connect a data source

A

using the “Get Data” feature option in Power BI

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How do you change data source settings

A
  1. In Power Query Editor, select the gear icon on a step (if you choose this option, you will have to do this for each query).
  2. In Power Query Editor on the Home tab select Data source settings
  3. In power BI Desktop on the home tab, select transform data, then select data source settings
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a shared data set?

A

A dataset that has already been prepared and publised to Power BI Service

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the benefits of a shared dataset?

A
  1. Ensures consistent data across multiple reports
  2. You aren’t copying data needlessly
  3. You can create a copy of an existing report and modify it (this takes less effort than starting from scratch)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the 3 types of storage modes?

A

Import, DirectQuery, and Dual

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is import mode?

A

The Import mode allows you to create a local Power BI copy of your datasets from your data source. You can use all Power BI service features with this storage mode, including Q&A and Quick Insights. However, data refreshes must be done manually. Import mode is the default for creating new Power BI reports.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does DirectQuery allow for with respect to caching data and meeting security requirements?

A

The DirectQuery option is useful when you do not want to save local copies of your data because your data will not be cached. Instead, you can query the specific tables that you will need by using native Power BI queries, and the required data will be retrieved from the underlying data source. Using this model ensures that you are always viewing the most up-to-date data, and that all security requirements are satisfied.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How does DirectQuery help with large datasets?

A

This mode is suited for when you have large datasets to pull data from. Instead of slowing down performance by having to load large amounts of data into Power BI, you can use DirectQuery to create a connection to the source, solving data latency issues as well.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does Dual Mode import allow you to do?

A

In Dual mode, you can identify some data to be directly imported and other data that must be queried. Any table that is brought into your report is a product of both Import and DirectQuery modes. Using the Dual mode allows Power BI to choose the most efficient form of data retrieval.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Connect Live?

A

Connect live is a new option in Azure Analysis Services. Azure Analysis Services uses the tabular model and DAX to build calculations, similar to Power BI. These models are compatible with one another. Using the Connect live option helps you keep the data and DAX calculations in their original location, without having to import them all into Power BI.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Azure Analysis Services?

A

Azure Analysis Services is an Azure product that allows you to ingest data from multiple data sources, build relationships between the data, and creates calculations on the data. The calculations are built using data analysis expressions (DAX). Azure Analysis Services is similar to the data modeling and storage technology in Power BI.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How getting data from Azure Analysis Services cubes similar to getting data from SQL Server?

A

Getting data from Azure Analysis Services cubes is similar to getting data from SQL Server, in that you can:

  • Authenticate to the server.
  • Pick the cube you want to use.
  • Select which tables you need.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are notable differences between Azure Analysis Services cubes and SQL Server?

A

Notable differences between Azure Analysis Services cubes and SQL Server are:

  • Analysis Services cubes have calculations already in the cube, which will be discussed in more detail later.
  • If you don’t need an entire table, you can query the data directly. Instead of using Transact-SQL (T-SQL) to query the data, like you would in SQL Server, you can use multi-dimensional expressions (MDX) or data analysis expressions (DAX).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How do you connect to data in Azure Analysis Services?

A

You use the Get data feature in Power BI Desktop. When you select Analysis Services, you are prompted for the server address and the database name with two options: Import and Connect live.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does Azure Analysis Services allow you to do with respect to refresh schedules?

A

Azure Analysis Services can have a fast refresh schedule, which means that when data is refreshed in the service, Power BI reports will immediately be updated, without the need to initiate a Power BI refresh schedule. This process can improve the timeliness of the data in your report.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How is connect live similar to a relational database?

A

Similar to a relational database, you can choose the tables that you want to use. If you want to directly query the Azure Analysis Services model, you can use DAX or MDX.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is an acceptable alternative with respect to importing the data that you want (from Excel, SQL Server, and so on) into the Azure Analysis Services model and using a live connection. What are the advantages of this alternative?

A

You will likely import the data directly into Power BI. An acceptable alternative is to import all other data that you want (from Excel, SQL Server, and so on) into the Azure Analysis Services model and then use a live connection. Using this approach, the data modeling and DAX measures are all performed in one place, and it’s a much simpler and easier way to maintain your solution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What are your options in the Navigator Window?

A
  1. You now have the option to select the Load button to automatically load your data into the Power BI model
  2. Transform Data button to launch the Power Query Editor, where you can review and
    clean your data before loading it into the Power BI model.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

How do you connect Power BI to a relational database?

A

You can use the Get data feature in Power BI Desktop and select the applicable option for your relational database. You select the SQL Server option.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What are the first steps when connecting your Power BI desktop tool to a SQL Server database?

A

The first step is to enter your database server name and a database name in the SQL Server database window. The two options in data connectivity mode are: Import (selected by default, recommended) and DirectQuery. Mostly, you select Import. Other advanced options are also available in the SQL Server database window

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

(SQL Database) After you have added your server and database names, you will be prompted to sign in with a username and password. You will have three sign-in options. What are they?

A
  • Windows: Use your Windows account (Azure Active Directory credentials).
  • Database: Use your database credentials. For instance, SQL Server has its own sign-in and authentication system that is sometimes used. If the database administrator gave you a unique sign-in to the database, you might need to enter those credentials on the Database tab.
  • Microsoft account: Use your Microsoft account credentials. This option is often used for Azure services.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What do the load and transform tools allow you to do?

A
  • Load: Automatically load your data into a Power BI model in its current state.
  • Transform Data: Open your data in Microsoft Power Query, where you can perform actions such as deleting unnecessary rows or columns, grouping your data, removing errors, and many other data quality tasks.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

How can you use SQL query to import data?

A

Another way you can import data is to write an SQL query to specify only the tables and columns that you need.
To write your SQL query, on the SQL Server database window, enter your server and database names, and then select the arrow next to Advanced options to expand this section and view your options. In the SQL statement box, write your query statement, and then select OK.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What are additional features of using the SQL Query?

A

You can also join different tables, run specific calculations, create logical statements, and filter data in your SQL query.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

When using an SQL query to import data, what approach should you take with respect to the wildcard character (*) in your query.

A

When using an SQL query to import data, try to avoid using the wildcard character () in your query. If you use the wildcard character () in your SELECT statement, you import all columns that you don’t need from the specified table.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Why is using the wildcard discouraged?

A

The wildcard is not recommended because it will lead to redundant data in your data model, which will cause performance issues and require additional steps to normalize your data for reporting.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What is a view?

A

A view is an object in a relational database, similar to a table. Views have rows and columns, and can contain almost every operator in
the SQL language.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What happens when Power BI uses a view?

A

If Power BI uses a view, when it retrieves data, it participates in query folding, a feature of Power Query.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What is a NoSQL database?

A

A NoSQL database (also referred to as non-SQL, not only SQL or non-relational) is a flexible type of database that does not use tables to store data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

How do you connect a NoSQL (Azure Cosmos DB) to Power BI?

A

You will use the Get data feature in Power BI Desktop. However, this time you will select the More… option to locate and connect to the type of database that you use. In this example, you will select the Azure category, select Azure Cosmos DB, and then select Connect.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What must you do to import a JSON type record?

A

JSON type records must be extracted and normalized before you can report on them, so you need to transform the data before loading it into Power BI Desktop.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

How do you connect data in an application?

A

Select the “Get data” feature in Power BI Desktop. Then, select the option that you need from the Online Services category.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

How to you connect your URL? (Sharepoint)

A

After you have selected Connect, you’ll be asked for your SharePoint URL. This URL is the one that you use to sign into your SharePoint site through a web browser. You do not need to enter your full URL file
path; you only need to load your site URL because, when you are connected, you can select the specific list that you want to load.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What do you need to after connecting your URL? (Shaepoint)

A

After you have entered your URL, select OK. Power BI needs to authorize the connection to SharePoint, so sign in with your Microsoft account and then select Connect.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

How do you import datasets when they are too large or when you have security requirements?

A

Sometimes there may be security requirements around your data that make it impossible to directly import a copy. Or your datasets may simply be too large and would take too long to load into Power BI, and you want to avoid creating a performance bottleneck. Power BI solves these problems by using the DirectQuery storage mode, which allows you to query the data in the data source directly and not import a copy into Power BI.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

Why is using DirectQuery useful?

A

DirectQuery is useful because it ensures you are always viewing the most recent version of the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

What data model limitations are there with DirectQuery

A
  1. No built-in date tables
  2. Calculated columns are limited to two ways (can only use the current row of the table or related row in many-one relationships which rules out all aggregation functions. Can only use some functions that return scalar values
  3. Calculated tables aren’t supported
    Parent-child functions such as PATH aren’t supported
    Building clusters which rely on DAX is not supported
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

How do you identify and fix problems in Power BI?

A

Power BI provides the Performance Analyzer tool to help fix problems and streamline the process.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

What does performance in Power Query depend on?

A

The performance in Power Query depends on the performance at the data source level.

40
Q

What do good SQL Server performance tuning techniques include?

A

Good SQL Server performance tuning techniques includes index creation, hardware upgrades, execution plan tuning, and data compression.

41
Q

What is query folding and what does it help?

A

The query folding within Power Query Editor helps you increase the performance of your Power BI reports. Query folding is the process by which the transformations and edits that you make in Power Query Editor are simultaneously tracked as native queries, or simple Select SQL statements, while you are actively making transformations.

42
Q

How can you use Power Query to load and transform data in Power BI?

A

You can use Power Query to load data into Power BI. Using Power Query Editor you can then make further transformations to your data, such as renaming or deleting columns, appending, parsing, filtering, or grouping your data.

43
Q

What are the benefits to query folding?

A

The benefits to query folding include:

  • More efficiency in data refreshes and incremental refreshes. When you import data tables by using query folding, Power BI is better able to allocate resources and refresh the data faster because Power BI does not have to run through each transformation locally.
  • Automatic compatibility with DirectQuery and Dual storage modes. All DirectQuery and Dual storage mode data sources must have the back-end server processing abilities to create a direct connection, which means that query folding is an automatic capability that you can use. If all transformations can be reduced to a single Select statement, then query folding can occur.
44
Q

What do you do if the View Native Query option is not available?

A

If the View Native Query option is not available (not displayed in bold type), that query folding is not possible for this step, and you will have to work backward in the Applied Steps area until you reach the step in which View Native Query is available (displays in bold type). This process will reveal the native query that is used to transform the dataset.

45
Q

Native queries are not possible for which transformations?

A

Native queries are not possible for the following transformations: -Adding an index column

  • Merging and appending columns of different tables with two different sources
  • Changing the data type of a column -Running complex DAX functions
46
Q

What is the guideline to know if you can use query folding?

A

A good guideline to remember is that if you can translate a transformation into a Select SQL statement, which includes operators and clauses such as GROUP BY, SORT BY, WHERE, UNION ALL, and JOIN, you can use query folding.

47
Q

While query folding is one option to optimize performance when retrieving,
importing, and preparing data, what is
another option?

A

Another option is query diagnostics.

48
Q

What does query diagnostics allow for?

A

This feature allows you to determine what bottlenecks (if any) exist while loading and transforming your data, refreshing your data in Power Query, running SQL statements in Query Editor, and so on.

49
Q

How do you access query diagnostics?

A

Selecting Diagnose Step shows you the length of time that it takes to run that step. This selection can tell you if a step takes longer to complete than others, which then serves as a starting point for further investigation.

50
Q

What are other ways to optimize query performance?

A
  • Process as much data as possible in the original data source. Power Query and Power Query Editor allow you to process the data; however, the processing power that is required to complete this task might lower performance in other areas of your reports. Generally, a good practice is to process, as much as possible, in the native data source.
  • Use native SQL queries. When using DirectQuery for SQL databases, such as the case for our scenario, make sure that you are not pulling data from stored procedures or common table expressions (CTEs).
  • Separate date and time, if bound together. If any of your tables have columns that combine date and time, make sure that you separate them into distinct columns before importing them into Power BI. This approach will increase compression abilities.
51
Q

What does incremental refresh do?

A

When working with large datasets, it may be a good idea to implement incremental refresh which keeps the bulk of your data static and only refreshes the most recent data that has changed/.

52
Q

What is Microsoft Dataverse?

A

A cloud-based storage space that organizations can use to store application data.

53
Q

What does Diagnose Step help you to see?

A

Selecting Diagnose Step shows you the length of time that it takes to run that step. This selection can tell you if a step takes longer to complete than others, which then serves as a starting point for further investigation.

54
Q

How is data in dataverse stored?

A

Stored in tables

55
Q

What is a PBIDS file?

A

A file that is designed with the “get data” process for future report developers. It allows users to skip the “get data” process (queries are already loaded in the file). The file does not contain credentials.

56
Q

What’s the purpose of parameters?

A

Serves as a way to easily store and manage a value that can be reused

57
Q

What are some examples of when you might use parameters?

A
  1. Switching between dev and production environments
  2. Configuring incremental refreshes
  3. Using Report templates
  4. Creating custom functions
58
Q

What are dataflows?

A

Dataflows are stored in a centralized location (Power BI Service) and are useful when you want your power query queries to be used across your organizations. They can be used in multiple files.

59
Q

What does XMLA endpoint do?

A

Is a power BI Premium feature that allows you to get data from outside of Power BI.

60
Q

How does Power BI determine data anomalies?

A

Power Query Editor determines data anomalies by using the Column Distribution feature.

61
Q

How many rows does Power Query examine by default? How do you increase?

A

By default, Power Query examines the first 1000 rows of your data set. To change this, select the profiling status in the status bar and select Column profiling based on entire data set.

62
Q

What does Column Distribution show?

A
# of distinct and unique values
- distinct # referes to how many different values are in the column

-unique # referes to the values that occur only once

Distinct and unique values will be the same if all values are unique

63
Q

What does Column Profile show?

A

Column statistics - Count, # of errors, # of empty, distinct, unique, min, max, average, even, odd

64
Q

When would you want to null values?

A

Occasionally, you might find that your data sources contain null values. For example, a freight amount on a sales order might have a null value if it’s synonymous with zero. If the value stays null, the averages will not calculate correctly. One solution would be to change the nulls to zero, which will produce the more accurate freight average. In this instance, using the same steps that you followed previously will help you replace the null values with zero.

65
Q

Why would you need to replace values?

A

If you have undesirable values

If you want to remove errors

66
Q

What two advanced options are there with replace values?

A

Match entire cell contents - if this option is checked, Power Query won’t’ replace values where the “replace with” value is only part of the value to find value

  1. IReplace using special characters - this is used to insert special character codes, such as carriage return or nonbreaking space, in the replace with or value to find box
67
Q

What are null values considered as in Power Query?

A

Special characters

68
Q

Why would you want to remove rows and how do you remove rows?

A

If you have errors in a column you can remove the rows entirely. (right-click the column header and select remove errors). This removes rows for the selected column.

69
Q

How do you promote headers?

A
  1. By selecting the Use First Row as Headers option on the Home tab
  2. By selecting the drop-down button next to Column1 and then selecting Use First Row as Headers.
70
Q

What will incorrect data types prevent you from doing what?

A

Incorrect data types will prevent you from creating certain calculations, deriving hierarchies, or creating proper relationships with other tables

71
Q

What data types does Power Query Support?

A
Decimal Number
Fixed Decimal Number
Whole Number
Percentage
Date/Time
Date
Time
Date/Time/Timezone
Duration
Text
True/False
Binary
72
Q

Why is the ability to combine queries is powerful?

A

The ability to combine queries is powerful because it allows you to append or merge different tables or queries together.

73
Q

What does Power BI do to detect data type?

A

When you import a table from any data source, Power BI Desktop automatically starts scanning the first 1,000 rows (default setting) and tries to detect the type of data in the columns.

74
Q

Under what circumstances can you combine tables into a single table?

A

You can combine tables into a single table in the following circumstances:

  • Too many tables exist, making it difficult to navigate an overly- complicated data model.
  • Several tables have a similar role.
  • A table has only a column or two that can fit into a different table. -You want to use several columns from different tables in a custom column
75
Q

How do you combine queries?

A

After you have finished reformatting, you can combine the queries. On the Home tab on the Power Query Editor ribbon, select the drop-down list for Append Queries. You can select Append Queries as New, which means that the output of appending will result in a new query or table, or you can select Append Queries, which will add the rows from an existing table into another.

76
Q

How is merging queries similar to with respect to SQL?

A

When you merge queries, you are combining the data from multiple tables into one based on a column that is common between the tables. This process is similar to the JOIN clause in SQL.

77
Q

What does it mean when a table is on the one side of a relationship?

A

the key in the table has a unique value for every row

78
Q

What does it mean when a table is on the many side of a relationship?

A

The values in the key column are not necessarily unique for every row & may repeat (also known as Foreign Key)

79
Q

What are the steps to merge tables?

A

Go to Home on the Power Query Editor ribbon and select the Merge Queries drop-down menu, where you can select Merge Queries as New. This selection will open a new window, where you can choose the tables that you want to merge from the drop-down list, and then select the column that is matching between the tables.

80
Q

What are the “join kinds” you can choose from when merging?

A

You can also choose how to join the two tables together, a process that is also similar to JOIN statements in SQL. These join options include:

  • Left Outer - Displays all rows from the first table (left) and only the matching rows from the second.
  • Full Outer - Displays all rows from both tables.
  • Inner - Displays the matched rows between the two tables.
81
Q

When would you use unpivoting?

A

Unpivoting is a useful feature of Power BI. You can use this feature with data from any data source, but you would most often use it when importing data from Excel. The following example shows a sample Excel document with sales data.
Change columns into rows

82
Q

What does unpivoting do with respect to creating DAX measures?

A

Unpivoting streamlines the process of creating DAX measures on the data later.

83
Q

How do you use the Pivot Column feature?

A

Change rows into columns

If the data that you are shaping is flat (in other words, it has lot of detail but is not organized or grouped in any way), the lack of structure can complicate your ability to identify patterns in the data.

You can use the Pivot Column feature to convert your flat data into a table that contains an aggregate value for each unique value in a column. For example, you might want to use this feature to summarize data by using different math functions such as Count, Minimum, Maximum, Median, Average, or Sum.

84
Q

When would you use the delete until end option?

A

Useful when you have more than 2 steps you want to remove

85
Q

What does the transpose feature do?

A

Changes columns to rows and rows to columns

86
Q

What does the group by feature do?

A

Summarize data and group data based on a selected column

87
Q

What does append do?

A

Adds rows when you have the same columns

88
Q

What does merge do?

A

Adds columns when you have the same rows

89
Q

How can you leverage the Advanced Editor to modify Power Query M Code?

A
  • you can do more by writing M queries yourself
  • M is case sensitive language
  • you can view & edit full M queries in the advanced editor
  • the query starts with “let” & ends with “in”
90
Q

Where can you see the steps in a query?

A
  1. Advanced editor

2. Formula bar

91
Q

How do you stop a query from loading?

A

Right-click on the query & select/clear “enable load”

92
Q

While importing data into Power BI, you may encounter errors resulting from factors such as?

A

While importing data into Power BI, you may encounter errors resulting from factors such as:

  • Power BI imports from numerous data sources.
  • Each data source might have dozens (and sometimes hundreds) of different error messages.
  • Other components can cause errors, such as hard drives, networks, software services, and operating systems.
  • Data can often not comply with any specific schema.
93
Q

What is the purpose of the query timeout?

A

Relational source systems often have many people who are concurrently using the same data in the same database. Some relational systems and their administrators seek to limit a user from monopolizing all hardware resources by setting a query timeout. These timeouts can be configured for any timespan, from as little as five seconds to as much as 30 minutes or more.

94
Q

What is a cell-level error?

A

An error when Power BI incorrectly assigns a data type

95
Q

What storage mode options are available for when connecting to Azure Analysis Services?

A

Import and Connect Live