Power BI - Clean, Transform, and Load Data Flashcards

1
Q

How do you promote headers?

A

You can promote headers in two ways:

  1. By selecting the Use First Row as Headers option on the Home tab
  2. By selecting the drop-down button next to Column1 and then selecting Use First Row as Headers.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How do you rename column headers?

A

You can rename column headers in two ways.

  1. Right-click the header, select Rename, edit the name, and then press Enter.
  2. Double-click the column header and overwrite the name with the correct name.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What do the load and transform tools allow you to do?

A
  • Load: Automatically load your data into a Power BI model in its current state.
  • Transform Data: Open your data in Microsoft Power Query, where you can perform actions such as deleting unnecessary rows or columns, grouping your data, removing errors, and many other data quality tasks.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What do you need to do to correct column names that are not interpreted as headers?

A

When a table is created in Power BI Desktop, Power Query Editor assumes that all data belongs in table rows. However, a data source might have a first row that contains column names. To correct this inaccuracy, you need to promote the first table row into column headers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How early do you want to remove unnecessary columns in the data shaping process?

A

A key step in the data shaping process is to remove unnecessary columns. It is much better to remove columns as early as possible. One way to remove columns would be to limit the column when you get data from data source. For instance, if you are extracting data from a relational database by using SQL, you would want to limit the column that you extract by using a column list in the SELECT statement.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does Unpivot Columns do?

A

Turns columns to Rows; Name, Values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does Pivot Columns do?

A

Turns Name,Value Rows to Columns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does unpivoting do with respect to creating DAX measures?

A

Unpivoting streamlines the process of creating DAX measures on the data later. By completing this process, you have now created a simpler way of slicing the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are your options in the Navigator window?

A

You now have the option to select the Load button to automatically load your data into the Power BI model or select the Transform Data button to launch the Power Query Editor, where you can review and clean your data before loading it into the Power BI model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Group by columns?

A

The process of shaping data by converting your flat data into a table that contains an aggregation value for each unique value in a column.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the purpose of data profiling?

A

Profiling data is about studying the nuances of the data: determining anomalies, examining and developing the underlying data structures, and querying data statistics such as row counts, value distributions, minimum and maximum values, averages, and so on.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why is data profiling important?

A

This concept is important because it allows you to shape and organize the data so that interacting with the data and identifying the distribution of the data is uncomplicated, therefore helping to make your task of working with the data on the front end to develop report elements near effortless.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How do you examine the data structures?

A

Before you begin examining the data in Power Query Editor, you should first learn about the underlying data structures that data is organized in. You can view the current data model under the Model tab on Power BI Desktop.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How does Power BI determine data anomalies?

A

Power Query Editor determines data anomalies by using the Column Distribution feature.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How many rows does Power Query examine by default? How do you increase?

A

By default, Power Query examines the first 1000 rows of your data set. To change this, select the profiling status in the status bar and select Column profiling based on entire data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does column distribution show you?

A

Column distribution shows you the distribution of the data within the column and the counts of distinct and unique values, both of which can tell you details about the data counts.

17
Q

What happens in the background each time you create a step in Power Query?

A

Each time you shape data in Power Query, you create a step in the Power Query process. Those steps can be reordered, deleted, and modified where it makes sense. Each cleaning step that you made was likely created by using the graphical interface, but Power Query uses the M language behind the scenes.

18
Q

Why do you need to be careful about reordering steps in M?

A

Be careful about reordering these steps because it could ruin the statement dependencies.

19
Q

How do you write a query formula?

A

Write to a query formula step by using the “in” statement. Generally, the last query step is used as the “in final data set” result.

20
Q

What does Column profile show you?

A

This column provides several different values, including the count of rows, which is important when verifying whether the importing of your data was successful.

21
Q

What does the Value distribution graph show?

A

Tells you the counts for each unique value in that specific column.

22
Q

Why would you want to null values?

A

Occasionally, you might find that your data sources contain null values. For example, a freight amount on a sales order might have a null value if it’s synonymous with zero. If the value stays null, the averages will not calculate correctly. One solution would be to change the nulls to zero, which will produce the more accurate freight average. In this instance, using the same steps that you followed previously will help you replace the null values with zero.

23
Q

What does Power BI do to detect data type?

A

When you import a table from any data source, Power BI Desktop automatically starts scanning the first 1,000 rows (default setting) and tries to detect the type of data in the columns.

24
Q

What will incorrect data types result in?

A

Incorrect data types will prevent you from creating certain calculations, deriving hierarchies, or creating proper relationships with other tables