Clean, transform, and load data in Power BI Flashcards

1
Q

Why is data cleaning important in Power BI?

A

To ensure the accuracy and reliability of data, making it ready for analysis and reporting.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the first step in the data cleaning process in Power BI?

A

Profiling the data to understand its structure and quality.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What tool is commonly used for data cleaning in Power BI?

A

Power Query Editor.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are some common data cleaning tasks in Power BI?

A

Removing duplicates, handling missing values, and correcting data types.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the primary purpose of shaping data in Power BI?

A

To transform and refine raw data into a structured format suitable for analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are some common data shaping tasks in Power BI?

A

Filtering rows, removing columns, grouping data, and creating calculated columns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Which tool in Power BI is used for shaping data?

A

Power Query Editor.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How can you ensure data quality during the shaping process?

A

By performing data validation checks and applying consistent data transformation rules.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the process of grouping data in Power BI?

A

Grouping data involves aggregating data based on specific criteria to summarize and analyze it more effectively.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How can you pivot data in Power BI?

A

Use the “Pivot Column” feature in Power Query Editor to transform rows into columns based on a key field.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the role of the “Unpivot Columns” feature?

A

It transforms columns into rows, making it easier to analyze data that was previously spread across multiple columns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a calculated column in Power BI?

A

A column created using DAX expressions to perform calculations on existing data within the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Why might you want to rename queries in Power BI?

A

To make query names more obvious and user-friendly, improving clarity and usability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How can you replace null values in Power Query Editor?

A

By selecting the column, clicking Replace Values, entering null in the Value to Find box, and the replacement value (e.g., zero).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Why is it important to remove duplicate values from columns in Power BI?

A

To ensure data accuracy and uniqueness, which can improve data analysis and reporting.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Why is setting the correct data type important in Power BI?

A

Ensures data accuracy and optimizes performance in calculations and visualizations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are common data types you might set in Power BI?

A

Text, Date, Whole Number, Decimal Number, and Boolean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

How can data type mismatches affect Power BI reports?

A

They can lead to errors in calculations, incorrect aggregations, and issues in data visualizations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Why would you want to combine tables in Power BI?

A

To simplify the model by reducing the number of tables, handle tables with similar roles, merge tables with few columns into another, or use columns from different tables in a custom column.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What are the two methods to combine tables in Power BI?

A

Merging and appending.

21
Q

What is appending queries in Power BI?

A

Adding rows from one table to another, resulting in a combined table with more rows.

22
Q

How do you append queries in Power BI?

A

Use the “Append Queries” option in Power Query Editor, selecting either “Append Queries” or “Append Queries as New” to create a new table.

23
Q

What must be consistent when appending tables in Power BI?

A

The column headers and data types should be the same across all tables being appended.

24
Q

What is merging queries in Power BI?

A

Combining columns from multiple tables into one based on a common column, similar to SQL JOIN operations.

25
What is a key requirement for merging tables?
A common column, or key, that exists in both tables to align the data correctly.
26
What are the join types available in Power BI when merging tables?
Left Outer, Full Outer, and Inner joins.
27
Describe a Left Outer join in Power BI.
It displays all rows from the first table and only the matching rows from the second table.
28
Describe a Full Outer join in Power BI.
It displays all rows from both tables, combining data where possible.
29
Describe an Inner join in Power BI.
It displays only the rows that have matching values in both tables.
30
What should you do before combining tables in Power BI?
Remove any unnecessary columns to streamline the resulting combined table.
31
What is the purpose of profiling data in Power BI?
To study the nuances of data, determine anomalies, examine underlying data structures, and query data statistics like row counts and value distributions.
32
What should you do before examining data in Power Query Editor?
Learn about the underlying data structures, viewable under the Model tab in Power BI Desktop.
33
What can you do under the Model tab in Power BI Desktop?
Edit column and table properties, transform data using the Transform Data button, and manage relationships between tables.
34
What feature in Power Query Editor helps identify data anomalies?
The Column Distribution feature under the View tab in the ribbon.
35
What does the Column Quality feature show?
Percentages of data that is valid, in error, and empty.
36
What is the significance of Column Distribution in Power Query Editor?
It shows the distribution of data within a column and the counts of distinct and unique values.
37
What detailed statistics does the Column Profile provide?
Row counts, outliers, empty rows, minimum and maximum values, and value distribution.
38
How can you change the default behavior of profiling only the first 1000 rows?
Select the profiling status in the status bar and choose "Column profiling based on entire data set."
39
What additional statistics are provided for numeric columns in Column Statistics?
Count of zeroes, null values, average value, standard deviation, and counts of even and odd values.
40
What is the purpose of the Advanced Editor in Power BI?
To view and directly modify the M code that Power Query uses to shape data.
41
How do you access the Advanced Editor in Power Query?
By selecting the View ribbon and then clicking on Advanced Editor.
42
What does each step in Power Query roughly correspond to in the Advanced Editor?
Each step corresponds to one or two lines of M code.
43
Is it necessary to write M code to use Power Query?
No, it is not required, as most steps can be created using the graphical interface.
44
How is M code written and structured?
M code is written top-down, where later steps can refer to previous steps by the variable name to the left of the equal sign.
45
What should you be careful about when reordering steps in M code?
Reordering steps can ruin the statement dependencies, which can break the data transformation process.
46
What is the 'in' statement used for in M code?
It is used to define the final data set result, typically referring to the last query step.
47
How can the Advanced Editor be useful for customization?
It allows for modifications such as changing the name of a database directly in the code.
48
What should you understand about each Power Query step created via the UI?
Each step is converted into M code, which can be read and modified in the Advanced Editor.