Data transformation in Tableau Prep Flashcards

1
Q

The _ tab allows you to select the connection to your data, which can be stored in a file (e.g., Excel), on a cloud server (e.g., BigQuery), or a different type of data storage solution.

A

Connections

(Image 1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How many ways are there to connect to your data?

A
  1. “Connection” Tab
  2. “Connect to Data” button

(Image 1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How can you select the connection to your data?

A

With the Connections tab at the upper left corner

e.g. Connections > To a File > Text File

OR

Select “Connect to Data” at the top of screen, above “Recent Flows”

(Image 1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

The connection to your data can be stored [in a file/ on a cloud server/both]

A

both

also a different type of data storage solution

(Image 1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How to edit an existing data flow

A

Select “Open a Flow” (Above “Recent Flows”)

(Image 1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How to create a new data flow

A

Select “Connect to Data” (Above “Recent Flows”)

(Image 1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is an easy way to familiarize yourself with the functionalities available in Tableau Prep Builder?

A
  1. Open the 2 “Sample Flows” at the bottom of the screen and view how the data flows were set up and manipulate them
  2. View “Training” and “Resources” on the “Discover” tab

(Image 1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

T/F

After Connecting to the data, you need to update the parameter settings

A

True

After connecting, you click the table on the left side and change the settings as needed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What do the settings in Image 2, where will the column header and data values start?

A

Column header will be the first row and the data values will start from the second row

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

T/F

Tableau Prep Builder cannot detect various data types (e.g., dates, numeric values, strings) and relationships between data fields.

A

False

Tableau Prep Builder can detect various data types (e.g., dates, numeric values, strings) and relationships between data fields.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

T/F

Tableau Prep is helpful for loading data from multiple data sources that have different data types

A

False

Tableau Prep is helpful for loading data from multiple data sources that have the same data types

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Tableau Prep is helpful for loading data from multiple data sources that have the same data types bc ___

A

Tableau Prep Builder detects various data types (e.g., dates, numeric values, strings) and relationships between data fields.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

After opening the dataset, the second step is to _

A

clean the dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How do you clean a dataset?

A
  1. click on the + icon next to the dataset in the top workspace
  2. select “Clean Step”

(Image 3)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Change data types, grouping similar values together, and splitting column values is an example of _ in Tableau Prep

A

Cleaning data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How can you replace the negative values with null values in a column (let’s call it “Founded” column)?

What are the steps?

Image 6 shows what you have to what you want the chart to look like

A

1- Get to “Clean Step”:

  • (click + icon next to the dataset in the top workspace
  • “Clean Step”) (Image 3)

2- Create a calculated field:

  • Select the “…” button in the Founded column
  • “Create Calculated Field”
  • “Custom Calculation” (Image 4)
  • Fill in “Field Name” with “Replace Negative with Nulls” &
  • write “IF [Founded] < 0 THEN NULL ELSE [Founded] END”)
  • “Save” (Image 5)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

T/F

Aggregating data allows you to adjust to the level of detail of your data table.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Aggregating data allows you to _

A

adjust to the level of detail of your data table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

T/F

Aggregating data allows you to combine data from multiple tables together that have the same level of detail.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

T/F

Aggregating data allows you to combine data from multiple tables together that have different levels of detail.

A

False

Aggregating data allows you to combine data from multiple tables together that have the same level of detail.

21
Q

How can you aggregate data in Tableau Prep?

A

1- Add aggregate step to workflow canvas:

  • (click + icon next to the dataset in the top workspace
  • “Aggregate”) (Image 7) Results in Image 8

2- Add the appropriate fields:

  • drag and drop any additional fields on the left pane onto the fields on the right** Image 9**
  • Change to the appropriate aggretation (e.g. change from SUM to AVG)

See Image 10 for result

22
Q

How can you create a pivot in Tableau Prep?

A

1- Add “Pivot”:

  • (click + icon next to the dataset in the top workspace
  • “Pivot”) (Image 11)

2- Edit “Pivoted Fields” pane:

  • Select appropriate option: Columns to Rows VS Rows to Columns (Image 12)
  • Drag & Drop the appropriate fields to “Pivoted Fields” panel
23
Q

How can you create a join in Tableau Prep?

(Assume that “Table 1” is already in the Flow and “Table 2” is already connected)

A

Create a Join interface:

Way 1: Drag “Table 1” to the right of “Table 2”
* Hover “Table 2” onto the RIGHT of “Table 1”
* drop it in the “Join” popup. (See Image 15 & 16 to see that the table can be original or cleaned)

Way 2 ** Add a Join Step**
1. Click + icon next to the dataset in the top workspace and select “Join”) (Image 3)
2. Hover Table 2 over to the LEFT of the Join step
3. Drop Table 2 into the “Add” popup. (Note that if you hover to the RIGHT/BELOW the Join step (using the Join/Union popups), it will create another Join/Union step. Not what we want. We want to add a Table to Join step we just created in step 1 of Way 2.)

After either Way 1 or 2, edit join type & the join clause

Results: Image 13

24
Q

How can you create a union in Tableau Prep?

(Assume that “Table 1” is already in the Flow and “Table 2” is already connected)

A

Create a Union interface:

1a- (Way 1) Drag “Table 1” below “Table 2”
* Hover “Table 2” UNDER “Table 1” and drop it in the “Union” popup. (See Image 15 & 16 to see that the table can be original or cleaned.)

1b- (Way 2)** Add a Union Step**
1. Click + icon next to the dataset in the top workspace and select “Union”) (Image 3)
2. Hover the 2nd table over to the LEFT of the Union step
3. Drop into the “Add” popup. (Image 17) (Note that if you hover to the RIGHT or BELOW of the Union step (using the Join/Union popups), it will create another Join/Union step. Not what we want. We want to add a Table to the existing Union step.)

Results: Image 14

25
Q

T/F

The Union interface has info on resulting and mismatched fields.

26
Q

T/F

To make your data run faster, rename steps to reflect their purpose or the transformation they perform.

A

False

To make your flow easier to understand and maintain, rename steps to reflect their purpose or the transformation they perform.

27
Q

T/F

Using multi-field operations where possible helps to minimize steps, avoiding unnecessary complexity.

28
Q

Which of these will simplify your workflow:
* cleaning and validating data
* limiting initial rows
* using meaningful step names
* using sampling
* using calculated fields wisely
* minimizing steps
* filtering early

A
  • Use meaningful step names: Rename steps to reflect their purpose or the transformation they perform. This practice makes your flow easier to understand and maintain.
  • Minimize steps: While it’s important to break down your workflow into logical steps, avoid unnecessary complexity. Use multi-field operations where possible and consolidate steps that perform similar transformations.
29
Q

Which of these will optimize preformance:
* cleaning and validating data
* limiting initial rows
* using sampling
* using calculated fields wisely
* minimizing steps
* filtering early

A
  • limiting initial rows
  • using sampling
  • filtering early
30
Q

Apply filters as [early/late/centric] as possible in your flow to reduce the volume of data processed in subsequent steps.

31
Q

When exploring and building your flow, limit the number of [columns/rows] initially loaded to speed up performance.

32
Q

Where can you limit the number of rows initially loaded?

A

In the input step setting

33
Q

Try to keep your columns below _

34
Q

The more columns that are in your input data, the [larger/smaller] the number of rows that Prep will allow

35
Q

T/F

Prep will somtimes apply sampling if the number of rows in your dataset is larger than the maximum number of rows.

A

False

Prep will only apply sampling if the number of rows in your dataset is larger than the maximum number of rows.

https://www.tableau.com/blog/how-to-use-sampling-in-tableau-prep

36
Q

Prep will only apply sampling if the number of rows in your dataset is larger than the maximum number of rows.

A

True

https://www.tableau.com/blog/how-to-use-sampling-in-tableau-prep

37
Q

Which of these will Ensure Data Quality:
* cleaning and validating data
* limiting initial rows
* using sampling
* using calculated fields wisely
* minimizing steps
* filtering early

A
  • Clean and validate data: Utilize the Clean step to address data quality issues such as duplicates, nulls, and outliers. Validate your data at various stages of the flow to ensure accuracy.
  • Use calculated fields wisely: Calculated fields are powerful for data transformation, but complex calculations can impact performance and readability.
38
Q

T/F

The Clean step is used to address data quality issues such as duplicates, nulls, and outliers.

39
Q

T/F

Validate your data at early stages of the flow to ensure accuracy.

A

False

Validate your data at various stages of the flow to ensure accuracy.

40
Q

T/F

Tableau Desktop focuses on data preperation & cleaning.

A

False

Tableau Prep Builder focuses on data preperation & cleaning.

41
Q

Tableau Desktop VS Tableau Prep Builder

What’s their focus?

A

Visualizing data & creating dashboards

VS

data preparation & cleaning

42
Q

T/F

Tableau Prep has a coding interface.

43
Q

A Table is a dataset

44
Q

A file is a dataset

45
Q

A dataset is what?

A

A dataset is simply the collection of data, often organized as tables or files, that you use to create visualizations and dashboards.

46
Q

T/F

Dataset, datasource, data frame, and database are all the same thing.

47
Q

What are 3 other names for data source

A

Dataset, data frame, and database

48
Q

A database with multiple tables is called a

A

Relational Database