ETL Data Pipelines Flashcards

Question 1

Q

What is process Intelligence ETL?

Answer

A

The data ingestion component. It automates data extractions and transformations from external source systems and loads it directly to SAP Signavio Process Intelligence

Question 2

Q

How is this setup on process intelligence?

Answer

A

there is not need to configure a staging environment, if you’re extracting from on-prem it requires an additional setup on the system side

Question 3

Q

[creation of ETL Data Pipelines] What happens during the Extract phase?

Answer

A

configure data sources
configure integrations
click extract

Question 4

Q

[creation of ETL Data Pipelines] What happens during the Transform phase?

Answer

A

configure business objects [sql scripts]
preview sql scripts

Question 5

Q

[creation of ETL Data Pipelines] What happens during the Load phase?

Answer

A

select or create a process
click run

Question 6

Q

What does SAP Signavio ETL use to carry out ETL?

Answer

A

they use standard connectors and provides an interface to extract, transform and load data. All interaction stays within the system.

Question 7

Q

what are the 3 main components of PI ETL

Answer

A

1- data source management
2. integration management
3. data model management

[image 6]

Question 8

Q

What do the the 3 main components do?

Answer

A

they are the integrated features of ETL, which together setup data pipelines

Question 9

Q

What is data source management

Answer

A

the framework to manage online data sources. it includes credential mgmt and scheduling

Question 10

Q

What does data sources establish?

Answer

A

a connection to the source system

Question 11

Q

What is integration management

Answer

A

the framework to define what, how and when to extract data. it includes pseudonymisation (techniques that replace, remove or transform information that identifies individuals, and keep that information separate) and partitioning schemas

Question 12

Q

How to extract, when the system is on prem?

Answer

A

On-premises Extractors are needed, it can then be set up under integrations where the specific tables and schedules for continuous loads can be defined

Question 13

Q

What are the two options for integration?

Answer

A

Simple method
intricate method

Question 14

Q

What is the simple method?

Answer

A

select the tables and fields through a graphical interface.
you can add a partition strategy in case of large tables and define field using a delta criteria

Question 15

Q

What is the intricate method?

Answer

A

write your own extraction scripts with SQL

Question 16

Q

When integrating, what is the default extraction time and why?

Answer

A

midnight (although continuous loads can be customised)
because data extraction should be done when the load on the source system has the least impact

Question 17

Q

What is data model management

Answer

A

the framework to transform your data into an event-log starting from connected integrations and extractions.
This is also when you connect your data to an investigation to start the process analysis.
It includes process orientated data modeling, SQL editors for the transformation and live previews of transformed data.

Question 18

Q

What do you do in a data model?

Answer

A

define how the ETL data pipeline extracts and transforms data, and where to load the data.

Question 19

Q

5 steps to creating and using data model

Answer

A

create a new data model
select source system
select the data model template
select integration
select the configured data source

Question 20

Q

A data model in SAP Signavio ETL has different sections - what is shown in the extraction section?

Answer

A

the connector - aka how we accessed the data on the source system eg. sap ecc
integration - what data we extracted from a source system - we can add new data or use a pre selected model template

Question 21

Q

A data model in SAP Signavio ETL has different sections - what is shown in the transformation section?

Answer

A

[it looks like a BPMN model]
1. process events - representing main events
2. business objects - transformation rules for the case attributes and events

Question 22

Q

What is an event collector?

Answer

A

the scripts for an event

Question 23

Q

How can you adjust transformation rules

Answer

A

using SQL