Data Requirements and Setup Flashcards

1
Q

What is data transformation?

A

changing the format, structure or values of raw data from ERP-systems in order to upload a file to the process mining system

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the file uploaded to the process mining system called?

A

event log

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does event log contain?

A

all recorded events with their timestamp assigned to certain case IDs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Examples of how data transformation is typically achieved?

A

-Translation and mapping
-Filtering, aggregation, and summarization
-Enrichment and imputation
-Indexing and ordering
-Anonymization and encryption
-Modeling, typecasting, formatting, and renaming

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What happens to every step in a system?

A

they get recorded and leave behind a trace

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What happens to all changes and transactions referring to business objects/steps?

A

they’re stored in a database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does process intelligence do with the data objects/steps stored?

A

those details can be explored, extracted and transformed in a way that allows backtracking of all the steps

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

The recreated steps are stored in what

A

the event log

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

For analysis purpose, what format should the data be?

A

uniform and standardised, there can be differences especially if they come from different source systems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Steps for data transformation

A
  1. Definition of the target format
  2. Conversion of the extracted data
  3. Saving the converted data into a new file
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why is data transformation necessary?

A

all data is stored in different tables and we need to ensure the extracted data is linked to specific cases

eg. system should know that Order ID 123 in the order table and Invoice ID 456 in the invoice table belong to the same case

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does case ID define?

A

the scope of the process. It determines where the process starts and ends

eg. in the procurement process, case ID would be the puchase document ID and every single request is a new case

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the last part of ETL?

A

data load phase

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does the data load phase cover?

A

This covers the tasks to upload the transformed data into the process mining system

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What 3 questions need to be addressed for your data load.

A
  1. which upload method is required?
    could be manual eg. csv or automatic eg. API
  2. should the existing data be replaced, or should new ones be attached?
    depends on scenario, appending new data to existing might be useful eg. annual data set created from extracting at the end of each quarter
  3. how often should new data be uploaded
    depends on availability of data, could be hourly if live
How well did you know this?
1
Not at all
2
3
4
5
Perfectly