1. Extracting and Pre-Processing Event Logs Flashcards
Why do we have to extract event logs?
- Event data is recorded as it occurs and thus never grouped into traces or event logs
- An event records multiple attributes (not just the name)
What is an event table/ stream?
A raw logging format for events
How do you show that an event e attribute a is undefined?
π(e,a) = ⊥
What do we require for each event?
What do we require for each event in an event table?
What do we need (in addition) for an event log?
- The attribute time is defined
- e has a value for some attribute other than time (so has at least one meaningful observation)
- Event table = finite sequence of events that all have the same attribute a defined (this could be an activity name of other measurement)
- A case identifier (this is usually an entity type attribute but could technically be any attribute except time- depends on the question we want to answer)
What is the difference between a general attribute and an event type?
Event types (i.e. users, orders, customers, deliveries) refer to specific/ unique entities or objects
To figure out if something is an entity type we need domain knowledge and additional context
If we use id as a case identifier c and π_id(e) = c, how do we describe this?
event e is correlated to case c
If all events correlated to a case c carry the same value v for a certain attribute x then what do we call x?
A case attribute of c
If that applies to every case c then x is a global case attribute
What is a trace?
A sequence of events correlated to a case and ordered by time
What is a structured event log?
What structure does it have?
Set of cases where each case is associated with exactly one trace with this case as a case attribute
Hierarchical structure
1. Cases
2. Case attributes as children (one of them being the trace)
3. Each event as children including their attributes (i.e. timestamp, observed activity, case identifier)
Can cases share events?
No
What is the difference between an activity and an event?
An event e ∈ E describes that a specific discrete ob-servation has been made (by a sensor, a system, a human observer, etc.). E.g. from an event table: it is an event that has a time: 19/12/2018 15:46, when payment was received. Handled by user System. The attribute time is thus defined and carries a value for some other attribute.
An activity is a specific action that can be executed or observed (so think it will be more generic like - making payment)
What is an event attribute?
An attribute only specified for certain types of events (i.e. with a certain kind of activity) or where the value is specific to the event (i.e. not all events in the same case share the same value)
Is an event atomic?
Yes, event should have single timestamp (not a start/ end timestamp)
To handle this we introduce lifecycle transitions- start/ complete (describe status of longer running activity- for example, two events with same activity name but with different lifecycle transitions- a start and end)
What is an event classifier?
Function that maps each event to a value. The value of an event classifier for a specific event will be an event class. If two events have the same value then they belong to the same event class.
For example, the standard event classifier is the activity name classifier class.
We can make a simple trace from any event classifier as a sequence of the values (ignoring the missing values).
If we want to use a particular event classifier for an analysis but an event has no value defined for that particular classifier, what should we do?
Omit the event from our analysis.