W11 Validating Simulation Models Flashcards
Verification
does the code ask what the specification asks?
Structural Validation?
Does model correctly represent problem space?
Input/Output Validation
Does simulation correctly represent an instance of hte problem.
Calibration:
adjust input values to get valid output values
Simulation Data (3 types)
Input Data
Process Data
Output (Result) Data
Input Data
collected or inferred from empirical status quo
Process Data
generated during simulation process
provides insights into model that are not empirically available or relevant for simulation purpose
Output (Result) Data
Indicators calculated for validation or what-if analysis, matching empirical indicators
Input Data:
kinds of data?
direct observations
->transparent systems
indirect inference
->empirical process and result indicators, e.g. customer choice
Scenarios:
stochastic
best/worst case
discrete
Railway Ticket as Input.
What data?
directly observable:
supply: products capacity, price categories, availabilities
demand: historical sales
indirect inference:
customer loyalty
reference prices
wtp
Input Data on the Future
1 generate insights about empirical data
2 simulate future scenarios by forecasting from 1
Process Data Types
empirically-available
- forecasts
- plans
- documented events
simulation-exclusive process data
- decisions
- learned experience
- communications
Result Data Types
empirically available
- system reports
- transaction data
simulation exclusive
- long term developments
- what if developments
- internal states
Usage of Results Data
output validation (compare results to empirical data)
sensitivity and meta modeling: analyze relationship in the black box
data farming: simulations generate artificial transaction data
Data Farming
simulation models run multiple times to provide insights into different consequences of different options
- generate much data
- success dependent on validity
- simple: monte carlo
- mechanik: discrete
emergent: agent-based
Monte Carlo Simulations
Problem: too little empirical data
Idea: create data based on fitting distributions
Change: meaningful conclusions from enriched sata
Risk: tainted by assumptions about distribution