Spaces:
Running
Running
File size: 1,850 Bytes
bdf9096 d9456f6 bdf9096 d9456f6 b061aff bdf9096 b061aff bdf9096 468cde1 bdf9096 9aa529a bdf9096 d9456f6 e1621a0 8742124 d9456f6 9aa529a d9456f6 15d25aa |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 |
# GEDI
**G**enerating **E**vent **D**ata with **I**ntentional Features for Benchmarking Process Mining
## Table of Contents
- [Requirements](#requirements)
- [Installation](#installation)
- [Usage](#usage)
- [References](#references)
## Requirements
- [Miniconda](https://docs.conda.io/en/latest/miniconda.html)
- Graphviz on your OS e.g.
For MacOS:
```console
brew install graphviz
brew install swig
```
- For smac:
```console
conda install pyrfr swig
```
## Installation
- `conda env create -f .conda.yml`
- Install [Feature Extractor for Event Data (feeed)](https://github.com/lmu-dbs/feeed) in the newly installed conda environment: `pip install feeed`
### Startup
```console
conda activate gedi
python main.py -o config_files/options/baseline.json -a config_files/algorithm/experiment_test.json
```
## Usage
Our pipeline offers several pipeline steps, which can be run sequentially or partially:
- feature_extraction
- generation
- benchmark
- evaluation_plotter
We also include two notebooks, which output experimental results as in our paper.
To run different steps of the GEDI pipeline, please adapt the `.json` accordingly.
```console
conda activate gedi
python main.py -o config_files/options/baseline.json -a config_files/algorithm/<pipeline-step>.json
```
For reference of possible keys and values for each step, please see `config_files/algorithm/experiment_test.json`.
To run the whole pipeline please create a new `.json` file, specifying all steps you want to run and specify desired keys and values for each step.
## References
The framework used by `GEDI` is taken directly from the original paper by [Maldonado](mailto:[email protected]), Frey, Tavares, Rehwald and Seidl. If you would like to discuss the paper, or corresponding research questions on benchmarking process mining tasks please email the authors.
|