Spaces:

andreamalhera
/

igedi

Running

File size: 1,850 Bytes

bdf9096
d9456f6
bdf9096
d9456f6
 
 
 
 
 
 
b061aff
bdf9096
 
 
 
 
b061aff
bdf9096
 
 
 
 
468cde1
bdf9096
 
 
 
 
9aa529a
bdf9096
 
d9456f6
 
 
e1621a0
8742124
d9456f6
 
 
 
 
 
9aa529a
d9456f6
 
 
 
 
 
15d25aa

# GEDI
**G**enerating **E**vent **D**ata with **I**ntentional Features for Benchmarking Process Mining

## Table of Contents

- [Requirements](#requirements)
- [Installation](#installation)
- [Usage](#usage)
- [References](#references)

## Requirements
- [Miniconda](https://docs.conda.io/en/latest/miniconda.html)
- Graphviz on your OS e.g.
For MacOS:
```console
brew install graphviz
brew install swig
```
- For smac:
```console
conda install pyrfr swig
```
## Installation
- `conda env create -f .conda.yml`
- Install [Feature Extractor for Event Data (feeed)](https://github.com/lmu-dbs/feeed) in the newly installed conda environment: `pip install feeed`

### Startup
```console
conda activate gedi
python main.py -o config_files/options/baseline.json -a config_files/algorithm/experiment_test.json
```
## Usage
Our pipeline offers several pipeline steps, which can be run sequentially or partially:
- feature_extraction
- generation
- benchmark
- evaluation_plotter

We also include two notebooks, which output experimental results as in our paper.

To run different steps of the GEDI pipeline, please adapt the `.json` accordingly.
```console
conda activate gedi
python main.py -o config_files/options/baseline.json -a config_files/algorithm/<pipeline-step>.json
```
For reference of possible keys and values for each step, please see `config_files/algorithm/experiment_test.json`.
To run the whole pipeline please create a new `.json` file, specifying all steps you want to run and specify desired keys and values for each step. 

## References
The framework used by `GEDI` is taken directly from the original paper by [Maldonado](mailto:[email protected]), Frey, Tavares, Rehwald and Seidl. If you would like to discuss the paper, or corresponding research questions on benchmarking process mining tasks please email the authors.