README.md · hivex-research/hivex-DBR-PPO-baseline-task-0-difficulty-10 at main

metadata

library_name: hivex
original_train_name: DroneBasedReforestation_difficulty_10_task_0_run_id_2_train
tags:
  - hivex
  - hivex-drone-based-reforestation
  - reinforcement-learning
  - multi-agent-reinforcement-learning
model-index:
  - name: hivex-DBR-PPO-baseline-task-0-difficulty-10
    results:
      - task:
          type: main-task
          name: main_task
          task-id: 0
          difficulty-id: 10
        dataset:
          name: hivex-drone-based-reforestation
          type: hivex-drone-based-reforestation
        metrics:
          - type: cumulative_distance_reward
            value: 2.4901976776123047 +/- 0.7106346342581482
            name: Cumulative Distance Reward
            verified: true
          - type: cumulative_distance_until_tree_drop
            value: 73.15180267333984 +/- 16.01239171149343
            name: Cumulative Distance Until Tree Drop
            verified: true
          - type: cumulative_distance_to_existing_trees
            value: 59.689389877319336 +/- 11.847134878664495
            name: Cumulative Distance to Existing Trees
            verified: true
          - type: cumulative_normalized_distance_until_tree_drop
            value: 0.2490197652578354 +/- 0.07106346368414662
            name: Cumulative Normalized Distance Until Tree Drop
            verified: true
          - type: cumulative_tree_drop_reward
            value: 6.189901051521301 +/- 2.069236630928566
            name: Cumulative Tree Drop Reward
            verified: true
          - type: out_of_energy_count
            value: 0.9284761929512024 +/- 0.0666754640473818
            name: Out of Energy Count
            verified: true
          - type: recharge_energy_count
            value: 9.823968200683593 +/- 1.0843417843839367
            name: Recharge Energy Count
            verified: true
          - type: tree_drop_count
            value: 1.0422539913654327 +/- 0.06928386006526491
            name: Tree Drop Count
            verified: true
          - type: cumulative_reward
            value: 10.091075601577758 +/- 2.9491417551616106
            name: Cumulative Reward
            verified: true

This model serves as the baseline for the Drone-Based Reforestation environment, trained and tested on task 0 with difficulty 10 using the Proximal Policy Optimization (PPO) algorithm.

Environment: Drone-Based Reforestation
Task: 0
Difficulty: 10
Algorithm: PPO
Episode Length: 2000
Training max_steps: 1200000
Testing max_steps: 300000

Train & Test Scripts
Download the Environment