Model Card for ahalev/mcuu-table-2-o55132xz

This model corresponds to run(s) in Table 2, specifically that with the hyperparameters:

1) {'scenario': 0, 'forecast_horizon': 6, 'intrinsic_reward_weight': 0.01, 'bound_reward_weight': nan, 'noise_std': 0.01}

Usage

from trainer import Trainer
trainer = Trainer.from_pretrained('ahalev/mcuu-table-2-o55132xz')
algo, env = trainer.algo, trainer.env

# Get an action from a random observation
action, _ = algo.policy.get_action(env.observation_space.sample())

# Evaluate the policy over 2920 timesteps
evaluation = trainer.evaluate()

For more information, see the repo and the paper.

This model was created by @ahalev.

Downloads last month
2
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model’s pipeline type.

Collection including ahalev/mcuu-table-2-o55132xz