krasserm
/

gba-planner-7B-v0.1-GGUF

Model card Files Files and versions Community

krasserm commited on Jun 2, 2024

Commit

70fd12b

·

verified ·

1 Parent(s): bf5cb22

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -3,7 +3,7 @@ license: apache-2.0
 datasets:
 - krasserm/gba-trajectories
 ---
-A planner LLM [fine-tuned on synthetic trajectories](https://krasserm.github.io/2024/05/31/planner-fine-tuning/) from an agent simulation. It can be used in [ReAct](https://arxiv.org/abs/2210.03629)-style LLM agents where [planning is separated from function calling](https://krasserm.github.io/2024/03/06/modular-agent/). Trajectory generation and planner fine-tuning are described in the [grammar-based-agents](https://github.com/krasserm/grammar-based-agents) project.
 The planner has been fine-tuned on the [krasserm/gba-trajectories](https://huggingface.co/datasets/krasserm/gba-trajectories) dataset. The original QLoRA adapters of this model are available at [krasserm/gba-planner-7B-v0.1](https://huggingface.co/krasserm/gba-planner-7B-v0.1).
@@ -87,4 +87,4 @@ The planner learned a (static) set of available tools during fine-tuning. These
 | `use_bash`         | Useful for executing commands in a Linux bash.                                            |
 | `final_answer`     | Useful for providing the final answer to a request. Must always be used in the last step. |
-The framework provided by the [grammar-based-agents](https://github.com/krasserm/grammar-based-agents) project can easily be adjusted to a different set of tools for specialization to other application domains.

 datasets:
 - krasserm/gba-trajectories
 ---
+A planner LLM [fine-tuned on synthetic trajectories](https://krasserm.github.io/2024/05/31/planner-fine-tuning/) from an agent simulation. It can be used in [ReAct](https://arxiv.org/abs/2210.03629)-style LLM agents where [planning is separated from function calling](https://krasserm.github.io/2024/03/06/modular-agent/). Trajectory generation and planner fine-tuning are described in the [bot-with-plan](https://github.com/krasserm/bot-with-plan) project.
 The planner has been fine-tuned on the [krasserm/gba-trajectories](https://huggingface.co/datasets/krasserm/gba-trajectories) dataset. The original QLoRA adapters of this model are available at [krasserm/gba-planner-7B-v0.1](https://huggingface.co/krasserm/gba-planner-7B-v0.1).
 | `use_bash`         | Useful for executing commands in a Linux bash.                                            |
 | `final_answer`     | Useful for providing the final answer to a request. Must always be used in the last step. |
+The framework provided by the [bot-with-plan](https://github.com/krasserm/bot-with-plan) project can easily be adjusted to a different set of tools for specialization to other application domains.