|
--- |
|
license: apache-2.0 |
|
--- |
|
|
|
# Tiger Model Card |
|
|
|
## Model details |
|
|
|
Tactic-guided reasoner (Tiger) is a language model that solves *reasoning in the wild* task proposed in paper [Can LLMs Reason in the Wild with Programs](https://arxiv.org/abs/2406.13764). |
|
It is trained by fine-tuning the LLaMA3-8B model on the [ReWild](https://huggingface.co/datasets/yuan-yang/ReWild) dataset. |
|
|
|
**Model type:** |
|
This repo contains the LoRA delta weights for `Tiger-PJ-8B` |
|
|
|
We also provide the delta weights of other versions: |
|
- [Tiger-Routing-8B](https://huggingface.co/yuan-yang/Tiger-Routing-8B/) |
|
- [Tiger-PJ-8B](https://huggingface.co/yuan-yang/Tiger-PJ-8B) |
|
- [Tiger-IPJ-8B](https://huggingface.co/yuan-yang/Tiger-IPJ-8B) |
|
|
|
**License:** |
|
Apache License 2.0 |
|
|
|
## Using the model |
|
|
|
Check out how to use the model on our project page: https://github.com/gblackout/Reason-in-the-Wild/ |
|
|
|
|
|
**Primary intended uses:** |
|
Tiger is intended to be used for research. |
|
|
|
|
|
## Citation |
|
|
|
``` |
|
@article{yang2024can, |
|
title={Can LLMs Reason in the Wild with Programs?}, |
|
author={Yang, Yuan and Xiong, Siheng and Payani, Ali and Shareghi, Ehsan and Fekri, Faramarz}, |
|
journal={arXiv preprint arXiv:2406.13764}, |
|
year={2024} |
|
} |
|
``` |