Tiger-PJ-8B / README.md
yuan-yang's picture
Update README.md
4b114df verified
|
raw
history blame
1.19 kB
metadata
license: apache-2.0

Tiger Model Card

Model details

Tactic-guided reasoner (Tiger) is a language model that solves reasoning in the wild task proposed in paper Can LLMs Reason in the Wild with Programs. It is trained by fine-tuning the LLaMA3-8B model on the ReWild dataset.

Model type: This repo contains the LoRA delta weights for Tiger-PJ-8B

We also provide the delta weights of other versions:

License: Apache License 2.0

Using the model

Check out how to use the model on our project page: https://github.com/gblackout/Reason-in-the-Wild/

Primary intended uses: Tiger is intended to be used for research.

Citation

@article{yang2024can,
  title={Can LLMs Reason in the Wild with Programs?},
  author={Yang, Yuan and Xiong, Siheng and Payani, Ali and Shareghi, Ehsan and Fekri, Faramarz},
  journal={arXiv preprint arXiv:2406.13764},
  year={2024}
}