yuan-yang
/

Tiger-PJ-8B

Model card Files Files and versions Community

Tiger-PJ-8B / README.md

yuan-yang's picture

Update README.md

4b114df verified 6 months ago

|

1.19 kB

	---
	license: apache-2.0
	---

	# Tiger Model Card

	## Model details

	Tactic-guided reasoner (Tiger) is a language model that solves reasoning in the wild task proposed in paper [Can LLMs Reason in the Wild with Programs](https://arxiv.org/abs/2406.13764).
	It is trained by fine-tuning the LLaMA3-8B model on the [ReWild](https://huggingface.co/datasets/yuan-yang/ReWild) dataset.

	Model type:
	This repo contains the LoRA delta weights for `Tiger-PJ-8B`

	We also provide the delta weights of other versions:
	- [Tiger-Routing-8B](https://huggingface.co/yuan-yang/Tiger-Routing-8B/)
	- [Tiger-PJ-8B](https://huggingface.co/yuan-yang/Tiger-PJ-8B)
	- [Tiger-IPJ-8B](https://huggingface.co/yuan-yang/Tiger-IPJ-8B)

	License:
	Apache License 2.0

	## Using the model

	Check out how to use the model on our project page: https://github.com/gblackout/Reason-in-the-Wild/


	Primary intended uses:
	Tiger is intended to be used for research.


	## Citation

	```
	@article{yang2024can,
	title={Can LLMs Reason in the Wild with Programs?},
	author={Yang, Yuan and Xiong, Siheng and Payani, Ali and Shareghi, Ehsan and Fekri, Faramarz},
	journal={arXiv preprint arXiv:2406.13764},
	year={2024}
	}
	```