yuan-yang
/

Tiger-PJ-8B

Model card Files Files and versions Community

yuan-yang commited on Jun 27, 2024

Commit

4b114df

·

verified ·

1 Parent(s): d7bea15

Update README.md

Files changed (1) hide show

README.md +41 -3

README.md CHANGED Viewed

@@ -1,3 +1,41 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+---
+# Tiger Model Card
+## Model details
+Tactic-guided reasoner (Tiger) is a language model that solves *reasoning in the wild* task proposed in paper [Can LLMs Reason in the Wild with Programs](https://arxiv.org/abs/2406.13764).
+It is trained by fine-tuning the LLaMA3-8B model on the [ReWild](https://huggingface.co/datasets/yuan-yang/ReWild) dataset.
+**Model type:**
+This repo contains the LoRA delta weights for `Tiger-PJ-8B`
+We also provide the delta weights of other versions:
+- [Tiger-Routing-8B](https://huggingface.co/yuan-yang/Tiger-Routing-8B/)
+- [Tiger-PJ-8B](https://huggingface.co/yuan-yang/Tiger-PJ-8B)
+- [Tiger-IPJ-8B](https://huggingface.co/yuan-yang/Tiger-IPJ-8B)
+**License:**
+Apache License 2.0
+## Using the model
+Check out how to use the model on our project page:  https://github.com/gblackout/Reason-in-the-Wild/
+**Primary intended uses:**
+Tiger is intended to be used for research.
+## Citation
+```
+@article{yang2024can,
+  title={Can LLMs Reason in the Wild with Programs?},
+  author={Yang, Yuan and Xiong, Siheng and Payani, Ali and Shareghi, Ehsan and Fekri, Faramarz},
+  journal={arXiv preprint arXiv:2406.13764},
+  year={2024}
+}
+```