bradhiltonendercorp commited on
Commit
e355c71
·
verified ·
1 Parent(s): 4009aac

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -4
README.md CHANGED
@@ -15,9 +15,11 @@ library_name: transformers
15
 
16
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/674a1d102c0f27a385772cfe/JauBmEQM0FpOdShBMSfst.png)
17
 
18
- Deductive Reasoning Qwen 14B is a reinforcement fine-tune of Qwen 2.5 14B Instruct to solve challenging deduction problems from the Temporal Clue dataset, trained by [OpenPipe](https://openpipe.ai)!
 
 
19
 
20
  - Blog Post
21
- - Training Recipe
22
- - Raw Experiments Codebase
23
- - Deductive Reasoning Qwen 32B
 
15
 
16
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/674a1d102c0f27a385772cfe/JauBmEQM0FpOdShBMSfst.png)
17
 
18
+ Deductive Reasoning Qwen 14B is a reinforcement fine-tune of [Qwen 2.5 14B Instruct](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct) to solve challenging deduction problems from the [Temporal Clue](https://github.com/bradhilton/temporal-clue) dataset, trained by [OpenPipe](https://openpipe.ai)!
19
+
20
+ Here are some additional resources to check out:
21
 
22
  - Blog Post
23
+ - [Training Recipe](https://github.com/openpipe/deductive-reasoning)
24
+ - [RL Experiments](https://github.com/openpipe/rl-experiments)
25
+ - [Deductive Reasoning Qwen 32B](https://huggingface.co/OpenPipe/Deductive-Reasoning-Qwen-32B)