SJTULean
/

LeanFormalizer_PPO

Model card Files Files and versions

Inuyasha2023ch commited on Dec 25, 2024

Commit

0638917

·

verified ·

1 Parent(s): 4d206b5

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -9,4 +9,5 @@ metrics:
 - accuracy
 base_model:
 - Qwen/Qwen2.5-7B-Instruct
----

 - accuracy
 base_model:
 - Qwen/Qwen2.5-7B-Instruct
+---
+Through PPO using a reward model trained on merely 5% of the LeanStatement_RL dataset, our model achieved enhanced performance, reaching a **pass@1 compilation success rate** of **95.3%** (465/488) on MiniF2F and **76.0%** (284/374) on ProofNet benchmarks.