A dataset and RL-zero pipeline for advanced mathematical reasoning of informal theorem proving.
Jiahao Xu
Jiahao004
AI & ML interests
Sentence Emebddings; Neural Machine Translation
Recent Activity
updated
a dataset
about 14 hours ago
Jiahao004/agentllm_trainingset
commented on
a paper
29 days ago
Reasoning with Exploration: An Entropy Perspective
commented on
a paper
29 days ago
Reasoning with Exploration: An Entropy Perspective