Le Yu's picture

Le Yu

vanillaOVO

·

https://yule-buaa.github.io/

yule-BUAA

AI & ML interests

None yet

Recent Activity

upvoted a paper 13 days ago

Agentic Reinforced Policy Optimization

upvoted a paper 17 days ago

Group Sequence Policy Optimization

authored a paper 18 days ago

RefCritic: Training Long Chain-of-Thought Critic Models with Refinement Feedback

View all activity

Organizations

None yet

Collections 1

Papers 10

arxiv:2507.15024

arxiv:2506.01939

arxiv:2505.10527

arxiv:2505.09388

models 16

vanillaOVO/WizardLM-13B-V1.2

Text Generation • Updated Jun 20, 2024 • 2

vanillaOVO/WizardCoder-Python-13B-V1.0

Text Generation • Updated Jun 20, 2024 • 2

vanillaOVO/WizardMath-13B-V1.0

Text Generation • Updated Jun 20, 2024 • 239 • 1

vanillaOVO/WizardLM-7B-V1.0

Text Generation • Updated Jun 20, 2024 • 84 • 1

vanillaOVO/WizardMath-7B-V1.0

Text Generation • Updated Jun 20, 2024 • 2

vanillaOVO/WizardCoder-Python-7B-V1.0

Text Generation • Updated Jun 20, 2024 • 909 • 1

vanillaOVO/roberta_base_glue_ckpts

Updated Apr 9, 2024 • 1

vanillaOVO/supermario_v4

Text Generation • 7B • Updated Apr 4, 2024 • 5 • 1

vanillaOVO/supermario_v3

Text Generation • 7B • Updated Apr 4, 2024 • 7

vanillaOVO/supermario_v2

Text Generation • 7B • Updated Apr 4, 2024 • 17 • 1

datasets 0

None public yet