arxiv:2412.18619
Ge Zhang
zhangysk
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
5 days ago
Agent-R: Training Language Model Agents to Reflect via Iterative
Self-Training
authored
a paper
17 days ago
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive
Survey
upvoted
a
paper
18 days ago
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language
Models
Organizations
Papers
70
models
None public yet
datasets
None public yet