Jianbo Wu's picture

Jianbo Wu

jwu323

AI & ML interests

None yet

Recent Activity

Organizations

whoisking's profile picture SimpleBerry Research Lab's profile picture

jwu323's activity

reacted to qq8933's post with ๐Ÿ‘€ 15 days ago
view post
Post
2521
LLaMA-O1-PRM and LLaMA-O1-Reinforcement will release in this weekend.
We have implemented a novel Reinforcement finetune(RFT) pipeline that taught models learning reasoning and reward labeling without human annotation.
ยท
reacted to qq8933's post with ๐Ÿš€ 16 days ago
reacted to qq8933's post with ๐Ÿš€ 24 days ago
view post
Post
3037
  • 3 replies
ยท
reacted to qq8933's post with ๐Ÿš€ 27 days ago
view post
Post
1343
LLaMA-O1 Base and SFT model will be uploaded to HF today.
RLHF pipeline already ready, still waiting for data sampling.
  • 1 reply
ยท
reacted to their post with ๐Ÿš€ 30 days ago
view post
Post
1351
We are excited to announce a new internal project, Rome, focused on advancing LLM reasoning. The code and accompanying paper will be released soon. Stay tuned!
  • 2 replies
ยท
posted an update 30 days ago
view post
Post
1351
We are excited to announce a new internal project, Rome, focused on advancing LLM reasoning. The code and accompanying paper will be released soon. Stay tuned!
  • 2 replies
ยท