Di Zhang

qq8933

AI & ML interests

AI4Chem, LLM, Green LLM

Recent Activity

Organizations

AI4Chem's profile picture SimpleBerry Research Lab's profile picture

qq8933's activity

posted an update 6 days ago
replied to their post 14 days ago
posted an update 14 days ago
view post
Post
2517
LLaMA-O1-PRM and LLaMA-O1-Reinforcement will release in this weekend.
We have implemented a novel Reinforcement finetune(RFT) pipeline that taught models learning reasoning and reward labeling without human annotation.
·
posted an update 15 days ago
New activity in qq8933/AIME_1983_2024 21 days ago

how about 2024 I

3
#2 opened 3 months ago by
hl0737