Reinforced Token Optimization

AI & ML interests

None defined yet.

Recent Activity

zkshan2002  published a model about 1 month ago
RTO-RL/Llama3-8B-TDPO
zkshan2002  updated a model about 1 month ago
RTO-RL/Llama3-8B-TDPO
zkshan2002  published a model about 1 month ago
RTO-RL/Llama3-8B-SimPO
View all activity