Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Reinforced Token Optimization
Activity Feed
Follow
4
AI & ML interests
None defined yet.
Team members
1
RTO-RL
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Articles
zkshan2002
published
2 models
4 months ago
RTO-RL/Llama3-8B-RTO_RPP
8B
•
Updated
Apr 10
•
4
•
1
RTO-RL/Llama3-8B-RPP
8B
•
Updated
Apr 10
•
4
•
1
zkshan2002
published
a model
6 months ago
RTO-RL/Llama3-8B-TDPO
8B
•
Updated
Feb 11
•
2
•
1
zkshan2002
updated
a model
6 months ago
RTO-RL/Llama3-8B-TDPO
8B
•
Updated
Feb 11
•
2
•
1
zkshan2002
published
a model
6 months ago
RTO-RL/Llama3-8B-SimPO
8B
•
Updated
Feb 11
•
1
zkshan2002
updated
a model
6 months ago
RTO-RL/Llama3-8B-SimPO
8B
•
Updated
Feb 11
•
1
zkshan2002
published
a model
6 months ago
RTO-RL/Llama3-8B-RDPO
8B
•
Updated
Feb 11
•
1
•
1
zkshan2002
updated
a model
6 months ago
RTO-RL/Llama3-8B-RDPO
8B
•
Updated
Feb 11
•
1
•
1
zkshan2002
published
a model
6 months ago
RTO-RL/Llama3-8B-PPO
8B
•
Updated
Feb 11
•
10
•
1
zkshan2002
updated
5 models
6 months ago
RTO-RL/Llama3-8B-PPO
8B
•
Updated
Feb 11
•
10
•
1
RTO-RL/Llama3-8B-RTO
8B
•
Updated
Feb 11
•
5
•
1
RTO-RL/Llama3.2-1B-RewardModel
1B
•
Updated
Feb 11
•
1.76k
RTO-RL/Llama3-8B-RewardModel
8B
•
Updated
Feb 11
•
5
RTO-RL/Llama3-8B-DPO
8B
•
Updated
Feb 11
•
1.49k
zkshan2002
published
a model
6 months ago
RTO-RL/Llama3-8B-RTO
8B
•
Updated
Feb 11
•
5
•
1
zkshan2002
updated
a model
6 months ago
RTO-RL/Llama3-8B-RTO
8B
•
Updated
Feb 11
•
5
•
1