Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
prithivMLmods
/
Bellatrix-Tiny-1B-R1
like
1
Text Generation
Transformers
Safetensors
English
llama
GRPO
Reinforcement learning
trl
SFT
conversational
text-generation-inference
License:
llama3.2
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
prithivMLmods
commited on
Feb 2
Commit
40ce38e
·
verified
·
1 Parent(s):
f793075
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+1
-0
README.md
CHANGED
Viewed
@@ -10,6 +10,7 @@ tags:
10
- GRPO
11
- Reinforcement learning
12
- trl
13
---
14
# **Bellatrix-Tiny-1B-R1**
15
10
- GRPO
11
- Reinforcement learning
12
- trl
13
+
- SFT
14
---
15
# **Bellatrix-Tiny-1B-R1**
16