tecosys
/

Nutaan-RL1

Reinforcement Learning

India's Own LLM

Model card Files Files and versions Community

tecosys commited on 14 days ago

Commit

1cc0eb4

·

verified ·

1 Parent(s): d9a8745

Update README.md

Files changed (1) hide show

README.md +20 -11

README.md CHANGED Viewed

@@ -1,6 +1,21 @@
 ---
 base_model: unsloth/meta-llama-3.1-8b-instruct-bnb-4bit
 library_name: peft
 ---
 # Model Card for Model ID
@@ -13,17 +28,11 @@ library_name: peft
 ### Model Description
-<!-- Provide a longer summary of what this model is. -->
-- **Developed by:** [More Information Needed]
-- **Funded by [optional]:** [More Information Needed]
-- **Shared by [optional]:** [More Information Needed]
-- **Model type:** [More Information Needed]
-- **Language(s) (NLP):** [More Information Needed]
-- **License:** [More Information Needed]
-- **Finetuned from model [optional]:** [More Information Needed]
 ### Model Sources [optional]

 ---
 base_model: unsloth/meta-llama-3.1-8b-instruct-bnb-4bit
 library_name: peft
+license: mit
+datasets:
+- openai/gsm8k
+metrics:
+- accuracy
+pipeline_tag: reinforcement-learning
+tags:
+- RL
+- LLM
+- Nutaan
+- India's Own LLM
+- Indian LLM
+- Indian RL
+- GRPO
+- LORA
 ---
 # Model Card for Model ID
 ### Model Description
+Nutaan-RL1 is an innovate reinforcement model with the reinforcement learning & GRPO with limited number of resources. The model is trained on google colab with unsloth.
+- **Developed by:** Tecosy Team [Check Here](https://tecosys.in/)
+- **Model type:** Large Language Model
+- **Finetuned from model:**  meta-llama-3.1-8b-instruct-bnb-4bit
+- **Trained on:** Google Colab [40 GB VRAM A100 GPU, System RAM: 83.5 GB, 235.7 GB Disk]
 ### Model Sources [optional]