tecosys commited on
Commit
1cc0eb4
·
verified ·
1 Parent(s): d9a8745

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -11
README.md CHANGED
@@ -1,6 +1,21 @@
1
  ---
2
  base_model: unsloth/meta-llama-3.1-8b-instruct-bnb-4bit
3
  library_name: peft
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4
  ---
5
 
6
  # Model Card for Model ID
@@ -13,17 +28,11 @@ library_name: peft
13
 
14
  ### Model Description
15
 
16
- <!-- Provide a longer summary of what this model is. -->
17
-
18
-
19
-
20
- - **Developed by:** [More Information Needed]
21
- - **Funded by [optional]:** [More Information Needed]
22
- - **Shared by [optional]:** [More Information Needed]
23
- - **Model type:** [More Information Needed]
24
- - **Language(s) (NLP):** [More Information Needed]
25
- - **License:** [More Information Needed]
26
- - **Finetuned from model [optional]:** [More Information Needed]
27
 
28
  ### Model Sources [optional]
29
 
 
1
  ---
2
  base_model: unsloth/meta-llama-3.1-8b-instruct-bnb-4bit
3
  library_name: peft
4
+ license: mit
5
+ datasets:
6
+ - openai/gsm8k
7
+ metrics:
8
+ - accuracy
9
+ pipeline_tag: reinforcement-learning
10
+ tags:
11
+ - RL
12
+ - LLM
13
+ - Nutaan
14
+ - India's Own LLM
15
+ - Indian LLM
16
+ - Indian RL
17
+ - GRPO
18
+ - LORA
19
  ---
20
 
21
  # Model Card for Model ID
 
28
 
29
  ### Model Description
30
 
31
+ Nutaan-RL1 is an innovate reinforcement model with the reinforcement learning & GRPO with limited number of resources. The model is trained on google colab with unsloth.
32
+ - **Developed by:** Tecosy Team [Check Here](https://tecosys.in/)
33
+ - **Model type:** Large Language Model
34
+ - **Finetuned from model:** meta-llama-3.1-8b-instruct-bnb-4bit
35
+ - **Trained on:** Google Colab [40 GB VRAM A100 GPU, System RAM: 83.5 GB, 235.7 GB Disk]
 
 
 
 
 
 
36
 
37
  ### Model Sources [optional]
38