File size: 2,974 Bytes
0982a20 bc5ba67 28a4471 e5bc2df 0982a20 9755981 0982a20 28a4471 0982a20 5af760d 0982a20 9755981 0982a20 28a4471 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 |
---
license: llama3
datasets:
- ajibawa-2023/Code-290k-ShareGPT
- m-a-p/CodeFeedback-Filtered-Instruction
- m-a-p/Code-Feedback
- microsoft/orca-math-word-problems-200k
language:
- en
tags:
- code
- Python
- Cpp
- PHP
- JS
- Java
- Rust
- Ruby
- SQL
- MySql
- R
- Julia
---
**Code-Llama-3-8B**
This Model is trained on refined version of my dataset [Code-290k-ShareGPT](https://huggingface.co/datasets/ajibawa-2023/Code-290k-ShareGPT).
Besides this it is trained on following datasets:
[Code-Feedback](https://huggingface.co/datasets/m-a-p/Code-Feedback)
[orca-math-word-problems-200k](https://huggingface.co/datasets/microsoft/orca-math-word-problems-200k)
[CodeFeedback-Filtered-Instruction](https://huggingface.co/datasets/m-a-p/CodeFeedback-Filtered-Instruction)
The idea was to check how this Model will perform with both Code & Maths datasets. This model is very good with Coding.
Maths outputs are also very good. You can test out this model.
It is very very good in Code generation in various languages such as **Python, Java, JavaScript, GO, C++, Rust, Ruby, Sql, MySql, R, Julia, Haskell**, etc..
This model will also generate detailed explanation/logic behind each code.
This Model is trained on massive datasets so the results are very good. You can check the Examples given below.
I have used ChatML prompt format.
This is Fully Finetuned Model. Quantized model will be updated very soon.
**GGUF & Exllama**
GGUF: TBA
Exllama v2: TBA
**Training:**
Entire dataset was trained on 4 x A100 80GB. For 2 epoch, training took more than 160 Hours. Axolotl & Deepspeed codebase was used for training purpose.
Entire data is trained on Llama-3-8B by Meta.
**Example Prompt:**
This model uses **ChatML** prompt format.
```
<|im_start|>system
You are a helpful Coding assistant.<|im_end|>
<|im_start|>user
{prompt}<|im_end|>
<|im_start|>assistant
```
You can modify above Prompt as per your requirement.
One example will be:
```
This is a conversation with your helpful Coding assistant. Assistant can generate Code in various Programming Languages along with necessary explanation.
```
I want to say special Thanks to the Open Source community for helping & guiding me to better understand the AI/Model development.
Thank you for your love & support.
**Example Output**
Example 1
![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/64aea8ff67511bd3d965697b/KJpeOSl9CHaBSqAp09iG5.jpeg)
Example 2
![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/64aea8ff67511bd3d965697b/XYnq7bzKSGj4reMr7sYmU.jpeg)
Example 3
![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/64aea8ff67511bd3d965697b/bdJZvHtrG7cSBKXOEbHa2.jpeg)
Example 4
![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/64aea8ff67511bd3d965697b/xViwKIT3laev5F9xZbs3-.jpeg)
Example 5
![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/64aea8ff67511bd3d965697b/O33l9tj93EVDPE34Q7ZHY.jpeg) |