stabilityai
/

stable-code-instruct-3b

Text Generation

Model card Files Files and versions

reshinthadith commited on Mar 25, 2024

Commit

fd8e2d7

·

verified ·

1 Parent(s): 7beb3b0

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -82,7 +82,11 @@ model-index:
 `stable-code-instruct-3b` is a 2.7B billion parameter decoder-only language model tuned from [`stable-code-3b`](https://huggingface.co/stabilityai/stable-code-3b/). This model was trained on a mix of publicly available datasets, synthetic datasets using [Direct Preference Optimization (DPO)](https://arxiv.org/abs/2305.18290).
 This instruct tune demonstrates state-of-the-art performance (compared to models of similar size) on the MultiPL-E metrics across multiple programming languages tested using [BigCode's Evaluation Harness](https://github.com/bigcode-project/bigcode-evaluation-harness/tree/main), and on the code portions of
-[MT Bench](https://klu.ai/glossary/mt-bench-eval)
 ## Usage
@@ -152,8 +156,8 @@ output = tokenizer.batch_decode(tokens[:, inputs.input_ids.shape[-1]:], skip_spe
 | DeepSeek Coder              | 1.3B | 4.6             |
 | Stable Code Instruct (DPO)  | 3B   | **5.8**(ours)             |
 | Stable Code Instruct (SFT)  | 3B   | 5.5             |
-| CodeLlama Instruct          | 7B   | 3.55            |
 | DeepSeek Coder              | 6.7B | **6.9**             |
 | StarChat2                   | 15B  | 5.7             |

 `stable-code-instruct-3b` is a 2.7B billion parameter decoder-only language model tuned from [`stable-code-3b`](https://huggingface.co/stabilityai/stable-code-3b/). This model was trained on a mix of publicly available datasets, synthetic datasets using [Direct Preference Optimization (DPO)](https://arxiv.org/abs/2305.18290).
 This instruct tune demonstrates state-of-the-art performance (compared to models of similar size) on the MultiPL-E metrics across multiple programming languages tested using [BigCode's Evaluation Harness](https://github.com/bigcode-project/bigcode-evaluation-harness/tree/main), and on the code portions of
+[MT Bench](https://klu.ai/glossary/mt-bench-eval).
+The model is finetuned to make it useable in tasks like,
+  - General purpose Code/Software Engineering like conversations.
+  - Function Calling
+  - SQL related generation and conversation.
 ## Usage
 | DeepSeek Coder              | 1.3B | 4.6             |
 | Stable Code Instruct (DPO)  | 3B   | **5.8**(ours)             |
 | Stable Code Instruct (SFT)  | 3B   | 5.5             |
 | DeepSeek Coder              | 6.7B | **6.9**             |
+| CodeLlama Instruct          | 7B   | 3.55            |
 | StarChat2                   | 15B  | 5.7             |