stabilityai
/

stable-code-3b

Text Generation

Model card Files Files and versions

hassanzay commited on Apr 12, 2024

Commit

b676b54

·

verified ·

1 Parent(s): 2190f5c

Update README.md

Files changed (1) hide show

README.md +5 -2

README.md CHANGED Viewed

@@ -81,6 +81,8 @@ model-index:
 ---
 # `stable-code-3b`
 ## Model Description
 `stable-code-3b` is a 2.7B billion parameter decoder-only language model pre-trained on 1.3 trillion tokens of diverse textual and code datasets. `stable-code-3b` is trained on 18 programming languages (selected based on the 2023 StackOverflow Developer Survey) and demonstrates state-of-the-art performance (compared to models of similar size) on the MultiPL-E metrics across multiple programming languages tested using [BigCode's Evaluation Harness](https://github.com/bigcode-project/bigcode-evaluation-harness/tree/main).
@@ -184,7 +186,8 @@ print(tokenizer.decode(tokens[0], skip_special_tokens=True))
 * **Model type**: `stable-code-3b` models are auto-regressive language models based on the transformer decoder architecture.
 * **Language(s)**: English, Code
 * **Library**: [GPT-NeoX](https://github.com/EleutherAI/gpt-neox)
-* **License**: License: StabilityAI Non-Commercial Research Community License. If you want to use this model for your commercial products or purposes, please contact us [here](https://stability.ai/membership) to learn more.
 * **Contact**: For questions and comments about the model, please email `[email protected]`
 ### Model Architecture
@@ -238,7 +241,7 @@ The model is pre-trained on the aforementioned datasets in `bfloat16` precision,
 ### Intended Use
-The model is intended to be used as a foundational base model for application-specific fine-tuning. Developers must evaluate and fine-tune the model for safe performance in downstream applications.
 ### Limitations and Bias

 ---
 # `stable-code-3b`
+Please note: For commercial use, please refer to https://stability.ai/membership.
 ## Model Description
 `stable-code-3b` is a 2.7B billion parameter decoder-only language model pre-trained on 1.3 trillion tokens of diverse textual and code datasets. `stable-code-3b` is trained on 18 programming languages (selected based on the 2023 StackOverflow Developer Survey) and demonstrates state-of-the-art performance (compared to models of similar size) on the MultiPL-E metrics across multiple programming languages tested using [BigCode's Evaluation Harness](https://github.com/bigcode-project/bigcode-evaluation-harness/tree/main).
 * **Model type**: `stable-code-3b` models are auto-regressive language models based on the transformer decoder architecture.
 * **Language(s)**: English, Code
 * **Library**: [GPT-NeoX](https://github.com/EleutherAI/gpt-neox)
+* **License**: Stability AI Non-Commercial Research Community License.
+* **Commercial License**: to use this model commercially, please refer to https://stability.ai/membership
 * **Contact**: For questions and comments about the model, please email `[email protected]`
 ### Model Architecture
 ### Intended Use
+The model is intended to be used as a foundational base model for application-specific fine-tuning. Developers must evaluate and fine-tune the model for safe performance in downstream applications. For commercial use, please refer to https://stability.ai/membership.
 ### Limitations and Bias