keras
/

opt_1.3b_en

Text Generation

KerasHub

Keras

English

Model card Files Files and versions Community

Divyasreepat commited on Mar 24

Commit

9d49d33

verified ·

1 Parent(s): 258503d

Update README.md with new model card content

Browse files

Files changed (1) hide show

README.md +28 -0

README.md CHANGED Viewed

@@ -22,6 +22,34 @@ warranties or conditions of any kind. The underlying model is provided by a
 third party and subject to a separate license, available
 [here](https://github.com/facebookresearch/fairseq/).
 __Arguments__

 third party and subject to a separate license, available
 [here](https://github.com/facebookresearch/fairseq/).
+## Links
+* [OPT Quickstart Notebook](https://www.kaggle.com/code/laxmareddypatlolla/opt-quickstart-notebook)
+* [OPT API Documentation](https://keras.io/keras_hub/api/models/opt/)
+* [KerasHub Beginner Guide](https://keras.io/guides/keras_hub/getting_started/)
+* [KerasHub Model Publishing Guide](https://keras.io/guides/keras_hub/upload/)
+## Installation
+Keras and KerasHub can be installed with:
+```
+pip install -U -q keras-Hub
+pip install -U -q keras
+```
+Jax, TensorFlow, and Torch come preinstalled in Kaggle Notebooks. For instructions on installing them in another environment see the [Keras Getting Started](https://keras.io/getting_started/) page.
+## Presets
+The following model checkpoints are provided by the Keras team. Full code examples for each are available below.
+| Preset name    | Parameters | Description                                      |
+|----------------|------------|--------------------------------------------------|
+| opt_1.3b_en | 125.24M     | 12-layer OPT model where case in maintained. Trained on BookCorpus, CommonCrawl, Pile, and PushShift.io corpora. |
+| opt_125m_en | 1.32B    | 24-layer OPT model where case in maintained. Trained on BookCorpus, CommonCrawl, Pile, and PushShift.io corpora. |
+| opt_2.7b_en| 2.70B    | 32-layer OPT model where case in maintained. Trained on BookCorpus, CommonCrawl, Pile, and PushShift.io corpora. |
+| opt_6.7b_en| 6.70B    | 32-layer OPT model where case in maintained. Trained on BookCorpus, CommonCrawl, Pile, and PushShift.io corpora. |
 __Arguments__