Update README.md with new model card content
Browse files
README.md
CHANGED
@@ -9,7 +9,7 @@ T5 encoder-decoder backbone model.
|
|
9 |
T5 is a LLM pretrained on a mix of unsupervised and supervised tasks,
|
10 |
where each task is converted to a sequence-to-sequence format.
|
11 |
T5 works well on a variety of tasks out-of-the-box by prepending
|
12 |
-
various
|
13 |
`"translate English to German: ..."`, for summarization:
|
14 |
`"summarize: ..."`.
|
15 |
|
@@ -23,6 +23,41 @@ preset architectures and weights, use the `from_preset` constructor.
|
|
23 |
Disclaimer: Pre-trained models are provided on an "as is" basis, without
|
24 |
warranties or conditions of any kind.
|
25 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
26 |
|
27 |
__Arguments__
|
28 |
|
|
|
9 |
T5 is a LLM pretrained on a mix of unsupervised and supervised tasks,
|
10 |
where each task is converted to a sequence-to-sequence format.
|
11 |
T5 works well on a variety of tasks out-of-the-box by prepending
|
12 |
+
various prefixes to the input sequence, e.g., for translation:
|
13 |
`"translate English to German: ..."`, for summarization:
|
14 |
`"summarize: ..."`.
|
15 |
|
|
|
23 |
Disclaimer: Pre-trained models are provided on an "as is" basis, without
|
24 |
warranties or conditions of any kind.
|
25 |
|
26 |
+
## Links
|
27 |
+
|
28 |
+
* [T5 Quickstart Notebook](coming soon)
|
29 |
+
* [T5 API Documentation](https://keras.io/keras_hub/api/models/t5/)
|
30 |
+
* [T5 Model Card](https://github.com/google-research/text-to-text-transfer-transformer/tree/main)
|
31 |
+
* [KerasHub Beginner Guide](https://keras.io/guides/keras_hub/getting_started/)
|
32 |
+
* [KerasHub Model Publishing Guide](https://keras.io/guides/keras_hub/upload/)
|
33 |
+
|
34 |
+
## Installation
|
35 |
+
|
36 |
+
Keras and KerasHub can be installed with:
|
37 |
+
|
38 |
+
```
|
39 |
+
pip install -U -q keras-hub
|
40 |
+
pip install -U -q keras
|
41 |
+
```
|
42 |
+
|
43 |
+
Jax, TensorFlow, and Torch come preinstalled in Kaggle Notebooks. For instructions on installing them in another environment see the [Keras Getting Started](https://keras.io/getting_started/) page.
|
44 |
+
|
45 |
+
## Presets
|
46 |
+
|
47 |
+
The following model checkpoints are provided by the Keras team. Full code examples for each are available below.
|
48 |
+
| Preset name | Parameters | Description |
|
49 |
+
|----------------|------------|--------------------------------------------------|
|
50 |
+
| t5_small_multi | 0 | 8-layer T5 model. Trained on the Colossal Clean Crawled Corpus (C4).|
|
51 |
+
| t5_base_multi| 0 | 12-layer T5 model. Trained on the Colossal Clean Crawled Corpus (C4). |
|
52 |
+
| t5_large_multi | 0 | 24-layer T5 model. Trained on the Colossal Clean Crawled Corpus (C4). |
|
53 |
+
| flan_small_multi | 0 | 8-layer T5 model. Trained on the Colossal Clean Crawled Corpus (C4). |
|
54 |
+
| flan_base_multi | 0 | 12-layer T5 model. Trained on the Colossal Clean Crawled Corpus (C4). |
|
55 |
+
| flan_large_multi | 0 | 24-layer T5 model. Trained on the Colossal Clean Crawled Corpus (C4). |
|
56 |
+
| t5_1.1_small | 60.51M | |
|
57 |
+
| tt5_1.1_base | 247.58M | |
|
58 |
+
| t5_1.1_large | 750.25M | |
|
59 |
+
| t5_1.1_xl | 2.85B | |
|
60 |
+
| t5_1.1_xxl | 11.14B | |
|
61 |
|
62 |
__Arguments__
|
63 |
|