MT5 release The MT5 release follows the T5 family, but is pretrained on multilingual data. The update UMT5 models are pretrained on an updated corpus. Collection by google 18 days ago 16 google/mt5-base Text2Text Generation • Updated Jan 24, 2023 • 98.4k • 204 google/mt5-large Text2Text Generation • Updated Jan 24, 2023 • 81.7k • 85 google/umt5-small Text2Text Generation • Updated Jul 6, 2023 • 7.34k • 21 google/umt5-xl Text2Text Generation • Updated Jul 3, 2023 • 1.74k • 16
My work Collection by quchenyuan Sep 12, 2023 - Multi-view Self-supervised Disentanglement for General Image Denoising Paper • 2309.05049 • Published Sep 10, 2023 • 1
Multi-view Self-supervised Disentanglement for General Image Denoising Paper • 2309.05049 • Published Sep 10, 2023 • 1
AI_MultiModal Collection by ET01 Oct 12, 2023 - Build error 125 ⚡ Qwen VL liuhaotian/llava-v1.5-13b Image-Text-to-Text • Updated May 9, 2024 • 104k • 490
T5 release The original T5 transformer release was done in two steps, the original T5 checkpoints and the improved T5v1 Collection by google 18 days ago 11 google-t5/t5-base Translation • Updated Feb 14, 2024 • 2M • 654 google-t5/t5-small Translation • Updated Jun 30, 2023 • 6.03M • 374 google-t5/t5-large Translation • Updated Apr 6, 2023 • 333k • 185 Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer Paper • 1910.10683 • Published Oct 23, 2019 • 10
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer Paper • 1910.10683 • Published Oct 23, 2019 • 10
nlp Collection by netapy Sep 12, 2023 - When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale Paper • 2309.04564 • Published Sep 8, 2023 • 15
When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale Paper • 2309.04564 • Published Sep 8, 2023 • 15
Flan-T5 release The Flan-T5 covers 4 checkpoints of different sizes each time. It also includes upgrades versions trained using Universal sampling Collection by google 18 days ago 21 google/flan-t5-small Text2Text Generation • Updated Oct 10, 2023 • 592k • 295 google/flan-t5-base Text2Text Generation • Updated Jul 17, 2023 • 598k • 827 google/flan-t5-large Text2Text Generation • Updated Jul 17, 2023 • 1.47M • • 649 google/flan-t5-xxl Text2Text Generation • Updated Jul 27, 2023 • 735k • 1.22k
aa Collection by liankafohali Sep 12, 2023 - csukuangfj/sherpa-ncnn-streaming-zipformer-small-bilingual-zh-en-2023-02-16 Updated May 23, 2023 • 3