Muennighoff
commited on
Commit
•
b21a48a
1
Parent(s):
81d56cf
Update README.md
Browse files
README.md
CHANGED
@@ -671,6 +671,64 @@ model-index:
|
|
671 |
- **Point of Contact:** [Niklas Muennighoff](mailto:[email protected])
|
672 |
- **Languages:** Refer to [BLOOM](https://huggingface.co/bigscience/bloom) for pretraining & [xP3](https://huggingface.co/bigscience/xP3) for finetuning language proportions. It understands both pretraining & finetuning languages.
|
673 |
- **BLOOMZ & mT0 Model Family:**
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
674 |
|Name|Explanation|
|
675 |
|----|-----------|
|
676 |
|[bloomz-560m](https://huggingface.co/bigscience/bloomz-560m)| 560M parameter multitask finetuned version of [bloom-560m](https://huggingface.co/bigscience/bloom-560m) on [xP3](https://huggingface.co/datasets/bigscience/xP3)|
|
|
|
671 |
- **Point of Contact:** [Niklas Muennighoff](mailto:[email protected])
|
672 |
- **Languages:** Refer to [BLOOM](https://huggingface.co/bigscience/bloom) for pretraining & [xP3](https://huggingface.co/bigscience/xP3) for finetuning language proportions. It understands both pretraining & finetuning languages.
|
673 |
- **BLOOMZ & mT0 Model Family:**
|
674 |
+
|
675 |
+
<table>
|
676 |
+
<tr>
|
677 |
+
<th colspan="11">Multitask finetuned on xP3 - Recommended for prompting in English.
|
678 |
+
</tr>
|
679 |
+
<tr>
|
680 |
+
<th>Parameters</th>
|
681 |
+
<td>560M</td>
|
682 |
+
<td>560M</td>
|
683 |
+
<td>560M</td>
|
684 |
+
<td>560M</td>
|
685 |
+
<td>560M</td>
|
686 |
+
<td>560M</td>
|
687 |
+
<td>560M</td>
|
688 |
+
<td>560M</td>
|
689 |
+
<td>560M</td>
|
690 |
+
<td>560M</td>
|
691 |
+
</tr>
|
692 |
+
<tr>
|
693 |
+
<th>Finetuned Model</th>
|
694 |
+
<td>560M</td>
|
695 |
+
<td>560M</td>
|
696 |
+
<td>560M</td>
|
697 |
+
<td>560M</td>
|
698 |
+
<td>560M</td>
|
699 |
+
<td>560M</td>
|
700 |
+
<td>560M</td>
|
701 |
+
<td>560M</td>
|
702 |
+
<td>560M</td>
|
703 |
+
<td>560M</td>
|
704 |
+
</tr>
|
705 |
+
</tr>
|
706 |
+
<tr>
|
707 |
+
<th>Original pretrained checkpoint</th>
|
708 |
+
<td>560M</td>
|
709 |
+
<td>560M</td>
|
710 |
+
<td>560M</td>
|
711 |
+
<td>560M</td>
|
712 |
+
<td>560M</td>
|
713 |
+
<td>560M</td>
|
714 |
+
<td>560M</td>
|
715 |
+
<td>560M</td>
|
716 |
+
<td>560M</td>
|
717 |
+
<td>560M</td>
|
718 |
+
</tr>
|
719 |
+
</table>
|
720 |
+
|
721 |
+
<table>
|
722 |
+
<tr>
|
723 |
+
<td>One</td>
|
724 |
+
<td>Two</td>
|
725 |
+
</tr>
|
726 |
+
<tr>
|
727 |
+
<td colspan="2">Three</td>
|
728 |
+
</tr>
|
729 |
+
</table>
|
730 |
+
|
731 |
+
|
732 |
|Name|Explanation|
|
733 |
|----|-----------|
|
734 |
|[bloomz-560m](https://huggingface.co/bigscience/bloomz-560m)| 560M parameter multitask finetuned version of [bloom-560m](https://huggingface.co/bigscience/bloom-560m) on [xP3](https://huggingface.co/datasets/bigscience/xP3)|
|