Muennighoff commited on
Commit
b21a48a
1 Parent(s): 81d56cf

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +58 -0
README.md CHANGED
@@ -671,6 +671,64 @@ model-index:
671
  - **Point of Contact:** [Niklas Muennighoff](mailto:[email protected])
672
  - **Languages:** Refer to [BLOOM](https://huggingface.co/bigscience/bloom) for pretraining & [xP3](https://huggingface.co/bigscience/xP3) for finetuning language proportions. It understands both pretraining & finetuning languages.
673
  - **BLOOMZ & mT0 Model Family:**
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
674
  |Name|Explanation|
675
  |----|-----------|
676
  |[bloomz-560m](https://huggingface.co/bigscience/bloomz-560m)| 560M parameter multitask finetuned version of [bloom-560m](https://huggingface.co/bigscience/bloom-560m) on [xP3](https://huggingface.co/datasets/bigscience/xP3)|
 
671
  - **Point of Contact:** [Niklas Muennighoff](mailto:[email protected])
672
  - **Languages:** Refer to [BLOOM](https://huggingface.co/bigscience/bloom) for pretraining & [xP3](https://huggingface.co/bigscience/xP3) for finetuning language proportions. It understands both pretraining & finetuning languages.
673
  - **BLOOMZ & mT0 Model Family:**
674
+
675
+ <table>
676
+ <tr>
677
+ <th colspan="11">Multitask finetuned on xP3 - Recommended for prompting in English.
678
+ </tr>
679
+ <tr>
680
+ <th>Parameters</th>
681
+ <td>560M</td>
682
+ <td>560M</td>
683
+ <td>560M</td>
684
+ <td>560M</td>
685
+ <td>560M</td>
686
+ <td>560M</td>
687
+ <td>560M</td>
688
+ <td>560M</td>
689
+ <td>560M</td>
690
+ <td>560M</td>
691
+ </tr>
692
+ <tr>
693
+ <th>Finetuned Model</th>
694
+ <td>560M</td>
695
+ <td>560M</td>
696
+ <td>560M</td>
697
+ <td>560M</td>
698
+ <td>560M</td>
699
+ <td>560M</td>
700
+ <td>560M</td>
701
+ <td>560M</td>
702
+ <td>560M</td>
703
+ <td>560M</td>
704
+ </tr>
705
+ </tr>
706
+ <tr>
707
+ <th>Original pretrained checkpoint</th>
708
+ <td>560M</td>
709
+ <td>560M</td>
710
+ <td>560M</td>
711
+ <td>560M</td>
712
+ <td>560M</td>
713
+ <td>560M</td>
714
+ <td>560M</td>
715
+ <td>560M</td>
716
+ <td>560M</td>
717
+ <td>560M</td>
718
+ </tr>
719
+ </table>
720
+
721
+ <table>
722
+ <tr>
723
+ <td>One</td>
724
+ <td>Two</td>
725
+ </tr>
726
+ <tr>
727
+ <td colspan="2">Three</td>
728
+ </tr>
729
+ </table>
730
+
731
+
732
  |Name|Explanation|
733
  |----|-----------|
734
  |[bloomz-560m](https://huggingface.co/bigscience/bloomz-560m)| 560M parameter multitask finetuned version of [bloom-560m](https://huggingface.co/bigscience/bloom-560m) on [xP3](https://huggingface.co/datasets/bigscience/xP3)|