Zyphra
/

Zamba-7B-v1

Text Generation

Inference Endpoints

Model card Files Files and versions Community

BerenMillidge commited on Jun 3

Commit

8dd106c

•

1 Parent(s): 88a4c87

Update README.md

Files changed (1) hide show

README.md +13 -0

README.md CHANGED Viewed

@@ -7,6 +7,8 @@ Zamba-7B-v1 is a hybrid model between Mamba, a state-space model, and transforme
 Note: the current Huggingface implementation of Zamba performs slower than our internal implementation. We are working to fix this with the Huggingface team.
 ## Quick start
 ### Presequities
@@ -43,6 +45,17 @@ outputs = model.generate(**input_ids, max_new_tokens=100)
 print(tokenizer.decode(outputs[0]))
 ```
 ## Notice
 Zamba is a pretrained base model and therefore does not have any moderation mechanism. In addition, one should not expect good chat performance, as this model was not fine-tuned for chat.

 Note: the current Huggingface implementation of Zamba performs slower than our internal implementation. We are working to fix this with the Huggingface team.
+Our technical report describing the training of Zamba is available [here](https://arxiv.org/abs/2405.16712)
 ## Quick start
 ### Presequities
 print(tokenizer.decode(outputs[0]))
 ```
+## Citation
+If you find Zamba useful in your work please cite it as:
+@article{glorioso2024zamba,
+  title={Zamba: A Compact 7B SSM Hybrid Model},
+  author={Glorioso, Paolo and Anthony, Quentin and Tokpanov, Yury and Whittington, James and Pilault, Jonathan and Ibrahim, Adam and Millidge, Beren},
+  journal={arXiv preprint arXiv:2405.16712},
+  year={2024}
+}
 ## Notice
 Zamba is a pretrained base model and therefore does not have any moderation mechanism. In addition, one should not expect good chat performance, as this model was not fine-tuned for chat.