AIDC-AI
/

Parrot-14B

Image-Text-to-Text

Model card Files Files and versions Community

liyang commited on Aug 2, 2024

Commit

549533d

·

verified ·

1 Parent(s): 699a22a

update readme

Files changed (1) hide show

README.md +37 -3

README.md CHANGED Viewed

@@ -1,3 +1,37 @@
----
-license: apache-2.0
----

+# Model Card
+<!-- Provide a quick summary of what the model is/does. -->
+Parrot is a multi-language and multi-modal large language model capable of achieving excellent performance.
+For a comprehensive introduction, please refer to [Parrot Paper](https://arxiv.org/abs/2406.02539) and [Parrot GitHub](https://github.com/AIDC-AI/Parrot).
+# Model Details
+![](https://github.com/AIDC-AI/Parrot/images/teaser.png)
+# Performance
+![](https://github.com/AIDC-AI/Parrot/images/teaser.png)
+# Usage
+Below is a code snippet to run Parrot with multimodal inputs. For additional usage instructions, including inference wrapper and Gradio UI, please refer to [Parrot GitHub](https://github.com/AIDC-AI/Parrot).
+```markdown
+pip install torch==2.1.2 transformers==4.43.2 pillow==10.3.0
+```
+```python
+import torch
+from PIL import Image
+from transformers import AutoModelForCausalLM
+```
+# Citation
+If you find Parrot useful, please cite the paper
+```markdown
+@article{sun2024parrot,
+  title={Parrot: Multilingual Visual Instruction Tuning},
+  author={Sun, Hai-Long and Zhou, Da-Wei and Li, Yang and Lu, Shiyin and Yi, Chao and Chen, Qing-Guo and Xu, Zhao and Luo, Weihua and Zhang, Kaifu and Zhan, De-Chuan and others},
+  journal={arXiv preprint arXiv:2406.02539},
+  year={2024}
+}
+```
+# License
+The project is licensed under Apache License Version 2.0 and is restricted to uses that comply with the license agreements of Qwen and Clip.