hiyouga
/

PaliGemma-3B-Chat-v0.1

Image-Text-to-Text

text-generation-inference

Model card Files Files and versions Community

hiyouga commited on May 23

Commit

f883033

•

1 Parent(s): a8dac0c

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -18,9 +18,11 @@ tags:
 # PaliGemma-3B-Chat-v0.1
-This model is fine-tuned from [google/paligemma-3b-mix-448](https://huggingface.co/google/paligemma-3b-mix-448) using [LLaMA Factory](https://github.com/hiyouga/LLaMA-Factory).
-![examples](examples_en.png)
 ## Usage
@@ -54,6 +56,8 @@ generate_ids = model.generate(input_ids, pixel_values=pixel_values, streamer=str
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:

 # PaliGemma-3B-Chat-v0.1
+This model is fine-tuned from [google/paligemma-3b-mix-448](https://huggingface.co/google/paligemma-3b-mix-448) for multiturn chat completions.
+![example_en](assets/example_en.png)
+![example_zh](assets/example_zh.png)
+![example_ja](assets/example_ja.png)
 ## Usage
 ## Training procedure
+We used [LLaMA Factory](https://github.com/hiyouga/LLaMA-Factory) to fine-tune this model. During fine-tuning, we freezed the vision tower and adjusted the parameters in the language model and projector layer.
 ### Training hyperparameters
 The following hyperparameters were used during training: