metadata
base_model:
- Qwen/Qwen2.5-VL-3B-Instruct
🧾 Qwen2.5-VL Invoice OCR - LoRA Adapter
Fine-tuned LoRA adapter for Qwen2.5-VL-3B-Instruct specialized in extracting structured JSON data from invoice and receipt images.
Performance
- Training Loss: 35.78 → 0.024 (99.3% improvement)
- Eval Loss: 0.010
- Steps: 150+ with engineered prompt
- Dataset: 2,043 samples
Usage
from transformers import AutoModelForVision2Seq, AutoProcessor
from peft import PeftModel
base_model = AutoModelForVision2Seq.from_pretrained("Qwen/Qwen2.5-VL-3B-Instruct")
model = PeftModel.from_pretrained(base_model, "kahua-ml/invoice1")