invoice1 / README.md
avewright's picture
Update README.md
39326ce verified
metadata
base_model:
  - Qwen/Qwen2.5-VL-3B-Instruct

🧾 Qwen2.5-VL Invoice OCR - LoRA Adapter

Fine-tuned LoRA adapter for Qwen2.5-VL-3B-Instruct specialized in extracting structured JSON data from invoice and receipt images.

Performance

  • Training Loss: 35.78 → 0.024 (99.3% improvement)
  • Eval Loss: 0.010
  • Steps: 150+ with engineered prompt
  • Dataset: 2,043 samples

Usage

from transformers import AutoModelForVision2Seq, AutoProcessor
from peft import PeftModel

base_model = AutoModelForVision2Seq.from_pretrained("Qwen/Qwen2.5-VL-3B-Instruct")
model = PeftModel.from_pretrained(base_model, "kahua-ml/invoice1")