metadata
language:
- ko
pipeline_tag: image-to-text
deplot_kr
deplot_kr is a Image-to-Data(Text) model based on the google's pix2struct architecture. It was fine-tuned from DePlot, using korean chart image-text pairs.
deplot_kr์ google์ pix2struct ๊ตฌ์กฐ๋ฅผ ๊ธฐ๋ฐ์ผ๋ก ํ ํ๊ตญ์ด image-to-data(ํ ์คํธ ํํ์ ๋ฐ์ดํฐ ํ ์ด๋ธ) ๋ชจ๋ธ์ ๋๋ค. DePlot ๋ชจ๋ธ์ ํ๊ตญ์ด ์ฐจํธ ์ด๋ฏธ์ง-ํ ์คํธ ์ ๋ฐ์ดํฐ์ธํธ(30๋ง ๊ฐ)๋ฅผ ์ด์ฉํ์ฌ fine-tuning ํ์ต๋๋ค.
How to use
Training data
Preprocessing
Train
The model was trained in a TPU environment.
- num_warmup_steps : 1,000
- num_training_steps : 40,000
Evaluation Results
This model achieves the following results:
metrics name | % |
---|---|
RNSS (Relative Number Set Similarity) | 99.5483 |
RMS F1 (Relative Mapping Similarity) | 16.6401 |