deplot_kr / README.md
brainventures's picture
Update README.md
735bb9d verified
|
raw
history blame
947 Bytes
---
language:
- ko
pipeline_tag: image-to-text
---
# **deplot_kr**
deplot_kr is a Image-to-Data(Text) model based on the google's pix2struct architecture.
It was fine-tuned from [DePlot](https://huggingface.co/google/deplot), using korean chart image-text pairs.
deplot_kr์€ google์˜ pix2struct ๊ตฌ์กฐ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•œ ํ•œ๊ตญ์–ด image-to-data(ํ…์ŠคํŠธ ํ˜•ํƒœ์˜ ๋ฐ์ดํ„ฐ ํ…Œ์ด๋ธ”) ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค.
[DePlot](https://huggingface.co/google/deplot) ๋ชจ๋ธ์„ ํ•œ๊ตญ์–ด ์ฐจํŠธ ์ด๋ฏธ์ง€-ํ…์ŠคํŠธ ์Œ ๋ฐ์ดํ„ฐ์„ธํŠธ(30๋งŒ ๊ฐœ)๋ฅผ ์ด์šฉํ•˜์—ฌ fine-tuning ํ–ˆ์Šต๋‹ˆ๋‹ค.
## How to use
### Training data
### Preprocessing
### Train
The model was trained in a TPU environment.
- num_warmup_steps : 1,000
- num_training_steps : 40,000
## Evaluation Results
This model achieves the following results:
|metrics name | % |
|:---:|:---:|
| RNSS (Relative Number Set Similarity)| 99.5483 |
| RMS F1 (Relative Mapping Similarity)| 16.6401 |