Update README.md
Browse files
README.md
CHANGED
@@ -2,4 +2,17 @@
|
|
2 |
datasets:
|
3 |
- Caraaaaa/non_text_image_captioning
|
4 |
pipeline_tag: image-to-text
|
5 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
2 |
datasets:
|
3 |
- Caraaaaa/non_text_image_captioning
|
4 |
pipeline_tag: image-to-text
|
5 |
+
---
|
6 |
+
|
7 |
+
This model is finetuned on non-text images extracted from documents (i.e.PDF). It is used to analyze the content of the image and produce a descriptive caption.
|
8 |
+
It is part of a project to build a software solution capable of processing offline documents (PDFs, Word, PowerPoint, PPT, etc.) to detect WCAG accessibility issues.
|
9 |
+
|
10 |
+
Example document with non-text images:
|
11 |
+

|
12 |
+
Extracted Image:
|
13 |
+

|
14 |
+
Generated caption:
|
15 |
+
"Indication of correct signature"
|
16 |
+
|
17 |
+
- [Training script](https://colab.research.google.com/drive/1QYvXdi0V1AXqlBMR8MpyydNMnK_Vt4dU?usp=sharing)
|
18 |
+
- [Github repo](https://github.com/caraaaaa/doc_accessibility?tab=readme-ov-file)
|