Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -37,15 +37,15 @@ demonstrate their fine-tuning potential in various downstream tasks.
|
|
37 |
- SightationVQA
|
38 |
- SightationReasoning
|
39 |
|
40 |
-
<img src="https://cdn-uploads.huggingface.co/production/uploads/67a86f66c6f66e2fa5888b41/cNshK4QAdiNMqk7x6J6j7.png" width="
|
41 |
The key benefit of utilizing sighted user feedback lies in their assessments that are based on solid visual
|
42 |
grounding. The compiled assessments prove an effective training substance for steering VLMs towards more
|
43 |
accessible descriptions.
|
44 |
-
<img src="https://cdn-uploads.huggingface.co/production/uploads/67a86f66c6f66e2fa5888b41/8oYvtq7dtv_Ck-U6OlcAE.png" width="
|
45 |
The description qualities assessed by their respective evaluator groups.
|
46 |
|
47 |
## Results
|
48 |
-
<img src="https://cdn-uploads.huggingface.co/production/uploads/67a86f66c6f66e2fa5888b41/094e9Hw7lauvT1tshg1Wj.png" width="
|
49 |
Tuning VLMs on Sightation enhanced various qualities of the diagram descriptions, evaluated by BLV educators, and shown here as normalized ratings averaged in each aspect.
|
50 |
The capability of the dataset is most strongly pronounced with Qwen2-VL-2B model, shown above.
|
51 |
|
|
|
37 |
- SightationVQA
|
38 |
- SightationReasoning
|
39 |
|
40 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/67a86f66c6f66e2fa5888b41/cNshK4QAdiNMqk7x6J6j7.png" width="80%" height="80%" title="visual_abstract" alt="visual_abstract"></img>
|
41 |
The key benefit of utilizing sighted user feedback lies in their assessments that are based on solid visual
|
42 |
grounding. The compiled assessments prove an effective training substance for steering VLMs towards more
|
43 |
accessible descriptions.
|
44 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/67a86f66c6f66e2fa5888b41/8oYvtq7dtv_Ck-U6OlcAE.png" width="70%" height="70%" title="dimensions_assignment" alt="dimensions_assignment"></img>
|
45 |
The description qualities assessed by their respective evaluator groups.
|
46 |
|
47 |
## Results
|
48 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/67a86f66c6f66e2fa5888b41/094e9Hw7lauvT1tshg1Wj.png" width="80%" height="80%" title="spider_chart" alt="spider_chart"></img>
|
49 |
Tuning VLMs on Sightation enhanced various qualities of the diagram descriptions, evaluated by BLV educators, and shown here as normalized ratings averaged in each aspect.
|
50 |
The capability of the dataset is most strongly pronounced with Qwen2-VL-2B model, shown above.
|
51 |
|