Spaces:

Sightation
/

README

Running

Jaime-Choi commited on 21 days ago

Commit

5c48dad

verified ·

1 Parent(s): 34556ee

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -37,7 +37,7 @@ demonstrate their fine-tuning potential in various downstream tasks.
 - SightationVQA
 - SightationReasoning
-<img src="https://cdn-uploads.huggingface.co/production/uploads/67a86f66c6f66e2fa5888b41/cNshK4QAdiNMqk7x6J6j7.png" width="80%" height="80%" title="visual_abstract" alt="visual_abstract"></img>
 The key benefit of utilizing sighted user feedback lies in their assessments that are based on solid visual
 grounding. The compiled assessments prove an effective training substance for steering VLMs towards more
 accessible descriptions.
@@ -45,7 +45,7 @@ accessible descriptions.
 The description qualities assessed by their respective evaluator groups.
 ## Results
-<img src="https://cdn-uploads.huggingface.co/production/uploads/67a86f66c6f66e2fa5888b41/094e9Hw7lauvT1tshg1Wj.png" width="80%" height="80%" title="spider_chart" alt="spider_chart"></img>
 Tuning VLMs on Sightation enhanced various qualities of the diagram descriptions, evaluated by BLV educators, and shown here as normalized ratings averaged in each aspect.
 The capability of the dataset is most strongly pronounced with Qwen2-VL-2B model, shown above.

 - SightationVQA
 - SightationReasoning
+<img src="https://cdn-uploads.huggingface.co/production/uploads/67a86f66c6f66e2fa5888b41/cNshK4QAdiNMqk7x6J6j7.png" width="100%" height="100%" title="visual_abstract" alt="visual_abstract"></img>
 The key benefit of utilizing sighted user feedback lies in their assessments that are based on solid visual
 grounding. The compiled assessments prove an effective training substance for steering VLMs towards more
 accessible descriptions.
 The description qualities assessed by their respective evaluator groups.
 ## Results
+<img src="https://cdn-uploads.huggingface.co/production/uploads/67a86f66c6f66e2fa5888b41/094e9Hw7lauvT1tshg1Wj.png" width="90%" height="90%" title="spider_chart" alt="spider_chart"></img>
 Tuning VLMs on Sightation enhanced various qualities of the diagram descriptions, evaluated by BLV educators, and shown here as normalized ratings averaged in each aspect.
 The capability of the dataset is most strongly pronounced with Qwen2-VL-2B model, shown above.