Spaces:

Sightation
/

README

Running

Jaime-Choi commited on 21 days ago

Commit

34556ee

verified ·

1 Parent(s): c8eae73

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -37,15 +37,15 @@ demonstrate their fine-tuning potential in various downstream tasks.
 - SightationVQA
 - SightationReasoning
-<img src="https://cdn-uploads.huggingface.co/production/uploads/67a86f66c6f66e2fa5888b41/cNshK4QAdiNMqk7x6J6j7.png" width="70%" height="70%" title="visual_abstract" alt="visual_abstract"></img>
 The key benefit of utilizing sighted user feedback lies in their assessments that are based on solid visual
 grounding. The compiled assessments prove an effective training substance for steering VLMs towards more
 accessible descriptions.
-<img src="https://cdn-uploads.huggingface.co/production/uploads/67a86f66c6f66e2fa5888b41/8oYvtq7dtv_Ck-U6OlcAE.png" width="50%" height="50%" title="dimensions_assignment" alt="dimensions_assignment"></img>
 The description qualities assessed by their respective evaluator groups.
 ## Results
-<img src="https://cdn-uploads.huggingface.co/production/uploads/67a86f66c6f66e2fa5888b41/094e9Hw7lauvT1tshg1Wj.png" width="60%" height="60%" title="spider_chart" alt="spider_chart"></img>
 Tuning VLMs on Sightation enhanced various qualities of the diagram descriptions, evaluated by BLV educators, and shown here as normalized ratings averaged in each aspect.
 The capability of the dataset is most strongly pronounced with Qwen2-VL-2B model, shown above.

 - SightationVQA
 - SightationReasoning
+<img src="https://cdn-uploads.huggingface.co/production/uploads/67a86f66c6f66e2fa5888b41/cNshK4QAdiNMqk7x6J6j7.png" width="80%" height="80%" title="visual_abstract" alt="visual_abstract"></img>
 The key benefit of utilizing sighted user feedback lies in their assessments that are based on solid visual
 grounding. The compiled assessments prove an effective training substance for steering VLMs towards more
 accessible descriptions.
+<img src="https://cdn-uploads.huggingface.co/production/uploads/67a86f66c6f66e2fa5888b41/8oYvtq7dtv_Ck-U6OlcAE.png" width="70%" height="70%" title="dimensions_assignment" alt="dimensions_assignment"></img>
 The description qualities assessed by their respective evaluator groups.
 ## Results
+<img src="https://cdn-uploads.huggingface.co/production/uploads/67a86f66c6f66e2fa5888b41/094e9Hw7lauvT1tshg1Wj.png" width="80%" height="80%" title="spider_chart" alt="spider_chart"></img>
 Tuning VLMs on Sightation enhanced various qualities of the diagram descriptions, evaluated by BLV educators, and shown here as normalized ratings averaged in each aspect.
 The capability of the dataset is most strongly pronounced with Qwen2-VL-2B model, shown above.