Update README.md
Browse files
README.md
CHANGED
@@ -15,16 +15,17 @@ Omni-Vision is a sub-billion (968M) multimodal model capable of processing both
|
|
15 |
- **9x Token Reduction through Token Compression**: Significant decrease in image token count, reducing latency and computational cost, ideal for on-device applications.
|
16 |
- **Minimal-Edit DPO for Enhanced Response Quality**: Improves model responses by using targeted edits, maintaining core capabilities without significant behavior shifts.
|
17 |
|
18 |
-
Quick Links
|
19 |
-
1. Interact
|
20 |
2. [Quickstart to run locally](#how-to-run-locally)
|
21 |
-
3. Learn more
|
22 |
|
23 |
**Feedback:** Send questions or comments about the model in our [Discord](https://discord.gg/nexa-ai)
|
24 |
|
25 |
## Intended Use Cases
|
26 |
-
OmniVision is intended for Visual Question Answering (answering questions about images) and Image Captioning (describing scenes in photos), optimized for edge devices.
|
27 |
|
|
|
28 |
Omni-Vision generated captions for a 1046×1568 pixel poster | **Processing time: <2s** | Device: MacBook M4 Pro
|
29 |
|
30 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/6618e0424dbef6bd3c72f89a/PTG3_n_p7_atBHCwRLOEE.png" alt="Example" style="width:700px;"/>
|
|
|
15 |
- **9x Token Reduction through Token Compression**: Significant decrease in image token count, reducing latency and computational cost, ideal for on-device applications.
|
16 |
- **Minimal-Edit DPO for Enhanced Response Quality**: Improves model responses by using targeted edits, maintaining core capabilities without significant behavior shifts.
|
17 |
|
18 |
+
**Quick Links:**
|
19 |
+
1. Interact in our HuggingFace Space.
|
20 |
2. [Quickstart to run locally](#how-to-run-locally)
|
21 |
+
3. Learn more in [blogs](https://nexa.ai)
|
22 |
|
23 |
**Feedback:** Send questions or comments about the model in our [Discord](https://discord.gg/nexa-ai)
|
24 |
|
25 |
## Intended Use Cases
|
26 |
+
OmniVision is intended for Visual Question Answering (answering questions about images) and Image Captioning (describing scenes in photos), optimized for edge devices.
|
27 |
|
28 |
+
**Example Demo:**
|
29 |
Omni-Vision generated captions for a 1046×1568 pixel poster | **Processing time: <2s** | Device: MacBook M4 Pro
|
30 |
|
31 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/6618e0424dbef6bd3c72f89a/PTG3_n_p7_atBHCwRLOEE.png" alt="Example" style="width:700px;"/>
|