alanzhuly commited on
Commit
0067dce
1 Parent(s): 6e0b0dd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -4
README.md CHANGED
@@ -15,16 +15,17 @@ Omni-Vision is a sub-billion (968M) multimodal model capable of processing both
15
  - **9x Token Reduction through Token Compression**: Significant decrease in image token count, reducing latency and computational cost, ideal for on-device applications.
16
  - **Minimal-Edit DPO for Enhanced Response Quality**: Improves model responses by using targeted edits, maintaining core capabilities without significant behavior shifts.
17
 
18
- Quick Links:
19
- 1. Interact directly in the HuggingFace Space.
20
  2. [Quickstart to run locally](#how-to-run-locally)
21
- 3. Learn more details in our blogs
22
 
23
  **Feedback:** Send questions or comments about the model in our [Discord](https://discord.gg/nexa-ai)
24
 
25
  ## Intended Use Cases
26
- OmniVision is intended for Visual Question Answering (answering questions about images) and Image Captioning (describing scenes in photos), optimized for edge devices. See example below:
27
 
 
28
  Omni-Vision generated captions for a 1046×1568 pixel poster | **Processing time: <2s** | Device: MacBook M4 Pro
29
 
30
  <img src="https://cdn-uploads.huggingface.co/production/uploads/6618e0424dbef6bd3c72f89a/PTG3_n_p7_atBHCwRLOEE.png" alt="Example" style="width:700px;"/>
 
15
  - **9x Token Reduction through Token Compression**: Significant decrease in image token count, reducing latency and computational cost, ideal for on-device applications.
16
  - **Minimal-Edit DPO for Enhanced Response Quality**: Improves model responses by using targeted edits, maintaining core capabilities without significant behavior shifts.
17
 
18
+ **Quick Links:**
19
+ 1. Interact in our HuggingFace Space.
20
  2. [Quickstart to run locally](#how-to-run-locally)
21
+ 3. Learn more in [blogs](https://nexa.ai)
22
 
23
  **Feedback:** Send questions or comments about the model in our [Discord](https://discord.gg/nexa-ai)
24
 
25
  ## Intended Use Cases
26
+ OmniVision is intended for Visual Question Answering (answering questions about images) and Image Captioning (describing scenes in photos), optimized for edge devices.
27
 
28
+ **Example Demo:**
29
  Omni-Vision generated captions for a 1046×1568 pixel poster | **Processing time: <2s** | Device: MacBook M4 Pro
30
 
31
  <img src="https://cdn-uploads.huggingface.co/production/uploads/6618e0424dbef6bd3c72f89a/PTG3_n_p7_atBHCwRLOEE.png" alt="Example" style="width:700px;"/>