Update README.md
Browse files
README.md
CHANGED
@@ -19,20 +19,20 @@ We evaluated Granite Vision 3.2 alongside other vision-language models (VLMs) in
|
|
19 |
| | Molmo-E | InternVL2 | Phi3v | Phi3.5v | Granite Vision |
|
20 |
|-----------|--------------|----------------|-------------|------------|------------|
|
21 |
| **Document benchmarks** |
|
22 |
-
| DocVQA | 0.66 | 0.87 | 0.87 |
|
23 |
-
| ChartQA | 0.60 | 0.75 | 0.81 | 0.82 | **0.
|
24 |
-
| TextVQA | 0.62 | 0.72 | 0.69 | 0.7 | **0.
|
25 |
-
| AI2D | 0.63 | 0.74 | **0.79** | **0.79** | 0.
|
26 |
-
| InfoVQA | 0.44 | 0.58 | 0.55 | 0.61 | **0.
|
27 |
-
| OCRBench | 0.65 |
|
28 |
| LiveXiv VQA | 0.47 | 0.51 | **0.61** | - | **0.61** |
|
29 |
| LiveXiv TQA | 0.36 | 0.38 | 0.48 | - | **0.55** |
|
30 |
| **Other benchmarks** |
|
31 |
-
| MMMU | 0.32 | 0.35 | 0.42 | **0.44** | 0.
|
32 |
-
| VQAv2 | 0.57 | 0.75 | 0.76 | 0.77 | **0.
|
33 |
-
| RealWorldQA | 0.55 | 0.34 | 0.60 | 0.58 | **0.
|
34 |
-
| VizWiz VQA | 0.49 | 0.46 | 0.57 | 0.57 | **0.
|
35 |
-
| OK VQA | 0.40 | 0.44 | 0.51 | 0.53 | **0.
|
36 |
|
37 |
|
38 |
- **Paper:** [Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence](https://arxiv.org/abs/2502.09927)
|
|
|
19 |
| | Molmo-E | InternVL2 | Phi3v | Phi3.5v | Granite Vision |
|
20 |
|-----------|--------------|----------------|-------------|------------|------------|
|
21 |
| **Document benchmarks** |
|
22 |
+
| DocVQA | 0.66 | 0.87 | 0.87 | 0.88 | **0.89** |
|
23 |
+
| ChartQA | 0.60 | 0.75 | 0.81 | 0.82 | **0.87** |
|
24 |
+
| TextVQA | 0.62 | 0.72 | 0.69 | 0.7 | **0.78** |
|
25 |
+
| AI2D | 0.63 | 0.74 | **0.79** | **0.79** | 0.76 |
|
26 |
+
| InfoVQA | 0.44 | 0.58 | 0.55 | 0.61 | **0.64** |
|
27 |
+
| OCRBench | 0.65 | 0.75 | 0.64 | 0.64 | **0.77** |
|
28 |
| LiveXiv VQA | 0.47 | 0.51 | **0.61** | - | **0.61** |
|
29 |
| LiveXiv TQA | 0.36 | 0.38 | 0.48 | - | **0.55** |
|
30 |
| **Other benchmarks** |
|
31 |
+
| MMMU | 0.32 | 0.35 | 0.42 | **0.44** | 0.37 |
|
32 |
+
| VQAv2 | 0.57 | 0.75 | 0.76 | 0.77 | **0.78** |
|
33 |
+
| RealWorldQA | 0.55 | 0.34 | 0.60 | 0.58 | **0.63** |
|
34 |
+
| VizWiz VQA | 0.49 | 0.46 | 0.57 | 0.57 | **0.63** |
|
35 |
+
| OK VQA | 0.40 | 0.44 | 0.51 | 0.53 | **0.56** |
|
36 |
|
37 |
|
38 |
- **Paper:** [Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence](https://arxiv.org/abs/2502.09927)
|