dhirajjoshi116 commited on
Commit
cf86415
·
verified ·
1 Parent(s): 48d9307

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -11
README.md CHANGED
@@ -19,20 +19,20 @@ We evaluated Granite Vision 3.2 alongside other vision-language models (VLMs) in
19
  | | Molmo-E | InternVL2 | Phi3v | Phi3.5v | Granite Vision |
20
  |-----------|--------------|----------------|-------------|------------|------------|
21
  | **Document benchmarks** |
22
- | DocVQA | 0.66 | 0.87 | 0.87 | **0.88** | **0.88** |
23
- | ChartQA | 0.60 | 0.75 | 0.81 | 0.82 | **0.86** |
24
- | TextVQA | 0.62 | 0.72 | 0.69 | 0.7 | **0.76** |
25
- | AI2D | 0.63 | 0.74 | **0.79** | **0.79** | 0.78 |
26
- | InfoVQA | 0.44 | 0.58 | 0.55 | 0.61 | **0.63** |
27
- | OCRBench | 0.65 | **0.75** | 0.64 | 0.64 | **0.75** |
28
  | LiveXiv VQA | 0.47 | 0.51 | **0.61** | - | **0.61** |
29
  | LiveXiv TQA | 0.36 | 0.38 | 0.48 | - | **0.55** |
30
  | **Other benchmarks** |
31
- | MMMU | 0.32 | 0.35 | 0.42 | **0.44** | 0.35 |
32
- | VQAv2 | 0.57 | 0.75 | 0.76 | 0.77 | **0.81** |
33
- | RealWorldQA | 0.55 | 0.34 | 0.60 | 0.58 | **0.65** |
34
- | VizWiz VQA | 0.49 | 0.46 | 0.57 | 0.57 | **0.64** |
35
- | OK VQA | 0.40 | 0.44 | 0.51 | 0.53 | **0.57** |
36
 
37
 
38
  - **Paper:** [Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence](https://arxiv.org/abs/2502.09927)
 
19
  | | Molmo-E | InternVL2 | Phi3v | Phi3.5v | Granite Vision |
20
  |-----------|--------------|----------------|-------------|------------|------------|
21
  | **Document benchmarks** |
22
+ | DocVQA | 0.66 | 0.87 | 0.87 | 0.88 | **0.89** |
23
+ | ChartQA | 0.60 | 0.75 | 0.81 | 0.82 | **0.87** |
24
+ | TextVQA | 0.62 | 0.72 | 0.69 | 0.7 | **0.78** |
25
+ | AI2D | 0.63 | 0.74 | **0.79** | **0.79** | 0.76 |
26
+ | InfoVQA | 0.44 | 0.58 | 0.55 | 0.61 | **0.64** |
27
+ | OCRBench | 0.65 | 0.75 | 0.64 | 0.64 | **0.77** |
28
  | LiveXiv VQA | 0.47 | 0.51 | **0.61** | - | **0.61** |
29
  | LiveXiv TQA | 0.36 | 0.38 | 0.48 | - | **0.55** |
30
  | **Other benchmarks** |
31
+ | MMMU | 0.32 | 0.35 | 0.42 | **0.44** | 0.37 |
32
+ | VQAv2 | 0.57 | 0.75 | 0.76 | 0.77 | **0.78** |
33
+ | RealWorldQA | 0.55 | 0.34 | 0.60 | 0.58 | **0.63** |
34
+ | VizWiz VQA | 0.49 | 0.46 | 0.57 | 0.57 | **0.63** |
35
+ | OK VQA | 0.40 | 0.44 | 0.51 | 0.53 | **0.56** |
36
 
37
 
38
  - **Paper:** [Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence](https://arxiv.org/abs/2502.09927)