Visual Document Retrieval
ColPali
Safetensors
English
qwen2_vl
vidore-exclude
tonywu71 commited on
Commit
14ed9f0
·
verified ·
1 Parent(s): 140365f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -3
README.md CHANGED
@@ -3,7 +3,7 @@ base_model: vidore/colqwen2-base
3
  language:
4
  - en
5
  library_name: colpali
6
- license: mit
7
  tags:
8
  - colpali
9
  - vidore-exclude
@@ -11,11 +11,10 @@ pipeline_tag: visual-document-retrieval
11
  ---
12
  # ColQwen2: Visual Retriever based on Qwen2-VL-2B-Instruct with ColBERT strategy
13
 
14
- ColQwen is a model based on a novel model architecture and training strategy based on Vision Language Models (VLMs) to efficiently index documents from their visual features.
15
  It is a [Qwen2-VL-2B](https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct) extension that generates [ColBERT](https://arxiv.org/abs/2004.12832)- style multi-vector representations of text and images.
16
  It was introduced in the paper [ColPali: Efficient Document Retrieval with Vision Language Models](https://arxiv.org/abs/2407.01449) and first released in [this repository](https://github.com/ManuelFay/colpali)
17
 
18
- This version is the untrained base version to guarantee deterministic projection layer initialization.
19
  <p align="center"><img width=800 src="https://github.com/illuin-tech/colpali/blob/main/assets/colpali_architecture.webp?raw=true"/></p>
20
 
21
  ## Version specificity
 
3
  language:
4
  - en
5
  library_name: colpali
6
+ license: apache-2.0
7
  tags:
8
  - colpali
9
  - vidore-exclude
 
11
  ---
12
  # ColQwen2: Visual Retriever based on Qwen2-VL-2B-Instruct with ColBERT strategy
13
 
14
+ ColQwen2 is a model based on a novel model architecture and training strategy based on Vision Language Models (VLMs) to efficiently index documents from their visual features.
15
  It is a [Qwen2-VL-2B](https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct) extension that generates [ColBERT](https://arxiv.org/abs/2004.12832)- style multi-vector representations of text and images.
16
  It was introduced in the paper [ColPali: Efficient Document Retrieval with Vision Language Models](https://arxiv.org/abs/2407.01449) and first released in [this repository](https://github.com/ManuelFay/colpali)
17
 
 
18
  <p align="center"><img width=800 src="https://github.com/illuin-tech/colpali/blob/main/assets/colpali_architecture.webp?raw=true"/></p>
19
 
20
  ## Version specificity