vidore
/

colqwen2-v1.0-merged

Visual Document Retrieval

Model card Files Files and versions Community

tonywu71 commited on 22 days ago

Commit

14ed9f0

·

verified ·

1 Parent(s): 140365f

Update README.md

Files changed (1) hide show

README.md +2 -3

README.md CHANGED Viewed

@@ -3,7 +3,7 @@ base_model: vidore/colqwen2-base
 language:
 - en
 library_name: colpali
-license: mit
 tags:
 - colpali
 - vidore-exclude
@@ -11,11 +11,10 @@ pipeline_tag: visual-document-retrieval
 ---
 # ColQwen2: Visual Retriever based on Qwen2-VL-2B-Instruct with ColBERT strategy
-ColQwen is a model based on a novel model architecture and training strategy based on Vision Language Models (VLMs) to efficiently index documents from their visual features.
 It is a [Qwen2-VL-2B](https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct) extension that generates [ColBERT](https://arxiv.org/abs/2004.12832)- style multi-vector representations of text and images.
 It was introduced in the paper [ColPali: Efficient Document Retrieval with Vision Language Models](https://arxiv.org/abs/2407.01449) and first released in [this repository](https://github.com/ManuelFay/colpali)
-This version is the untrained base version to guarantee deterministic projection layer initialization.
 <p align="center"><img width=800 src="https://github.com/illuin-tech/colpali/blob/main/assets/colpali_architecture.webp?raw=true"/></p>
 ## Version specificity

 language:
 - en
 library_name: colpali
+license: apache-2.0
 tags:
 - colpali
 - vidore-exclude
 ---
 # ColQwen2: Visual Retriever based on Qwen2-VL-2B-Instruct with ColBERT strategy
+ColQwen2 is a model based on a novel model architecture and training strategy based on Vision Language Models (VLMs) to efficiently index documents from their visual features.
 It is a [Qwen2-VL-2B](https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct) extension that generates [ColBERT](https://arxiv.org/abs/2004.12832)- style multi-vector representations of text and images.
 It was introduced in the paper [ColPali: Efficient Document Retrieval with Vision Language Models](https://arxiv.org/abs/2407.01449) and first released in [this repository](https://github.com/ManuelFay/colpali)
 <p align="center"><img width=800 src="https://github.com/illuin-tech/colpali/blob/main/assets/colpali_architecture.webp?raw=true"/></p>
 ## Version specificity