mPLUG
/

DocOwl2

@@ -1,8 +1,23 @@
 ---
 license: apache-2.0
 ---
 ```
@@ -26,7 +41,7 @@ class DocOwlInfer():
 docowl = DocOwlInfer(ckpt_path='mPLUG/DocOwl2')
-images = ['/nas-alinlp/anwenhu/tmp/paper.png']
 query = "What is this paper about, provide detailed information."

 ---
 license: apache-2.0
+language:
+- en
+pipeline_tag: image-text-to-text
+tags:
+- chat
 ---
+# mPLUG-DocOwl2
+## Introduction
+mPLUG-DocOwl2 is a state-of-the-art Multimodal LLM for OCR-free Multi-page Document Understanding.
+Through a compressing module named High-resolution DocCompressor, each page is encoded with just 324 tokens.
+Github: [mPLUG-DocOwl](https://github.com/X-PLUG/mPLUG-DocOwl)
+## Quickstart
 ```
 docowl = DocOwlInfer(ckpt_path='mPLUG/DocOwl2')
+images = ['./examples/paper_page1.png', './examples/paper_page2.png', './examples/paper_page3.png']
 query = "What is this paper about, provide detailed information."