pascalmusabyimana
pascal-maker
AI & ML interests
computer vision, nlp , machine learning and deeplearning
Recent Activity
reacted
to
prithivMLmods's
post
with ❤️
about 24 hours ago
Introducing Camel-Doc-OCR-080125(v2), a document content-structure retrieval VLM designed for content extraction and summarization. This is the second model in the Camel Doc OCR VLM series, following Camel-Doc-OCR-062825(v1). The new version fixes formal table reconstruction issues in both en and zh language, achieving optimal performance for long-context inferences.🤗🐪
⤷ Camel-Doc-OCR(v2) : https://huggingface.co/prithivMLmods/Camel-Doc-OCR-080125
⤷ Camel-Doc-OCR(v1) : https://huggingface.co/prithivMLmods/Camel-Doc-OCR-062825
⤷ Demo : https://huggingface.co/spaces/prithivMLmods/core-OCR
Multimodal Model Collections and Spaces:
➝ Camel-Doc-OCR : https://huggingface.co/collections/prithivMLmods/camel-doc-ocr-080125-688c0c61c5dba648756f31f8
➝ Vision-Language (VLr) : https://huggingface.co/collections/prithivMLmods/vision-language-for-reasoning-vlr-6889b3f45917352b5e3a6f7a
➝ Multimodal Spaces : https://huggingface.co/collections/prithivMLmods/multimodal-implementations-67c9982ea04b39f0608badb0
➝ Multimodal VLMs : https://huggingface.co/collections/prithivMLmods/multimodal-vlms-until-july25-688312e6b840e1e156f13027
.
.
.
To know more about it, visit the model card of the respective model. !!