Image-to-Text
Transformers
PyTorch
phi3_v
text-generation
latex
custom_code
mjbuehler's picture
Update README.md
8cdc03b verified
|
raw
history blame
682 Bytes
---
library_name: transformers
tags:
- latex
- image-to-text
datasets:
- lamm-mit/OleehyO-latex-formulas
- OleehyO/latex-formulas
license: apache-2.0
---
## Model Summary
Cephalo is a series of multimodal materials science focused vision large language models (V-LLMs) designed to integrate visual and linguistic data for advanced understanding and interaction in human-AI or multi-agent AI frameworks.
![image/png](https://cdn-uploads.huggingface.co/production/uploads/623ce1c6b66fedf374859fe7/kl5GWBP9WS0D4uwd1t3S7.png)
## Model Capabilities
This version of Cephalo, lamm-mit/Cephalo-LaTeX-Phi-3-vision-128k-4b-alpha, is trained to convert images of equations to LaTeX code.