Shawon Ashraf
shawon
AI & ML interests
Multi-Modal NLP, LLM and RAG
Recent Activity
liked
a model
about 9 hours ago
answerdotai/ModernBERT-large
liked
a model
about 9 hours ago
answerdotai/ModernBERT-base
liked
a model
about 22 hours ago
allenai/olmOCR-7B-0225-preview
Organizations
Collections
3
-
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Paper • 2402.17177 • Published • 87 -
Mora: Enabling Generalist Video Generation via A Multi-Agent Framework
Paper • 2403.13248 • Published • 78 -
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper • 2311.05437 • Published • 50 -
UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models
Paper • 2409.20551 • Published • 15
models
None public yet