ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents Paper โข 2507.22827 โข Published 18 days ago โข 90 โข 4
Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations Paper โข 2506.18898 โข Published Jun 23 โข 33 โข 1