Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing Paper • 2504.21356 • Published Apr 30 • 1
Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs Paper • 2504.17432 • Published Apr 24 • 39
EliGen: Entity-Level Controlled Image Generation with Regional Attention Paper • 2501.01097 • Published Jan 2
MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation Paper • 2412.03558 • Published Dec 4, 2024 • 19
SWIFT:A Scalable lightWeight Infrastructure for Fine-Tuning Paper • 2408.05517 • Published Aug 10, 2024 • 2
HIE-SQL: History Information Enhanced Network for Context-Dependent Text-to-SQL Semantic Parsing Paper • 2203.07376 • Published Mar 14, 2022
FaceChain: A Playground for Human-centric Artificial Intelligence Generated Content Paper • 2308.14256 • Published Aug 28, 2023 • 1
Minimum Tuning to Unlock Long Output from LLMs with High Quality Data as the Key Paper • 2410.10210 • Published Oct 14, 2024 • 6
Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images Paper • 2407.06191 • Published Jul 8, 2024 • 14
CLIP2Point: Transfer CLIP to Point Cloud Classification with Image-Depth Pre-training Paper • 2210.01055 • Published Oct 3, 2022 • 1