OpenFly: A Versatile Toolchain and Large-scale Benchmark for Aerial Vision-Language Navigation Paper • 2502.18041 • Published Feb 25 • 1
LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control Paper • 2406.16038 • Published Jun 23, 2024 • 1
SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model Paper • 2501.15830 • Published Jan 27 • 14
Exploring the Potential of Encoder-free Architectures in 3D LMMs Paper • 2502.09620 • Published Feb 13 • 26