Running 2 2 Goekdeniz Guelmez Josiefied Qwen3 8B Abliterated V1 π Generate text using a large language model
Running on Zero MCP 305 305 Wan 2.2 5B π Generate high-quality videos from text prompts and images
T-LoRA: Single Image Diffusion Model Customization Without Overfitting Paper β’ 2507.05964 β’ Published Jul 8 β’ 115
Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers Paper β’ 2506.23918 β’ Published Jun 30 β’ 86
WebSailor: Navigating Super-human Reasoning for Web Agent Paper β’ 2507.02592 β’ Published Jul 3 β’ 110
Fine-Grained Preference Optimization Improves Spatial Reasoning in VLMs Paper β’ 2506.21656 β’ Published Jun 26 β’ 14
ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models Paper β’ 2506.21356 β’ Published Jun 26 β’ 22
XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation Paper β’ 2506.21416 β’ Published Jun 26 β’ 28
BlenderFusion: 3D-Grounded Visual Editing and Generative Compositing Paper β’ 2506.17450 β’ Published Jun 20 β’ 62