knightnemo's picture

8 1

knightnemo

knightnemo

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Unified Video Action Model

upvoted a paper 5 days ago

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

upvoted a paper 6 days ago

Visual-RFT: Visual Reinforcement Fine-Tuning

View all activity

Organizations

None yet

knightnemo's activity

upvoted a paper 4 days ago

Unified Video Action Model

Paper • 2503.00200 • Published 9 days ago • 11

upvoted a paper 5 days ago

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Paper • 2502.20172 • Published 10 days ago • 26

upvoted a paper 6 days ago

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published 6 days ago • 59

upvoted a collection 10 days ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 8 items • Updated 14 days ago • 389

upvoted a paper 12 days ago

RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers

Paper • 2502.15894 • Published 16 days ago • 20

upvoted a paper 13 days ago

Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation

Paper • 2502.16707 • Published 14 days ago • 11

upvoted a paper 20 days ago

SURGE: On the Potential of Large Language Models as General-Purpose Surrogate Code Executors

Paper • 2502.11167 • Published 21 days ago • 10

upvoted a paper 6 months ago

Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27, 2024 • 123