Step-3 is Large yet Affordable: Model-system Co-design for Cost-effective Decoding Paper • 2507.19427 • Published 7 days ago • 8
ML4CO-KIDA: Knowledge Inheritance in Dataset Aggregation Paper • 2201.10328 • Published Jan 25, 2022
Collaborative Neural Rendering using Anime Character Sheets Paper • 2207.05378 • Published Jul 12, 2022 • 1
Real-Time Intermediate Flow Estimation for Video Frame Interpolation Paper • 2011.06294 • Published Nov 12, 2020 • 1
Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction Paper • 2502.11946 • Published Feb 17 • 3
ViStoryBench: Comprehensive Benchmark Suite for Story Visualization Paper • 2505.24862 • Published May 30 • 31
Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model Paper • 2506.08967 • Published Jun 10 • 2
Recent Advances in Attack and Defense Approaches of Large Language Models Paper • 2409.03274 • Published Sep 5, 2024 • 1
Advancing Video Self-Supervised Learning via Image Foundation Models Paper • 2505.19218 • Published May 25
A Survey on Future Frame Synthesis: Bridging Deterministic and Generative Approaches Paper • 2401.14718 • Published Jan 26, 2024 • 1