Kyu Song

kyunocap

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Automated Movie Generation via Multi-Agent CoT Planning

upvoted a paper 1 day ago

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

upvoted a paper 9 days ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

View all activity

Organizations

None yet

kyunocap's activity

upvoted 2 papers 1 day ago

Automated Movie Generation via Multi-Agent CoT Planning

Paper • 2503.07314 • Published 3 days ago • 34

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published 8 days ago • 203

upvoted a paper 9 days ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published 10 days ago • 72

upvoted a paper 15 days ago

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Paper • 2502.18411 • Published 16 days ago • 69

liked 2 models 16 days ago

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • Updated 7 days ago • 3.38M • • 662

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • Updated 6 days ago • 293k • • 369

upvoted a paper 22 days ago

Phantom: Subject-consistent video generation via cross-modal alignment

Paper • 2502.11079 • Published 25 days ago • 52

liked a Space 22 days ago

2.23k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper 24 days ago

On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices

Paper • 2502.04363 • Published Feb 5 • 12

upvoted a paper 29 days ago

Magic 1-For-1: Generating One Minute Video Clips within One Minute

Paper • 2502.07701 • Published about 1 month ago • 34

liked a model about 1 month ago

DAMO-NLP-SG/VideoLLaMA3-7B

Visual Question Answering • Updated 1 day ago • 22.8k • 39

upvoted 5 papers about 1 month ago

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published Feb 3 • 189

liked a Space about 1 month ago

1.18k

FLUX Prompt Generator

😻

Display a user interface for various tasks

upvoted a paper about 1 month ago

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published Jan 26 • 63

upvoted 2 papers about 2 months ago

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Paper • 2501.12895 • Published Jan 22 • 57

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 346