hongyu's picture

345 1

hongyu

learn12138

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 months ago

DreamRelation: Relation-Centric Video Customization

upvoted a paper 5 months ago

EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer

upvoted a paper 5 months ago

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

View all activity

Organizations

None yet

upvoted 20 papers 5 months ago

DreamRelation: Relation-Centric Video Customization

Paper • 2503.07602 • Published Mar 10 • 14

EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer

Paper • 2503.07027 • Published Mar 10 • 29

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5 • 233

LoRACode: LoRA Adapters for Code Embeddings

Paper • 2503.05315 • Published Mar 7 • 13

TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

Paper • 2503.05638 • Published Mar 7 • 19

VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control

Paper • 2503.05639 • Published Mar 7 • 24

Forgetting Transformer: Softmax Attention with a Forget Gate

Paper • 2503.02130 • Published Mar 3 • 32

Union of Experts: Adapting Hierarchical Routing to Equivalently Decomposed Transformer

Paper • 2503.02495 • Published Mar 4 • 8

Diverse Controllable Diffusion Policy with Signal Temporal Logic

Paper • 2503.02924 • Published Mar 4 • 3

Remasking Discrete Diffusion Models with Inference-Time Scaling

Paper • 2503.00307 • Published Mar 1 • 11

RectifiedHR: Enable Efficient High-Resolution Image Generation via Energy Rectification

Paper • 2503.02537 • Published Mar 4 • 12

Unified Video Action Model

Paper • 2503.00200 • Published Feb 28 • 14

Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator

Paper • 2503.01103 • Published Mar 3 • 5

Training Consistency Models with Variational Noise Coupling

Paper • 2502.18197 • Published Feb 25 • 7

Building Interactable Replicas of Complex Articulated Objects via Gaussian Splatting

Paper • 2502.19459 • Published Feb 26 • 11

Mobius: Text to Seamless Looping Video Generation via Latent Shift

Paper • 2502.20307 • Published Feb 27 • 19

FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute

Paper • 2502.20126 • Published Feb 27 • 20

Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation

Paper • 2502.20388 • Published Feb 27 • 16

UniTok: A Unified Tokenizer for Visual Generation and Understanding

Paper • 2502.20321 • Published Feb 27 • 30

KV-Edit: Training-Free Image Editing for Precise Background Preservation

Paper • 2502.17363 • Published Feb 24 • 38