8 12 26

Kunchang Li

Andy1621

https://github.com/Andy1621

Andy1621

AI & ML interests

computer vision

Recent Activity

upvoted a paper 5 days ago

Qwen2.5 Technical Report

upvoted a paper 8 days ago

VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping

authored a paper 8 days ago

Causal Diffusion Transformers for Generative Modeling

View all activity

Organizations

Andy1621's activity

upvoted a paper 5 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 6 days ago • 325

upvoted a paper 8 days ago

VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping

Paper • 2412.11279 • Published 9 days ago • 12

authored a paper 8 days ago

Causal Diffusion Transformers for Generative Modeling

Paper • 2412.12095 • Published 8 days ago • 23

upvoted a paper 8 days ago

Causal Diffusion Transformers for Generative Modeling

Paper • 2412.12095 • Published 8 days ago • 23

commented a paper 8 days ago

Causal Diffusion Transformers for Generative Modeling

Paper • 2412.12095 • Published 8 days ago • 23 •

authored a paper 9 days ago

Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel

Paper • 2412.08467 • Published 14 days ago • 5

upvoted a paper 11 days ago

EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM

Paper • 2412.09618 • Published 12 days ago • 21

upvoted a paper 13 days ago

StreamChat: Chatting with Streaming Video

Paper • 2412.08646 • Published 13 days ago • 17

authored a paper 2 months ago

TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration

Paper • 2410.12183 • Published Oct 16 • 3

liked a model 4 months ago

OpenGVLab/UMT

Video Classification • Updated Aug 17 • 1

updated a model 5 months ago

Andy1621/VideoChat2_VicunaV0_7B_stage3_noLoRA

Updated Jul 30

liked a model 7 months ago

stabilityai/stable-diffusion-3-medium

Text-to-Image • Updated Aug 12 • 29.3k • 4.64k

upvoted a paper 7 months ago

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Paper • 2406.06525 • Published Jun 10 • 65

liked 2 models 8 months ago

internlm/internlm2-chat-20b

Text Generation • Updated Aug 20 • 11.8k • 87

OpenGVLab/InternVL-Chat-V1-5

Image-Text-to-Text • Updated 7 days ago • 2.38k • 405

upvoted a paper 9 months ago

Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction

Paper • 2404.02905 • Published Apr 3 • 65

authored a paper 9 months ago

InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding

Paper • 2403.15377 • Published Mar 22 • 22

upvoted a paper 9 months ago

InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding

Paper • 2403.15377 • Published Mar 22 • 22

New activity in OpenGVLab/VideoMamba 9 months ago

Local demo on the repo

#4 opened 9 months ago by

ysharma

Upload IMG_20240316_204018.jpg

#5 opened 9 months ago by

Hammer88