6 29 60

zhang yuechen

julianjuaner

https://julianjuaner.github.io/

julianjuaner

AI & ML interests

Controllable Generation (Customization)

Recent Activity

upvoted a collection 6 days ago

X2I Dataset

liked a model 6 days ago

FastVideo/FastHunyuan

upvoted a paper 7 days ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

View all activity

Organizations

julianjuaner's activity

upvoted a collection 6 days ago

X2I Dataset

Collection

Datasets used in OmniGen-v1 • 5 items • Updated 3 days ago • 7

liked a model 6 days ago

FastVideo/FastHunyuan

Text-to-Video • Updated 8 days ago • 368 • 113

upvoted a paper 7 days ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published 16 days ago • 45

authored a paper 10 days ago

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Paper • 2412.09501 • Published 13 days ago • 43

upvoted a paper 11 days ago

OminiControl: Minimal and Universal Control for Diffusion Transformer

Paper • 2411.15098 • Published Nov 22 • 53

commented a paper 13 days ago

Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion

Paper • 2412.09593 • Published 13 days ago • 17 •

upvoted 2 papers 13 days ago

Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion

Paper • 2412.09593 • Published 13 days ago • 17

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Paper • 2412.09501 • Published 13 days ago • 43

upvoted a paper 20 days ago

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Paper • 2412.04467 • Published 20 days ago • 104

liked a model 23 days ago

Yuanshi/OminiControl

Image-to-Image • Updated 16 days ago • 11.6k • 100

liked a model 30 days ago

ali-vilab/In-Context-LoRA

Text-to-Image • Updated 9 days ago • 125k • • 500

liked 2 models about 2 months ago

THUDM/CogVideoX1.5-5B-SAT

Image-to-Video • Updated Nov 8 • 143

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Aug 16 • 1.27M • • 7.5k

liked a model 2 months ago

genmo/mochi-1-preview

Text-to-Video • Updated 7 days ago • 34.3k • 1.12k

liked a model 3 months ago

alibaba-pai/CogVideoX-Fun-2b-InP

Image-to-Video • Updated Sep 23 • 930 • 19

liked a model 4 months ago

THUDM/CogVideoX-5b

Text-to-Video • Updated Nov 23 • 116k • 541

upvoted a paper 4 months ago

FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual Guidance

Paper • 2408.08189 • Published Aug 15 • 15

liked a Space 4 months ago

Running

143

👩‍🎨

UniPortrait

upvoted a paper 4 months ago

UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization

Paper • 2408.05939 • Published Aug 12 • 13

authored a paper 4 months ago

ControlNeXt: Powerful and Efficient Control for Image and Video Generation

Paper • 2408.06070 • Published Aug 12 • 53