2 13 23

Huang

Jinfa

AI & ML interests

None yet

Recent Activity

liked a Space 16 days ago

ssocean/Newborn_Article_Impact_Predict

liked a dataset 21 days ago

BestWishYsh/ConsisID-preview-Data

liked a model 25 days ago

BestWishYsh/ConsisID-preview

View all activity

Organizations

Jinfa's activity

liked a Space 16 days ago

Running on Zero

💻

Newborn Article Impact Predict

Use title and abstract to predict future academic impact

liked a dataset 21 days ago

BestWishYsh/ConsisID-preview-Data

Viewer • Updated 4 days ago • 31.9k • 829 • 17

liked a model 25 days ago

BestWishYsh/ConsisID-preview

Image-to-Video • Updated 2 days ago • 1.23k • 23

liked a Space 27 days ago

Running on L40S

🔥

ConsisID-preview

Identity-Preserving Text-to-Video Generation

upvoted a paper 28 days ago

Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Paper • 2411.17440 • Published 30 days ago • 35

authored a paper 28 days ago

Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Paper • 2411.17440 • Published 30 days ago • 35

liked a dataset 28 days ago

Xkev/LLaVA-CoT-100k

Viewer • Updated 28 days ago • 98.6k • 2.3k • 58

upvoted 2 papers 29 days ago

FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity

Paper • 2411.15411 • Published Nov 23 • 7

VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models

Paper • 2411.17451 • Published 30 days ago • 10

liked a model about 1 month ago

Xkev/Llama-3.2V-11B-cot

Image-Text-to-Text • Updated 10 days ago • 11.4k • 133

upvoted a paper about 1 month ago

Autoregressive Models in Vision: A Survey

Paper • 2411.05902 • Published Nov 8 • 16

commented a paper about 1 month ago

Autoregressive Models in Vision: A Survey

Paper • 2411.05902 • Published Nov 8 • 16 •

liked a model about 2 months ago

genmo/mochi-1-preview

Text-to-Video • Updated 7 days ago • 34.3k • 1.12k

upvoted a paper 2 months ago

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Paper • 2410.10139 • Published Oct 14 • 51

liked 2 datasets 2 months ago

BestWishYsh/ChronoMagic-ProH

Viewer • Updated 23 days ago • 145k • 363 • 15

BestWishYsh/ChronoMagic-Bench

Viewer • Updated 23 days ago • 1.8k • 83 • 10

upvoted a paper 3 months ago

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

Paper • 2409.18042 • Published Sep 26 • 36

liked a model 5 months ago

zheyangqin/VADER_VideoCrafter_PickScore

Text-to-Video • Updated Jul 23 • 48 • 13

upvoted 2 papers 6 months ago

Video Diffusion Alignment via Reward Gradients

Paper • 2407.08737 • Published Jul 11 • 48

MAVIS: Mathematical Visual Instruction Tuning

Paper • 2407.08739 • Published Jul 11 • 30