YannQi's picture

2 4

YannQi

YannQi

·

https://yannqi.github.io/

yannqi

AI & ML interests

Computer vision, AGI, Multi-modality.

Recent Activity

liked a Space about 1 month ago

MRAMG/README

upvoted a collection 5 months ago

upvoted a paper 6 months ago

Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis

View all activity

Organizations

YannQi's activity

liked a Space about 1 month ago

README

upvoted a collection 5 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 16 days ago • 560

upvoted a paper 6 months ago

Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis

Paper • 2409.06135 • Published Sep 10, 2024 • 16

authored 3 papers 6 months ago

Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis

Paper • 2409.06135 • Published Sep 10, 2024 • 16

AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation

Paper • 2408.01708 • Published Aug 3, 2024 • 4

Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-Visual Segmentation

Paper • 2312.06462 • Published Dec 11, 2023

liked a model 12 months ago

YannQi/COMBO-AVS-checkpoints

Updated Mar 19, 2024 • 2

updated a model 12 months ago

YannQi/COMBO-AVS-checkpoints

Updated Mar 19, 2024 • 2

liked a model about 1 year ago

stabilityai/stable-video-diffusion-img2vid

Image-to-Video • Updated Jul 10, 2024 • 101k • 897

liked a Space almost 2 years ago

MiniGPT-4