4 48 71

Kyle Tuft

Chilangosta

AI & ML interests

None yet

Recent Activity

liked a Space about 13 hours ago

Qwen/Qwen2.5-VL-32B-Instruct

liked a model 1 day ago

ydeng9/OpenVLThinker-7B

upvoted a paper 1 day ago

InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

View all activity

Organizations

None yet

Chilangosta's activity

liked a Space about 13 hours ago

Qwen2.5 VL 32B Instruct Demo

🏃

Chat with images and videos using Qwen

liked a model 1 day ago

ydeng9/OpenVLThinker-7B

Image-Text-to-Text • Updated about 6 hours ago • 21 • 6

upvoted a paper 1 day ago

InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

Paper • 2503.16418 • Published 5 days ago • 32

liked a Space 4 days ago

Scale Wise Distillation

🖼

Generate images from text prompts

liked a model 4 days ago

ByteDance/InfiniteYou

Text-to-Image • Updated about 9 hours ago • 350

upvoted a paper 7 days ago

VGGT: Visual Geometry Grounded Transformer

Paper • 2503.11651 • Published 11 days ago • 20

liked a model 7 days ago

Skywork/Skywork-R1V-38B

Image-Text-to-Text • Updated 7 days ago • 2.98k • 102

liked a model 8 days ago

mistralai/Mistral-Small-3.1-24B-Instruct-2503

Image-Text-to-Text • Updated 3 days ago • 66.6k • 969

upvoted 2 papers 8 days ago

TPDiff: Temporal Pyramid Video Diffusion Model

Paper • 2503.09566 • Published 13 days ago • 41

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published 11 days ago • 75

liked a model 11 days ago

NousResearch/DeepHermes-3-Llama-3-3B-Preview-GGUF

Updated 12 days ago • 3.33k • 23

liked 2 models 14 days ago

google/siglip-so400m-patch14-384

Zero-Shot Image Classification • Updated Sep 26, 2024 • 8M • • 505

trashpanda-org/QwQ-32B-Snowdrop-v0

Text Generation • Updated 13 days ago • 3.73k • 46

liked a model 15 days ago

Lightricks/LTX-Video

Text-to-Video • Updated 13 days ago • 173k • • 1.09k

upvoted a paper 15 days ago

EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer

Paper • 2503.07027 • Published 16 days ago • 26

liked a model 15 days ago

TencentARC/VideoPainter

Updated about 11 hours ago • 16

upvoted a paper 15 days ago

VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control

Paper • 2503.05639 • Published 18 days ago • 22

upvoted 2 papers 17 days ago

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published 20 days ago • 38

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published 20 days ago • 84

upvoted an article 19 days ago

Article

Remote VAEs for decoding with HF endpoints 🤗

about 1 month ago

• 37