Ivelin Ivanov's picture

6 6 21

Ivelin Ivanov PRO

ivelin

·

AI & ML interests

computer vision, vision-language models, multi modal transformers

Recent Activity

updated a model 12 days ago

ivelin/SmolVLM-Instruct-vqav2

liked a Space 16 days ago

donb-hf/eduscape

reacted to merve's post with 😎 20 days ago

small but mighty 🔥 you can fine-tune SmolVLM on an L4 with batch size of 4 and it will only take 16.4 GB VRAM 🫰🏻 also with gradient accumulation simulated batch size is 16 ✨ I made a notebook that includes all the goodies: QLoRA, gradient accumulation, gradient checkpointing with explanations on how they work 💝 https://github.com/huggingface/smollm/blob/main/finetuning/Smol_VLM_FT.ipynb

View all activity

Organizations

ivelin's activity

updated a model 12 days ago

ivelin/SmolVLM-Instruct-vqav2

Updated 12 days ago • 4

liked a Space 16 days ago

EduScape

reacted to merve's post with 😎 20 days ago

Post

2631

small but mighty 🔥
you can fine-tune SmolVLM on an L4 with batch size of 4 and it will only take 16.4 GB VRAM 🫰🏻 also with gradient accumulation simulated batch size is 16 ✨
I made a notebook that includes all the goodies: QLoRA, gradient accumulation, gradient checkpointing with explanations on how they work 💝 https://github.com/huggingface/smollm/blob/main/finetuning/Smol_VLM_FT.ipynb

liked a model 29 days ago

rail-berkeley/crossformer

Robotics • Updated Aug 26 • 52 • 5

upvoted 2 papers 30 days ago

One to rule them all: natural language to bind communication, perception and action

Paper • 2411.15033 • Published Nov 22 • 3

WildLMa: Long Horizon Loco-Manipulation in the Wild

Paper • 2411.15131 • Published Nov 22 • 6

upvoted a paper about 1 month ago

Scalable Diffusion Models with Transformers

Paper • 2212.09748 • Published Dec 19, 2022 • 17

liked 3 models about 1 month ago

notmahi/dobb-e

Robotics • Updated Nov 28, 2023 • 173 • 80

Embodied-CoT/ecot-openvla-7b-bridge

Robotics • Updated Jul 25 • 4.93k • 4

robotics-diffusion-transformer/rdt-1b

Robotics • Updated Oct 17 • 3.77k • 56

liked a Space about 1 month ago

Visualize Dataset (v2.0+ latest dataset format)

upvoted a paper about 2 months ago

Diffusion Policy: Visuomotor Policy Learning via Action Diffusion

Paper • 2303.04137 • Published Mar 7, 2023 • 3

upvoted an article 3 months ago

Article

XetHub is joining Hugging Face!

Aug 8

• 81

updated a Space 7 months ago

UI RefExp Click (by GuardianUI)

liked a Space 11 months ago

Gradio Lite Classify

updated a Space 11 months ago

Darts Demo

liked a model almost 2 years ago

google/pix2struct-base

Image-to-Text • Updated Dec 24, 2023 • 5.95k • 66

updated a Space almost 2 years ago

LabelStudio

liked 2 models almost 2 years ago

hkunlp/instructor-xl

Sentence Similarity • Updated Jan 21, 2023 • 17.2k • 554

intfloat/e5-base

Sentence Similarity • Updated Aug 7, 2023 • 73.4k • 20