merve's picture

merve PRO

merve

·

https://github.com/merveenoyan/smol-vision

AI & ML interests

I love this website VLMs, vision & co

Recent Activity

updated a collection about 1 hour ago

Releases August 9

liked a dataset about 1 hour ago

jxm/gpt-oss20b-samples

updated a collection about 1 hour ago

Releases August 9

View all activity

Organizations

Posts 153

Post

2326

GPT-4.1-mini level model right in your iPhone 🤯

openbmb/MiniCPM-V-4 is only 4B while surpassing GPT-4.1-mini in vision benchmarks 🔥

allows commercial use as well!

Articles 33

Article

37

Vision Language Model Alignment in TRL ⚡️

View all Articles

Collections 68

View 68 collections

spaces 107

Vision Papers

All paper summaries read by Merve

No application file

Test2

Llama Guard 4

Check if text and images are safe

Running on Zero

ShieldGemma2 VLM

Demo for ShieldGemma 2, multimodal safety model

UDOP

Generate text from document images

Running on Zero

Paligemma2 Vqav2

PaliGemma2 LoRA finetuned on VQAv2

View 107 Spaces

models 98

merve/Qwen2.5-VL-3B-Instruct-trl-mpo-rlaif-v

Updated 19 days ago

merve/smol-vision

Image-Text-to-Text • Updated 19 days ago • 94

merve/Qwen2.5-VL-7B-Instruct-trl-mpo-rlaif-v

Updated 20 days ago

merve/gemma-3n-finevideo

Updated 26 days ago • 7

merve/vjepa2-vitl-fpc16-256-ssv2-ucf101

Video Classification • 0.4B • Updated Jun 13 • 10

merve/test

merve/SmolVLM2-500M-Video-Instruct-video-feedback

Image-Text-to-Text • 0.5B • Updated Feb 20 • 4

merve/SmolVLM2-500M-Video-Instruct-videofeedback

Image-Text-to-Text • 0.5B • Updated Feb 20 • 4

merve/SmolVLM2-500M-Video-Instruct-emotions

Image-Text-to-Text • 0.5B • Updated Feb 20 • 5

merve/colpali_ufo

Updated Dec 20, 2024 • 6

datasets 30

merve/vlm_test_images

Viewer • Updated 7 days ago • 19 • 1.17k • 2

merve/finevideo-split

Viewer • Updated Jul 9 • 3.14k • 103

merve/test2

Updated Jun 20 • 5

merve/retail-in-the-wild

Viewer • Updated Mar 6 • 20 • 56 • 3

merve/model-test-inputs

Updated Oct 21, 2024 • 25

merve/vqav2-small

Viewer • Updated Aug 8, 2024 • 21.4k • 1.2k • 12

merve/SGinW

Viewer • Updated Jul 11, 2024 • 16.7k • 372 • 1

merve/pascal-voc

Viewer • Updated Jul 6, 2024 • 336k • 704 • 1

merve/YouCook2

Viewer • Updated May 28, 2024 • 2k • 62

merve/faiss_embeddings

Updated Jan 25, 2024 • 18

View 30 datasets