martin's picture

martin PRO

martintomov

·

AI & ML interests

None yet

Recent Activity

upvoted an article 3 days ago

SmolVLM2: Bringing Video Understanding to Every Device

liked a model 5 days ago

microsoft/wham

liked a model 7 days ago

Skywork/SkyReels-V1-Hunyuan-I2V

View all activity

Organizations

martintomov's activity

upvoted an article 3 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

5 days ago

• 146

upvoted a collection 15 days ago

SYNTHETIC-1

A collection of tasks & verifiers for reasoning datasets • 9 items • Updated 4 days ago • 48

upvoted an article 18 days ago

Article

Open-source DeepResearch – Freeing our search agents

21 days ago

• 1.09k

upvoted a collection about 1 month ago

Cosmos

The collection of Cosmos models • 31 items • Updated Jan 17 • 262

upvoted a collection 3 months ago

[MASK] is All You Need

Code, dataset, and pretrained model • 6 items • Updated 18 days ago • 9

upvoted a paper 3 months ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 128

upvoted a collection 3 months ago

PaliGemma 2 Release

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated Dec 13, 2024 • 142

upvoted a paper 3 months ago

OminiControl: Minimal and Universal Control for Diffusion Transformer

Paper • 2411.15098 • Published Nov 22, 2024 • 55

upvoted a collection 3 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 527

upvoted 3 papers 3 months ago

MagicQuill: An Intelligent Interactive Image Editing System

Paper • 2411.09703 • Published Nov 14, 2024 • 65

Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models

Paper • 2411.07232 • Published Nov 11, 2024 • 65

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 141

upvoted 4 papers 4 months ago

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

Paper • 2411.04928 • Published Nov 7, 2024 • 50

Adding Conditional Control to Text-to-Image Diffusion Models

Paper • 2302.05543 • Published Feb 10, 2023 • 46

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 111

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Paper • 2410.19355 • Published Oct 25, 2024 • 23

upvoted 3 papers 5 months ago

Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

Paper • 2409.18124 • Published Sep 26, 2024 • 32

Imagine yourself: Tuning-Free Personalized Image Generation

Paper • 2409.13346 • Published Sep 20, 2024 • 69

A Diffusion Approach to Radiance Field Relighting using Multi-Illumination Synthesis

Paper • 2409.08947 • Published Sep 13, 2024 • 14

upvoted a collection 6 months ago

Hermes 3

The Hermes 3 Series of Models • 12 items • Updated 11 days ago • 108