Aritra Roy Gosthipaty's picture

Aritra Roy Gosthipaty PRO

ariG23498

·

https://arig23498.github.io/

AI & ML interests

Deep Representation Learning

Recent Activity

liked a model about 2 hours ago

Qwen/Qwen2.5-Omni-7B

upvoted a collection 1 day ago

TxGemma Release

liked a Space 6 days ago

lvwerra/distill-blog-template

View all activity

Organizations

ariG23498's activity

upvoted a collection 1 day ago

TxGemma Release

Collection of open models to accelerate the development of therapeutics. • 5 items • Updated 1 day ago • 18

upvoted a collection 6 days ago

Florence

9 items • Updated Jan 8 • 166

upvoted an article 8 days ago

Article

NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets

9 days ago

• 29

upvoted an article 14 days ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

15 days ago

• 346

upvoted a collection 14 days ago

Gemma 3

4 items • Updated 15 days ago • 15

upvoted a collection 15 days ago

Gemma 3 Release

9 items • Updated 13 days ago • 294

upvoted a collection 20 days ago

Shot categorizer

Fine-tune of Florence-2 to generate shot categories, useful for data curation. Code: https://github.com/huggingface/movie-shot-categorizer. • 3 items • Updated 20 days ago • 2

upvoted a collection 22 days ago

C4AI Aya Vision

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 22 days ago • 68

upvoted an article 22 days ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

23 days ago

• 69

upvoted 2 articles 27 days ago

Article

SigLIP 2: A better multilingual vision language encoder

Feb 21

• 145

Article

HuggingFace, IISc partner to supercharge model building on India's diverse languages

28 days ago

• 17

upvoted a paper 28 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 114

upvoted a collection 28 days ago

Phi-4

Phi-4 family of small language and multi-modal models. • 7 items • Updated 23 days ago • 112

upvoted an article about 1 month ago

Article

Remote VAEs for decoding with HF endpoints 🤗

about 1 month ago

• 37

upvoted 2 collections about 1 month ago

SigLIP 2

OpenCLIP and timm SigLIP 2 models • 45 items • Updated Feb 21 • 14

SigLIP2

36 items • Updated 15 days ago • 65

upvoted a paper about 1 month ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 138

upvoted an article about 1 month ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

Feb 20

• 218