3 137 270

Anthonny Olime

Aviv-anthonnyolime

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1

liked a dataset 2 days ago

nvidia/Llama-Nemotron-Post-Training-Dataset-v1

View all activity

Organizations

Aviv-anthonnyolime's activity

upvoted a paper 2 days ago

Vision-Speech Models: Teaching Speech Models to Converse about Images

Paper • 2503.15633 • Published 8 days ago • 1

upvoted 3 papers 23 days ago

Self-Guided Diffusion Models

Paper • 2210.06462 • Published Oct 12, 2022 • 3

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published 24 days ago • 77

Vision Transformers Need Registers

Paper • 2309.16588 • Published Sep 28, 2023 • 79

upvoted 3 papers 27 days ago

upvoted an article about 1 month ago

Article

SigLIP 2: A better multilingual vision language encoder

Feb 21

• 146

upvoted 7 papers about 1 month ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 138

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 173

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

Paper • 2502.04328 • Published Feb 6 • 30

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 47

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 149

Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance

Paper • 2502.08127 • Published Feb 12 • 52

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 107

upvoted an article about 1 month ago

Article

Fixing Open LLM Leaderboard with Math-Verify

Feb 14

• 27

upvoted a paper about 2 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 214