1 19 389

trevor PRO

TrevorJS

TrevorS

AI & ML interests

small models

Recent Activity

liked a model about 13 hours ago

sesame/csm-1b

liked a model 2 days ago

google/gemma-3-27b-it

liked a model 2 days ago

open-r1/OlympicCoder-32B

View all activity

Organizations

None yet

TrevorJS's activity

upvoted an article 2 days ago

Article

Open R1: Update #3

and 9 others •

3 days ago

• 207

upvoted a paper 6 days ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 107

upvoted a paper 9 days ago

MPO: Boosting LLM Agents with Meta Plan Optimization

Paper • 2503.02682 • Published 10 days ago • 23

upvoted a collection 10 days ago

C4AI Aya Vision

Collection

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 10 days ago • 63

upvoted a paper 11 days ago

NeoBERT: A Next-Generation BERT

Paper • 2502.19587 • Published 16 days ago • 38

upvoted an article 19 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

22 days ago

• 205

upvoted a collection 20 days ago

SigLIP2

Collection

36 items • Updated 2 days ago • 62

upvoted 2 papers about 1 month ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 203

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published Jan 24 • 52

upvoted an article about 2 months ago

Article

We now support VLMs in smolagents!

Jan 24

• 92

upvoted a collection about 2 months ago

Sana

Collection

⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 21 items • Updated Feb 10 • 88

upvoted 4 papers 5 months ago

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents

Paper • 2410.10594 • Published Oct 14, 2024 • 26

MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning

Paper • 2409.20566 • Published Sep 30, 2024 • 56

CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling

Paper • 2409.19291 • Published Sep 28, 2024 • 19

UniMuMo: Unified Text, Music and Motion Generation

Paper • 2410.04534 • Published Oct 6, 2024 • 19

upvoted a paper 6 months ago

MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines

Paper • 2409.12959 • Published Sep 19, 2024 • 37

upvoted 2 papers 11 months ago

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Paper • 2404.18796 • Published Apr 29, 2024 • 69

CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data

Paper • 2404.15653 • Published Apr 24, 2024 • 28

upvoted a paper 12 months ago

RAFT: Adapting Language Model to Domain Specific RAG

Paper • 2403.10131 • Published Mar 15, 2024 • 70