view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 207
view article Article Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs By wenhuach and 8 others • Apr 29 • 39
view article Article Gemma 3n fully available in the open-source ecosystem! By ariG23498 and 7 others • Jun 26 • 115
view article Article Training and Finetuning Sparse Embedding Models with Sentence Transformers v5 By tomaarsen and 1 other • Jul 1 • 108
view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model By EuroBERT and 3 others • Mar 10 • 146
view article Article wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR?? By catherinearnett • Sep 27, 2024 • 48
RuSentNE-2023: Evaluating Entity-Oriented Sentiment Analysis on Russian News Texts Paper • 2305.17679 • Published May 28, 2023 • 2
view article Article Open-source DeepResearch – Freeing our search agents By m-ric and 4 others • Feb 4 • 1.28k
view article Article Welcome to Inference Providers on the Hub 🔥 By julien-c and 6 others • Jan 28 • 487
view article Article The N Implementation Details of RLHF with PPO By vwxyzjn and 2 others • Oct 24, 2023 • 66
view article Article PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs By samuellimabraz • Jan 24 • 34
view article Article Distributed SFT with trl and DeepSpeed Part 1: Starting Locally By jlzhou • Jan 23 • 4
ProLIP Collection Official ProLIP weights, Probabilistic Language-Image Pre-Training (ICLR 2025) • 7 items • Updated Apr 18 • 10
view article Article How to Expand Your AI Music Generations of 30 Seconds to Several Minutes By theeseus-ai • Dec 13, 2024 • 3
view article Article Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO) By ariG23498 • Jan 19 • 27
view article Article Fine-tune ModernBERT for RAG with Synthetic Data By sdiazlor and 2 others • Jan 20 • 41