Community Articles
view allSpeeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques
By
and 8 others
•
•
4Revisão da Monica AI: Extensão de IA de monica.im
By
•
•
1Llama-3.1-Nemotron-Nano-8B-v1-bnb-4bit unsloth Train examples
By
•
Llama-3.1-Nemotron-Nano-8B-v1-bnb-4bit
By
•
mistral.rs v0.5.0
By
•
•
3🎙️🧩 TP/Inference: Sharon Zhou on AI Hallucinations, Agents Hype, and Giving Developers the Keys to GenAI
By
•
•
2PangolinGuard: Fine-Tuning ModernBERT as a Lightweight Approach to AI Guardrails
By
•
•
4Making any LLM model "reasoning"
By
•
•
1Judge Assisted GRPO Tuning: The Pirates, Knights, and Vikings Experiment
By
•
DeepSearch Using Visual RAG in Agentic Frameworks 🔎
By
and 1 other
•
•
13Build Your Own Timeline Algorithm: A Blueprint
By
and 1 other
•
•
2Monica AI Review: AI Extension from monica.im
By
•
What is Qwen-Agent framework? Inside the Qwen family
By
and 1 other
•
•
6Key conributions from top AI Papers strait to your inbox
By
•
•
2LLM Routing for Batched Instructions
By
•
•
5Gradio’s Dataframe has been upgraded! 🎨
By
•
•
7Digest of models based on YandexGPT 5 Lite
By
•
•
25Undress AI: Technical Frameworks and Responsible Implementation in the Age of Generative Models
By
•
•
1Reinforcement Learning for Large Language Models: Beyond the Agent Paradigm
By
•
•
1🌁#92: Fight for Developers and the Year of Orchestration
By
•
•
4