Blog, Articles, and discussions

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

By July 16, 2025 • 53

Community Articles

view all

Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever.

•

18 days ago

• 130

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 158

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 107

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 638

Code a simple RAG from scratch

•

Oct 29, 2024

• 137

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 196

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 324

From Zero to MCP: Three Lessons I Learned Building Tools for LLMs

•

4 days ago

• 5

Small Language Models (SLM): A Comprehensive Overview

•

Feb 22

• 51

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

•

Apr 16

• 29

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

and 5 others •

May 21

• 32

Why We Built the OpenMDW License: A Comprehensive License for ML Models

•

Jul 2

• 22

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 283

Mastering Tensor Dimensions in Transformers

•

Jan 12

• 82

Common AI Model Formats

•

Feb 27

• 47

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

and 4 others •

Jun 11

• 74

CPU Optimized Embeddings with 🤗 Optimum Intel and fastRAG

By March 15, 2024 guest • 10

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

By March 15, 2024 • 11

Text-Generation Pipeline on Intel® Gaudi® 2 AI Accelerator

By February 29, 2024 guest • 1

StarCoder2 and The Stack v2

By February 28, 2024 • 9

AI Watermarking 101: Tools and Techniques

By February 26, 2024 • 20

Fine-Tuning Gemma Models in Hugging Face

By February 23, 2024 guest • 36

🪆 Introduction to Matryoshka Embedding Models

By February 23, 2024 • 153

Welcome Gemma - Google's new open LLM

By February 21, 2024 • 25

🤗 PEFT welcomes new merging methods

By February 19, 2024 • 22

Synthetic data: save money, time and carbon with open source

By February 16, 2024 • 78

From OpenAI to Open LLMs with Messages API

By February 8, 2024 • 20

Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding

By January 30, 2024 guest • 9

Open-source LLMs as LangChain Agents

By January 24, 2024 • 69

Preference Tuning LLMs with Direct Preference Optimization Methods

By January 18, 2024 • 69