Blog, Articles, and discussions

Generate Images with Claude and Hugging Face

By August 19, 2025 • 17

Community Articles

view all

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

and 4 others •

11 days ago

• 63

Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B

and 9 others •

4 days ago

• 15

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

•

13 days ago

• 26

Old Maps, New Terrain: Updating Labour Taxonomies for the AI Era

and 1 other •

2 days ago

• 12

What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware

•

14 days ago

• 18

AutoBench Third Run: Revolutionizing LLM Evaluation with Record-Breaking Scale, Accuracy, and a New Home at autobench.org

•

2 days ago

• 6

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 164

Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training

•

May 17

• 9

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

and 5 others •

Jun 11

• 82

From GRPO to DAPO and GSPO: What, Why, and How

•

14 days ago

• 13

Kimina-Prover-RL

and 18 others •

8 days ago

• 9

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 116

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 209

What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models

and 5 others •

19 days ago

• 27

The GPT-OSS models are here… and they’re energy-efficient!

•

15 days ago

• 19

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

By August 18, 2025 • 34

MCP for Research: How to Connect AI to Research Tools

By August 18, 2025 • 30

TextQuests: How Good are LLMs at Text-Based Video Games?

By August 12, 2025 guest • 26

Introducing AI Sheets: a tool to work with datasets using open AI models!

By August 8, 2025 • 65

Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training

By August 8, 2025 • 50

🇵🇭 FilBench - Can LLMs Understand and Generate Filipino?

By August 12, 2025 • 13

Vision Language Model Alignment in TRL ⚡️

By August 7, 2025 • 69

Welcome GPT OSS, the new open-source model family from OpenAI!

By August 5, 2025 • 470

Build an AI Shopping Assistant with Gradio MCP Servers

By July 31, 2025 • 50

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

By July 29, 2025 • 158

Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨

By July 25, 2025 • 79

Parquet Content-Defined Chunking

By July 25, 2025 • 61

TimeScope: How Long Can Your Video Large Multimodal Model Go?

By July 23, 2025 • 39

Fast LoRA inference for Flux with Diffusers and PEFT

By July 23, 2025 • 45

Community Articles

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

and 4 others •

11 days ago

• 63

Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B

and 9 others •

4 days ago

• 15

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

•

13 days ago

• 26

Old Maps, New Terrain: Updating Labour Taxonomies for the AI Era

and 1 other •

2 days ago

• 12

What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware

•

14 days ago

• 18

How To Build a News Agent with GPT-OSS, Hugging Face Inference & Gradio

•

8 days ago

• 19

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 658

NVIDIA Releases 6 Million Multi-Lingual Reasoning Dataset

and 4 others •

2 days ago

• 9

Code a simple RAG from scratch

•

Oct 29, 2024

• 160

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

and 9 others •

11 days ago

• 25

AutoBench Third Run: Revolutionizing LLM Evaluation with Record-Breaking Scale, Accuracy, and a New Home at autobench.org

•

2 days ago

• 6

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 164

Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training

•

May 17

• 9

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

and 5 others •

Jun 11

• 82

From GRPO to DAPO and GSPO: What, Why, and How

•

14 days ago

• 13

Kimina-Prover-RL

and 18 others •

8 days ago

• 9

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 116

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 209

What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models

and 5 others •

19 days ago

• 27

The GPT-OSS models are here… and they’re energy-efficient!

•

15 days ago

• 19

View all