Blog, Articles, and discussions

HuggingFace, IISc partner to supercharge model building on India's diverse languages

By February 27, 2025 • 20

Community Articles

view all

Interactive Tools for machine learning, deep learning, and math

•

5 days ago

• 33

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 593

Bigger isn't always better: how to choose the most efficient model for context-specific tasks 🌱🧑🏼‍💻

•

3 days ago

• 13

Mitigating False Negatives in Multiple Negatives Ranking Loss for Retriever Training

•

6 days ago

• 6

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

•

Feb 11

• 37

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 69

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 41

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

•

Apr 16

• 15

DeepWiki: Best AI Documentation Generator for Any Github Repo

•

Apr 28

• 27

AgenticSeek: Running Manus AI Locally with Deepseek & Qwen (Open Source Tool)

•

7 days ago

• 5

🌙 Introducing Moon: Storytelling Generator Model

•

1 day ago

• 4

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 139

Let's talk about LLM evaluation

•

May 23, 2024

• 174

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29, 2024

• 324

Code a simple RAG from scratch

•

Oct 29, 2024

• 81

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 142

Retrieval Augmented Generation with Huggingface Transformers and Ray

By February 10, 2021 guest • 6

Hugging Face on PyTorch / XLA TPUs

By February 9, 2021 guest • 3

Porting fairseq wmt19 translation system to transformers

By November 3, 2020

Hyperparameter Search with Transformers and Ray Tune

By November 2, 2020 guest • 4

Community Articles

Interactive Tools for machine learning, deep learning, and math

•

5 days ago

• 33

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 593

Bigger isn't always better: how to choose the most efficient model for context-specific tasks 🌱🧑🏼‍💻

•

3 days ago

• 13

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 273

Falcon-Arabic: A Breakthrough in Arabic Language Models

and 7 others •

10 days ago

• 28

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve

•

11 days ago

• 18

Major PHYBench Update Released

and 1 other •

6 days ago

• 7

Mitigating False Negatives in Multiple Negatives Ranking Loss for Retriever Training

•

6 days ago

• 6

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

•

Feb 11

• 37

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 69

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 41

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

•

Apr 16

• 15

DeepWiki: Best AI Documentation Generator for Any Github Repo

•

Apr 28

• 27

AgenticSeek: Running Manus AI Locally with Deepseek & Qwen (Open Source Tool)

•

7 days ago

• 5

🌙 Introducing Moon: Storytelling Generator Model

•

1 day ago

• 4

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 139

Let's talk about LLM evaluation

•

May 23, 2024

• 174

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29, 2024

• 324

Code a simple RAG from scratch

•

Oct 29, 2024

• 81

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 142

View all

Blog, Articles, and discussions

HuggingFace, IISc partner to supercharge model building on India's diverse languages

Interactive Tools for machine learning, deep learning, and math

Uncensor any LLM with abliteration

Bigger isn't always better: how to choose the most efficient model for context-specific tasks 🌱🧑🏼‍💻

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Falcon-Arabic: A Breakthrough in Arabic Language Models

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve

Major PHYBench Update Released

Mitigating False Negatives in Multiple Negatives Ranking Loss for Retriever Training

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

KV Caching Explained: Optimizing Transformer Inference Efficiency

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

DeepWiki: Best AI Documentation Generator for Any Github Repo

AgenticSeek: Running Manus AI Locally with Deepseek & Qwen (Open Source Tool)

🌙 Introducing **Moon**: Storytelling Generator Model

Introduction to State Space Models (SSM)

Let's talk about LLM evaluation

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Code a simple RAG from scratch

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Retrieval Augmented Generation with Huggingface Transformers and Ray

Hugging Face on PyTorch / XLA TPUs

Porting fairseq wmt19 translation system to transformers

Hyperparameter Search with Transformers and Ray Tune

Interactive Tools for machine learning, deep learning, and math

Uncensor any LLM with abliteration

Bigger isn't always better: how to choose the most efficient model for context-specific tasks 🌱🧑🏼‍💻

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Falcon-Arabic: A Breakthrough in Arabic Language Models

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve

Major PHYBench Update Released

Mitigating False Negatives in Multiple Negatives Ranking Loss for Retriever Training

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

KV Caching Explained: Optimizing Transformer Inference Efficiency

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

DeepWiki: Best AI Documentation Generator for Any Github Repo

AgenticSeek: Running Manus AI Locally with Deepseek & Qwen (Open Source Tool)

🌙 Introducing **Moon**: Storytelling Generator Model

Introduction to State Space Models (SSM)

Let's talk about LLM evaluation

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Code a simple RAG from scratch

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

🌙 Introducing Moon: Storytelling Generator Model

🌙 Introducing Moon: Storytelling Generator Model