Spaces:

open-concept-steering
/

README

Running

hbfreed commited on Jan 31

Commit

2ace760

verified ·

1 Parent(s): 10911fe

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,3 +1,12 @@
 # Open Concept Steering
 Open Concept Steering is an open-source library for discovering and manipulating interpretable features in large language models using Sparse Autoencoders (SAEs). Inspired by Anthropic's work on [Scaling Monosemanticity](https://transformer-circuits.pub/2024/scaling-monosemanticity/) and [Golden Gate Claude](https://www.anthropic.com/news/golden-gate-claude), this project aims to make concept steering accessible to the broader research community.

+---
+license: mit
+title: Open Concept Steering
+sdk: static
+emoji: 👁
+colorFrom: indigo
+colorTo: indigo
+short_description: Training SAEs
+---
 # Open Concept Steering
 Open Concept Steering is an open-source library for discovering and manipulating interpretable features in large language models using Sparse Autoencoders (SAEs). Inspired by Anthropic's work on [Scaling Monosemanticity](https://transformer-circuits.pub/2024/scaling-monosemanticity/) and [Golden Gate Claude](https://www.anthropic.com/news/golden-gate-claude), this project aims to make concept steering accessible to the broader research community.