Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jordan Taylor's picture
34

Jordan Taylor

JordanTensor
Theartplug's profile picture Maheep's profile picture
·
https://sites.google.com/view/jordantensor
  • jordantensor
  • jordansauce
  • jordan-tensor

AI & ML interests

Mechanistic interpretability, mechanistic anomaly detection, model internals techniques and AI safety techniques generally.

Organizations

Mechanistic  Anomaly Detection's profile picture

JordanTensor 's collections 1

Sandbagging research sprint 1
https://github.com/jordansauce/sandbagging-research-sprint/ https://wandb.ai/jordantensor/gemma-sandbagging
  • JordanTensor/sandbagging-sciq

    Viewer • Updated Feb 14 • 13.7k • 14 • 1
  • JordanTensor/sandbagging-prefixes

    Viewer • Updated Dec 7, 2024 • 9.9k • 12 • 1
  • JordanTensor/gemma-sandbagging-0w4j7rba-step512

    Updated Feb 14
  • JordanTensor/gemma-sandbagging-0w4j7rba-step1024

    Updated Feb 14
Sandbagging research sprint 1
https://github.com/jordansauce/sandbagging-research-sprint/ https://wandb.ai/jordantensor/gemma-sandbagging
  • JordanTensor/sandbagging-sciq

    Viewer • Updated Feb 14 • 13.7k • 14 • 1
  • JordanTensor/sandbagging-prefixes

    Viewer • Updated Dec 7, 2024 • 9.9k • 12 • 1
  • JordanTensor/gemma-sandbagging-0w4j7rba-step512

    Updated Feb 14
  • JordanTensor/gemma-sandbagging-0w4j7rba-step1024

    Updated Feb 14
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs