Timaeus

non-profit

https://timaeus.co

TimaeusResearch

timaeus-research

AI & ML interests

Singular Learning Theory & Developmental Interpretability

Recent Activity

algo2217 updated a model about 1 month ago

timaeus/prune-runs-random_heads-0.5-F_random_heads_0.50_pretrained_50000

algo2217 published a model about 1 month ago

timaeus/prune-runs-random_heads-0.5-F_random_heads_0.50_pretrained_50000

algo2217 updated a model about 1 month ago

timaeus/prune-runs-random_heads-0.5-T_random_heads_0.50_pretrained_50000

View all activity

Organization Card

Community About org cards

Timaeus

About Us

Timaeus is a non-profit research organization founded in October 2023, focused on applying singular learning theory (SLT) to AI alignment. Our mission is to make fundamental breakthroughs in technical AI alignment using deep ideas from mathematics and the sciences.

Our Research Focus

We concentrate on understanding the relationship between internal structure in neural networks and the geometry of the loss landscape, as revealed by singular learning theory. This connection provides a basis for developing scalable tools for interpretability, mechanistic anomaly detection, and beyond.

Key Research Areas:

Developmental Interpretability (DevInterp): Applying SLT to interpret the development of structure in neural networks, aiming to identify when, where, and what circuits form.
Structural Generalization (StrucGen): Using SLT to study out-of-distribution generalization, with the goal of building tools to predict how circuits will "break".
Geometry of Program Synthesis (GPS): Applying SLT to study inductive biases, advancing our understanding of how to predict and measure alignment-relevant risks.

Resources

Connect With Us

Website

For collaboration inquiries or more information about our research, please contact us at [email protected]

Collections 14

View 14 collections

models 76

timaeus/tetrahedron-space-dedupe

Updated 16 days ago

timaeus/prune-runs-random_heads-0.5-F_random_heads_0.50_pretrained_50000

0.0B • Updated Jul 16 • 5

timaeus/prune-runs-random_heads-0.5-T_random_heads_0.50_pretrained_50000

0.0B • Updated Jul 16 • 2

timaeus/prune-runs-head_norm-0.5-F_head_norm_0.50_pretrained_50000

0.0B • Updated Jul 16 • 2

timaeus/prune-runs-head_norm-0.5-T_head_norm_0.50_pretrained_50000

0.0B • Updated Jul 16 • 2

timaeus/prune-runs-head_attention-0.5-F_head_attention_0.50_pretrained_50000

0.0B • Updated Jul 16 • 1

timaeus/prune-runs-head_attention-0.5-T_head_attention_0.50_pretrained_50000

0.0B • Updated Jul 16 • 1

timaeus/test-distillation

0.0B • Updated Jul 16 • 1

timaeus/triangle-40k-og

timaeus/dsir-pile-1m-filtered-no-github-or-dm_mathematics

datasets 737

timaeus/dsir-pile-1m-filtered-no-github-or-dm_mathematics

Viewer • Updated Jul 7 • 1M • 40

timaeus/dsir-pile-13m-filtered-no-github-or-dm_mathematics

Viewer • Updated Jun 26 • 12.8M • 23

timaeus/pile-ubuntu_irc-broken

Viewer • Updated Apr 21 • 13.7k • 29

timaeus/pile-philpapers

Viewer • Updated Apr 3 • 49k • 1.06k

timaeus/pile-europarl

Viewer • Updated Apr 3 • 101k • 1.08k

timaeus/pile-gutenberg_pg-19

Viewer • Updated Apr 3 • 4k • 10

timaeus/pile-nih_exporter

Viewer • Updated Apr 3 • 100k • 1.04k

timaeus/pile-hackernews

Viewer • Updated Apr 3 • 100k • 1.02k

timaeus/pile-enron_emails

Viewer • Updated Apr 3 • 100k • 1.12k • 2

timaeus/pile-dm_mathematics

Viewer • Updated Apr 3 • 100k • 1.12k • 1

View 737 datasets