t.d.a.g. PRO

sequelbox

AI & ML interests

open source, infinite games. (they/them)

Recent Activity

Organizations

Valiant Labs's profile picture

Posts 21

view post
Post
248
EARLY RELEASE PREVIEW of Esper 3 for Qwen 3 8b!

- Reasoning finetune focused on coding, architecture, DevOps, and general reasoning
- Trained using DeepSeek-R1 685b synthetic data
- Official Apache 2.0 release coming soon on Valiant Labs: try out the preview for now and see what you think!

Try it out: sequelbox/Qwen3-8B-Esper3-PREVIEW

with my love,
allegra
view post
Post
1720
TITANIUM 2 Deepseek-R1 dataset is here! Open-source synthetic architecture and DevOps dataset: sequelbox/Titanium2-DeepSeek-R1

Esper 3 will be coming out soon for multiple base models, trained on Titanium, Raiden, and more :)

with my love,
allegra