3 159 6

Léo Hunout

hunoutl

AI & ML interests

AI Engineer working on Jean Zay supercomputer in France 🇫🇷

Recent Activity

upvoted a paper 13 days ago

Thinking Preference Optimization

upvoted a paper 13 days ago

Exploring the sustainable scaling of AI dilemma: A projective study of corporations' AI environmental impacts

upvoted a paper 13 days ago

We Can't Understand AI Using our Existing Vocabulary

View all activity

Organizations

hunoutl's activity

upvoted 16 papers 13 days ago

Thinking Preference Optimization

Paper • 2502.13173 • Published 21 days ago • 17

Exploring the sustainable scaling of AI dilemma: A projective study of corporations' AI environmental impacts

Paper • 2501.14334 • Published Jan 24 • 20

SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer

Paper • 2501.18427 • Published Jan 30 • 17

Matryoshka Quantization

Paper • 2502.06786 • Published 28 days ago • 29

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 28 days ago • 142

AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting

Paper • 2502.05176 • Published about 1 month ago • 32

Value-Based Deep RL Scales Predictably

Paper • 2502.04327 • Published Feb 6 • 6

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published Feb 6 • 49

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 200

TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets

Paper • 2502.01506 • Published Feb 3 • 33

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published Feb 5 • 58

You Do Not Fully Utilize Transformer's Representation Capacity

Paper • 2502.09245 • Published 25 days ago • 34

MoM: Linear Sequence Modeling with Mixture-of-Memories

Paper • 2502.13685 • Published 19 days ago • 33

commented a paper 13 days ago

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published Jan 30 • 27 •

liked a Space 13 days ago

2.16k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted 2 papers 14 days ago

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published Jan 28 • 36

DeepFlow: Serverless Large Language Model Serving at Scale

Paper • 2501.14417 • Published Jan 24 • 3