view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 902
view post Post 2563 Lightweight (nanoGPT) implementation of hybrid norm - an intuitive normalization method that combines the strength of both pre-norm (i.e QKV-norm in MHA) and post-norm in the feed-forward network.Code: https://github.com/Jaykef/ai-algorithms/blob/main/hybrid_normalization.ipynb See translation 👀 5 5 🔥 2 2 + Reply