view article Article nanoJAXGPT: A pedagogical introduction to JAX/Equinox By sachithgunasekara • Oct 23 • 4
view article Article 🌟 Easy Fine-Tuning with Hugging Face SQL Console, Notebook Creator, and SFT By asoria • Sep 24 • 12
How FaR Are Large Language Models From Agents with Theory-of-Mind? Paper • 2310.03051 • Published Oct 4, 2023 • 34
Training Language Models to Self-Correct via Reinforcement Learning Paper • 2409.12917 • Published Sep 19 • 135
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA May 24, 2023 • 101
Open-Bezoar Collection Small, Cost-Effective and Open Models Trained on Mixes of Instruction Data • 7 items • Updated Apr 19 • 6