view article Article Introducing RWKV — An RNN with the advantages of a transformer May 15, 2023 • 14
view article Article GaLore: Advancing Large Model Training on Consumer-grade Hardware Mar 20, 2024 • 26
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27, 2024 • 606
StableIdentity: Inserting Anybody into Anywhere at First Sight Paper • 2401.15975 • Published Jan 29, 2024 • 17