view article Article Introducing RWKV — An RNN with the advantages of a transformer May 15, 2023 • 14
view article Article GaLore: Advancing Large Model Training on Consumer-grade Hardware Mar 20, 2024 • 26