Scaling Image Tokenizers with Grouped Spherical Quantization Paper • 2412.02632 • Published 23 days ago • 10
TransNormerLLM Collection TransNormerLLM: A Faster and Better Large Language Model with Improved TransNormer • 11 items • Updated Jun 25 • 3
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models Paper • 2401.04658 • Published Jan 9 • 25