Rethinking Diverse Human Preference Learning through Principal Component Analysis Paper • 2502.13131 • Published 6 days ago • 34 • 3
TransMLA: Multi-head Latent Attention Is All You Need Paper • 2502.07864 • Published 13 days ago • 43 • 8
TransMLA: Multi-head Latent Attention Is All You Need Paper • 2502.07864 • Published 13 days ago • 43 • 8
TransMLA: Multi-head Latent Attention Is All You Need Paper • 2502.07864 • Published 13 days ago • 43 • 8
Scaling Image Tokenizers with Grouped Spherical Quantization Paper • 2412.02632 • Published Dec 3, 2024 • 10 • 2
Training and Evaluating Language Models with Template-based Data Generation Paper • 2411.18104 • Published Nov 27, 2024 • 3 • 3
General Preference Modeling with Preference Representations for Aligning Language Models Paper • 2410.02197 • Published Oct 3, 2024 • 9 • 4
Self-Discover: Large Language Models Self-Compose Reasoning Structures Paper • 2402.03620 • Published Feb 6, 2024 • 115 • 10
Contrastive Learning Is Spectral Clustering On Similarity Graph Paper • 2303.15103 • Published Mar 27, 2023 • 2 • 3