GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models Paper • 2411.05830 • Published Nov 5 • 20
Attention Is All You Need But You Don't Need All Of It For Inference of Large Language Models Paper • 2407.15516 • Published Jul 22 • 1
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions Paper • 2406.15877 • Published Jun 22 • 45
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing Paper • 2406.08464 • Published Jun 12 • 65
No Train No Gain: Revisiting Efficient Training Algorithms For Transformer-based Language Models Paper • 2307.06440 • Published Jul 12, 2023 • 3
Challenges and Applications of Large Language Models Paper • 2307.10169 • Published Jul 19, 2023 • 47