Rethinking the Influence of Source Code on Test Case Generation Paper • 2409.09464 • Published Sep 14, 2024 • 1
AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge Paper • 2412.13670 • Published Dec 18, 2024 • 5
CodeArena: A Collective Evaluation Platform for LLM Code Generation Paper • 2503.01295 • Published 7 days ago • 7
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU Paper • 2502.08910 • Published 25 days ago • 143
Self-Play Preference Optimization for Language Model Alignment Paper • 2405.00675 • Published May 1, 2024 • 27
Mercury: An Efficiency Benchmark for LLM Code Synthesis Paper • 2402.07844 • Published Feb 12, 2024 • 1