B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners Paper • 2412.17256 • Published 2 days ago • 31
Progressive Multimodal Reasoning via Active Retrieval Paper • 2412.14835 • Published 6 days ago • 66 • 2
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing Paper • 2305.11738 • Published May 19, 2023 • 8
UI Agent Collection a collection of algorithmic agents for user interfaces/interactions and program synthesis • 231 items • Updated 4 days ago • 35
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published 12 days ago • 74
Multi-Dimensional Insights: Benchmarking Real-World Personalization in Large Multimodal Models Paper • 2412.12606 • Published 8 days ago • 41
OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper • 2412.13018 • Published 8 days ago • 40
Multi-Dimensional Insights: Benchmarking Real-World Personalization in Large Multimodal Models Paper • 2412.12606 • Published 8 days ago • 41
Smaller Language Models Are Better Instruction Evolvers Paper • 2412.11231 • Published 10 days ago • 24
Smaller Language Models Are Better Instruction Evolvers Paper • 2412.11231 • Published 10 days ago • 24
Smaller Language Models Are Better Instruction Evolvers Paper • 2412.11231 • Published 10 days ago • 24 • 2
RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation Paper • 2412.11919 • Published 9 days ago • 33
RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation Paper • 2412.11919 • Published 9 days ago • 33 • 4