Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering Paper โข 2411.11504 โข Published Nov 18, 2024 โข 19
Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning Paper โข 2410.22304 โข Published Oct 29, 2024 โข 16
Training Language Models to Self-Correct via Reinforcement Learning Paper โข 2409.12917 โข Published Sep 19, 2024 โข 136
Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning Paper โข 2410.22304 โข Published Oct 29, 2024 โข 16
Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning Paper โข 2410.22304 โข Published Oct 29, 2024 โข 16 โข 2
SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI Paper โข 2410.11096 โข Published Oct 14, 2024 โข 12
MIRAI: Evaluating LLM Agents for Event Forecasting Paper โข 2407.01231 โข Published Jul 1, 2024 โข 16 โข 3
Enhancing Large Vision Language Models with Self-Training on Image Comprehension Paper โข 2405.19716 โข Published May 30, 2024
view post Post 1260 Check out our new benchmark paper on LLM agents for global events forecasting! MIRAI: Evaluating LLM Agents for Event Forecasting (2407.01231) ๐ Arxiv: https://arxiv.org/abs/2407.01231๐ Project page: https://mirai-llm.github.io๐ป GitHub Repo: https://github.com/yecchen/MIRAI๐ Dataset: https://drive.google.com/file/d/1xmSEHZ_wqtBu1AwLpJ8wCDYmT-jRpfrN/view?usp=sharing๐ Interactive Demo Notebook: https://colab.research.google.com/drive/1QyqT35n6NbtPaNtqQ6A7ILG_GMeRgdnO?usp=sharing โค๏ธ 2 2 + Reply
MIRAI: Evaluating LLM Agents for Event Forecasting Paper โข 2407.01231 โข Published Jul 1, 2024 โข 16
view post Post 1260 Check out our new benchmark paper on LLM agents for global events forecasting! MIRAI: Evaluating LLM Agents for Event Forecasting (2407.01231) ๐ Arxiv: https://arxiv.org/abs/2407.01231๐ Project page: https://mirai-llm.github.io๐ป GitHub Repo: https://github.com/yecchen/MIRAI๐ Dataset: https://drive.google.com/file/d/1xmSEHZ_wqtBu1AwLpJ8wCDYmT-jRpfrN/view?usp=sharing๐ Interactive Demo Notebook: https://colab.research.google.com/drive/1QyqT35n6NbtPaNtqQ6A7ILG_GMeRgdnO?usp=sharing โค๏ธ 2 2 + Reply
MIRAI: Evaluating LLM Agents for Event Forecasting Paper โข 2407.01231 โข Published Jul 1, 2024 โข 16
MIRAI: Evaluating LLM Agents for Event Forecasting Paper โข 2407.01231 โข Published Jul 1, 2024 โข 16 โข 3