SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning Paper • 2506.01713 • Published Jun 2 • 47
A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment Paper • 2504.15585 • Published Apr 22 • 13
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper • 2504.01990 • Published Mar 31 • 301
METAGENE-1: Metagenomic Foundation Model for Pandemic Monitoring Paper • 2501.02045 • Published Jan 3 • 21
Game-theoretic LLM: Agent Workflow for Negotiation Games Paper • 2411.05990 • Published Nov 8, 2024 • 8
Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions Paper • 2412.08737 • Published Dec 11, 2024 • 55
D3RoMa: Disparity Diffusion-based Depth Sensing for Material-Agnostic Robotic Manipulation Paper • 2409.14365 • Published Sep 22, 2024
SAGE: Bridging Semantic and Actionable Parts for GEneralizable Manipulation of Articulated Objects Paper • 2312.01307 • Published Dec 3, 2023
On Retrieval Augmentation and the Limitations of Language Model Training Paper • 2311.09615 • Published Nov 16, 2023 • 1
DeLLMa: A Framework for Decision Making Under Uncertainty with Large Language Models Paper • 2402.02392 • Published Feb 4, 2024 • 6
IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations Paper • 2404.01266 • Published Apr 1, 2024 • 4
Improve Mathematical Reasoning in Language Models by Automated Process Supervision Paper • 2406.06592 • Published Jun 5, 2024 • 30
FACT-GPT: Fact-Checking Augmentation via Claim Matching with LLMs Paper • 2402.05904 • Published Feb 8, 2024