A Comprehensive Capability Analysis of GPT-3 and GPT-3.5 Series Models Paper • 2303.10420 • Published Mar 18, 2023 • 1
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published Jan 21 • 56
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training Paper • 2501.11425 • Published Jan 20 • 99
PaSa: An LLM Agent for Comprehensive Academic Paper Search Paper • 2501.10120 • Published Jan 17 • 48
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use Paper • 2501.02506 • Published Jan 5 • 11
InstructUIE: Multi-task Instruction Tuning for Unified Information Extraction Paper • 2304.08085 • Published Apr 17, 2023
ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios Paper • 2401.00741 • Published Jan 1, 2024
Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback Paper • 2401.11458 • Published Jan 21, 2024 • 2
MouSi: Poly-Visual-Expert Vision-Language Models Paper • 2401.17221 • Published Jan 30, 2024 • 9
LLM can Achieve Self-Regulation via Hyperparameter Aware Generation Paper • 2402.11251 • Published Feb 17, 2024 • 1
LLM-DA: Data Augmentation via Large Language Models for Few-Shot Named Entity Recognition Paper • 2402.14568 • Published Feb 22, 2024