FinSage: A Multi-aspect RAG System for Financial Filings Question Answering Paper • 2504.14493 • Published Apr 20
MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation Paper • 2506.14028 • Published Jun 16 • 91
Fine-Tune an SLM or Prompt an LLM? The Case of Generating Low-Code Workflows Paper • 2505.24189 • Published May 30 • 5
System-1.5 Reasoning: Traversal in Language and Latent Spaces with Dynamic Shortcuts Paper • 2505.18962 • Published May 25 • 13
FACT: Examining the Effectiveness of Iterative Context Rewriting for Multi-fact Retrieval Paper • 2410.21012 • Published Oct 28, 2024
R$^3$Mem: Bridging Memory Retention and Retrieval via Reversible Compression Paper • 2502.15957 • Published Feb 21
GraphOmni: A Comprehensive and Extendable Benchmark Framework for Large Language Models on Graph-theoretic Tasks Paper • 2504.12764 • Published Apr 17 • 41
Pix2Shape: Towards Unsupervised Learning of 3D Scenes from Images using a View-based Representation Paper • 2003.14166 • Published Mar 23, 2020
StarVector: Generating Scalable Vector Graphics Code from Images Paper • 2312.11556 • Published Dec 17, 2023 • 36
Capture the Flag: Uncovering Data Insights with Large Language Models Paper • 2312.13876 • Published Dec 21, 2023 • 1
WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks? Paper • 2403.07718 • Published Mar 12, 2024 • 2