OnePiece123
's Collections
Unlocking Continual Learning Abilities in Language Models
Paper
•
2406.17245
•
Published
•
29
A Closer Look into Mixture-of-Experts in Large Language Models
Paper
•
2406.18219
•
Published
•
16
Symbolic Learning Enables Self-Evolving Agents
Paper
•
2406.18532
•
Published
•
12
Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of
LLMs
Paper
•
2406.18629
•
Published
•
42
AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for
Retrieval-Augmented Generation
Paper
•
2406.19251
•
Published
•
9
LiteSearch: Efficacious Tree Search for LLM
Paper
•
2407.00320
•
Published
•
38
Chain-of-Knowledge: Integrating Knowledge Reasoning into Large Language
Models by Learning from Knowledge Graphs
Paper
•
2407.00653
•
Published
•
11
We-Math: Does Your Large Multimodal Model Achieve Human-like
Mathematical Reasoning?
Paper
•
2407.01284
•
Published
•
76
Agentless: Demystifying LLM-based Software Engineering Agents
Paper
•
2407.01489
•
Published
•
59
Planetarium: A Rigorous Benchmark for Translating Text to Structured
Planning Languages
Paper
•
2407.03321
•
Published
•
16
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for
LLM Agents
Paper
•
2407.04363
•
Published
•
28
DotaMath: Decomposition of Thought with Code Assistance and
Self-correction for Mathematical Reasoning
Paper
•
2407.04078
•
Published
•
18
AgentInstruct: Toward Generative Teaching with Agentic Flows
Paper
•
2407.03502
•
Published
•
51
Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large
Language Models -- The Story Goes On
Paper
•
2407.08348
•
Published
•
51
Towards Building Specialized Generalist AI with System 1 and System 2
Fusion
Paper
•
2407.08642
•
Published
•
9
MUSCLE: A Model Update Strategy for Compatible LLM Evolution
Paper
•
2407.09435
•
Published
•
22
Paper
•
2407.10671
•
Published
•
161
Sibyl: Simple yet Effective Agent Framework for Complex Real-world
Reasoning
Paper
•
2407.10718
•
Published
•
18