booydar/RMT-Llama-3.2-1B-Instruct-2x1024-mem16-lora-smol_qa1-5-1_9-qa1-5-distill_1.0-1000_steps Updated about 7 hours ago
booydar/RMT-Llama-3.2-1B-Instruct-2x1024-mem16-lora-smol_qa1-5-1_9-qa1-5-distill_1.0-1000_steps Updated about 7 hours ago
booydar/RMT-Llama-3.2-1B-Instruct-2x1024-mem16-lora-dolly_qa1-5-1_9-qa1-5-distill_1.0-1000_steps Updated about 7 hours ago
booydar/RMT-Llama-3.2-1B-Instruct-2x1024-mem16-lora-dolly_qa1-5-1_9-qa1-5-distill_1.0-1000_steps Updated about 7 hours ago
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper • 2502.15007 • Published 6 days ago • 139
Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity Paper • 2502.13063 • Published 8 days ago • 62
Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity Paper • 2502.13063 • Published 8 days ago • 62
SRMT: Shared Memory for Multi-agent Lifelong Pathfinding Paper • 2501.13200 • Published Jan 22 • 64
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation Paper • 2412.06531 • Published Dec 9, 2024 • 71