9 13 12

Yury Kuratov

yurakuratov

AI & ML interests

None yet

Recent Activity

upvoted a paper 15 days ago

Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

upvoted a collection 27 days ago

Qwen2.5

new activity about 1 month ago

RMT-team/babilong:Why does the GitHub link to https://github.com/booydar/recurrent-memory-transformer/?

View all activity

Organizations

yurakuratov's activity

upvoted a paper 15 days ago

Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

Paper • 2412.06531 • Published 16 days ago • 71

upvoted a collection 27 days ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated 27 days ago • 444

New activity in RMT-team/babilong about 1 month ago

Why does the GitHub link to https://github.com/booydar/recurrent-memory-transformer/?

#2 opened 3 months ago by

zhiminy

updated a Space about 1 month ago

Running

🏆🤖

BABILong Leaderboard

LLM extra long context benchmark

liked a dataset about 1 month ago

princeton-nlp/prolong-data-64K

Updated Oct 5 • 9.68k • 10

liked a Space 2 months ago

Running on Zero

716

🤯

Whisper Turbo

upvoted 2 papers 4 months ago

MAPF-GPT: Imitation Learning for Multi-Agent Pathfinding at Scale

Paper • 2409.00134 • Published Aug 29 • 2

Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing

Paper • 2409.01322 • Published Sep 2 • 94

New activity in microsoft/Phi-3.5-mini-instruct 4 months ago

Evaluation of Phi-3.5 on long-context BABILong bench

#12 opened 4 months ago by

yurakuratov

upvoted a collection 4 months ago

DNA language models

Collection

9 items • Updated Apr 17 • 5

New activity in google/recurrentgemma-9b-it 4 months ago

RecurrentGemmaForCausalLM.forward() got an unexpected keyword argument 'position_ids'

#14 opened 4 months ago by

yurakuratov

upvoted a paper 5 months ago

POGEMA: A Benchmark Platform for Cooperative Multi-Agent Navigation

Paper • 2407.14931 • Published Jul 20 • 20

upvoted 3 papers 6 months ago

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Paper • 2407.04620 • Published Jul 5 • 27

Associative Recurrent Memory Transformer

Paper • 2407.04841 • Published Jul 5 • 31

AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents

Paper • 2407.04363 • Published Jul 5 • 27

authored a paper 6 months ago

Associative Recurrent Memory Transformer

Paper • 2407.04841 • Published Jul 5 • 31

upvoted a paper 6 months ago

Complexity of Symbolic Representation in Working Memory of Transformer Correlates with the Complexity of a Task

Paper • 2406.14213 • Published Jun 20 • 20

authored a paper 6 months ago

BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack

Paper • 2406.10149 • Published Jun 14 • 48

upvoted a paper 6 months ago

XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning

Paper • 2406.08973 • Published Jun 13 • 86

updated a dataset 6 months ago

RMT-team/babilong-1k-samples

Viewer • Updated Jun 17 • 59.8k • 1.81k • 3