David Khosid PRO

davidkh

AI & ML interests

Intelligent robotics, reinforcement learning, physical simulations, generative art

Recent Activity

updated a dataset 13 days ago

davidkh/so101_orange_ron_2

published a dataset 13 days ago

davidkh/so101_orange_ron_2

reacted to IliaLarchenko's post with 🔥 3 months ago

I am presenting Decoder-Only Transformer (DOT) Policy a simple Behavioral Control policy that outperforms SOTA models on two simple benchmark tasks: ✅ PushT (pushing an object to a goal) – 84% success on keypoints, 74% on images (previous best: 75% / 69%) ✅ ALOHA Insert (precise bimanual insertion) – 30% success (previous best: ~21%) The best part? DOT is much smaller (sometimes 100 times less parameters) than previous SOTA models, trains faster, and avoids complexity: 🚫 No generative models (Diffusion, VAE, GANs) 🚫 No discretization/tokenization of actions 🚫 No reinforcement learning or multi-stage training ✅ Just learns from human demos, plain and simple This is still early — more complex real-life tasks need testing, and no guarantees it will actually work well there, but I think it's interesting to share. Sometimes, simpler approaches can be just as effective (or even better) than complex ones. 🔗 Open-source code and detailed description: https://github.com/IliaLarchenko/dot_policy Trained models on Hugging Face: https://huggingface.co/IliaLarchenko/dot_pusht_keypoints https://huggingface.co/IliaLarchenko/dot_pusht_images https://huggingface.co/IliaLarchenko/dot_bimanual_insert

View all activity

Organizations

Collections 1

spaces 1

Runtime error

First Agent Template

⚡

Fetch current time in any timezone

models 16

datasets 1

davidkh/so101_orange_ron_2

Viewer • Updated 13 days ago • 1.8k • 148

David Khosid PRO

AI & ML interests

Recent Activity

Organizations

Collections 1

Medical SAM 2: Segment medical images as video via Segment Anything Model 2

Medical SAM 2: Segment medical images as video via Segment Anything Model 2

spaces 1

First Agent Template

models 16

davidkh/rl_course_vizdoom_health_gathering_supreme

davidkh/ppo-CartPole-v1

davidkh/poca-SoccerTwos

davidkh/ppo-unity_pyraminds

davidkh/ppo-SnowballTarget

davidkh/Reinforce-Pixelcopter-PLE-v0

davidkh/Reinforce-CartPole-v1

davidkh/dqn-SpaceInvadersNoFrameskip-v4

davidkh/q-taxi-v3

davidkh/q-FrozenLake-v1-4x4-noSlippery

datasets 1

davidkh/so101_orange_ron_2

David Khosid PRO

AI & ML interests

Recent Activity

Organizations

Collections 1

spaces 1

First Agent Template

models 16 Sort: Recently updated

datasets 1

models 16