Perusha Moodley
moodlep
AI & ML interests
RL, DRL, Decision Transformers, Auxiliary signals, self-supervised methods
Recent Activity
upvoted
an
article
about 1 month ago
SmolLM - blazingly fast and remarkably powerful
liked
a Space
about 1 month ago
nanotron/ultrascale-playbook
liked
a dataset
2 months ago
Anthropic/hh-rlhf
Organizations
Collections
1
models
9
moodlep/smollm2-17b-dpo-cai-v1
Updated
•
1
moodlep/smollm2-1.7b-instr-sft-cai-v1
Updated
moodlep/smollm2-1.7b-instr-sft-cai
Updated
•
2
moodlep/mistral-7b-sft-constitutional-ai
Updated
moodlep/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
moodlep/output
Updated
moodlep/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
•
1
moodlep/ppo-Huggy
Reinforcement Learning
•
Updated
•
62
moodlep/ppo-LunarLander-v2
Reinforcement Learning
•
Updated