1 32 20

Kevin King PRO

NeoCodes-dev

king112

AI & ML interests

Deep RL, RL for LLMs

Recent Activity

updated a collection about 15 hours ago

VLMs - Robotics

updated a collection about 15 hours ago

VLMs

updated a collection about 15 hours ago

Research Papers

View all activity

Organizations

Find the current time in any timezone

models 20

NeoCodes-dev/Qwen2-0.5B-GRPO-test

Updated Apr 21

NeoCodes-dev/SmolLM_135M_GRPO

Text Generation • 0.1B • Updated Mar 14 • 2

NeoCodes-dev/Qwen2_7B-GRPO-test

Updated Mar 5

NeoCodes-dev/Qwen2.5_3B-GRPO-test

Updated Mar 5

NeoCodes-dev/codeparrot-ds

0.1B • Updated Mar 3 • 7

NeoCodes-dev/gemma-2-2B-it-thinking-function_calling-V0

Updated Feb 25

NeoCodes-dev/Unit8_part1_V1

Reinforcement Learning • Updated Jul 28, 2024

NeoCodes-dev/rl_course_vizdoom_health_gathering_supreme

Reinforcement Learning • Updated Jul 24, 2024

NeoCodes-dev/poca-SoccerTwos

Reinforcement Learning • Updated Nov 9, 2023 • 7

NeoCodes-dev/a2c-PandaReachDense-v2

Reinforcement Learning • Updated Jul 28, 2023 • 2

View 20 models

datasets 0

None public yet

Kevin King PRO

AI & ML interests

Recent Activity

Organizations

Collections 21

xlangai/AgentNet

Salesforce/Llama-xLAM-2-70b-fc-r

xlangai/OpenCUA-32B

xlangai/AgentNet

Salesforce/Llama-xLAM-2-70b-fc-r

xlangai/OpenCUA-32B

spaces 1

First Agent Template

models 20

NeoCodes-dev/Qwen2-0.5B-GRPO-test

NeoCodes-dev/SmolLM_135M_GRPO

NeoCodes-dev/Qwen2_7B-GRPO-test

NeoCodes-dev/Qwen2.5_3B-GRPO-test

NeoCodes-dev/codeparrot-ds

NeoCodes-dev/gemma-2-2B-it-thinking-function_calling-V0

NeoCodes-dev/Unit8_part1_V1

NeoCodes-dev/rl_course_vizdoom_health_gathering_supreme

NeoCodes-dev/poca-SoccerTwos

NeoCodes-dev/a2c-PandaReachDense-v2

datasets 0

Kevin King PRO

AI & ML interests

Recent Activity

Organizations

Collections 21

spaces 1

First Agent Template

models 20 Sort: Recently updated

datasets 0

models 20