Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Kevin King's picture
1 22 16

Kevin King PRO

NeoCodes-dev
GoHugo's profile picture
·
  • king112

AI & ML interests

Deep RL, RL for LLMs

Recent Activity

updated a collection about 23 hours ago
LLMs
updated a collection about 23 hours ago
Agent-Specific/Function-Calling Models
liked a model about 23 hours ago
mistralai/Devstral-Small-2505
View all activity

Organizations

AI Starter Pack's profile picture

Collections 18

Datasets - MultiModal
  • SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers

    Paper • 2407.09413 • Published Jul 12, 2024 • 11
ARC-AGI2
  • state-spaces/mamba2attn-2.7b

    Updated Jun 3, 2024 • 245 • 7

spaces 1

Sleeping

First Agent Template

⚡

Find the current time in any timezone

Feb 13

models 20

NeoCodes-dev/Qwen2-0.5B-GRPO-test

Updated Apr 21

NeoCodes-dev/SmolLM_135M_GRPO

Text Generation • Updated Mar 14 • 7

NeoCodes-dev/Qwen2_7B-GRPO-test

Updated Mar 5

NeoCodes-dev/Qwen2.5_3B-GRPO-test

Updated Mar 5

NeoCodes-dev/codeparrot-ds

Updated Mar 3

NeoCodes-dev/gemma-2-2B-it-thinking-function_calling-V0

Updated Feb 25

NeoCodes-dev/Unit8_part1_V1

Reinforcement Learning • Updated Jul 28, 2024

NeoCodes-dev/rl_course_vizdoom_health_gathering_supreme

Reinforcement Learning • Updated Jul 24, 2024

NeoCodes-dev/poca-SoccerTwos

Reinforcement Learning • Updated Nov 9, 2023 • 11

NeoCodes-dev/a2c-PandaReachDense-v2

Reinforcement Learning • Updated Jul 28, 2023 • 2

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs