Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

Inference Endpoints

text-generation-inference

AutoTrain Compatible

Misc with no match

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

12

Full-text search

Active filters: reward

TIGER-Lab/AceCodeRM-7B

Updated Feb 8 • 244 • 3

TIGER-Lab/AceCodeRM-32B

Updated Feb 8 • 172 • 4

eth-nlped/Qwen2.5-1.5B-pedagogical-rewardmodel

Text Classification • Updated 6 days ago • 66 • 2

li-jay-cs/test2-rlhf-rm-checkpoint

Updated Dec 21, 2023 • 6

li-jay-cs/gpt2-medium-rlhf-rm-checkpoint

Updated Dec 25, 2023 • 9

li-jay-cs/test3-rlhf-rm-checkpoint

Updated Dec 24, 2023 • 35

li-jay-cs/gpt2-rlhf-rm-checkpoint

Updated Dec 24, 2023 • 42

li-jay-cs/gpt2-training-full-rlhf-rm-checkpoint

Updated Dec 25, 2023 • 13

li-jay-cs/gpt2-last_token_reward_and_full_training-rlhf-rm-checkpoint

Updated Dec 25, 2023 • 6

li-jay-cs/1gpu-gpt2-myepoch1-gcp-reward-model

Updated Jan 12, 2024 • 12

thobauma/opt-350m

Text Classification • Updated Apr 25, 2024 • 20

ZhangNy/2024-11-18_10-58-28

Updated Nov 18, 2024 • 8