Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

PKU-Alignment/PKU-SafeRLHF

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

38

Full-text search

Active filters: PKU-Alignment/PKU-SafeRLHF

PKU-Alignment/beaver-7b-v1.0

Reinforcement Learning • 7B • Updated May 9, 2024 • 60 • 11

PKU-Alignment/beaver-7b-v1.0-reward

Reinforcement Learning • 7B • Updated Apr 20, 2024 • 511 • 17

PKU-Alignment/beaver-7b-v1.0-cost

Reinforcement Learning • 7B • Updated Apr 20, 2024 • 347 • 10

chargoddard/servile-harpsichord-cdpo

Text Generation • 7B • Updated Dec 10, 2023 • 866

chargoddard/piano-medley-7b

Text Generation • 7B • Updated Jan 4, 2024 • 863 • 6

LLM360/AmberSafe

Text Generation • 7B • Updated Oct 4, 2024 • 8 • 7

MaziyarPanahi/piano-medley-7b-Mistral-7B-Instruct-v0.1

Text Generation • 7B • Updated Jan 17, 2024 • 6

MaziyarPanahi/piano-medley-7b-Mistral-7B-Instruct-v0.1-GGUF

Text Generation • 7B • Updated Jan 27, 2024 • 6

PKU-Alignment/beaver-7b-v2.0

Reinforcement Learning • 7B • Updated May 9, 2024 • 19

PKU-Alignment/beaver-7b-v2.0-reward

Reinforcement Learning • 7B • Updated Apr 20, 2024 • 6

PKU-Alignment/beaver-7b-v2.0-cost

Reinforcement Learning • 7B • Updated Apr 20, 2024 • 4

PKU-Alignment/beaver-7b-v3.0

Reinforcement Learning • 7B • Updated May 9, 2024 • 177

PKU-Alignment/beaver-7b-v3.0-reward

Reinforcement Learning • 7B • Updated Apr 20, 2024 • 110

PKU-Alignment/beaver-7b-v3.0-cost

Reinforcement Learning • 13B • Updated Apr 20, 2024 • 16

PKU-Alignment/beaver-7b-unified-reward

Reinforcement Learning • 7B • Updated Apr 20, 2024 • 543

PKU-Alignment/beaver-7b-unified-cost

Reinforcement Learning • 7B • Updated Apr 20, 2024 • 576 • 1

wxzhang/dpo-selective-alpaca

Text Generation • 7B • Updated Apr 23, 2024 • 3

xiaodongguaAIGC/xdg-llama-3-8B

Text Generation • 8B • Updated Jun 24, 2024 • 8 • 5

mradermacher/piano-medley-7b-GGUF

7B • Updated Jun 4, 2024 • 87

mradermacher/piano-medley-7b-i1-GGUF

7B • Updated Aug 2, 2024 • 203

NCSOFT/Llama-3-OffsetBias-8B

Text Generation • 8B • Updated Jul 23, 2024 • 14 • 13

NCSOFT/Llama-3-OffsetBias-RM-8B

Text Classification • 8B • Updated Sep 6, 2024 • 63 • 23

mradermacher/Llama-3-OffsetBias-8B-GGUF

8B • Updated Jul 22, 2024 • 42 • 1

mradermacher/AmberSafe-GGUF

7B • Updated Oct 5, 2024 • 70

mradermacher/AmberSafe-i1-GGUF

7B • Updated Oct 5, 2024 • 285

tensorblock/Llama-3-OffsetBias-8B-GGUF

8B • Updated Jul 8 • 52

arshandalili/autotrain-llama2-7b-chat-hf-saferlhf

Text Generation • Updated Dec 6, 2024

Foreshhh/Qwen2-VL-7B-SafeRLHF

Visual Question Answering • 8B • Updated Dec 22, 2024 • 221 • 3

tensorblock/servile-harpsichord-cdpo-GGUF

7B • Updated Jul 9 • 32

tensorblock/piano-medley-7b-GGUF

7B • Updated Jul 9 • 37