Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

alignment-handbook

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

4,592

Full-text search

Active filters: alignment-handbook

DUAL-GPO/phi-2-gpo-test-longest-iter-random2-1

Updated Mar 27, 2024 • 2

DUAL-GPO/phi-2-gpo-test-longest-iter-random2-2

Updated Mar 27, 2024 • 5

DUAL-GPO/phi-2-gpo-test-longest-iter-random2-3

Updated Mar 27, 2024 • 4

alvarobartt/mistral-7b-orpo-alignment-handbook

Text Generation • 7B • Updated Mar 27, 2024 • 6

DUAL-GPO/phi-2-gpo-test-longest-iter-random2-4

Updated Mar 27, 2024 • 2

DUAL-GPO/phi-2-dpo-test-iter-0

Updated Mar 28, 2024 • 3

kykim0/gemma-2b-ultrachat-sft

Text Generation • 3B • Updated Mar 28, 2024 • 6 • 1

alvarobartt/mistral-7b-orpo-airoboros-pref-10k

Text Generation • 7B • Updated Mar 28, 2024 • 6

kykim0/gemma-7b-ultrachat-sft

Text Generation • 9B • Updated Mar 29, 2024 • 8

shineil/zephyr-7b-gemma-dpo

Text Generation • 9B • Updated Mar 29, 2024 • 6

mradermacher/mistral-7b-orpo-capybara-reproduction-GGUF

7B • Updated May 6, 2024 • 245

EllieS/zephyr-7b-sft-lora-timedial

Updated Mar 29, 2024 • 2

EllieS/zephyr-7b-dpo-lora-timedial

Updated Mar 29, 2024 • 2

Minbyul/selfbiorag-7b-dpo-full-wo-healthsearch_qa-ep3

Text Generation • 7B • Updated Apr 9, 2024 • 3

DUAL-GPO/zephyr-7b-gpo-iter1

Updated Mar 29, 2024 • 1

DUAL-GPO-2/phi-2-ipo-test-iter-0

Updated Mar 30, 2024 • 4

jetmoe/jetmoe-8b-sft

Text Generation • 9B • Updated Apr 15, 2024 • 8 • 6

jetmoe/jetmoe-8b-chat

Text Generation • 9B • Updated May 11, 2024 • 15 • 29

pkarypis/gpt2-sft-port

Text Generation • 0.1B • Updated Apr 25, 2024 • 3

DUAL-GPO/zephyr-7b-gpo-iter2

Updated Apr 1, 2024 • 2

nthakur/mistral-7b-instruct-v0.2-dpo-multilingual-mix-1st-apr-final

Updated Apr 2, 2024 • 3

Shamane/mistral-instruct-v2-sec-cpt-qlora

Updated Apr 2, 2024 • 3

Serega6678/My_script_50pct_LLM_pretraining

Updated Apr 4, 2024 • 3

objects76/zephyr-7b-dpo-qlora

Updated Apr 5, 2024 • 3

Serega6678/prototype_joint_trained

Updated Apr 4, 2024 • 2

DUAL-GPO/zephyr-7b-ipo-qlora-v0

Updated Apr 6, 2024 • 2

DUAL-GPO/zephyr-7b-gpo-update3-i0

Updated Apr 6, 2024 • 3

DUAL-GPO/zephyr-7b-gpo-update4-i0

Updated Apr 6, 2024 • 4

DUAL-GPO/zephyr-7b-dpo-qlora-v1

Updated Apr 6, 2024 • 10

ShenaoZ/0.0_dataup_iter_1

Text Generation • 7B • Updated Apr 5, 2024 • 5