Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ContextualAI
/
Contextual_KTO_Mistral_PairRM
like
31
Follow
ContextualAI
59
Text Generation
Transformers
Safetensors
snorkelai/Snorkel-Mistral-PairRM-DPO-Dataset
English
mistral
human feedback
rlhf
preferences
alignment
HALO
halos
dpo
rl
conversational
text-generation-inference
Inference Endpoints
arxiv:
2402.01306
License:
apache-2.0
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
main
Contextual_KTO_Mistral_PairRM
Commit History
Update README.md
98bee13
verified
xwinxu
commited on
Apr 26
Update README.md
bdf7fe0
verified
xwinxu
commited on
Mar 7
Update README.md
31efc9a
verified
xwinxu
commited on
Mar 7
Update README.md
8d0fec9
verified
xwinxu
commited on
Mar 7
Update README.md
8b7e5cc
verified
xwinxu
commited on
Mar 7
Fix tokenizer chat template
8cffcfe
verified
shikib
commited on
Mar 6
Update README.md
06fc6e3
verified
xwinxu
commited on
Mar 5
Upload MistralForCausalLM
d8380f4
verified
Muennighoff
commited on
Mar 5
Upload tokenizer
eb151d5
verified
Muennighoff
commited on
Mar 5
Upload README.md with huggingface_hub
66b1fa9
verified
Muennighoff
commited on
Mar 5
Upload tokenizer
e531f7b
verified
Muennighoff
commited on
Mar 5
Upload README.md with huggingface_hub
0d81ff6
verified
Muennighoff
commited on
Mar 5
Upload tokenizer
231bafb
verified
Muennighoff
commited on
Mar 5
Upload README.md with huggingface_hub
dba0d32
verified
Muennighoff
commited on
Mar 5
Upload tokenizer
257fdd0
verified
Muennighoff
commited on
Mar 5
Upload README.md with huggingface_hub
c96f499
verified
Muennighoff
commited on
Mar 5
Upload tokenizer
c47c194
verified
Muennighoff
commited on
Mar 5
Upload README.md with huggingface_hub
0b9cba1
verified
Muennighoff
commited on
Mar 5
Upload tokenizer
f652cba
verified
Muennighoff
commited on
Mar 5
Upload README.md with huggingface_hub
cbf882a
verified
Muennighoff
commited on
Mar 5
Upload tokenizer
45df619
verified
Muennighoff
commited on
Mar 5
Upload README.md with huggingface_hub
2c7e3b1
verified
Muennighoff
commited on
Mar 5
Upload tokenizer
8922fc2
verified
Muennighoff
commited on
Mar 5
Upload README.md with huggingface_hub
927e33a
verified
Muennighoff
commited on
Mar 5
initial commit
4564abd
verified
xwinxu
commited on
Mar 5