Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ContextualAI
/
Contextual_KTO_Mistral_PairRM
like
31
Follow
ContextualAI
62
Text Generation
Transformers
Safetensors
snorkelai/Snorkel-Mistral-PairRM-DPO-Dataset
English
mistral
human feedback
rlhf
preferences
alignment
HALO
halos
dpo
rl
conversational
text-generation-inference
Inference Endpoints
arxiv:
2402.01306
License:
apache-2.0
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
8cffcfe
Contextual_KTO_Mistral_PairRM
/
special_tokens_map.json
Commit History
Fix tokenizer chat template
8cffcfe
verified
shikib
commited on
Mar 6, 2024
Upload tokenizer
c47c194
verified
Muennighoff
commited on
Mar 5, 2024
Upload tokenizer
8922fc2
verified
Muennighoff
commited on
Mar 5, 2024