metadata
license: apache-2.0
datasets:
- HuggingFaceH4/ultrafeedback_binarized
language:
- en
- fr
base_model:
- NousResearch/Nous-Hermes-llama-2-7b
- meta-llama/Llama-2-7b
pipeline_tag: text-generation
metrics:
- accuracy
- bertscore
- bleurt
- brier_score
tags:
- biology
- chemistry
Trained NousResearch/Nous-Hermes-llama-2-7b on UltraFeedback for Direct Preference Optimization on the preference data created on Ultrafeedback having difference b/w chosen score and rejected score>=5