lighthouse-kr
/

Mistral-7B-lighthouse-merge-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

clee84 commited on Jan 22, 2024

Commit

0875e77

·

verified ·

1 Parent(s): 5830962

Update README.md

Files changed (1) hide show

README.md +12 -5

README.md CHANGED Viewed

@@ -15,12 +15,19 @@ tags:
 - NLP
 - DPO
 ---
-# Model Card for Mistral-7B-lighthouse-merge-v0.1
-This model is a result of merging two models A and B.
-The method used for merging is "slerp" with [mergekit](https://github.com/cg123/mergekit).
-A: [mistralai/Mistral-7B-instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2)
-B: [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) fine-tuned through DPO training.
 ### Jinja Prompt Template
 ```

 - NLP
 - DPO
 ---
+# Model Overview
+This model is a result of a sophisticated merging process involving two distinct models, Model A and Model B. The merging methodology employed is the "slerp" technique, facilitated by the use of [mergekit](https://github.com/cg123/mergekit).
+## Component Models
+### Model A
+- **Source**: [mistralai/Mistral-7B-instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2)
+- **Description**: This model is designed to provide instruction-based outputs, enhancing the clarity and precision in response generation.
+### Model B
+- **Source**: Based on [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1)
+- **Enhancements**: Fine-tuned using DPO (Direct Preference Optimization) training, augmenting its capabilities for more adaptive and context-aware responses.
 ### Jinja Prompt Template
 ```