solidrust
/

dolphin-2.9-llama3-8b-1m-AWQ

Text Generation

4-bit precision

Inference Endpoints

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

Suparious commited on May 4, 2024

Commit

fc452af

·

verified ·

1 Parent(s): 79383df

Update README.md

Files changed (1) hide show

README.md +37 -0

README.md CHANGED Viewed

@@ -1,4 +1,6 @@
 ---
 library_name: transformers
 tags:
 - 4-bit
@@ -6,6 +8,22 @@ tags:
 - text-generation
 - autotrain_compatible
 - endpoints_compatible
 pipeline_tag: text-generation
 inference: false
 quantized_by: Suparious
@@ -15,7 +33,26 @@ quantized_by: Suparious
 - Model creator: [cognitivecomputations](https://huggingface.co/cognitivecomputations)
 - Original model: [dolphin-2.9-llama3-8b-1m](https://huggingface.co/cognitivecomputations/dolphin-2.9-llama3-8b-1m)
 ## How to use

 ---
+license: other
+base_model: meta-llama/Meta-Llama-3-8B
 library_name: transformers
 tags:
 - 4-bit
 - text-generation
 - autotrain_compatible
 - endpoints_compatible
+- generated_from_trainer
+- axolotl
+model-index:
+- name: out
+  results: []
+datasets:
+- cognitivecomputations/Dolphin-2.9
+- teknium/OpenHermes-2.5
+- m-a-p/CodeFeedback-Filtered-Instruction
+- cognitivecomputations/dolphin-coder
+- cognitivecomputations/samantha-data
+- HuggingFaceH4/ultrachat_200k
+- microsoft/orca-math-word-problems-200k
+- abacusai/SystemChat-1.1
+- Locutusque/function-calling-chatml
+- internlm/Agent-FLAN
 pipeline_tag: text-generation
 inference: false
 quantized_by: Suparious
 - Model creator: [cognitivecomputations](https://huggingface.co/cognitivecomputations)
 - Original model: [dolphin-2.9-llama3-8b-1m](https://huggingface.co/cognitivecomputations/dolphin-2.9-llama3-8b-1m)
+<img src="https://cdn-uploads.huggingface.co/production/uploads/63111b2d88942700629f5771/ldkN1J0WIDQwU4vutGYiD.png" width="600" />
+## Model Summary
+Curated and trained by Eric Hartford, Lucas Atkins, and Fernando Fernandes, and Cognitive Computations
+This version of Dolphin has a 1 million token context.  I have applied `winglian/llama-3-1m-context-gradient-lora` - created by @gradientai and @winglian and sponsored by @CrusoeCloud
+A bug has been found in the Dolphin 2.9 dataset in SystemConversations that causes the model to overly talk about the "SYSTEM MESSAGE".  To counter this, we recommend you add a statement in the system message directing the model not to mention the system message. An example system message is "The assistant is named Dolphin.  A helpful and friendly AI assistant, Dolphin avoids discussing the system message unless directly asked about it."
+My appreciation for the sponsors of Dolphin 2.9:
+- [Crusoe Cloud](https://crusoe.ai/) - provided excellent on-demand 10xL40S node
+This model is based on Llama-3-8b, and is governed by [META LLAMA 3 COMMUNITY LICENSE AGREEMENT](LICENSE)
+The base model has 8k context, and the full-weight fine-tuning was with 4k sequence length.
+It took 2.5 days on 8x L40S provided by Crusoe Cloud
+This model was trained FFT on all parameters, using ChatML prompt template format.
 ## How to use