Ba2han
/

Cucumber-7b-10k

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Ba2han commited on Feb 26, 2024

Commit

137a42e

·

verified ·

1 Parent(s): 7260750

Update README.md

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -6,6 +6,8 @@ datasets:
 - Ba2han/databricks-dolly_rated
 - Open-Orca/OpenOrca
 ---
 The training dataset consists of 2k (longest) examples from no_robots, reddit_instruct, dolly, OpenOrca plus two other personal datasets.
 Please use with ChatML and the default system message or enter your own. It was trained with various system messages, the one in the config being the default one.
@@ -19,4 +21,8 @@ The model is:
 - Not great with short text both in input and generation.
-The aim is to see how the **"Long is More for Alignment"** paper holds. This is basically a combination of LIMA + LMA. There should be no benchmark contamination as far as I am aware of. Around 70% of the data is from the mentioned datasets. I am happy with how it turned out.

 - Ba2han/databricks-dolly_rated
 - Open-Orca/OpenOrca
 ---
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6324eabf05bd8a54c6eb1650/xRIRb-57y8tyROdrF4aeI.png)
 The training dataset consists of 2k (longest) examples from no_robots, reddit_instruct, dolly, OpenOrca plus two other personal datasets.
 Please use with ChatML and the default system message or enter your own. It was trained with various system messages, the one in the config being the default one.
 - Not great with short text both in input and generation.
+The aim is to see how the **"Long is More for Alignment"** paper holds. This is basically a combination of LIMA + LMA. There should be no benchmark contamination as far as I am aware of. Around 70% of the data is from the mentioned datasets. I am happy with how it turned out.
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6324eabf05bd8a54c6eb1650/qtvTG0XVdEgr3SE58Dmx-.png)