kubernetes-bad commited on
Commit
c95cea4
·
1 Parent(s): 1fe3de4

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +42 -0
README.md ADDED
@@ -0,0 +1,42 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - HuggingFaceH4/no_robots
4
+ language:
5
+ - en
6
+ license: cc-by-nc-4.0
7
+ ---
8
+
9
+
10
+ The model "Good Robot" had one simple goal in mind: to be a good instruction-following model that doesn't talk like ChatGPT.
11
+
12
+ Built upon the Mistral 7b base, this model aims to provide responses that are as human-like as possible, thanks to some DPO training using the (for now, private) `minerva-ai/yes-robots-dpo` dataset.
13
+
14
+
15
+ HuggingFaceH4/no-robots was used as the base for generating a custom dataset to create DPO pairs.
16
+
17
+ It should follow instructions and be generally as smart as a typical Mistral model - just not as soulless and full of GPT slop.
18
+
19
+ ## Prompt Format:
20
+
21
+ Alpaca, my beloved ❤️
22
+ ```
23
+ Below is an instruction that describes a task. Write a response that appropriately completes the request.
24
+
25
+ ### Instruction:
26
+ {your prompt goes here}
27
+
28
+ ### Response:
29
+ ```
30
+
31
+ ## Training Data:
32
+ - [HuggingFaceH4/no_robots](https://huggingface.co/datasets/HuggingFaceH4/no_robots)
33
+ - [MinervaAI/yes-robots-dpo](https://huggingface.co/MinervaAI)
34
+ - private datasets with common GPTisms
35
+
36
+
37
+ ## Limitations:
38
+
39
+ While I did my best to minimize GPTisms, no model is perfect, and there may still be instances where the generated content has GPT's common phrases - I have a suspicion that's due to them being engrained into Mistral model itself.
40
+
41
+ ## License:
42
+ cc-by-nc-4.0