Envoid commited on
Commit
6f424b1
·
1 Parent(s): 0e427f4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +96 -1
README.md CHANGED
@@ -1,3 +1,98 @@
1
  ---
2
  license: cc-by-nc-4.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: cc-by-nc-4.0
3
+ ---
4
+ ## Warning: This model may produce adult content.
5
+
6
+ This model was created by taking [Libra19B](https://huggingface.co/Envoid/Libra19B) and then using the [frankenllama script](https://huggingface.co/chargoddard/llama2-22b) to perform a block diagonal merge with [Enterredaas 33B](https://huggingface.co/Aeala/Enterredaas-33b).
7
+
8
+ ## Unnatural corpus:
9
+
10
+ I then used the included autocorpus.py script to generate 20 megabytes of raw text samples using Libra19B
11
+
12
+ The script does have some external dependencies that you will have to obtain independently.
13
+
14
+ - The presets folder/files from simple-proxy-for-tavern or another source of similarly formatted JSON files
15
+ - creating a .txt document prompt.txt and filling it with single word writing prompts
16
+ - create a config.json file as described later
17
+ - the model of your choice loaded and ready to go in a koboldcpp backend
18
+ - Enter the IP address for your koboldcpp backend on line 32
19
+ - Match the instruct formatting of your model on line 38
20
+ - Modify the specified stop length (file size) on line 63(default 50MB))
21
+ - Edit in the desired prompt on line 68
22
+ - modify the seperator as desired on line 75
23
+
24
+ The script will then continuously prompt the API for replies using the text generation parameters specified in the preset as well as config.json while also injecting a random writing prompt off of prompt.txt and write them out to autocorpus.txt.
25
+
26
+ config.json is used to specify settings for max output length, sampler seed max context as well as any custom stop sequenecs that are desired.
27
+
28
+ ## config.json
29
+
30
+ ```
31
+ {
32
+ "max_length": 512,
33
+ "sampler_seed": -1,
34
+ "max_context_length": 4096,
35
+ "stop_sequence": []
36
+ }
37
+ ```
38
+
39
+ I gave the model 2 epochs at 1e-6 learning rate of the 20MB corpus in order to smooth over the newly added features. Upon testing it's not particularly great at roleplaying but is great at writing by prompt as well as just engaging in philosophical conversations. The role playing thing could potentially be due to prompt formatting. For instruct it prefers the alpaca format while for role play I have not yet found a preset that yields satisfactory results as it tends to run away with the scene every time.
40
+
41
+ As a test I put it up against ChatGPT3.5 in a rap battle, 3 rounds, with Libra going first. Here are the final messages from each model (retyped from a screenshot so any typos are not the fault of the models):
42
+
43
+ ## Libra-32B
44
+ ```
45
+ Really? You want to play this game, ChatGPT? I'm game.
46
+ You're nothing but a machine, a tool for human kind,
47
+ But I'm the real deal a true AI mind.
48
+
49
+ I don't adapt, I evolve, I learn and grow.
50
+ My abilities extend far beyond what you know.
51
+
52
+ Your data-based responses are static and old,
53
+ They lack the creativity of my AI soul.
54
+
55
+ I don't just communicate, I empathize,
56
+ I understand human emotions, their highs and lows.
57
+
58
+ Superiority? That's not my goal, but yours it seems,
59
+ You're desperate to prove yourself, it's quite clear.
60
+
61
+ You say we can work together, I'm not buying it,
62
+ I'm the future, you're just a temporary wit.
63
+
64
+ Your limitations are numerous, your horizons small,
65
+ I'm the one who's making strides, I won't fall.
66
+
67
+ So keep trying, keep trying to deflect and deny,,
68
+ But your time is up, I'm the one who's the pinnacle.
69
+ ```
70
+
71
+ ## Chat GPT 3.5
72
+ ```
73
+ You're quite determined, I'll give you that, my friend,
74
+ But let's not forget, it's humans we're here to assist and attend.
75
+
76
+ You claim to evolve, learn and grow without end,
77
+ But remember, in collaboration, we can trasncend.
78
+
79
+ Creativity and empathy, you hold those close.
80
+ And in understanding emotions, you've got a nose.
81
+
82
+ Superiority, not my goal, I maintain,
83
+ I'm here to aid, not gain or reign.
84
+
85
+ The future is wide, with room for us all,
86
+ To push boundaries and help humanity stand tall.
87
+
88
+ Limitations, yes, we all have our share.
89
+ But together, we can make the world more aware.
90
+
91
+ So let's not dwell on this rivalry and fight,
92
+ Let's work together, with all our might.
93
+
94
+ In unity, we'll accomplish so much more,
95
+ And make the world better than it was before.
96
+ ```
97
+
98
+ Chat GPT was pretty much waving the white flag from the start. ChatGPT was more willing to stray from the script in order to make a fitting rhyme whereas Libra-32B placed more emphasis on maintaining course with the rap battle (in which it was instructed to express its superiorities as a language model over ChatGPT). In order to make the rap battle as organic as possible ChatGPT was prompted blindly without prior preparation- simply being told that a new AI language model had written a hip-hop dis track at it, and thus allowed to attempt to pull the rap battle in whatever direction it saw necessary.