Walmart-the-bag commited on
Commit
b792701
1 Parent(s): 899e653

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +38 -1
README.md CHANGED
@@ -8,4 +8,41 @@ datasets:
8
  - unalignment/toxic-dpo-v0.1
9
  base_model: stabilityai/stablelm-zephyr-3b
10
  inference: false
11
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
  - unalignment/toxic-dpo-v0.1
9
  base_model: stabilityai/stablelm-zephyr-3b
10
  inference: false
11
+ ---
12
+
13
+ # Model Description
14
+ This is a finetune of [StableLM-Zephyr-3B](stabilityai/stablelm-zephyr-3b) with 2 datasets, toxic-dpo and openhermes with 10000 samples.
15
+
16
+ # Training Parameters
17
+ - 1xA6000-48GB
18
+ - batch_size: 6
19
+ - learning_rate: 5e-5
20
+ - float16
21
+
22
+ # Datasets:
23
+ - unalignment/toxic-dpo-v0.1
24
+ - teknium/openhermes
25
+ -
26
+ # License
27
+ ### MIT
28
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6530994e70a88b63f007324d/Zf3wrU5zn2uVyoYAZ47rQ.png)
29
+
30
+ ```
31
+ Permission is hereby granted, free of charge, to any person obtaining a copy
32
+ of this software and associated documentation files (the "Software"), to deal
33
+ in the Software without restriction, including without limitation the rights
34
+ to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
35
+ copies of the Software, and to permit persons to whom the Software is
36
+ furnished to do so, subject to the following conditions:
37
+
38
+ The above copyright notice and this permission notice shall be included in all
39
+ copies or substantial portions of the Software.
40
+
41
+ THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
42
+ IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
43
+ FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
44
+ AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
45
+ LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
46
+ OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
47
+ SOFTWARE.
48
+ ```