Walmart-the-bag
commited on
Commit
•
b792701
1
Parent(s):
899e653
Update README.md
Browse files
README.md
CHANGED
@@ -8,4 +8,41 @@ datasets:
|
|
8 |
- unalignment/toxic-dpo-v0.1
|
9 |
base_model: stabilityai/stablelm-zephyr-3b
|
10 |
inference: false
|
11 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
8 |
- unalignment/toxic-dpo-v0.1
|
9 |
base_model: stabilityai/stablelm-zephyr-3b
|
10 |
inference: false
|
11 |
+
---
|
12 |
+
|
13 |
+
# Model Description
|
14 |
+
This is a finetune of [StableLM-Zephyr-3B](stabilityai/stablelm-zephyr-3b) with 2 datasets, toxic-dpo and openhermes with 10000 samples.
|
15 |
+
|
16 |
+
# Training Parameters
|
17 |
+
- 1xA6000-48GB
|
18 |
+
- batch_size: 6
|
19 |
+
- learning_rate: 5e-5
|
20 |
+
- float16
|
21 |
+
|
22 |
+
# Datasets:
|
23 |
+
- unalignment/toxic-dpo-v0.1
|
24 |
+
- teknium/openhermes
|
25 |
+
-
|
26 |
+
# License
|
27 |
+
### MIT
|
28 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6530994e70a88b63f007324d/Zf3wrU5zn2uVyoYAZ47rQ.png)
|
29 |
+
|
30 |
+
```
|
31 |
+
Permission is hereby granted, free of charge, to any person obtaining a copy
|
32 |
+
of this software and associated documentation files (the "Software"), to deal
|
33 |
+
in the Software without restriction, including without limitation the rights
|
34 |
+
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
|
35 |
+
copies of the Software, and to permit persons to whom the Software is
|
36 |
+
furnished to do so, subject to the following conditions:
|
37 |
+
|
38 |
+
The above copyright notice and this permission notice shall be included in all
|
39 |
+
copies or substantial portions of the Software.
|
40 |
+
|
41 |
+
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
|
42 |
+
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
|
43 |
+
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
|
44 |
+
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
|
45 |
+
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
|
46 |
+
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
|
47 |
+
SOFTWARE.
|
48 |
+
```
|