Update README.md
Browse files
README.md
CHANGED
@@ -4,4 +4,72 @@ language:
|
|
4 |
- ko
|
5 |
- en
|
6 |
library_name: transformers
|
7 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
4 |
- ko
|
5 |
- en
|
6 |
library_name: transformers
|
7 |
+
---
|
8 |
+
|
9 |
+
---
|
10 |
+
license: llama2
|
11 |
+
language:
|
12 |
+
- ko
|
13 |
+
- en
|
14 |
+
library_name: transformers
|
15 |
+
base_model: mncai/llama2-13b-dpo-v7
|
16 |
+
pipeline_tag: text-generation
|
17 |
+
---
|
18 |
+
|
19 |
+
# **mnsim-dpo-peftmerged-2-eos**
|
20 |
+
|
21 |
+
|
22 |
+
## Our Team
|
23 |
+
|
24 |
+
| Research & Engineering | Product Management |
|
25 |
+
| :--------------------: | :----------------: |
|
26 |
+
| David Sohn | David Sohn |
|
27 |
+
|
28 |
+
|
29 |
+
## **Model Details**
|
30 |
+
|
31 |
+
### **Base Model**
|
32 |
+
|
33 |
+
[mncai/llama2-13b-dpo-v7](https://huggingface.co/mncai/llama2-13b-dpo-v7)
|
34 |
+
|
35 |
+
### **Trained On**
|
36 |
+
|
37 |
+
- **OS**: Ubuntu 22.04
|
38 |
+
- **GPU**: A100 40GB 1ea
|
39 |
+
- **transformers**: v4.35.2
|
40 |
+
|
41 |
+
### **Instruction format**
|
42 |
+
|
43 |
+
It follows **Custom** format.
|
44 |
+
|
45 |
+
E.g.
|
46 |
+
|
47 |
+
```python
|
48 |
+
text = """\
|
49 |
+
<s>
|
50 |
+
<|user|>
|
51 |
+
건κ°ν μμ΅κ΄μ λ§λ€κΈ° μν΄μλ μ΄λ»κ² νλκ²μ΄ μ’μκΉμ?
|
52 |
+
<|assistant|>
|
53 |
+
건κ°ν μμ΅κ΄μ λ§λ€κΈ° μν΄μλ μμ μ΄ μνλ μμμ΄λ μμ¬ λ©λ΄λ₯Ό μ§μ λ§λ€μ΄ λ³Έλ€λ©΄ μ’μ κ² κ°μμ.
|
54 |
+
</s>
|
55 |
+
"""
|
56 |
+
```
|
57 |
+
|
58 |
+
|
59 |
+
## **Implementation Code**
|
60 |
+
|
61 |
+
This model contains the chat_template instruction format.
|
62 |
+
You can use the code below.
|
63 |
+
|
64 |
+
```python
|
65 |
+
# Use a pipeline as a high-level helper
|
66 |
+
from transformers import pipeline
|
67 |
+
|
68 |
+
pipe = pipeline("text-generation", model="msy127/mnsim-dpo-peftmerged-2-eos")
|
69 |
+
|
70 |
+
# Load model directly
|
71 |
+
from transformers import AutoTokenizer, AutoModelForCausalLM
|
72 |
+
|
73 |
+
tokenizer = AutoTokenizer.from_pretrained("msy127/mnsim-dpo-peftmerged-2-eos")
|
74 |
+
model = AutoModelForCausalLM.from_pretrained("msy127/mnsim-dpo-peftmerged-2-eos")
|
75 |
+
```
|