Update README.md
Browse files
README.md
CHANGED
@@ -10,6 +10,9 @@ language:
|
|
10 |
-->
|
11 |
<img src="https://i.imgur.com/E1vqCIw.png" alt="PicToModel" width="330"/>
|
12 |
|
|
|
|
|
|
|
13 |
|
14 |
## Model Details
|
15 |
Jellyfish-8B is a large language model equipped with 8 billion parameters.
|
@@ -26,6 +29,7 @@ More details about the model can be found in the [Jellyfish paper](https://arxiv
|
|
26 |
- **Language(s) (NLP):** English
|
27 |
- **License:** Non-Commercial Creative Commons license (CC BY-NC-4.0)
|
28 |
- **Finetuned from model:** [Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)
|
|
|
29 |
## Citation
|
30 |
|
31 |
If you find our work useful, please give us credit by citing:
|
@@ -105,3 +109,84 @@ _Few-shot is disabled for Jellyfish models._
|
|
105 |
<|start_header_id|>user<|end_header_id|>{prompt}<|eot_id|>
|
106 |
<|start_header_id|>assistant<|end_header_id|>
|
107 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
10 |
-->
|
11 |
<img src="https://i.imgur.com/E1vqCIw.png" alt="PicToModel" width="330"/>
|
12 |
|
13 |
+
Other versions of Jellyfish:
|
14 |
+
[Jellyfish-7B](https://huggingface.co/NECOUDBFM/Jellyfish-7B)
|
15 |
+
[Jellyfish-13B](https://huggingface.co/NECOUDBFM/Jellyfish-13B)
|
16 |
|
17 |
## Model Details
|
18 |
Jellyfish-8B is a large language model equipped with 8 billion parameters.
|
|
|
29 |
- **Language(s) (NLP):** English
|
30 |
- **License:** Non-Commercial Creative Commons license (CC BY-NC-4.0)
|
31 |
- **Finetuned from model:** [Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)
|
32 |
+
|
33 |
## Citation
|
34 |
|
35 |
If you find our work useful, please give us credit by citing:
|
|
|
109 |
<|start_header_id|>user<|end_header_id|>{prompt}<|eot_id|>
|
110 |
<|start_header_id|>assistant<|end_header_id|>
|
111 |
```
|
112 |
+
|
113 |
+
## Prompts
|
114 |
+
|
115 |
+
We provide the prompts used for both the model's fine-tuning and inference.
|
116 |
+
You can structure your data according to these prompts.
|
117 |
+
|
118 |
+
### System Message
|
119 |
+
```
|
120 |
+
You are an AI assistant that follows instruction extremely well.
|
121 |
+
User will give you a question. Your task is to answer as faithfully as you can.
|
122 |
+
```
|
123 |
+
|
124 |
+
### For Entity Matching
|
125 |
+
```
|
126 |
+
You are tasked with determining whether two records listed below are the same based on the information provided.
|
127 |
+
Carefully compare the {attribute 1}, {attribute 2}... for each record before making your decision.
|
128 |
+
Note that missing values (N/A or \"nan\") should not be used as a basis for your decision.
|
129 |
+
Record A: [{attribute 1}: {attribute 1 value}, {attribute 2}: {attribute 2 value}, ...]
|
130 |
+
Record B: [{attribute 1}: {attribute 1 value}, {attribute 2}: {attribute 2 value}, ...]
|
131 |
+
Are record A and record B the same entity? Choose your answer from: [Yes, No].
|
132 |
+
```
|
133 |
+
|
134 |
+
### For Data Imputation
|
135 |
+
```
|
136 |
+
You are presented with a {keyword} record that is missing a specific attribute: {attribute X}.
|
137 |
+
Your task is to deduce or infer the value of {attribute X} using the available information in the record.
|
138 |
+
You may be provided with fields like {attribute 1}, {attribute 2}, ... to help you in the inference.
|
139 |
+
Record: [{attribute 1}: {attribute 1 value}, {attribute 2}: {attribute 2 value}, ...]
|
140 |
+
Based on the provided record, what would you infer is the value for the missing attribute {attribute X}?
|
141 |
+
Answer only the value of {attribute X}.
|
142 |
+
```
|
143 |
+
|
144 |
+
### For Data Imputation
|
145 |
+
```
|
146 |
+
You are presented with a {keyword} record that is missing a specific attribute: {attribute X}.
|
147 |
+
Your task is to deduce or infer the value of {attribute X} using the available information in the record.
|
148 |
+
You may be provided with fields like {attribute 1}, {attribute 2}, ... to help you in the inference.
|
149 |
+
Record: [{attribute 1}: {attribute 1 value}, {attribute 2}: {attribute 2 value}, ...]
|
150 |
+
Based on the provided record, what would you infer is the value for the missing attribute {attribute X}?
|
151 |
+
Answer only the value of {attribute X}.
|
152 |
+
```
|
153 |
+
|
154 |
+
### For Error Detection
|
155 |
+
_There are two forms of the error detection task.
|
156 |
+
In the first form, a complete record row is provided, and the task is to determine if a specific value is erroneous.
|
157 |
+
In the second form, only the value of a specific attribute is given, and the decision about its correctness is based solely on the attribute's name and value.
|
158 |
+
The subsequent prompt examples pertain to these two forms, respectively._
|
159 |
+
```
|
160 |
+
Your task is to determine if there is an error in the value of a specific attribute within the whole record provided.
|
161 |
+
The attributes may include {attribute 1}, {attribute 2}, ...
|
162 |
+
Errors may include, but are not limited to, spelling errors, inconsistencies, or values that don't make sense given the context of the whole record.
|
163 |
+
Record [{attribute 1}: {attribute 1 value}, {attribute 2}: {attribute 2 value}, ...]
|
164 |
+
Attribute for Verification: [{attribute X}: {attribute X value}]
|
165 |
+
Question: Is there an error in the value of {attribute X}? Choose your answer from: [Yes, No].
|
166 |
+
```
|
167 |
+
```
|
168 |
+
Your task is to determine if there is an error in the value of a specific attribute.
|
169 |
+
The attributes may belong to a {keyword} record and could be one of the following: {attribute 1}, {attribute 2}, ...
|
170 |
+
Errors can include, but are not limited to, spelling errors, inconsistencies, or values that don't make sense for that attribute.
|
171 |
+
Note: Missing values (N/A or \"nan\") are not considered errors.
|
172 |
+
Attribute for Verification: [{attribute X}: {attribute X value}]
|
173 |
+
Question: Is there an error in the value of {attribute X}? Choose your answer from: [Yes, No].
|
174 |
+
```
|
175 |
+
|
176 |
+
### For Schema Matching
|
177 |
+
```
|
178 |
+
Your task is to determine if the two attributes (columns) are semantically equivalent in the context of merging two tables.
|
179 |
+
Each attribute will be provided by its name and a brief description.
|
180 |
+
Your goal is to assess if they refer to the same information based on these names and descriptions provided.
|
181 |
+
Attribute A is [name: {value of name}, description: {value of description}].
|
182 |
+
Attribute B is [name: {value of name}, description: {value of description}].
|
183 |
+
Are Attribute A and Attribute B semantically equivalent? Choose your answer from: [Yes, No].
|
184 |
+
```
|
185 |
+
|
186 |
+
### For Column Type Annotation
|
187 |
+
|
188 |
+
We follow the prompt in [Column Type Annotation using ChatGPT](https://arxiv.org/abs/2306.00745) (text+inst+2-step).
|
189 |
+
|
190 |
+
### For Attribute Value Extraction
|
191 |
+
|
192 |
+
We follow the prompt in [Product Attribute Value Extraction using Large Language Models](https://arxiv.org/abs/2310.12537) (textual, w/o examples).
|