HCZhang commited on
Commit
13cbca0
·
verified ·
1 Parent(s): 4b5fb1a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +85 -0
README.md CHANGED
@@ -10,6 +10,9 @@ language:
10
  -->
11
  <img src="https://i.imgur.com/E1vqCIw.png" alt="PicToModel" width="330"/>
12
 
 
 
 
13
 
14
  ## Model Details
15
  Jellyfish-8B is a large language model equipped with 8 billion parameters.
@@ -26,6 +29,7 @@ More details about the model can be found in the [Jellyfish paper](https://arxiv
26
  - **Language(s) (NLP):** English
27
  - **License:** Non-Commercial Creative Commons license (CC BY-NC-4.0)
28
  - **Finetuned from model:** [Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)
 
29
  ## Citation
30
 
31
  If you find our work useful, please give us credit by citing:
@@ -105,3 +109,84 @@ _Few-shot is disabled for Jellyfish models._
105
  <|start_header_id|>user<|end_header_id|>{prompt}<|eot_id|>
106
  <|start_header_id|>assistant<|end_header_id|>
107
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  -->
11
  <img src="https://i.imgur.com/E1vqCIw.png" alt="PicToModel" width="330"/>
12
 
13
+ Other versions of Jellyfish:
14
+ [Jellyfish-7B](https://huggingface.co/NECOUDBFM/Jellyfish-7B)
15
+ [Jellyfish-13B](https://huggingface.co/NECOUDBFM/Jellyfish-13B)
16
 
17
  ## Model Details
18
  Jellyfish-8B is a large language model equipped with 8 billion parameters.
 
29
  - **Language(s) (NLP):** English
30
  - **License:** Non-Commercial Creative Commons license (CC BY-NC-4.0)
31
  - **Finetuned from model:** [Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)
32
+
33
  ## Citation
34
 
35
  If you find our work useful, please give us credit by citing:
 
109
  <|start_header_id|>user<|end_header_id|>{prompt}<|eot_id|>
110
  <|start_header_id|>assistant<|end_header_id|>
111
  ```
112
+
113
+ ## Prompts
114
+
115
+ We provide the prompts used for both the model's fine-tuning and inference.
116
+ You can structure your data according to these prompts.
117
+
118
+ ### System Message
119
+ ```
120
+ You are an AI assistant that follows instruction extremely well.
121
+ User will give you a question. Your task is to answer as faithfully as you can.
122
+ ```
123
+
124
+ ### For Entity Matching
125
+ ```
126
+ You are tasked with determining whether two records listed below are the same based on the information provided.
127
+ Carefully compare the {attribute 1}, {attribute 2}... for each record before making your decision.
128
+ Note that missing values (N/A or \"nan\") should not be used as a basis for your decision.
129
+ Record A: [{attribute 1}: {attribute 1 value}, {attribute 2}: {attribute 2 value}, ...]
130
+ Record B: [{attribute 1}: {attribute 1 value}, {attribute 2}: {attribute 2 value}, ...]
131
+ Are record A and record B the same entity? Choose your answer from: [Yes, No].
132
+ ```
133
+
134
+ ### For Data Imputation
135
+ ```
136
+ You are presented with a {keyword} record that is missing a specific attribute: {attribute X}.
137
+ Your task is to deduce or infer the value of {attribute X} using the available information in the record.
138
+ You may be provided with fields like {attribute 1}, {attribute 2}, ... to help you in the inference.
139
+ Record: [{attribute 1}: {attribute 1 value}, {attribute 2}: {attribute 2 value}, ...]
140
+ Based on the provided record, what would you infer is the value for the missing attribute {attribute X}?
141
+ Answer only the value of {attribute X}.
142
+ ```
143
+
144
+ ### For Data Imputation
145
+ ```
146
+ You are presented with a {keyword} record that is missing a specific attribute: {attribute X}.
147
+ Your task is to deduce or infer the value of {attribute X} using the available information in the record.
148
+ You may be provided with fields like {attribute 1}, {attribute 2}, ... to help you in the inference.
149
+ Record: [{attribute 1}: {attribute 1 value}, {attribute 2}: {attribute 2 value}, ...]
150
+ Based on the provided record, what would you infer is the value for the missing attribute {attribute X}?
151
+ Answer only the value of {attribute X}.
152
+ ```
153
+
154
+ ### For Error Detection
155
+ _There are two forms of the error detection task.
156
+ In the first form, a complete record row is provided, and the task is to determine if a specific value is erroneous.
157
+ In the second form, only the value of a specific attribute is given, and the decision about its correctness is based solely on the attribute's name and value.
158
+ The subsequent prompt examples pertain to these two forms, respectively._
159
+ ```
160
+ Your task is to determine if there is an error in the value of a specific attribute within the whole record provided.
161
+ The attributes may include {attribute 1}, {attribute 2}, ...
162
+ Errors may include, but are not limited to, spelling errors, inconsistencies, or values that don't make sense given the context of the whole record.
163
+ Record [{attribute 1}: {attribute 1 value}, {attribute 2}: {attribute 2 value}, ...]
164
+ Attribute for Verification: [{attribute X}: {attribute X value}]
165
+ Question: Is there an error in the value of {attribute X}? Choose your answer from: [Yes, No].
166
+ ```
167
+ ```
168
+ Your task is to determine if there is an error in the value of a specific attribute.
169
+ The attributes may belong to a {keyword} record and could be one of the following: {attribute 1}, {attribute 2}, ...
170
+ Errors can include, but are not limited to, spelling errors, inconsistencies, or values that don't make sense for that attribute.
171
+ Note: Missing values (N/A or \"nan\") are not considered errors.
172
+ Attribute for Verification: [{attribute X}: {attribute X value}]
173
+ Question: Is there an error in the value of {attribute X}? Choose your answer from: [Yes, No].
174
+ ```
175
+
176
+ ### For Schema Matching
177
+ ```
178
+ Your task is to determine if the two attributes (columns) are semantically equivalent in the context of merging two tables.
179
+ Each attribute will be provided by its name and a brief description.
180
+ Your goal is to assess if they refer to the same information based on these names and descriptions provided.
181
+ Attribute A is [name: {value of name}, description: {value of description}].
182
+ Attribute B is [name: {value of name}, description: {value of description}].
183
+ Are Attribute A and Attribute B semantically equivalent? Choose your answer from: [Yes, No].
184
+ ```
185
+
186
+ ### For Column Type Annotation
187
+
188
+ We follow the prompt in [Column Type Annotation using ChatGPT](https://arxiv.org/abs/2306.00745) (text+inst+2-step).
189
+
190
+ ### For Attribute Value Extraction
191
+
192
+ We follow the prompt in [Product Attribute Value Extraction using Large Language Models](https://arxiv.org/abs/2310.12537) (textual, w/o examples).