Tanvi03 commited on
Commit
b67b098
·
verified ·
1 Parent(s): 40bcaaf

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +76 -2
README.md CHANGED
@@ -69,7 +69,8 @@ print(generated_text)
69
 
70
  <!-- -->
71
  ### Training Data
72
-
 
73
  <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
74
 
75
  ### Training Procedure
@@ -193,4 +194,77 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
193
 
194
  ## Model Card Contact
195
 
196
- [More Information Needed]--->
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
69
 
70
  <!-- -->
71
  ### Training Data
72
+ This link provides the Evol-Instruct question-and-answer dataset
73
+ https://raw.githubusercontent.com/M-e-e-n-a/Synthetic-Dataset-Creation/main/combined_dataset.json
74
  <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
75
 
76
  ### Training Procedure
 
194
 
195
  ## Model Card Contact
196
 
197
+ [More Information Needed]--->
198
+ ## Results
199
+
200
+ ## Evaluation Metrics
201
+
202
+ <table>
203
+ <thead>
204
+ <tr>
205
+ <th>Metrics</th>
206
+ <th>Llama-2-7b</th>
207
+ <th>Mistral-7b</th>
208
+ <th>Mixtral-47B</th>
209
+ <th>ReidLM</th>
210
+ </tr>
211
+ </thead>
212
+ <tbody>
213
+ <tr>
214
+ <td>ROUGE-1</td>
215
+ <td>0.3117</td>
216
+ <td>0.3188</td>
217
+ <td>0.2637</td>
218
+ <td>0.3281</td>
219
+ </tr>
220
+ <tr>
221
+ <td>ROUGE-2</td>
222
+ <td>0.1867</td>
223
+ <td>0.1176</td>
224
+ <td>0.1573</td>
225
+ <td>0.1270</td>
226
+ </tr>
227
+ <tr>
228
+ <td>ROUGE-L</td>
229
+ <td>0.1818</td>
230
+ <td>0.1449</td>
231
+ <td>0.2637</td>
232
+ <td>0.2031</td>
233
+ </tr>
234
+ <tr>
235
+ <td>ROUGE-LSUM</td>
236
+ <td>0.1818</td>
237
+ <td>0.1449</td>
238
+ <td>0.2637</td>
239
+ <td>0.2031</td>
240
+ </tr>
241
+ <tr>
242
+ <td>METEOR</td>
243
+ <td>0.0693</td>
244
+ <td>0.3088</td>
245
+ <td>0.4377</td>
246
+ <td>0.3662</td>
247
+ </tr>
248
+ <tr>
249
+ <td>BERTScore</td>
250
+ <td>0.8262</td>
251
+ <td>0.8538</td>
252
+ <td>0.9070</td>
253
+ <td>0.8782</td>
254
+ </tr>
255
+ <tr>
256
+ <td>G-Eval</td>
257
+ <td>0.35</td>
258
+ <td>0.42</td>
259
+ <td>0.78</td>
260
+ <td>0.87</td>
261
+ </tr>
262
+ <tr>
263
+ <td>QAG Score</td>
264
+ <td>0.1046</td>
265
+ <td>0.2061</td>
266
+ <td>0.3762</td>
267
+ <td>0.2609</td>
268
+ </tr>
269
+ </tbody>
270
+ </table>