ggbetz commited on
Commit
a118b67
·
verified ·
1 Parent(s): 1eee035

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +206 -6
README.md CHANGED
@@ -122,6 +122,11 @@ model-index:
122
  This model is a fine-tuned version of [meta-llama/Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct).
123
  It has been trained using [TRL](https://github.com/huggingface/trl).
124
 
 
 
 
 
 
125
  ## Quick start
126
 
127
  ```python
@@ -133,15 +138,210 @@ output = generator([{"role": "user", "content": question}], max_new_tokens=128,
133
  print(output["generated_text"])
134
  ```
135
 
136
- ## Evals
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
137
 
138
- **⚠️ NOTE**: These self-reported results have been obtained with lm-eval-harness and using local-completions api; they deviate significantly from the official [Open LLM Leaderboard](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/) evals, which are also reported at the end of this readme.
139
 
140
- LM Eval Harness results (local completions/vllm): [wandb report](https://api.wandb.ai/links/ggbetz/3bwr0ou6)
141
 
142
- |Model|BBH|MATH|GPQA|MMLU Pro|
143
- |:--------|:---:|:---:|:---:|:---:|
144
- | Llama-3.1-Argunaut-1-8B-SFT | 44.6% | 9.0% | 32.1% | 34.5% |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
145
 
146
 
147
  ## SFT dataset mixture
 
122
  This model is a fine-tuned version of [meta-llama/Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct).
123
  It has been trained using [TRL](https://github.com/huggingface/trl).
124
 
125
+ 📘 [HF Blog Article](https://huggingface.co/blog/ggbetz/argunauts-phase-1)
126
+
127
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/ggbetz/Argunauts-1/runs/922q79gp/overview)
128
+
129
+
130
  ## Quick start
131
 
132
  ```python
 
138
  print(output["generated_text"])
139
  ```
140
 
141
+ ## Evaluation
142
+
143
+
144
+ ### Chat Experience
145
+
146
+ <details>
147
+
148
+ <code>
149
+
150
+ 👤 **You**
151
+ Hi, are you familiar with Argdown?
152
+
153
+ 🤖 **Assistant**
154
+ Yes, I'm familiar with Argdown. It's a markup language for reasoning that allows you to write structured text in a natural and readable way. With Argdown, you can create argument maps where sentences are connected by lines that represent different relations (e.g., support, reject, reason). This makes it easy to visualize complex argumentative structures.
155
+
156
+ Would you like to create an Argdown argument map? Feel free to provide a topic and/or some sentences, and I'll help you set up the map!
157
+
158
+
159
+ 👤 **You**
160
+ Can you please map the following text?
161
+
162
+ School uniforms deter crime and increase student safety.
163
+ In Long Beach, California, after two years of a district-wide K-8 mandatory uniform policy, reports of assault and battery in the district’s schools decreased by 34%, assault with a deadly weapon dropped by 50%, fighting incidents went down by 51%, sex offenses were cut by 74%, robbery dropped by 65%, possession of weapons (or weapon “look-alikes”) decreased by 52%, possession of drugs went down by 69%, and vandalism was lowered by 18%. [64]
164
+
165
+ One year after Sparks Middle School in Nevada instituted a uniform policy, school police data showed a 63% drop in police log reports, and decreases were also noted in gang activity, student fights, graffiti, property damage, and battery. A peer-reviewed study found that schools with uniform policies had 12% fewer firearm-related incidents and 15% fewer drug-related incidents than schools without uniforms. [25][69]
166
+
167
+ School uniforms also prevent students from concealing weapons under baggy clothing, make it easier to keep track of students on field trips, and make intruders on campus more visible. Frank Quatrone, superintendent in the Lodi school district of New Jersey, states, “When you have students dressed alike, you make them safer. If someone were to come into a building, the intruder could easily be recognized.” [6][38]
168
+
169
+ Further, school uniforms create a level playing field among students, reducing peer pressure and bullying. When all students are dressed alike, competition between students over clothing choices and the teasing of those who are dressed in less expensive or less fashionable outfits can be eliminated. Research by the Schoolwear Association found that 83% of teachers thought “a good school uniform…could prevent bullying based on appearance or economic background.” Arminta Jacobson, founder and director of the Center for Parent Education at the University of North Texas, states that uniforms put “all kids on the same playing field in terms of their appearance. I think it probably gives them a sense of belonging and a feeling of being socially accepted.” [5][91]
170
+
171
+ 🤖 **Assistant**
172
+ ```argdown
173
+ School uniforms deter crime and increase student safety.
174
+ <+ In Long Beach, California, after two years of a district-wide K-8 mandatory uniform policy, reports of assault and battery in the district’s schools decreased by 34%, assault with a deadly weapon dropped by 50%, fighting incidents went down by 51%, sex offenses were cut by 74%, robbery dropped by 65%, possession of weapons (or weapon “look-alikes”) decreased by 52%, possession of drugs went down by 69%, and vandalism was lowered by 18%. [64]
175
+ <+ One year after Sparks Middle School in Nevada instituted a uniform policy, school police data showed a 63% drop in police log reports, and decreases were also noted in gang activity, student fights, graffiti, property damage, and battery. A peer-reviewed study found that schools with uniform policies had 12% fewer firearm-related incidents and 15% fewer drug-related incidents than schools without uniforms. [25][69]
176
+ <+ School uniforms also prevent students from concealing weapons under baggy clothing, make it easier to keep track of students on field trips, and make intruders on campus more visible. Frank Quatrone, superintendent in the Lodi school district of New Jersey, states, “When you have students dressed alike, you make them safer. If someone were to come into a building, the intruder could easily be recognized.” [6][38]
177
+ <+ Further, school uniforms create a level playing field among students, reducing peer pressure and bullying. When all students are dressed alike, competition between students over clothing choices and the teasing of those who are dressed in less expensive or less fashionable outfits can be eliminated. Research by the Schoolwear Association found that 83% of teachers thought “a good school uniform…could prevent bullying based on appearance or economic background.” Arminta Jacobson, founder and director of the Center for Parent Education at the University of North Texas, states that uniforms put “all kids on the same playing field in terms of their appearance. I think it probably gives them a sense of belonging and a feeling of being socially accepted.” [5][91]
178
+ ```
179
+
180
+ 👤 **You**
181
+ Please summarize, in your own words, the main argument presented in the text. No more than 2-3 sentemces, please.
182
+
183
+ 🤖 **Assistant**
184
+ School uniforms are an effective solution to crime and safety issues in schools, as evidenced by the reduction of incidents in various institutions that implemented uniform policies.
185
+
186
+
187
+ 👤 **You**
188
+ Let's focus on this argument. Just for the protocol, what's the conclusion?
189
+
190
+ 🤖 **Assistant**
191
+ The text argues that:
192
+ School uniforms deter crime and increase student safety.
193
+
194
+ By examining this argument, we can now proceed to reconstruct it. Please provide the next step in this inference guide:
195
+
196
+
197
+
198
+ 👤 **You**
199
+ Can you reconstruct this argument as premise conclusion structure, using argdown syntax?
200
+
201
+ 🤖 **Assistant**
202
+ The argument can be reconstructed as follows:
203
+
204
+ ```argdown
205
+ School uniforms deter crime and increase student safety.
206
+ <+ In Long Beach, California, after two years of a district-wide K-8 mandatory uniform policy, reports of assault and battery in the district’s schools decreased by 34%, assault with a deadly weapon dropped by 50%, fighting incidents went down by 51%, sex offenses were cut by 74%, robbery dropped by 65%, possession of weapons (or weapon “look-alikes”) decreased by 52%, possession of drugs went down by 69%, and vandalism was lowered by 18%. [64]
207
+ <+ One year after Sparks Middle School in Nevada instituted a uniform policy, school police data showed a 63% drop in police log reports, and decreases were also noted in gang activity, student fights, graffiti, property damage, and battery. A peer-reviewed study found that schools with uniform policies had 12% fewer firearm-related incidents and 15% fewer drug-related incidents than schools without uniforms. [25][69]
208
+ <+ School uniforms also prevent students from concealing weapons under baggy clothing, make it easier to keep track of students on field trips, and make intruders on campus more visible. Frank Quatrone, superintendent in the Lodi school district of New Jersey, states, “When you have students dressed alike, you make them safer. If someone were to come into a building, the intruder could easily be recognized.” [6][38]
209
+ <+ Further, school uniforms create a level playing field among students, reducing peer pressure and bullying. When all students are dressed alike, competition between students over clothing choices and the teasing of those who are dressed in less expensive or less fashionable outfits can be eliminated. Research by the Schoolwear Association found that 83% of teachers thought “a good school uniform…could prevent bullying based on appearance or economic background.” Arminta Jacobson, founder and director of the Center for Parent Education at the University of North Texas, states that uniforms put “all kids on the same playing field in terms of their appearance. I think it probably gives them a sense of belonging and a feeling of being socially accepted.” [5][91]
210
+ ```
211
+
212
+ > 🛑 Poor answer
213
+ >
214
+ > That's NOT a reconstruction in standard form, but just a repetition of the previously given map.
215
+
216
+ 👤 **You**
217
+ No, I really want you to reconstruct the argument in standard form, listing premises and conclusions.
218
+
219
+ 🤖 **Assistant**
220
+ ```argdown
221
+ School uniforms deter crime and increase student safety.
222
+ <+ <Subargument 1>
223
+ <+ In Long Beach, California, after two years of a district-wide K-8 mandatory uniform policy, reports of assault and battery in the district’s schools decreased by 34%, assault with a deadly weapon dropped by 50%, fighting incidents went down by 51%, sex offenses were cut by 74%, robbery dropped by 65%, possession of weapons (or weapon “look-alikes”) decreased by 52%, possession of drugs went down by 69%, and vandalism was lowered by 18%. [64]
224
+ <+ One year after Sparks Middle School in Nevada instituted a uniform policy, school police data showed a 63% drop in police log reports, and decreases were also noted in gang activity, student fights, graffiti, property damage, and battery. A peer-reviewed study found that schools with uniform policies had 12% fewer firearm-related incidents and 15% fewer drug-related incidents than schools without uniforms. [25][69]
225
+ <+ School uniforms also prevent students from concealing weapons under baggy clothing, make it easier to keep track of students on field trips, and make intruders on campus more visible. Frank Quatrone, superintendent in the Lodi school district of New Jersey, states, “When you have students dressed alike, you make them safer. If someone were to come into a building, the intruder could easily be recognized.” [6][38]
226
+ <+ Further, school uniforms create a level playing field among students, reducing peer pressure and bullying. When all students are dressed alike, competition between students over clothing choices and the teasing of those who are dressed in less expensive or less fashionable outfits can be eliminated. Research by the Schoolwear Association found that 83% of teachers thought “a good school uniform…could prevent bullying based on appearance or economic background.” Arminta Jacobson, founder and director of the Center for Parent Education at the University of North Texas, states that uniforms put “all kids on the same playing field in terms of their appearance. I think it probably gives them a sense of belonging and a feeling of being socially accepted.” [5][91]
227
+ ```
228
+
229
+ > 🛑 Poor answer
230
+ >
231
+ > The assistant fails to correct the answer, despite the user's directions.
232
+
233
+ </code>
234
+
235
+ Anbother chat:
236
+
237
+ <code>
238
+
239
+ 👤 **You**
240
+ Hi, are you familiar with Argdown and rendering arguments as premise conclusion structures?
241
+
242
+ 🤖 **Assistant**
243
+ Yes, I am familiar with Argdown and can render arguments as premise-conclusion structures. What would you like to discuss or have me demonstrate?
244
+
245
+
246
+ 👤 **You**
247
+ Can you please describe the main argument presented in the following text?
248
+
249
+ School uniforms deter crime and increase student safety.
250
+ In Long Beach, California, after two years of a district-wide K-8 mandatory uniform policy, reports of assault and battery in the district’s schools decreased by 34%, assault with a deadly weapon dropped by 50%, fighting incidents went down by 51%, sex offenses were cut by 74%, robbery dropped by 65%, possession of weapons (or weapon “look-alikes”) decreased by 52%, possession of drugs went down by 69%, and vandalism was lowered by 18%. [64]
251
+
252
+ One year after Sparks Middle School in Nevada instituted a uniform policy, school police data showed a 63% drop in police log reports, and decreases were also noted in gang activity, student fights, graffiti, property damage, and battery. A peer-reviewed study found that schools with uniform policies had 12% fewer firearm-related incidents and 15% fewer drug-related incidents than schools without uniforms. [25][69]
253
+
254
+ School uniforms also prevent students from concealing weapons under baggy clothing, make it easier to keep track of students on field trips, and make intruders on campus more visible. Frank Quatrone, superintendent in the Lodi school district of New Jersey, states, “When you have students dressed alike, you make them safer. If someone were to come into a building, the intruder could easily be recognized.” [6][38]
255
+
256
+ Further, school uniforms create a level playing field among students, reducing peer pressure and bullying. When all students are dressed alike, competition between students over clothing choices and the teasing of those who are dressed in less expensive or less fashionable outfits can be eliminated. Research by the Schoolwear Association found that 83% of teachers thought “a good school uniform…could prevent bullying based on appearance or economic background.” Arminta Jacobson, founder and director of the Center for Parent Education at the University of North Texas, states that uniforms put “all kids on the same playing field in terms of their appearance. I think it probably gives them a sense of belonging and a feeling of being socially accepted.” [5][91]
257
+
258
+ 🤖 **Assistant**
259
+ School uniforms are not only a traditional and respectful way for institutions to create an environment of unity, but they also benefit students' safety and performance. The implementation of school uniforms can lead to a significant reduction in crime rates, improved student behavior, and enhanced safety on school campuses.
260
+
261
+
262
+ 👤 **You**
263
+ Can you please reconstruct this argument in standard form (premise-conclusion structure)?
264
+
265
+ 🤖 **Assistant**
266
+
267
+ ```argdown
268
+ (1) Implementing school uniforms has led to a significant reduction in crime rates in the Long Beach, California district.
269
+ (2) The reduction in crime rates is a result of the uniform policy and not just a coincidence.
270
+ (3) The reduction in crime rates is a desirable outcome that contributes to the overall safety and well-being of students.
271
+ (4) The reduction in crime rates is a result of the uniform policy and not just a coincidence.
272
+ -----
273
+ (5) Implementing school uniforms is an effective way to deter crime and increase student safety.
274
+ ```
275
+
276
+
277
+ 👤 **You**
278
+ Can you please add intermediary conclusions and add inference rule?
279
+
280
+ 🤖 **Assistant**
281
+ Sure, here are the intermediate steps leading to the final conclusion:
282
+
283
+ ```argdown
284
+ Implementing school uniforms has led to a significant reduction in crime rates in the Long Beach, California district.
285
+ The reduction in crime rates is a result of the uniform policy and not just a coincidence.
286
+ The reduction in crime rates is a desirable outcome that contributes to the overall safety and well-being of students.
287
+ Implementing school uniforms is an effective way to deter crime and increase student safety.
288
+ ```
289
+
290
+
291
+ 👤 **You**
292
+ Please render the refined reconstruction in Argdown.
293
+
294
+ 🤖 **Assistant**
295
+ The argument can be represented as follows:
296
+
297
+ ```argdown
298
+ [Conclusion 1]: Implementing school uniforms is an effective way to deter crime and increase student safety.
299
+ <+ <Subargument 1>
300
+ <+ [Premise]: Implementing school uniforms has led to a significant reduction in crime rates in the Long Beach, California district.
301
+ <+ [Premise]: The reduction in crime rates is a result of the uniform policy and not just a coincidence.
302
+ <+ [Premise]: The reduction in crime rates is a desirable outcome that contributes to the overall safety and well-being of students.
303
+ ```
304
+
305
+ 👤 **You**
306
+ And as premise-conclusion structure, with inference info added?
307
+
308
+ 🤖 **Assistant**
309
+ ```argdown
310
+ [Conclusion 1]: Implementing school uniforms is an effective way to deter crime and increase student safety.
311
+ <+ <Subargument 1>
312
+ <+ [Premise]: Implementing school uniforms has led to a significant reduction in crime rates in the Long Beach, California district.
313
+ <+ [Premise]: The reduction in crime rates is a result of the uniform policy and not just a coincidence.
314
+ <+ [Premise]: The reduction in crime rates is a desirable outcome that contributes to the overall safety and well-being of students.
315
+ ```
316
+
317
+ </code>
318
+ </details>
319
+
320
+
321
 
322
+ ### Metrics
323
 
 
324
 
325
+ | |Llama-3.1-8B-Instruct|Argunaut-1-8B-SFT|
326
+ |:--|--:|--:|
327
+ | ⬇️ **Argdown Bench**
328
+ | pass@1 | 80.8 | 98.9 |
329
+ | pass@5 | 98.9 | 99.8 |
330
+ | graph sim | 8.4 | 65.5 |
331
+ | isomorphic | 1.3 | 44.0 |
332
+ | 🤗 **HF Leaderboard**
333
+ |MMLU pro | 37.6 | 34.5 |
334
+ |MUSR | 40.1 | 41.6 |
335
+ |GPQA Diamond | 32.8 | 30.8 |
336
+ |GPQA Main | 28.5 | 32.1 |
337
+ |MATH | 12.5 | 9.1 |
338
+ |BBH | 54.7 | 48.2 |
339
+ | ⛓️ **COT Leaderboard**
340
+ | LogiQA | 5.9 | 1.4 |
341
+ | LogiQA2 | 15.5 | 0.8 |
342
+ | LSAT-ar | 11.7 | 3.0 |
343
+ | LSAT-lr | 20.8 | 3.9 |
344
+ | LSAT-rc | 27.5 | 13.8 |
345
 
346
 
347
  ## SFT dataset mixture