piotr-szleg-bards-ai commited on
Commit
446174f
·
1 Parent(s): 6d40f49

2024-03-08 15:33:44 Publish script update

Browse files
Files changed (2) hide show
  1. app.py +5 -3
  2. data/summary_metrics_plots.csv +0 -0
app.py CHANGED
@@ -234,8 +234,6 @@ Note that pause and resume time cost was not included in the "Cost Per Token" co
234
  """
235
  )
236
  general_plots[general_plots.plot_name == "execution_costs"].apply(display_filtered_plot, axis=1)
237
- with gr.Tab("Summary metrics"):
238
- summary_metrics_plots.apply(display_filtered_plot, axis=1)
239
  with gr.Tab("Context length and parameters count"):
240
  general_plots[general_plots.plot_name != "execution_costs"].apply(display_filtered_plot, axis=1)
241
  gr.Markdown(
@@ -247,7 +245,9 @@ A lot of models had to be omitted due to their developers not disclosing their p
247
  Mainly OpenAI's GPT models and Google's Palm 2.
248
  """
249
  )
250
- with gr.Tab("Combined plots"):
 
 
251
  with gr.Row():
252
  choices = combined_plots.header
253
  choices = choices[choices.str.contains("for model")]
@@ -275,6 +275,8 @@ Mainly OpenAI's GPT models and Google's Palm 2.
275
  Radial plots are used to compare the most important aspects of each model researched on this board using single images.
276
 
277
  All values are normalized and scaled into 0.25 to 1 range, 0 is left for unknown values.
 
 
278
 
279
  To compare the parameters more thoroughly use the filtering box on top of this page and inspect individual tabs.
280
  """)
 
234
  """
235
  )
236
  general_plots[general_plots.plot_name == "execution_costs"].apply(display_filtered_plot, axis=1)
 
 
237
  with gr.Tab("Context length and parameters count"):
238
  general_plots[general_plots.plot_name != "execution_costs"].apply(display_filtered_plot, axis=1)
239
  gr.Markdown(
 
245
  Mainly OpenAI's GPT models and Google's Palm 2.
246
  """
247
  )
248
+ with gr.Tab("Summary quality metrics"):
249
+ summary_metrics_plots.apply(display_filtered_plot, axis=1)
250
+ with gr.Tab("Comprehensive models comparison"):
251
  with gr.Row():
252
  choices = combined_plots.header
253
  choices = choices[choices.str.contains("for model")]
 
275
  Radial plots are used to compare the most important aspects of each model researched on this board using single images.
276
 
277
  All values are normalized and scaled into 0.25 to 1 range, 0 is left for unknown values.
278
+
279
+ Some metrics were reversed in order to make the plots more readable, for example "Fast execution" is `1 - execution_time` scaled to 0-1 range and moved 0.25 units up as mentioned above.
280
 
281
  To compare the parameters more thoroughly use the filtering box on top of this page and inspect individual tabs.
282
  """)
data/summary_metrics_plots.csv CHANGED
The diff for this file is too large to render. See raw diff