Spaces:
Running
Running
piotr-szleg-bards-ai
commited on
Commit
·
446174f
1
Parent(s):
6d40f49
2024-03-08 15:33:44 Publish script update
Browse files- app.py +5 -3
- data/summary_metrics_plots.csv +0 -0
app.py
CHANGED
@@ -234,8 +234,6 @@ Note that pause and resume time cost was not included in the "Cost Per Token" co
|
|
234 |
"""
|
235 |
)
|
236 |
general_plots[general_plots.plot_name == "execution_costs"].apply(display_filtered_plot, axis=1)
|
237 |
-
with gr.Tab("Summary metrics"):
|
238 |
-
summary_metrics_plots.apply(display_filtered_plot, axis=1)
|
239 |
with gr.Tab("Context length and parameters count"):
|
240 |
general_plots[general_plots.plot_name != "execution_costs"].apply(display_filtered_plot, axis=1)
|
241 |
gr.Markdown(
|
@@ -247,7 +245,9 @@ A lot of models had to be omitted due to their developers not disclosing their p
|
|
247 |
Mainly OpenAI's GPT models and Google's Palm 2.
|
248 |
"""
|
249 |
)
|
250 |
-
with gr.Tab("
|
|
|
|
|
251 |
with gr.Row():
|
252 |
choices = combined_plots.header
|
253 |
choices = choices[choices.str.contains("for model")]
|
@@ -275,6 +275,8 @@ Mainly OpenAI's GPT models and Google's Palm 2.
|
|
275 |
Radial plots are used to compare the most important aspects of each model researched on this board using single images.
|
276 |
|
277 |
All values are normalized and scaled into 0.25 to 1 range, 0 is left for unknown values.
|
|
|
|
|
278 |
|
279 |
To compare the parameters more thoroughly use the filtering box on top of this page and inspect individual tabs.
|
280 |
""")
|
|
|
234 |
"""
|
235 |
)
|
236 |
general_plots[general_plots.plot_name == "execution_costs"].apply(display_filtered_plot, axis=1)
|
|
|
|
|
237 |
with gr.Tab("Context length and parameters count"):
|
238 |
general_plots[general_plots.plot_name != "execution_costs"].apply(display_filtered_plot, axis=1)
|
239 |
gr.Markdown(
|
|
|
245 |
Mainly OpenAI's GPT models and Google's Palm 2.
|
246 |
"""
|
247 |
)
|
248 |
+
with gr.Tab("Summary quality metrics"):
|
249 |
+
summary_metrics_plots.apply(display_filtered_plot, axis=1)
|
250 |
+
with gr.Tab("Comprehensive models comparison"):
|
251 |
with gr.Row():
|
252 |
choices = combined_plots.header
|
253 |
choices = choices[choices.str.contains("for model")]
|
|
|
275 |
Radial plots are used to compare the most important aspects of each model researched on this board using single images.
|
276 |
|
277 |
All values are normalized and scaled into 0.25 to 1 range, 0 is left for unknown values.
|
278 |
+
|
279 |
+
Some metrics were reversed in order to make the plots more readable, for example "Fast execution" is `1 - execution_time` scaled to 0-1 range and moved 0.25 units up as mentioned above.
|
280 |
|
281 |
To compare the parameters more thoroughly use the filtering box on top of this page and inspect individual tabs.
|
282 |
""")
|
data/summary_metrics_plots.csv
CHANGED
The diff for this file is too large to render.
See raw diff
|
|