Spaces:

launch
/

factbench

Running

farimafatahi commited on Oct 27, 2024

Commit

2a17a1c

verified ·

1 Parent(s): 8e86e3b

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -184,7 +184,7 @@ with tab1:
         <strong> 🎯 Factual Precision </strong> measures the ratio of supported units divided by all units averaged over model responses. <strong> 🌀 Hallucination Score </strong> quantifies the incorrect or inconclusive contents within a model response, as described in the paper. We also provide statistics on the average length of the response in terms of the number of tokens, the average verifiable units existing in the model responses (<strong>Avg. # Units</strong>), the average number of units labelled as undecidable (<strong>Avg. # Undecidable</strong>), and the average number of units labelled as unsupported (<strong>Avg. # Unsupported</strong>).
         </p>
         <p>
-        🔒 for closed LLMs; 🔑 for open-weights LLMs; 🚨 for newly added models"
         </p>
     </div>
     """,

         <strong> 🎯 Factual Precision </strong> measures the ratio of supported units divided by all units averaged over model responses. <strong> 🌀 Hallucination Score </strong> quantifies the incorrect or inconclusive contents within a model response, as described in the paper. We also provide statistics on the average length of the response in terms of the number of tokens, the average verifiable units existing in the model responses (<strong>Avg. # Units</strong>), the average number of units labelled as undecidable (<strong>Avg. # Undecidable</strong>), and the average number of units labelled as unsupported (<strong>Avg. # Unsupported</strong>).
         </p>
         <p>
+        🔒 for closed LLMs; 🔑 for open-weights LLMs; 🚨 for newly added models
         </p>
     </div>
     """,