Report for ProsusAI/finbert

#39
by inoki-giskard - opened

Hey Team!🤗✨
We’re thrilled to share some amazing evaluation results that’ll make your day!🎉📊

We have identified 2 potential vulnerabilities in your model based on an automated scan.

This automated analysis evaluated the model on the dataset financial_phrasebank (subset sentences_75agree, split train).

👉Robustness issues (1)
Vulnerability Level Data slice Metric Transformation Deviation
Robustness medium 🟡 Fail rate = 0.093 Add typos 93/1000 tested samples (9.3%) changed prediction after perturbation
🔍✨Examples When feature “text” is perturbed with the transformation “Add typos”, the model changes its prediction in 9.3% of the cases. We expected the predictions not to be affected by this transformation.
text Add typos(text) Original prediction Prediction after perturbation
3221 Operating result , excluding one-off items , totaled EUR 9.1 mn compared to EUR 10.6 mn in continuing operations , excluding one-off items in 2004 . Operatng resuly , exckluding one-off oitems , totlaed EUR 9.1 mn ompared to EUR 10.6 mn in continuing opefations , excluding ojne-off itens in 200R . negative (p = 0.68) positive (p = 0.89)
3257 Body The credit falls due February 24 , 2014 . Body Thd credit falls due Fbruary 24 , 2014 . neutral (p = 0.77) negative (p = 0.95)
2646 The 3C Expo is a signature show in Dongguan , which is supported by the Dongguan Municipal Government every year , featuring computer accessories , software , communication and network products . The 3C Expo s a singature show in Dongguan , whic his supported by the Dongguan Municipal Govrenment evwery year , featuring computer accessories , software , communication and network products . neutral (p = 0.90) positive (p = 0.52)
👉Performance issues (1)
Vulnerability Level Data slice Metric Transformation Deviation
Performance medium 🟡 text contains "quarter" Balanced Accuracy = 0.901 -6.20% than global
🔍✨Examples For records in the dataset where `text` contains "quarter", the Balanced Accuracy is 6.2% lower than the global Balanced Accuracy.
text label Predicted label
480 The loss for the third quarter of 2007 was EUR 0.3 mn smaller than the loss of the second quarter of 2007 . positive negative (p = 0.93)
506 Finnish power supply solutions and systems provider Efore Oyj said its net loss widened to 3.2 mln euro $ 4.2 mln for the first quarter of fiscal 2006-2007 ending October 31 , 2007 from 900,000 euro $ 1.2 mln for the same period of fiscal 2005-06 . negative positive (p = 0.93)
507 ADP News - Apr 22 , 2009 - Finnish business information systems developer Solteq Oyj HEL : STQ1V said today its net loss widened to EUR 189,000 USD 245,000 for the first quarter of 2009 from EUR 10,000 for the same peri negative positive (p = 0.93)

Disclaimer: it's important to note that automated scans may produce false positives or miss certain vulnerabilities. We encourage you to review the findings and assess the impact accordingly.

💡 What's Next?

  • Checkout the Giskard Space and improve your model.
  • The Giskard community is always buzzing with ideas. 🐢🤔 What do you want to see next? Your feedback is our favorite fuel, so drop your thoughts in the community forum! 🗣️💬 Together, we're building something extraordinary.

🙌 Big Thanks!

We're grateful to have you on this adventure with us. 🚀🌟 Here's to more breakthroughs, laughter, and code magic! 🥂✨ Keep hugging that code and spreading the love! 💻 #Giskard #Huggingface #AISafety 🌈👏 Your enthusiasm, feedback, and contributions are what seek. 🌟 Keep being awesome!

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment