Spaces:

SeaLLMs
/

README

Running

kenchan0226 commited on Jun 6

Commit

e646cbf

verified ·

1 Parent(s): 7852e18

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -31,6 +31,7 @@ After the release of SeaLLMs-v3, we've focused on extending along two directions
 - [LLM Leaderboard for Southeast Asian Languages](https://huggingface.co/spaces/SeaLLMs/LLM_Leaderboard_for_SEA): evaluates LLMs on Southeast Asian languages through two comprehensive benchmarks - SeaExam and SeaBench
 - SeaExam assesses world knowledge and reasoning capabilities through exam-style questions (for both base and chat version models) [[data (public)](https://huggingface.co/datasets/SeaLLMs/SeaExam), [eval code](https://github.com/DAMO-NLP-SG/SeaExam)]
 - SeaBench evaluates instruction-following abilities and multi-turn conversational skills (thus only for chat version models). [[data (public)](https://huggingface.co/datasets/SeaLLMs/SeaBench), [eval code](https://github.com/DAMO-NLP-SG/SeaBench)]
 ## Quick Links
 - [Project Page](https://damo-nlp-sg.github.io/DAMO-SeaLLMs/): project page that contains link to everything you need

 - [LLM Leaderboard for Southeast Asian Languages](https://huggingface.co/spaces/SeaLLMs/LLM_Leaderboard_for_SEA): evaluates LLMs on Southeast Asian languages through two comprehensive benchmarks - SeaExam and SeaBench
 - SeaExam assesses world knowledge and reasoning capabilities through exam-style questions (for both base and chat version models) [[data (public)](https://huggingface.co/datasets/SeaLLMs/SeaExam), [eval code](https://github.com/DAMO-NLP-SG/SeaExam)]
 - SeaBench evaluates instruction-following abilities and multi-turn conversational skills (thus only for chat version models). [[data (public)](https://huggingface.co/datasets/SeaLLMs/SeaBench), [eval code](https://github.com/DAMO-NLP-SG/SeaBench)]
+- We also release a new evalaution suite to evaluate the generalization of knowledge boundary cognition across languages. [[data (public)](https://huggingface.co/collections/SeaLLMs/evaluation-suite-for-hallucination-of-multilingual-llms-6842674a542c9011f1bfbefb), [eval code](https://github.com/DAMO-NLP-SG/LLM-Multilingual-Knowledge-Boundaries), [paper](https://arxiv.org/pdf/2504.13816)]
 ## Quick Links
 - [Project Page](https://damo-nlp-sg.github.io/DAMO-SeaLLMs/): project page that contains link to everything you need