Update README.md
Browse files
README.md
CHANGED
@@ -31,6 +31,7 @@ After the release of SeaLLMs-v3, we've focused on extending along two directions
|
|
31 |
- [LLM Leaderboard for Southeast Asian Languages](https://huggingface.co/spaces/SeaLLMs/LLM_Leaderboard_for_SEA): evaluates LLMs on Southeast Asian languages through two comprehensive benchmarks - SeaExam and SeaBench
|
32 |
- SeaExam assesses world knowledge and reasoning capabilities through exam-style questions (for both base and chat version models) [[data (public)](https://huggingface.co/datasets/SeaLLMs/SeaExam), [eval code](https://github.com/DAMO-NLP-SG/SeaExam)]
|
33 |
- SeaBench evaluates instruction-following abilities and multi-turn conversational skills (thus only for chat version models). [[data (public)](https://huggingface.co/datasets/SeaLLMs/SeaBench), [eval code](https://github.com/DAMO-NLP-SG/SeaBench)]
|
|
|
34 |
|
35 |
## Quick Links
|
36 |
- [Project Page](https://damo-nlp-sg.github.io/DAMO-SeaLLMs/): project page that contains link to everything you need
|
|
|
31 |
- [LLM Leaderboard for Southeast Asian Languages](https://huggingface.co/spaces/SeaLLMs/LLM_Leaderboard_for_SEA): evaluates LLMs on Southeast Asian languages through two comprehensive benchmarks - SeaExam and SeaBench
|
32 |
- SeaExam assesses world knowledge and reasoning capabilities through exam-style questions (for both base and chat version models) [[data (public)](https://huggingface.co/datasets/SeaLLMs/SeaExam), [eval code](https://github.com/DAMO-NLP-SG/SeaExam)]
|
33 |
- SeaBench evaluates instruction-following abilities and multi-turn conversational skills (thus only for chat version models). [[data (public)](https://huggingface.co/datasets/SeaLLMs/SeaBench), [eval code](https://github.com/DAMO-NLP-SG/SeaBench)]
|
34 |
+
- We also release a new evalaution suite to evaluate the generalization of knowledge boundary cognition across languages. [[data (public)](https://huggingface.co/collections/SeaLLMs/evaluation-suite-for-hallucination-of-multilingual-llms-6842674a542c9011f1bfbefb), [eval code](https://github.com/DAMO-NLP-SG/LLM-Multilingual-Knowledge-Boundaries), [paper](https://arxiv.org/pdf/2504.13816)]
|
35 |
|
36 |
## Quick Links
|
37 |
- [Project Page](https://damo-nlp-sg.github.io/DAMO-SeaLLMs/): project page that contains link to everything you need
|