kenchan0226 commited on
Commit
e646cbf
·
verified ·
1 Parent(s): 7852e18

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -31,6 +31,7 @@ After the release of SeaLLMs-v3, we've focused on extending along two directions
31
  - [LLM Leaderboard for Southeast Asian Languages](https://huggingface.co/spaces/SeaLLMs/LLM_Leaderboard_for_SEA): evaluates LLMs on Southeast Asian languages through two comprehensive benchmarks - SeaExam and SeaBench
32
  - SeaExam assesses world knowledge and reasoning capabilities through exam-style questions (for both base and chat version models) [[data (public)](https://huggingface.co/datasets/SeaLLMs/SeaExam), [eval code](https://github.com/DAMO-NLP-SG/SeaExam)]
33
  - SeaBench evaluates instruction-following abilities and multi-turn conversational skills (thus only for chat version models). [[data (public)](https://huggingface.co/datasets/SeaLLMs/SeaBench), [eval code](https://github.com/DAMO-NLP-SG/SeaBench)]
 
34
 
35
  ## Quick Links
36
  - [Project Page](https://damo-nlp-sg.github.io/DAMO-SeaLLMs/): project page that contains link to everything you need
 
31
  - [LLM Leaderboard for Southeast Asian Languages](https://huggingface.co/spaces/SeaLLMs/LLM_Leaderboard_for_SEA): evaluates LLMs on Southeast Asian languages through two comprehensive benchmarks - SeaExam and SeaBench
32
  - SeaExam assesses world knowledge and reasoning capabilities through exam-style questions (for both base and chat version models) [[data (public)](https://huggingface.co/datasets/SeaLLMs/SeaExam), [eval code](https://github.com/DAMO-NLP-SG/SeaExam)]
33
  - SeaBench evaluates instruction-following abilities and multi-turn conversational skills (thus only for chat version models). [[data (public)](https://huggingface.co/datasets/SeaLLMs/SeaBench), [eval code](https://github.com/DAMO-NLP-SG/SeaBench)]
34
+ - We also release a new evalaution suite to evaluate the generalization of knowledge boundary cognition across languages. [[data (public)](https://huggingface.co/collections/SeaLLMs/evaluation-suite-for-hallucination-of-multilingual-llms-6842674a542c9011f1bfbefb), [eval code](https://github.com/DAMO-NLP-SG/LLM-Multilingual-Knowledge-Boundaries), [paper](https://arxiv.org/pdf/2504.13816)]
35
 
36
  ## Quick Links
37
  - [Project Page](https://damo-nlp-sg.github.io/DAMO-SeaLLMs/): project page that contains link to everything you need