MMTEB: Massive Multilingual Text Embedding Benchmark Paper β’ 2502.13595 β’ Published Feb 19 β’ 33
SIFT: Grounding LLM Reasoning in Contexts via Stickers Paper β’ 2502.14922 β’ Published Feb 19 β’ 30
MMTEB: Massive Multilingual Text Embedding Benchmark Paper β’ 2502.13595 β’ Published Feb 19 β’ 33
What's the Meaning of Superhuman Performance in Today's NLU? Paper β’ 2305.08414 β’ Published May 15, 2023 β’ 1
Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-OASIS Paper β’ 2411.19655 β’ Published Nov 29, 2024 β’ 20
LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps Paper β’ 2412.15035 β’ Published Dec 19, 2024 β’ 4
LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps Paper β’ 2412.15035 β’ Published Dec 19, 2024 β’ 4
LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps Paper β’ 2412.15035 β’ Published Dec 19, 2024 β’ 4 β’ 3
Word Sense Linking: Disambiguating Outside the Sandbox Paper β’ 2412.09370 β’ Published Dec 12, 2024 β’ 9
Word Sense Linking Collection Word Sense Linking is the task designed to identify and disambiguate spans of text to their most suitable senses from a reference inventory. β’ 6 items β’ Updated Jan 13 β’ 6
Babelscape/LLM-Oasis_e2e_factuality_evaluation Viewer β’ Updated Dec 2, 2024 β’ 1.71k β’ 135 β’ 5