Update README.md
Browse files
README.md
CHANGED
@@ -301,15 +301,16 @@ Llama-3-SauerkrautLM-70b-Instruct 8.6125
|
|
301 |
|
302 |
|
303 |
**German RAG LLM Evaluation**
|
|
|
304 |
```
|
305 |
-
| Task |Version|Metric|Value
|
306 |
-
|
307 |
-
|all | |acc |0.
|
308 |
-
|community:german_rag_eval:_average:0 | |acc |0.
|
309 |
-
|community:german_rag_eval:choose_context_by_question:0| 0|acc |0.
|
310 |
-
|community:german_rag_eval:choose_question_by_context:0| 0|acc |
|
311 |
-
|community:german_rag_eval:context_question_match:0 | 0|acc |0.
|
312 |
-
|community:german_rag_eval:question_answer_match:0 | 0|acc |0.
|
313 |
```
|
314 |
|
315 |
## Disclaimer
|
|
|
301 |
|
302 |
|
303 |
**German RAG LLM Evaluation**
|
304 |
+
corrected result after FIX: https://github.com/huggingface/lighteval/pull/171
|
305 |
```
|
306 |
+
| Task |Version|Metric|Value| |Stderr|
|
307 |
+
|------------------------------------------------------|------:|------|----:|---|-----:|
|
308 |
+
|all | |acc |0.980|± |0.0034|
|
309 |
+
|community:german_rag_eval:_average:0 | |acc |0.980|± |0.0034|
|
310 |
+
|community:german_rag_eval:choose_context_by_question:0| 0|acc |0.998|± |0.0014|
|
311 |
+
|community:german_rag_eval:choose_question_by_context:0| 0|acc |1.000|± |0.0000|
|
312 |
+
|community:german_rag_eval:context_question_match:0 | 0|acc |0.973|± |0.0051|
|
313 |
+
|community:german_rag_eval:question_answer_match:0 | 0|acc |0.949|± |0.0070|
|
314 |
```
|
315 |
|
316 |
## Disclaimer
|