DavidGF commited on
Commit
5a48168
·
verified ·
1 Parent(s): 24fd931

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -8
README.md CHANGED
@@ -301,15 +301,16 @@ Llama-3-SauerkrautLM-70b-Instruct 8.6125
301
 
302
 
303
  **German RAG LLM Evaluation**
 
304
  ```
305
- | Task |Version|Metric|Value | |Stderr|
306
- |------------------------------------------------------|------:|------|-----:|---|-----:|
307
- |all | |acc |0.9593|± |0.0054|
308
- |community:german_rag_eval:_average:0 | |acc |0.9593|± |0.0054|
309
- |community:german_rag_eval:choose_context_by_question:0| 0|acc |0.9050|± |0.0093|
310
- |community:german_rag_eval:choose_question_by_context:0| 0|acc |0.9980|± |0.0014|
311
- |community:german_rag_eval:context_question_match:0 | 0|acc |0.9790|± |0.0045|
312
- |community:german_rag_eval:question_answer_match:0 | 0|acc |0.9550|± |0.0066|
313
  ```
314
 
315
  ## Disclaimer
 
301
 
302
 
303
  **German RAG LLM Evaluation**
304
+ corrected result after FIX: https://github.com/huggingface/lighteval/pull/171
305
  ```
306
+ | Task |Version|Metric|Value| |Stderr|
307
+ |------------------------------------------------------|------:|------|----:|---|-----:|
308
+ |all | |acc |0.980|± |0.0034|
309
+ |community:german_rag_eval:_average:0 | |acc |0.980|± |0.0034|
310
+ |community:german_rag_eval:choose_context_by_question:0| 0|acc |0.998|± |0.0014|
311
+ |community:german_rag_eval:choose_question_by_context:0| 0|acc |1.000|± |0.0000|
312
+ |community:german_rag_eval:context_question_match:0 | 0|acc |0.973|± |0.0051|
313
+ |community:german_rag_eval:question_answer_match:0 | 0|acc |0.949|± |0.0070|
314
  ```
315
 
316
  ## Disclaimer