doberst commited on
Commit
696e380
·
verified ·
1 Parent(s): 6fbf44f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -14,10 +14,10 @@ BLING-QWEN-MINI-TOOL (1.5B)
14
  Evaluated against the benchmark test: RAG-Instruct-Benchmark-Tester
15
  1 Test Run with sample=False & temperature=0.0 (deterministic output) - 1 point for correct answer, 0.5 point for partial correct or blank / NF, 0.0 points for incorrect, and -1 points for hallucinations.
16
 
17
- --Accuracy Score: **93.5** correct out of 100
18
- --Not Found Classification: 75.0%
19
- --Boolean: 87.5%
20
- --Math/Logic: 70.0%
21
  --Complex Questions (1-5): 3 (Average)
22
  --Summarization Quality (1-5): 3 (Average)
23
  --Hallucinations: No hallucinations observed in test runs.
 
14
  Evaluated against the benchmark test: RAG-Instruct-Benchmark-Tester
15
  1 Test Run with sample=False & temperature=0.0 (deterministic output) - 1 point for correct answer, 0.5 point for partial correct or blank / NF, 0.0 points for incorrect, and -1 points for hallucinations.
16
 
17
+ --Accuracy Score: **93.5** correct out of 100
18
+ --Not Found Classification: 75.0%
19
+ --Boolean: 87.5%
20
+ --Math/Logic: 70.0%
21
  --Complex Questions (1-5): 3 (Average)
22
  --Summarization Quality (1-5): 3 (Average)
23
  --Hallucinations: No hallucinations observed in test runs.