empirischtech
/

Llama-3.1-10B-Instruct

Text Generation

Model card Files Files and versions Community

rwmasood commited on Feb 17

Commit

a385b17

·

verified ·

1 Parent(s): e390311

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -109,7 +109,7 @@ print(round(results["mean_perplexity"], 2))
 ### Harness Evaluation
 - The performance evaluation is based on the tasks being evaluated on the [Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard).
-The model is evaluated on three benchmark datasets, which include `ARC-Challenge`, `HellaSwag` and `MMLU`.
 The library used is [lm-evaluation-harness repository](https://github.com/EleutherAI/lm-evaluation-harness)

 ### Harness Evaluation
 - The performance evaluation is based on the tasks being evaluated on the [Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard).
+The model is evaluated on three benchmark datasets, which include `ARC-Challenge`, `HellaSwag`, `MMLU` and `IFEval`.
 The library used is [lm-evaluation-harness repository](https://github.com/EleutherAI/lm-evaluation-harness)