zake7749
/

Llama-3.2-3B-it-chinese-kyara

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

zake7749 commited on Nov 20, 2024

Commit

54105b8

·

verified ·

1 Parent(s): 299bb71

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -27,6 +27,7 @@ This is a preview model, with the stable version set to be released soon.
 ## Benchmark
 | Metric                   | Kyara-3b-it    | Llama3.2-3b-it |
 |--------------------------|----------|-------------|
@@ -37,5 +38,6 @@ This is a preview model, with the stable version set to be released soon.
 | &emsp;- Social-Science     | **44.16**   | 41.98      |
 | **[MMLU-Redux](https://github.com/yuchenlin/ZeroEval)**    | **57.24**| 56.91       |
 | **[GSM8K](https://github.com/yuchenlin/ZeroEval)**         | **54.21**| 51.63       |
 | **[CRUX](https://github.com/yuchenlin/ZeroEval)**          | **31.25**| 25.25     |
 | **[AlpacaEval](https://github.com/tatsu-lab/alpaca_eval)**    | **23.87**| 19.35  |

 ## Benchmark
+All evaluations are conducted in a zero-shot setting.
 | Metric                   | Kyara-3b-it    | Llama3.2-3b-it |
 |--------------------------|----------|-------------|
 | &emsp;- Social-Science     | **44.16**   | 41.98      |
 | **[MMLU-Redux](https://github.com/yuchenlin/ZeroEval)**    | **57.24**| 56.91       |
 | **[GSM8K](https://github.com/yuchenlin/ZeroEval)**         | **54.21**| 51.63       |
+| **[MATH-L5](https://github.com/yuchenlin/ZeroEval)**         | **19.97**| 16.23       |
 | **[CRUX](https://github.com/yuchenlin/ZeroEval)**          | **31.25**| 25.25     |
 | **[AlpacaEval](https://github.com/tatsu-lab/alpaca_eval)**    | **23.87**| 19.35  |