Update README.md (#1)

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,6 +1,9 @@
 ---
 base_model:
 - meta-llama/Llama-3.1-8B-Instruct
 ---
 # Model Overview
@@ -75,39 +78,37 @@ python examples/llama/convert_checkpoint.py --model_dir Llama-3.1-8B-Instruct-FP
 trtllm-build --checkpoint_dir /ckpt --output_dir /engine
 ```
-* Accuracy evaluation:
-1) Prepare the MMLU dataset:
-```sh
-mkdir data; wget https://people.eecs.berkeley.edu/~hendrycks/data.tar -O data/mmlu.tar
-tar -xf data/mmlu.tar -C data && mv data/data data/mmlu
-```
-2) Measure MMLU:
-```sh
-python examples/mmlu.py --engine_dir ./engine --tokenizer_dir Llama-3.1-8B-Instruct-FP8/ --test_trt_llm --data_dir data/mmlu
-```
 * Throughputs evaluation:
 Please refer to the [TensorRT-LLM benchmarking documentation](https://github.com/NVIDIA/TensorRT-LLM/blob/main/benchmarks/Suite.md) for details.
 ## Evaluation
-The accuracy (MMLU, 5-shot) and throughputs (tokens per second, TPS) benchmark results are presented in the table below:
 <table>
   <tr>
    <td><strong>Precision</strong>
    </td>
    <td><strong>MMLU</strong>
    </td>
    <td><strong>TPS</strong>
    </td>
   </tr>
   <tr>
-   <td>FP16
    </td>
-   <td>68.6
    </td>
    <td>8,579.93
    </td>
@@ -115,7 +116,13 @@ The accuracy (MMLU, 5-shot) and throughputs (tokens per second, TPS) benchmark r
   <tr>
    <td>FP8
    </td>
-   <td>68.3
    </td>
    <td>11,062.90
    </td>

 ---
 base_model:
 - meta-llama/Llama-3.1-8B-Instruct
+license: llama3.1
+pipeline_tag: text-generation
+library_name: transformers
 ---
 # Model Overview
 trtllm-build --checkpoint_dir /ckpt --output_dir /engine
 ```
 * Throughputs evaluation:
 Please refer to the [TensorRT-LLM benchmarking documentation](https://github.com/NVIDIA/TensorRT-LLM/blob/main/benchmarks/Suite.md) for details.
 ## Evaluation
 <table>
   <tr>
    <td><strong>Precision</strong>
    </td>
    <td><strong>MMLU</strong>
    </td>
+   <td><strong>GSM8K (CoT) </strong>
+   </td>
+   <td><strong>ARC Challenge</strong>
+   </td>
+   <td><strong>IFEVAL</strong>
+   </td>
    <td><strong>TPS</strong>
    </td>
   </tr>
   <tr>
+   <td>BF16
+   </td>
+   <td>69.4
+   </td>
+   <td>84.5
    </td>
+   <td>83.4
+   </td>
+   <td>80.4
    </td>
    <td>8,579.93
    </td>
   <tr>
    <td>FP8
    </td>
+   <td>68.7
+   </td>
+   <td>83.1
+   </td>
+   <td>83.3
+   </td>
+   <td>81.8
    </td>
    <td>11,062.90
    </td>