Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -21,12 +21,15 @@ pipeline_tag: text-generation
 # Installation
 ```
-pip install transformers
 pip install --pre torchao --index-url https://download.pytorch.org/whl/nightly/cu126
-pip install [email protected]:EleutherAI/lm-evaluation-harness.git
 pip install vllm --pre --extra-index-url https://wheels.vllm.ai/nightly
 ```
 # Quantization Recipe
 We used following code to get the quantized model:
@@ -81,11 +84,6 @@ print(f"{save_to} model:", benchmark_fn(quantized_model.generate, **inputs, max_
 # Model Quality
 We rely on [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) to evaluate the quality of the quantized model.
-## Installing the nightly version to get most recent updates
-```
-pip install git+https://github.com/EleutherAI/lm-evaluation-harness
-```
 ## baseline
 ```
 lm_eval --model hf --model_args pretrained=microsoft/Phi-4-mini-instruct --tasks hellaswag --device cuda:0 --batch_size 8

 # Installation
 ```
+pip install git+https://github.com/huggingface/transformers
 pip install --pre torchao --index-url https://download.pytorch.org/whl/nightly/cu126
 pip install vllm --pre --extra-index-url https://wheels.vllm.ai/nightly
 ```
+Also need to install lm-eval from source:
+https://github.com/EleutherAI/lm-evaluation-harness#install
 # Quantization Recipe
 We used following code to get the quantized model:
 # Model Quality
 We rely on [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) to evaluate the quality of the quantized model.
 ## baseline
 ```
 lm_eval --model hf --model_args pretrained=microsoft/Phi-4-mini-instruct --tasks hellaswag --device cuda:0 --batch_size 8