metascroy commited on
Commit
164eb9a
·
verified ·
1 Parent(s): 0b831a8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -2
README.md CHANGED
@@ -219,9 +219,11 @@ python -m executorch.examples.models.llama.export_llama \
219
 
220
  ## Running in a mobile app
221
  The PTE file can be run with ExecuTorch on a mobile phone. See the [instructions](https://pytorch.org/executorch/main/llm/llama-demo-ios.html) for doing this in iOS.
222
- On iPhone 15 Pro, the model runs at 17.3 tokens/sec and uses 3206 Mb of memory.
 
 
 
223
 
224
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/66049fc71116cebd1d3bdcf4/AEdAJjGK2lED7tr6seWGf.png)
225
 
226
  # Disclaimer
227
  PyTorch has not performed safety evaluations or red teamed the quantized models. Performance characteristics, outputs, and behaviors may differ from the original models. Users are solely responsible for selecting appropriate use cases, evaluating and mitigating for accuracy, safety, and fairness, ensuring security, and complying with all applicable laws and regulations.
 
219
 
220
  ## Running in a mobile app
221
  The PTE file can be run with ExecuTorch on a mobile phone. See the [instructions](https://pytorch.org/executorch/main/llm/llama-demo-ios.html) for doing this in iOS.
222
+ On iPhone 15 Pro, the model runs at 17.3 tokens/sec and uses 3206 Mb of memory.
223
+
224
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/66049fc71116cebd1d3bdcf4/521rXwIlYS9HIAEBAPJjw.png)
225
+
226
 
 
227
 
228
  # Disclaimer
229
  PyTorch has not performed safety evaluations or red teamed the quantized models. Performance characteristics, outputs, and behaviors may differ from the original models. Users are solely responsible for selecting appropriate use cases, evaluating and mitigating for accuracy, safety, and fairness, ensuring security, and complying with all applicable laws and regulations.