Update README.md
Browse files
README.md
CHANGED
@@ -219,9 +219,11 @@ python -m executorch.examples.models.llama.export_llama \
|
|
219 |
|
220 |
## Running in a mobile app
|
221 |
The PTE file can be run with ExecuTorch on a mobile phone. See the [instructions](https://pytorch.org/executorch/main/llm/llama-demo-ios.html) for doing this in iOS.
|
222 |
-
On iPhone 15 Pro, the model runs at 17.3 tokens/sec and uses 3206 Mb of memory.
|
|
|
|
|
|
|
223 |
|
224 |
-

|
225 |
|
226 |
# Disclaimer
|
227 |
PyTorch has not performed safety evaluations or red teamed the quantized models. Performance characteristics, outputs, and behaviors may differ from the original models. Users are solely responsible for selecting appropriate use cases, evaluating and mitigating for accuracy, safety, and fairness, ensuring security, and complying with all applicable laws and regulations.
|
|
|
219 |
|
220 |
## Running in a mobile app
|
221 |
The PTE file can be run with ExecuTorch on a mobile phone. See the [instructions](https://pytorch.org/executorch/main/llm/llama-demo-ios.html) for doing this in iOS.
|
222 |
+
On iPhone 15 Pro, the model runs at 17.3 tokens/sec and uses 3206 Mb of memory.
|
223 |
+
|
224 |
+

|
225 |
+
|
226 |
|
|
|
227 |
|
228 |
# Disclaimer
|
229 |
PyTorch has not performed safety evaluations or red teamed the quantized models. Performance characteristics, outputs, and behaviors may differ from the original models. Users are solely responsible for selecting appropriate use cases, evaluating and mitigating for accuracy, safety, and fairness, ensuring security, and complying with all applicable laws and regulations.
|