qualcomm
/

Inception-v3-Quantized

@@ -17,7 +17,7 @@ tags:
 # Inception-v3-Quantized: Optimized for Mobile Deployment
 ## Quantized Imagenet classifier and general purpose backbone
-InceptionNetV3 is a machine learning model that can classify images from the Imagenet dataset. It can also be used as a backbone in building more complex models for specific use cases. This model is post-training quantized to int8 using samples from [Google's open images dataset](https://storage.googleapis.com/openimages/web/index.html).
 This model is an implementation of Inception-v3-Quantized found [here](https://github.com/pytorch/vision/blob/main/torchvision/models/inception.py).
 This repository provides scripts to run Inception-v3-Quantized on Qualcomm® devices.
@@ -37,7 +37,8 @@ More details on model performance across various devices, can be found
 | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
 | ---|---|---|---|---|---|---|---|
-| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | TFLite | 0.623 ms | 0 - 2 MB | INT8 | NPU |  [Inception-v3-Quantized.tflite](https://huggingface.co/qualcomm/Inception-v3-Quantized/blob/main/Inception-v3-Quantized.tflite)
 ## Installation
@@ -97,10 +98,10 @@ python -m qai_hub_models.models.inception_v3_quantized.export
 ```
 Profile Job summary of Inception-v3-Quantized
 --------------------------------------------------
-Device: QCS8550 (Proxy) (12)
-Estimated Inference Time: 0.64 ms
-Estimated Peak Memory Range: 0.01-1.83 MB
-Compute Units: NPU (146) | Total (146)
 ```

 # Inception-v3-Quantized: Optimized for Mobile Deployment
 ## Quantized Imagenet classifier and general purpose backbone
+InceptionNetV3 is a machine learning model that can classify images from the Imagenet dataset. It can also be used as a backbone in building more complex models for specific use cases. This model is post-training quantized to int8 using samples from Google's open images dataset.
 This model is an implementation of Inception-v3-Quantized found [here](https://github.com/pytorch/vision/blob/main/torchvision/models/inception.py).
 This repository provides scripts to run Inception-v3-Quantized on Qualcomm® devices.
 | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
 | ---|---|---|---|---|---|---|---|
+| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | TFLite | 0.615 ms | 0 - 2 MB | INT8 | NPU |  [Inception-v3-Quantized.tflite](https://huggingface.co/qualcomm/Inception-v3-Quantized/blob/main/Inception-v3-Quantized.tflite)
+| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Model Library | 0.656 ms | 0 - 67 MB | INT8 | NPU |  [Inception-v3-Quantized.so](https://huggingface.co/qualcomm/Inception-v3-Quantized/blob/main/Inception-v3-Quantized.so)
 ## Installation
 ```
 Profile Job summary of Inception-v3-Quantized
 --------------------------------------------------
+Device: Snapdragon X Elite CRD (11)
+Estimated Inference Time: 0.72 ms
+Estimated Peak Memory Range: 0.39-0.39 MB
+Compute Units: NPU (134) | Total (134)
 ```