v0.30.2

Browse files

See https://github.com/quic/ai-hub-models/releases/v0.30.2 for changelog.

Files changed (8) hide show

ControlNet_Quantized.bin +0 -3
README.md +39 -40
TextEncoder_Quantized.bin +0 -3
TextEncoder_Quantized.so +0 -3
UNet_Quantized.bin +0 -3
UNet_Quantized.so +0 -3
VAEDecoder_Quantized.bin +0 -3
VAEDecoder_Quantized.so +0 -3

ControlNet_Quantized.bin DELETED Viewed

@@ -1,3 +0,0 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:d9757acb27d03ef59e7ed5b658d59f0a668b6eb896f6d280e3c9116953945da5
-size 368625928

README.md CHANGED Viewed

@@ -1,9 +1,8 @@
 ---
 library_name: pytorch
-license: apache-2.0
 tags:
 - generative_ai
-- quantized
 - android
 pipeline_tag: unconditional-image-generation
@@ -27,7 +26,7 @@ More details on model performance across various devices, can be found
 ### Model Details
-- **Model Type:** Image generation
 - **Model Stats:**
   - Input: Text prompt and input image as a reference
   - Conditioning Input: Canny-Edge
@@ -37,20 +36,20 @@ More details on model performance across various devices, can be found
   - ControlNet Number of parameters: 361M
   - Model size: 1.4GB
-| Model | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
 |---|---|---|---|---|---|---|---|---|
-| TextEncoder_Quantized | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | QNN | 11.394 ms | 0 - 74 MB | UINT16 | NPU | [ControlNet.bin](https://huggingface.co/qualcomm/ControlNet/blob/main/TextEncoder_Quantized.bin) |
-| TextEncoder_Quantized | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | QNN | 8.08 ms | 0 - 137 MB | UINT16 | NPU | [ControlNet.bin](https://huggingface.co/qualcomm/ControlNet/blob/main/TextEncoder_Quantized.bin) |
-| TextEncoder_Quantized | QCS8550 (Proxy) | QCS8550 Proxy | QNN | 10.982 ms | 0 - 1 MB | UINT16 | NPU | Use Export Script |
-| UNet_Quantized | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | QNN | 262.52 ms | 11 - 17 MB | UINT16 | NPU | [ControlNet.bin](https://huggingface.co/qualcomm/ControlNet/blob/main/UNet_Quantized.bin) |
-| UNet_Quantized | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | QNN | 192.789 ms | 3 - 1247 MB | UINT16 | NPU | [ControlNet.bin](https://huggingface.co/qualcomm/ControlNet/blob/main/UNet_Quantized.bin) |
-| UNet_Quantized | QCS8550 (Proxy) | QCS8550 Proxy | QNN | 260.158 ms | 14 - 15 MB | UINT16 | NPU | Use Export Script |
-| VAEDecoder_Quantized | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | QNN | 390.243 ms | 0 - 36 MB | UINT16 | NPU | [ControlNet.bin](https://huggingface.co/qualcomm/ControlNet/blob/main/VAEDecoder_Quantized.bin) |
-| VAEDecoder_Quantized | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | QNN | 294.404 ms | 0 - 88 MB | UINT16 | NPU | [ControlNet.bin](https://huggingface.co/qualcomm/ControlNet/blob/main/VAEDecoder_Quantized.bin) |
-| VAEDecoder_Quantized | QCS8550 (Proxy) | QCS8550 Proxy | QNN | 379.548 ms | 0 - 1 MB | UINT16 | NPU | Use Export Script |
-| ControlNet_Quantized | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | QNN | 100.33 ms | 2 - 68 MB | UINT16 | NPU | [ControlNet.bin](https://huggingface.co/qualcomm/ControlNet/blob/main/ControlNet_Quantized.bin) |
-| ControlNet_Quantized | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | QNN | 76.94 ms | 0 - 533 MB | UINT16 | NPU | [ControlNet.bin](https://huggingface.co/qualcomm/ControlNet/blob/main/ControlNet_Quantized.bin) |
-| ControlNet_Quantized | QCS8550 (Proxy) | QCS8550 Proxy | QNN | 103.52 ms | 2 - 3 MB | UINT16 | NPU | Use Export Script |
@@ -112,39 +111,39 @@ python -m qai_hub_models.models.controlnet.export
 Profiling Results
 ------------------------------------------------------------
 TextEncoder_Quantized
-Device                          : Samsung Galaxy S23 (13)
-Runtime                         : QNN
-Estimated inference time (ms)   : 11.4
-Estimated peak memory usage (MB): [0, 74]
-Total # Ops                     : 570
-Compute Unit(s)                 : NPU (570 ops)
 ------------------------------------------------------------
 UNet_Quantized
-Device                          : Samsung Galaxy S23 (13)
-Runtime                         : QNN
-Estimated inference time (ms)   : 262.5
-Estimated peak memory usage (MB): [11, 17]
-Total # Ops                     : 5434
-Compute Unit(s)                 : NPU (5434 ops)
 ------------------------------------------------------------
 VAEDecoder_Quantized
-Device                          : Samsung Galaxy S23 (13)
-Runtime                         : QNN
-Estimated inference time (ms)   : 390.2
-Estimated peak memory usage (MB): [0, 36]
-Total # Ops                     : 409
-Compute Unit(s)                 : NPU (409 ops)
 ------------------------------------------------------------
 ControlNet_Quantized
-Device                          : Samsung Galaxy S23 (13)
-Runtime                         : QNN
-Estimated inference time (ms)   : 100.3
-Estimated peak memory usage (MB): [2, 68]
-Total # Ops                     : 2406
-Compute Unit(s)                 : NPU (2406 ops)
 ```

 ---
 library_name: pytorch
+license: other
 tags:
 - generative_ai
 - android
 pipeline_tag: unconditional-image-generation
 ### Model Details
+- **Model Type:** Model_use_case.image_generation
 - **Model Stats:**
   - Input: Text prompt and input image as a reference
   - Conditioning Input: Canny-Edge
   - ControlNet Number of parameters: 361M
   - Model size: 1.4GB
+| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
 |---|---|---|---|---|---|---|---|---|
+| TextEncoder_Quantized | w8a16 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN | 10.874 ms | 0 - 3 MB | NPU | Use Export Script |
+| TextEncoder_Quantized | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN | 7.918 ms | 0 - 18 MB | NPU | Use Export Script |
+| TextEncoder_Quantized | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN | 10.875 ms | 0 - 3 MB | NPU | Use Export Script |
+| UNet_Quantized | w8a16 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN | 258.151 ms | 13 - 15 MB | NPU | Use Export Script |
+| UNet_Quantized | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN | 197.629 ms | 13 - 31 MB | NPU | Use Export Script |
+| UNet_Quantized | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN | 256.936 ms | 13 - 16 MB | NPU | Use Export Script |
+| VAEDecoder_Quantized | w8a16 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN | 397.625 ms | 0 - 2 MB | NPU | Use Export Script |
+| VAEDecoder_Quantized | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN | 300.627 ms | 0 - 21 MB | NPU | Use Export Script |
+| VAEDecoder_Quantized | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN | 395.006 ms | 0 - 3 MB | NPU | Use Export Script |
+| ControlNet_Quantized | w8a16 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN | 104.668 ms | 2 - 9 MB | NPU | Use Export Script |
+| ControlNet_Quantized | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN | 77.289 ms | 2 - 23 MB | NPU | Use Export Script |
+| ControlNet_Quantized | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN | 103.817 ms | 2 - 5 MB | NPU | Use Export Script |
 Profiling Results
 ------------------------------------------------------------
 TextEncoder_Quantized
+Device                          : cs_8_gen_2 (ANDROID 13)
+Runtime                         : QNN
+Estimated inference time (ms)   : 10.9
+Estimated peak memory usage (MB): [0, 3]
+Total # Ops                     : 569
+Compute Unit(s)                 : npu (569 ops) gpu (0 ops) cpu (0 ops)
 ------------------------------------------------------------
 UNet_Quantized
+Device                          : cs_8_gen_2 (ANDROID 13)
+Runtime                         : QNN
+Estimated inference time (ms)   : 258.2
+Estimated peak memory usage (MB): [13, 15]
+Total # Ops                     : 5433
+Compute Unit(s)                 : npu (5433 ops) gpu (0 ops) cpu (0 ops)
 ------------------------------------------------------------
 VAEDecoder_Quantized
+Device                          : cs_8_gen_2 (ANDROID 13)
+Runtime                         : QNN
+Estimated inference time (ms)   : 397.6
+Estimated peak memory usage (MB): [0, 2]
+Total # Ops                     : 408
+Compute Unit(s)                 : npu (408 ops) gpu (0 ops) cpu (0 ops)
 ------------------------------------------------------------
 ControlNet_Quantized
+Device                          : cs_8_gen_2 (ANDROID 13)
+Runtime                         : QNN
+Estimated inference time (ms)   : 104.7
+Estimated peak memory usage (MB): [2, 9]
+Total # Ops                     : 2405
+Compute Unit(s)                 : npu (2405 ops) gpu (0 ops) cpu (0 ops)
 ```

TextEncoder_Quantized.bin DELETED Viewed

@@ -1,3 +0,0 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:4355278c27482577083f4afdec02b783d9d43fd9d349226265cbe455de6764d2
-size 162623336

TextEncoder_Quantized.so DELETED Viewed

@@ -1,3 +0,0 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:4355278c27482577083f4afdec02b783d9d43fd9d349226265cbe455de6764d2
-size 162623336

UNet_Quantized.bin DELETED Viewed

@@ -1,3 +0,0 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:a541970e46808e9d65337db4ff83022376d4acb35cb8159555b22fb65d92a0a3
-size 880611000

UNet_Quantized.so DELETED Viewed

@@ -1,3 +0,0 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:a541970e46808e9d65337db4ff83022376d4acb35cb8159555b22fb65d92a0a3
-size 880611000

VAEDecoder_Quantized.bin DELETED Viewed

@@ -1,3 +0,0 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:5583d793b155115acfc74e32bdf87951519c5bf9f0675d177bf6d474edab1c0c
-size 72766264

VAEDecoder_Quantized.so DELETED Viewed

@@ -1,3 +0,0 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:5583d793b155115acfc74e32bdf87951519c5bf9f0675d177bf6d474edab1c0c
-size 72766264