v0.30.2
Browse filesSee https://github.com/quic/ai-hub-models/releases/v0.30.2 for changelog.
- ControlNet_Quantized.bin +0 -3
- README.md +39 -40
- TextEncoder_Quantized.bin +0 -3
- TextEncoder_Quantized.so +0 -3
- UNet_Quantized.bin +0 -3
- UNet_Quantized.so +0 -3
- VAEDecoder_Quantized.bin +0 -3
- VAEDecoder_Quantized.so +0 -3
ControlNet_Quantized.bin
DELETED
@@ -1,3 +0,0 @@
|
|
1 |
-
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:d9757acb27d03ef59e7ed5b658d59f0a668b6eb896f6d280e3c9116953945da5
|
3 |
-
size 368625928
|
|
|
|
|
|
|
|
README.md
CHANGED
@@ -1,9 +1,8 @@
|
|
1 |
---
|
2 |
library_name: pytorch
|
3 |
-
license:
|
4 |
tags:
|
5 |
- generative_ai
|
6 |
-
- quantized
|
7 |
- android
|
8 |
pipeline_tag: unconditional-image-generation
|
9 |
|
@@ -27,7 +26,7 @@ More details on model performance across various devices, can be found
|
|
27 |
|
28 |
### Model Details
|
29 |
|
30 |
-
- **Model Type:**
|
31 |
- **Model Stats:**
|
32 |
- Input: Text prompt and input image as a reference
|
33 |
- Conditioning Input: Canny-Edge
|
@@ -37,20 +36,20 @@ More details on model performance across various devices, can be found
|
|
37 |
- ControlNet Number of parameters: 361M
|
38 |
- Model size: 1.4GB
|
39 |
|
40 |
-
| Model | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) |
|
41 |
|---|---|---|---|---|---|---|---|---|
|
42 |
-
| TextEncoder_Quantized | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | QNN |
|
43 |
-
| TextEncoder_Quantized | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | QNN |
|
44 |
-
| TextEncoder_Quantized | QCS8550 (Proxy) | QCS8550 Proxy | QNN | 10.
|
45 |
-
| UNet_Quantized | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | QNN |
|
46 |
-
| UNet_Quantized | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | QNN |
|
47 |
-
| UNet_Quantized | QCS8550 (Proxy) | QCS8550 Proxy | QNN |
|
48 |
-
| VAEDecoder_Quantized | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | QNN |
|
49 |
-
| VAEDecoder_Quantized | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | QNN |
|
50 |
-
| VAEDecoder_Quantized | QCS8550 (Proxy) | QCS8550 Proxy | QNN |
|
51 |
-
| ControlNet_Quantized | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | QNN |
|
52 |
-
| ControlNet_Quantized | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | QNN |
|
53 |
-
| ControlNet_Quantized | QCS8550 (Proxy) | QCS8550 Proxy | QNN | 103.
|
54 |
|
55 |
|
56 |
|
@@ -112,39 +111,39 @@ python -m qai_hub_models.models.controlnet.export
|
|
112 |
Profiling Results
|
113 |
------------------------------------------------------------
|
114 |
TextEncoder_Quantized
|
115 |
-
Device :
|
116 |
-
Runtime : QNN
|
117 |
-
Estimated inference time (ms) :
|
118 |
-
Estimated peak memory usage (MB): [0,
|
119 |
-
Total # Ops :
|
120 |
-
Compute Unit(s) :
|
121 |
|
122 |
------------------------------------------------------------
|
123 |
UNet_Quantized
|
124 |
-
Device :
|
125 |
-
Runtime : QNN
|
126 |
-
Estimated inference time (ms) :
|
127 |
-
Estimated peak memory usage (MB): [
|
128 |
-
Total # Ops :
|
129 |
-
Compute Unit(s) :
|
130 |
|
131 |
------------------------------------------------------------
|
132 |
VAEDecoder_Quantized
|
133 |
-
Device :
|
134 |
-
Runtime : QNN
|
135 |
-
Estimated inference time (ms) :
|
136 |
-
Estimated peak memory usage (MB): [0,
|
137 |
-
Total # Ops :
|
138 |
-
Compute Unit(s) :
|
139 |
|
140 |
------------------------------------------------------------
|
141 |
ControlNet_Quantized
|
142 |
-
Device :
|
143 |
-
Runtime : QNN
|
144 |
-
Estimated inference time (ms) :
|
145 |
-
Estimated peak memory usage (MB): [2,
|
146 |
-
Total # Ops :
|
147 |
-
Compute Unit(s) :
|
148 |
```
|
149 |
|
150 |
|
|
|
1 |
---
|
2 |
library_name: pytorch
|
3 |
+
license: other
|
4 |
tags:
|
5 |
- generative_ai
|
|
|
6 |
- android
|
7 |
pipeline_tag: unconditional-image-generation
|
8 |
|
|
|
26 |
|
27 |
### Model Details
|
28 |
|
29 |
+
- **Model Type:** Model_use_case.image_generation
|
30 |
- **Model Stats:**
|
31 |
- Input: Text prompt and input image as a reference
|
32 |
- Conditioning Input: Canny-Edge
|
|
|
36 |
- ControlNet Number of parameters: 361M
|
37 |
- Model size: 1.4GB
|
38 |
|
39 |
+
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
40 |
|---|---|---|---|---|---|---|---|---|
|
41 |
+
| TextEncoder_Quantized | w8a16 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN | 10.874 ms | 0 - 3 MB | NPU | Use Export Script |
|
42 |
+
| TextEncoder_Quantized | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN | 7.918 ms | 0 - 18 MB | NPU | Use Export Script |
|
43 |
+
| TextEncoder_Quantized | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN | 10.875 ms | 0 - 3 MB | NPU | Use Export Script |
|
44 |
+
| UNet_Quantized | w8a16 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN | 258.151 ms | 13 - 15 MB | NPU | Use Export Script |
|
45 |
+
| UNet_Quantized | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN | 197.629 ms | 13 - 31 MB | NPU | Use Export Script |
|
46 |
+
| UNet_Quantized | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN | 256.936 ms | 13 - 16 MB | NPU | Use Export Script |
|
47 |
+
| VAEDecoder_Quantized | w8a16 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN | 397.625 ms | 0 - 2 MB | NPU | Use Export Script |
|
48 |
+
| VAEDecoder_Quantized | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN | 300.627 ms | 0 - 21 MB | NPU | Use Export Script |
|
49 |
+
| VAEDecoder_Quantized | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN | 395.006 ms | 0 - 3 MB | NPU | Use Export Script |
|
50 |
+
| ControlNet_Quantized | w8a16 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN | 104.668 ms | 2 - 9 MB | NPU | Use Export Script |
|
51 |
+
| ControlNet_Quantized | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN | 77.289 ms | 2 - 23 MB | NPU | Use Export Script |
|
52 |
+
| ControlNet_Quantized | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN | 103.817 ms | 2 - 5 MB | NPU | Use Export Script |
|
53 |
|
54 |
|
55 |
|
|
|
111 |
Profiling Results
|
112 |
------------------------------------------------------------
|
113 |
TextEncoder_Quantized
|
114 |
+
Device : cs_8_gen_2 (ANDROID 13)
|
115 |
+
Runtime : QNN
|
116 |
+
Estimated inference time (ms) : 10.9
|
117 |
+
Estimated peak memory usage (MB): [0, 3]
|
118 |
+
Total # Ops : 569
|
119 |
+
Compute Unit(s) : npu (569 ops) gpu (0 ops) cpu (0 ops)
|
120 |
|
121 |
------------------------------------------------------------
|
122 |
UNet_Quantized
|
123 |
+
Device : cs_8_gen_2 (ANDROID 13)
|
124 |
+
Runtime : QNN
|
125 |
+
Estimated inference time (ms) : 258.2
|
126 |
+
Estimated peak memory usage (MB): [13, 15]
|
127 |
+
Total # Ops : 5433
|
128 |
+
Compute Unit(s) : npu (5433 ops) gpu (0 ops) cpu (0 ops)
|
129 |
|
130 |
------------------------------------------------------------
|
131 |
VAEDecoder_Quantized
|
132 |
+
Device : cs_8_gen_2 (ANDROID 13)
|
133 |
+
Runtime : QNN
|
134 |
+
Estimated inference time (ms) : 397.6
|
135 |
+
Estimated peak memory usage (MB): [0, 2]
|
136 |
+
Total # Ops : 408
|
137 |
+
Compute Unit(s) : npu (408 ops) gpu (0 ops) cpu (0 ops)
|
138 |
|
139 |
------------------------------------------------------------
|
140 |
ControlNet_Quantized
|
141 |
+
Device : cs_8_gen_2 (ANDROID 13)
|
142 |
+
Runtime : QNN
|
143 |
+
Estimated inference time (ms) : 104.7
|
144 |
+
Estimated peak memory usage (MB): [2, 9]
|
145 |
+
Total # Ops : 2405
|
146 |
+
Compute Unit(s) : npu (2405 ops) gpu (0 ops) cpu (0 ops)
|
147 |
```
|
148 |
|
149 |
|
TextEncoder_Quantized.bin
DELETED
@@ -1,3 +0,0 @@
|
|
1 |
-
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:4355278c27482577083f4afdec02b783d9d43fd9d349226265cbe455de6764d2
|
3 |
-
size 162623336
|
|
|
|
|
|
|
|
TextEncoder_Quantized.so
DELETED
@@ -1,3 +0,0 @@
|
|
1 |
-
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:4355278c27482577083f4afdec02b783d9d43fd9d349226265cbe455de6764d2
|
3 |
-
size 162623336
|
|
|
|
|
|
|
|
UNet_Quantized.bin
DELETED
@@ -1,3 +0,0 @@
|
|
1 |
-
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:a541970e46808e9d65337db4ff83022376d4acb35cb8159555b22fb65d92a0a3
|
3 |
-
size 880611000
|
|
|
|
|
|
|
|
UNet_Quantized.so
DELETED
@@ -1,3 +0,0 @@
|
|
1 |
-
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:a541970e46808e9d65337db4ff83022376d4acb35cb8159555b22fb65d92a0a3
|
3 |
-
size 880611000
|
|
|
|
|
|
|
|
VAEDecoder_Quantized.bin
DELETED
@@ -1,3 +0,0 @@
|
|
1 |
-
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:5583d793b155115acfc74e32bdf87951519c5bf9f0675d177bf6d474edab1c0c
|
3 |
-
size 72766264
|
|
|
|
|
|
|
|
VAEDecoder_Quantized.so
DELETED
@@ -1,3 +0,0 @@
|
|
1 |
-
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:5583d793b155115acfc74e32bdf87951519c5bf9f0675d177bf6d474edab1c0c
|
3 |
-
size 72766264
|
|
|
|
|
|
|
|