Update README.md
Browse files
README.md
CHANGED
@@ -146,10 +146,10 @@ We can use the following code to get a sense of peak memory usage during inferen
|
|
146 |
|
147 |
## Results
|
148 |
|
149 |
-
| Benchmark
|
150 |
-
|
151 |
-
|
|
152 |
-
| Peak Memory
|
153 |
|
154 |
|
155 |
## Benchmark Peak Memory
|
|
|
146 |
|
147 |
## Results
|
148 |
|
149 |
+
| Benchmark | | |
|
150 |
+
|------------------|----------------|--------------------------------|
|
151 |
+
| | Phi-4 mini-Ins | Phi-4-mini-instruct-int4wo-hqq |
|
152 |
+
| Peak Memory (GB) | 8.91 | 2.98 (67% reduction) |
|
153 |
|
154 |
|
155 |
## Benchmark Peak Memory
|