PowerInfer
/

SmallThinker-21BA3B-Instruct

Text Generation

Model card Files Files and versions Community

wdl339 commited on about 3 hours ago

Commit

fe1da60

·

verified ·

1 Parent(s): 109a2c4

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -36,7 +36,7 @@ All models are evaluated in non-thinking mode.
 | Gemma 3n E2B                         | 1G, theoretically   | 36.88    | 27.06     | 12.50        | 6.66           |
 | Gemma 3n E4B                         | 2G, theoretically   | 21.93    | 16.58     | 7.37         | 4.01           |
-Note：i9 14900、1+13 8ge4 use 4 threads，others use the number of threads that can achieve the maximum speed. All models here have been quantized to q4_0.
 You can deploy SmallThinker with offloading support using [PowerInfer](https://github.com/SJTU-IPADS/PowerInfer/tree/main/smallthinker)
 ## Model Card

 | Gemma 3n E2B                         | 1G, theoretically   | 36.88    | 27.06     | 12.50        | 6.66           |
 | Gemma 3n E4B                         | 2G, theoretically   | 21.93    | 16.58     | 7.37         | 4.01           |
+Note: i9 14900, 1+13 8ge4 use 4 threads, others use the number of threads that can achieve the maximum speed. All models here have been quantized to q4_0.
 You can deploy SmallThinker with offloading support using [PowerInfer](https://github.com/SJTU-IPADS/PowerInfer/tree/main/smallthinker)
 ## Model Card