PowerInfer/SmallThinker-3B-Preview
Text Generation
•
Updated
•
26.3k
•
•
335
Ah I see, hope so too - Catching both size and speed means alot in mobile! Gotta preemptively thanks your future IQ4_NL quants ;)
Interesting, in this case will description "Legacy format, generally not worth using over similarly sized formats" of Q4_0
change to something like "ARM recommended (Do not use in Apple Silicons)" - or will IQ4_NL
added in list and recommend that over Q4_0
?