Xenova HF Staff whitphx HF Staff commited on
Commit
9916cc9
·
verified ·
1 Parent(s): 6eaa46b

Add/update the quantized ONNX model files and README.md for Transformers.js v3 (#1)

Browse files

- Add/update the quantized ONNX model files and README.md for Transformers.js v3 (b06fb7fa94030780bd6f74b66508cfdab242d096)


Co-authored-by: Yuichiro Tachibana <[email protected]>

README.md CHANGED
@@ -5,4 +5,20 @@ library_name: transformers.js
5
 
6
  https://huggingface.co/EleutherAI/pythia-31m with ONNX weights to be compatible with Transformers.js.
7
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
  Note: Having a separate repo for ONNX weights is intended to be a temporary solution until WebML gains more traction. If you would like to make your models web-ready, we recommend converting to ONNX using [🤗 Optimum](https://huggingface.co/docs/optimum/index) and structuring your repo like this one (with ONNX weights located in a subfolder named `onnx`).
 
5
 
6
  https://huggingface.co/EleutherAI/pythia-31m with ONNX weights to be compatible with Transformers.js.
7
 
8
+ ## Usage (Transformers.js)
9
+
10
+ If you haven't already, you can install the [Transformers.js](https://huggingface.co/docs/transformers.js) JavaScript library from [NPM](https://www.npmjs.com/package/@huggingface/transformers) using:
11
+ ```bash
12
+ npm i @huggingface/transformers
13
+ ```
14
+
15
+ **Example:** Text generation.
16
+
17
+ ```js
18
+ import { pipeline } from '@huggingface/transformers';
19
+
20
+ const generator = await pipeline('text-generation', 'Xenova/pythia-31m');
21
+ const output = await generator('Once upon a time, there was', { max_new_tokens: 10 });
22
+ ```
23
+
24
  Note: Having a separate repo for ONNX weights is intended to be a temporary solution until WebML gains more traction. If you would like to make your models web-ready, we recommend converting to ONNX using [🤗 Optimum](https://huggingface.co/docs/optimum/index) and structuring your repo like this one (with ONNX weights located in a subfolder named `onnx`).
onnx/decoder_model_bnb4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:48daca08832028d49af7cdab70b10000d920fc15e261f114784d3b7b7dfc3d53
3
+ size 66022175
onnx/decoder_model_fp16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f71c76cb5a13f28396312e4b3e41c3776a41c11c1d73bd50b02cf2b0f315385e
3
+ size 65451550
onnx/decoder_model_int8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c39d7cf9d49d50e9fc16b30cfe21b6d6a0d54fe2549fcfa3a1d5eb4610e097b6
3
+ size 35115842
onnx/decoder_model_merged_bnb4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3d4e23d74f529d4b497b2d65c9e789c916e88d28fcb6a7526061528f154afbfc
3
+ size 66359333
onnx/decoder_model_merged_fp16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4b5427cdce54a1e31f91a8708c3e8814e7655f8fbdbf21599a1dbd09d6f4a8a8
3
+ size 65788426
onnx/decoder_model_merged_int8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eabdfdf570e1204d9f45a53dafe63126957ab77a17f19a317db7883e9dc08a95
3
+ size 35481190
onnx/decoder_model_merged_q4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b246484485804ec996fd4674f299574470a342330aaa3d1c080d4cd8b7c68330
3
+ size 67458696
onnx/decoder_model_merged_q4f16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9567fdc8ed403cc02a7031cab353d5c103cd9f10a59ed831a4bbfc7553d3f31d
3
+ size 40500628
onnx/decoder_model_merged_uint8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:75d65054832a4ae51ff30fdb7b8e98e8aebfad86fe7739834c84d7b918a78d7d
3
+ size 35481201
onnx/decoder_model_q4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:016b687e7497c68a047ebdda8a095602139683df0b639753a9b615b2543b13a0
3
+ size 67121763
onnx/decoder_model_q4f16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7a1588f9c01f0ec7bed341bf4ec6cda82f93ed40f7ef00fcfd903a168b64ea79
3
+ size 40160425
onnx/decoder_model_uint8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9c4d86cd934e21b6644f46ba032157db1a8d5c6b44bc1e4f15172f39954686b0
3
+ size 35115853
onnx/decoder_with_past_model_bnb4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fef4cfe1159872f274f264ab5a6350693a4b3532fd12cfc9c34584591bb068ae
3
+ size 66050207
onnx/decoder_with_past_model_fp16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6952f106a64efcce74a3869dd2050194ea62ff5e3821e80cca1c682a1e5a78bc
3
+ size 65481753
onnx/decoder_with_past_model_int8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:21469f894e8e7015d70d12747cedec00643a7921f75235e6fec7064fb133a343
3
+ size 35143874
onnx/decoder_with_past_model_q4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3c871eaa0b207423b5051750573568129d3f7e13cd8f8679036276f86d5ee90f
3
+ size 67149795
onnx/decoder_with_past_model_q4f16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e8c65d25e0316bfded34767a97adac68ea838d1f60a93625355410446ef5ffdc
3
+ size 40190628
onnx/decoder_with_past_model_uint8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eb70609c792868baaa65c14d69524f81ac8ab8c7ed3be5b2274d88309e46c6bf
3
+ size 35143885