Add/update the quantized ONNX model files and README.md for Transformers.js v3
#1
by
whitphx
HF Staff
- opened
Applied Quantizations
β Based on model.onnx
with slimming
0%| | 0/1 [00:00<?, ?it/s]
Processing /tmp/tmpbl__miwg/model.onnx: 0%| | 0/1 [00:00<?, ?it/s]
0%| | 0/5 [00:00<?, ?it/s][A
- Quantizing to int8: 0%| | 0/5 [00:00<?, ?it/s][A2025-07-22 08:06:58,873 root [INFO] - Quantization parameters for tensor:"/emb_ln/Add_1_output_0" not specified
2025-07-22 08:06:58,879 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.0/attn/MatMul]
2025-07-22 08:06:58,879 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.0/attn/MatMul_1]
2025-07-22 08:06:58,880 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/attn/Reshape_1_output_0" not specified
2025-07-22 08:06:58,883 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/norm1/Add_1_output_0" not specified
2025-07-22 08:06:58,900 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/mlp/Mul_1_output_0" not specified
2025-07-22 08:06:58,908 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/norm2/Add_1_output_0" not specified
2025-07-22 08:06:58,914 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.1/attn/MatMul]
2025-07-22 08:06:58,914 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.1/attn/MatMul_1]
2025-07-22 08:06:58,915 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/attn/Reshape_1_output_0" not specified
2025-07-22 08:06:58,918 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/norm1/Add_1_output_0" not specified
2025-07-22 08:06:58,934 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/mlp/Mul_1_output_0" not specified
2025-07-22 08:06:58,942 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/norm2/Add_1_output_0" not specified
2025-07-22 08:06:58,948 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.2/attn/MatMul]
2025-07-22 08:06:58,948 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.2/attn/MatMul_1]
2025-07-22 08:06:58,949 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/attn/Reshape_1_output_0" not specified
2025-07-22 08:06:58,952 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/norm1/Add_1_output_0" not specified
2025-07-22 08:06:58,969 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/mlp/Mul_1_output_0" not specified
2025-07-22 08:06:58,977 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/norm2/Add_1_output_0" not specified
2025-07-22 08:06:58,983 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.3/attn/MatMul]
2025-07-22 08:06:58,983 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.3/attn/MatMul_1]
2025-07-22 08:06:58,984 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/attn/Reshape_1_output_0" not specified
2025-07-22 08:06:58,987 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/norm1/Add_1_output_0" not specified
2025-07-22 08:06:59,002 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/mlp/Mul_1_output_0" not specified
2025-07-22 08:06:59,011 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/norm2/Add_1_output_0" not specified
2025-07-22 08:06:59,018 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.4/attn/MatMul]
2025-07-22 08:06:59,018 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.4/attn/MatMul_1]
2025-07-22 08:06:59,019 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/attn/Reshape_1_output_0" not specified
2025-07-22 08:06:59,022 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/norm1/Add_1_output_0" not specified
2025-07-22 08:06:59,038 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/mlp/Mul_1_output_0" not specified
2025-07-22 08:06:59,047 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/norm2/Add_1_output_0" not specified
2025-07-22 08:06:59,054 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.5/attn/MatMul]
2025-07-22 08:06:59,054 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.5/attn/MatMul_1]
2025-07-22 08:06:59,055 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/attn/Reshape_1_output_0" not specified
2025-07-22 08:06:59,058 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/norm1/Add_1_output_0" not specified
2025-07-22 08:06:59,076 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/mlp/Mul_1_output_0" not specified
2025-07-22 08:06:59,084 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/norm2/Add_1_output_0" not specified
2025-07-22 08:06:59,091 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.6/attn/MatMul]
2025-07-22 08:06:59,091 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.6/attn/MatMul_1]
2025-07-22 08:06:59,092 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/attn/Reshape_1_output_0" not specified
2025-07-22 08:06:59,095 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/norm1/Add_1_output_0" not specified
2025-07-22 08:06:59,112 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/mlp/Mul_1_output_0" not specified
2025-07-22 08:06:59,121 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/norm2/Add_1_output_0" not specified
2025-07-22 08:06:59,128 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.7/attn/MatMul]
2025-07-22 08:06:59,128 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.7/attn/MatMul_1]
2025-07-22 08:06:59,129 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/attn/Reshape_1_output_0" not specified
2025-07-22 08:06:59,132 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/norm1/Add_1_output_0" not specified
2025-07-22 08:06:59,150 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/mlp/Mul_1_output_0" not specified
2025-07-22 08:06:59,158 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/norm2/Add_1_output_0" not specified
2025-07-22 08:06:59,165 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.8/attn/MatMul]
2025-07-22 08:06:59,166 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.8/attn/MatMul_1]
2025-07-22 08:06:59,167 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/attn/Reshape_1_output_0" not specified
2025-07-22 08:06:59,170 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/norm1/Add_1_output_0" not specified
2025-07-22 08:06:59,188 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/mlp/Mul_1_output_0" not specified
2025-07-22 08:06:59,196 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/norm2/Add_1_output_0" not specified
2025-07-22 08:06:59,203 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.9/attn/MatMul]
2025-07-22 08:06:59,203 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.9/attn/MatMul_1]
2025-07-22 08:06:59,204 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/attn/Reshape_1_output_0" not specified
2025-07-22 08:06:59,207 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/norm1/Add_1_output_0" not specified
2025-07-22 08:06:59,225 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/mlp/Mul_1_output_0" not specified
2025-07-22 08:06:59,235 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/norm2/Add_1_output_0" not specified
2025-07-22 08:06:59,242 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.10/attn/MatMul]
2025-07-22 08:06:59,242 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.10/attn/MatMul_1]
2025-07-22 08:06:59,243 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/attn/Reshape_1_output_0" not specified
2025-07-22 08:06:59,246 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/norm1/Add_1_output_0" not specified
2025-07-22 08:06:59,264 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/mlp/Mul_1_output_0" not specified
2025-07-22 08:06:59,273 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/norm2/Add_1_output_0" not specified
2025-07-22 08:06:59,280 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.11/attn/MatMul]
2025-07-22 08:06:59,280 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.11/attn/MatMul_1]
2025-07-22 08:06:59,282 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/attn/Reshape_1_output_0" not specified
2025-07-22 08:06:59,285 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/norm1/Add_1_output_0" not specified
2025-07-22 08:06:59,303 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/mlp/Mul_1_output_0" not specified
- Quantizing to int8: 20%|ββ | 1/5 [00:05<00:20, 5.19s/it][A
- Quantizing to uint8: 20%|ββ | 1/5 [00:05<00:20, 5.19s/it][A2025-07-22 08:07:03,551 root [INFO] - Quantization parameters for tensor:"/emb_ln/Add_1_output_0" not specified
2025-07-22 08:07:03,557 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.0/attn/MatMul]
2025-07-22 08:07:03,557 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.0/attn/MatMul_1]
2025-07-22 08:07:03,558 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:03,561 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/norm1/Add_1_output_0" not specified
2025-07-22 08:07:03,577 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:03,585 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/norm2/Add_1_output_0" not specified
2025-07-22 08:07:03,591 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.1/attn/MatMul]
2025-07-22 08:07:03,591 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.1/attn/MatMul_1]
2025-07-22 08:07:03,592 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:03,595 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/norm1/Add_1_output_0" not specified
2025-07-22 08:07:03,610 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:03,619 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/norm2/Add_1_output_0" not specified
2025-07-22 08:07:03,625 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.2/attn/MatMul]
2025-07-22 08:07:03,626 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.2/attn/MatMul_1]
2025-07-22 08:07:03,627 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:03,629 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/norm1/Add_1_output_0" not specified
2025-07-22 08:07:03,644 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:03,652 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/norm2/Add_1_output_0" not specified
2025-07-22 08:07:03,658 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.3/attn/MatMul]
2025-07-22 08:07:03,659 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.3/attn/MatMul_1]
2025-07-22 08:07:03,660 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:03,663 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/norm1/Add_1_output_0" not specified
2025-07-22 08:07:03,679 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:03,687 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/norm2/Add_1_output_0" not specified
2025-07-22 08:07:03,693 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.4/attn/MatMul]
2025-07-22 08:07:03,693 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.4/attn/MatMul_1]
2025-07-22 08:07:03,694 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:03,697 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/norm1/Add_1_output_0" not specified
2025-07-22 08:07:03,715 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:03,724 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/norm2/Add_1_output_0" not specified
2025-07-22 08:07:03,730 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.5/attn/MatMul]
2025-07-22 08:07:03,731 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.5/attn/MatMul_1]
2025-07-22 08:07:03,732 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:03,735 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/norm1/Add_1_output_0" not specified
2025-07-22 08:07:03,752 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:03,762 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/norm2/Add_1_output_0" not specified
2025-07-22 08:07:03,768 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.6/attn/MatMul]
2025-07-22 08:07:03,769 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.6/attn/MatMul_1]
2025-07-22 08:07:03,770 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:03,773 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/norm1/Add_1_output_0" not specified
2025-07-22 08:07:03,789 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:03,799 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/norm2/Add_1_output_0" not specified
2025-07-22 08:07:03,806 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.7/attn/MatMul]
2025-07-22 08:07:03,806 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.7/attn/MatMul_1]
2025-07-22 08:07:03,807 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:03,810 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/norm1/Add_1_output_0" not specified
2025-07-22 08:07:03,828 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:03,837 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/norm2/Add_1_output_0" not specified
2025-07-22 08:07:03,844 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.8/attn/MatMul]
2025-07-22 08:07:03,844 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.8/attn/MatMul_1]
2025-07-22 08:07:03,845 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:03,848 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/norm1/Add_1_output_0" not specified
2025-07-22 08:07:03,866 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:03,874 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/norm2/Add_1_output_0" not specified
2025-07-22 08:07:03,882 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.9/attn/MatMul]
2025-07-22 08:07:03,882 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.9/attn/MatMul_1]
2025-07-22 08:07:03,883 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:03,886 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/norm1/Add_1_output_0" not specified
2025-07-22 08:07:03,904 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:03,914 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/norm2/Add_1_output_0" not specified
2025-07-22 08:07:03,921 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.10/attn/MatMul]
2025-07-22 08:07:03,921 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.10/attn/MatMul_1]
2025-07-22 08:07:03,922 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:03,925 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/norm1/Add_1_output_0" not specified
2025-07-22 08:07:03,944 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:03,952 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/norm2/Add_1_output_0" not specified
2025-07-22 08:07:03,960 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.11/attn/MatMul]
2025-07-22 08:07:03,960 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.11/attn/MatMul_1]
2025-07-22 08:07:03,961 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:03,964 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/norm1/Add_1_output_0" not specified
2025-07-22 08:07:03,983 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/mlp/Mul_1_output_0" not specified
- Quantizing to uint8: 40%|ββββ | 2/5 [00:09<00:14, 4.89s/it][A
- Quantizing to q4: 40%|ββββ | 2/5 [00:09<00:14, 4.89s/it] [A2025-07-22 08:07:05,947 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/word_embeddings/Gather ...
2025-07-22 08:07:05,948 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/token_type_embeddings/Gather ...
2025-07-22 08:07:05,948 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Unsqueeze ...
2025-07-22 08:07:05,948 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/Add ...
2025-07-22 08:07:05,948 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Cast ...
2025-07-22 08:07:05,948 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/ReduceMean ...
2025-07-22 08:07:05,948 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Sub ...
2025-07-22 08:07:05,948 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Sub ...
2025-07-22 08:07:05,948 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Mul ...
2025-07-22 08:07:05,948 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Pow ...
2025-07-22 08:07:05,948 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/ReduceMean_1 ...
2025-07-22 08:07:05,948 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Add ...
2025-07-22 08:07:05,948 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Sqrt ...
2025-07-22 08:07:05,948 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Div ...
2025-07-22 08:07:05,948 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Mul ...
2025-07-22 08:07:05,948 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Add_1 ...
2025-07-22 08:07:05,948 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/Wqkv/MatMul ...
2025-07-22 08:07:05,954 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/Wqkv/MatMul ...
2025-07-22 08:07:05,954 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_1 ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_1 ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Concat ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Reshape ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cast ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Range ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Neg ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Einsum ...
2025-07-22 08:07:05,955 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cos ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Sin ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_3 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_4 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_5 ...
2025-07-22 08:07:05,956 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose ...
2025-07-22 08:07:05,957 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_1 ...
2025-07-22 08:07:05,957 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_2 ...
2025-07-22 08:07:05,957 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/MatMul ...
2025-07-22 08:07:05,957 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:05,957 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/MatMul ...
2025-07-22 08:07:05,957 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Div_1 ...
2025-07-22 08:07:05,957 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Add ...
2025-07-22 08:07:05,957 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Softmax ...
2025-07-22 08:07:05,957 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/MatMul_1 ...
2025-07-22 08:07:05,957 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:05,957 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/MatMul_1 ...
2025-07-22 08:07:05,957 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_3 ...
2025-07-22 08:07:05,957 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_3 ...
2025-07-22 08:07:05,957 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_6 ...
2025-07-22 08:07:05,957 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_7 ...
2025-07-22 08:07:05,957 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_3 ...
2025-07-22 08:07:05,957 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_4 ...
2025-07-22 08:07:05,957 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Concat_1 ...
2025-07-22 08:07:05,957 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Reshape_1 ...
2025-07-22 08:07:05,957 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/out_proj/MatMul ...
2025-07-22 08:07:05,960 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/out_proj/MatMul ...
2025-07-22 08:07:05,960 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Add ...
2025-07-22 08:07:05,960 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/ReduceMean ...
2025-07-22 08:07:05,960 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Sub ...
2025-07-22 08:07:05,960 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Pow ...
2025-07-22 08:07:05,960 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/ReduceMean_1 ...
2025-07-22 08:07:05,960 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Add ...
2025-07-22 08:07:05,960 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Sqrt ...
2025-07-22 08:07:05,961 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Div ...
2025-07-22 08:07:05,961 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Mul ...
2025-07-22 08:07:05,961 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Add_1 ...
2025-07-22 08:07:05,961 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc11/MatMul ...
2025-07-22 08:07:05,966 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc11/MatMul ...
2025-07-22 08:07:05,966 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc12/MatMul ...
2025-07-22 08:07:05,973 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc12/MatMul ...
2025-07-22 08:07:05,973 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Sigmoid ...
2025-07-22 08:07:05,973 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Mul ...
2025-07-22 08:07:05,973 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Mul_1 ...
2025-07-22 08:07:05,973 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc2/MatMul ...
2025-07-22 08:07:05,979 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc2/MatMul ...
2025-07-22 08:07:05,979 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Add_1 ...
2025-07-22 08:07:05,979 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/ReduceMean ...
2025-07-22 08:07:05,979 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Sub ...
2025-07-22 08:07:05,979 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Pow ...
2025-07-22 08:07:05,979 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/ReduceMean_1 ...
2025-07-22 08:07:05,979 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Add ...
2025-07-22 08:07:05,979 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Sqrt ...
2025-07-22 08:07:05,979 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Div ...
2025-07-22 08:07:05,979 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Mul ...
2025-07-22 08:07:05,979 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Add_1 ...
2025-07-22 08:07:05,979 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/Wqkv/MatMul ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/Wqkv/MatMul ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_1 ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_1 ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Concat ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Reshape ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cast ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:05,985 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Range ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Neg ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Einsum ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cos ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Sin ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:05,986 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_3 ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_4 ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_5 ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_1 ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_2 ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/MatMul ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/MatMul ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Div_1 ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Add ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Softmax ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/MatMul_1 ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/MatMul_1 ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_3 ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_3 ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_6 ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_7 ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_3 ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_4 ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Concat_1 ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Reshape_1 ...
2025-07-22 08:07:05,987 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/out_proj/MatMul ...
2025-07-22 08:07:05,991 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/out_proj/MatMul ...
2025-07-22 08:07:05,991 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Add ...
2025-07-22 08:07:05,991 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/ReduceMean ...
2025-07-22 08:07:05,991 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Sub ...
2025-07-22 08:07:05,991 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Pow ...
2025-07-22 08:07:05,991 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/ReduceMean_1 ...
2025-07-22 08:07:05,991 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Add ...
2025-07-22 08:07:05,991 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Sqrt ...
2025-07-22 08:07:05,991 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Div ...
2025-07-22 08:07:05,991 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Mul ...
2025-07-22 08:07:05,991 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Add_1 ...
2025-07-22 08:07:05,991 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc11/MatMul ...
2025-07-22 08:07:05,998 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc11/MatMul ...
2025-07-22 08:07:05,998 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc12/MatMul ...
2025-07-22 08:07:06,004 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc12/MatMul ...
2025-07-22 08:07:06,004 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Sigmoid ...
2025-07-22 08:07:06,004 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Mul ...
2025-07-22 08:07:06,004 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Mul_1 ...
2025-07-22 08:07:06,004 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc2/MatMul ...
2025-07-22 08:07:06,011 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc2/MatMul ...
2025-07-22 08:07:06,011 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Add_1 ...
2025-07-22 08:07:06,011 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/ReduceMean ...
2025-07-22 08:07:06,011 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Sub ...
2025-07-22 08:07:06,011 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Pow ...
2025-07-22 08:07:06,011 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/ReduceMean_1 ...
2025-07-22 08:07:06,011 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Add ...
2025-07-22 08:07:06,011 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Sqrt ...
2025-07-22 08:07:06,011 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Div ...
2025-07-22 08:07:06,011 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Mul ...
2025-07-22 08:07:06,011 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Add_1 ...
2025-07-22 08:07:06,011 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/Wqkv/MatMul ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/Wqkv/MatMul ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_1 ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_1 ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Concat ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Reshape ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cast ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Range ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:06,017 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Neg ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Einsum ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cos ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Sin ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_3 ...
2025-07-22 08:07:06,018 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_4 ...
2025-07-22 08:07:06,019 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_5 ...
2025-07-22 08:07:06,019 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose ...
2025-07-22 08:07:06,019 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_1 ...
2025-07-22 08:07:06,019 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_2 ...
2025-07-22 08:07:06,019 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/MatMul ...
2025-07-22 08:07:06,019 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:06,019 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/MatMul ...
2025-07-22 08:07:06,019 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Div_1 ...
2025-07-22 08:07:06,019 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Add ...
2025-07-22 08:07:06,019 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Softmax ...
2025-07-22 08:07:06,019 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/MatMul_1 ...
2025-07-22 08:07:06,019 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:06,019 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/MatMul_1 ...
2025-07-22 08:07:06,019 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_3 ...
2025-07-22 08:07:06,019 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_3 ...
2025-07-22 08:07:06,019 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_6 ...
2025-07-22 08:07:06,019 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_7 ...
2025-07-22 08:07:06,019 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_3 ...
2025-07-22 08:07:06,019 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_4 ...
2025-07-22 08:07:06,019 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Concat_1 ...
2025-07-22 08:07:06,019 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Reshape_1 ...
2025-07-22 08:07:06,019 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/out_proj/MatMul ...
2025-07-22 08:07:06,023 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/out_proj/MatMul ...
2025-07-22 08:07:06,023 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Add ...
2025-07-22 08:07:06,023 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/ReduceMean ...
2025-07-22 08:07:06,023 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Sub ...
2025-07-22 08:07:06,023 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Pow ...
2025-07-22 08:07:06,023 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/ReduceMean_1 ...
2025-07-22 08:07:06,023 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Add ...
2025-07-22 08:07:06,023 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Sqrt ...
2025-07-22 08:07:06,023 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Div ...
2025-07-22 08:07:06,023 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Mul ...
2025-07-22 08:07:06,023 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Add_1 ...
2025-07-22 08:07:06,023 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc11/MatMul ...
2025-07-22 08:07:06,029 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc11/MatMul ...
2025-07-22 08:07:06,029 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc12/MatMul ...
2025-07-22 08:07:06,035 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc12/MatMul ...
2025-07-22 08:07:06,036 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Sigmoid ...
2025-07-22 08:07:06,036 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Mul ...
2025-07-22 08:07:06,036 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Mul_1 ...
2025-07-22 08:07:06,036 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc2/MatMul ...
2025-07-22 08:07:06,042 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc2/MatMul ...
2025-07-22 08:07:06,042 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Add_1 ...
2025-07-22 08:07:06,042 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/ReduceMean ...
2025-07-22 08:07:06,042 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Sub ...
2025-07-22 08:07:06,042 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Pow ...
2025-07-22 08:07:06,042 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/ReduceMean_1 ...
2025-07-22 08:07:06,042 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Add ...
2025-07-22 08:07:06,042 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Sqrt ...
2025-07-22 08:07:06,042 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Div ...
2025-07-22 08:07:06,042 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Mul ...
2025-07-22 08:07:06,042 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Add_1 ...
2025-07-22 08:07:06,042 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/Wqkv/MatMul ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/Wqkv/MatMul ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_1 ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_1 ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Concat ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Reshape ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cast ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:06,048 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Range ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Neg ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Einsum ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cos ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Sin ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:06,049 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_3 ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_4 ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_5 ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_1 ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_2 ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/MatMul ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/MatMul ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Div_1 ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Add ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Softmax ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/MatMul_1 ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/MatMul_1 ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_3 ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_3 ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_6 ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_7 ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_3 ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_4 ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Concat_1 ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Reshape_1 ...
2025-07-22 08:07:06,050 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/out_proj/MatMul ...
2025-07-22 08:07:06,054 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/out_proj/MatMul ...
2025-07-22 08:07:06,054 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Add ...
2025-07-22 08:07:06,054 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/ReduceMean ...
2025-07-22 08:07:06,054 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Sub ...
2025-07-22 08:07:06,054 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Pow ...
2025-07-22 08:07:06,054 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/ReduceMean_1 ...
2025-07-22 08:07:06,054 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Add ...
2025-07-22 08:07:06,054 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Sqrt ...
2025-07-22 08:07:06,054 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Div ...
2025-07-22 08:07:06,054 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Mul ...
2025-07-22 08:07:06,054 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Add_1 ...
2025-07-22 08:07:06,054 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc11/MatMul ...
2025-07-22 08:07:06,061 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc11/MatMul ...
2025-07-22 08:07:06,061 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc12/MatMul ...
2025-07-22 08:07:06,067 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc12/MatMul ...
2025-07-22 08:07:06,068 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Sigmoid ...
2025-07-22 08:07:06,068 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Mul ...
2025-07-22 08:07:06,068 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Mul_1 ...
2025-07-22 08:07:06,068 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc2/MatMul ...
2025-07-22 08:07:06,074 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc2/MatMul ...
2025-07-22 08:07:06,074 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Add_1 ...
2025-07-22 08:07:06,074 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/ReduceMean ...
2025-07-22 08:07:06,074 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Sub ...
2025-07-22 08:07:06,074 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Pow ...
2025-07-22 08:07:06,074 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/ReduceMean_1 ...
2025-07-22 08:07:06,074 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Add ...
2025-07-22 08:07:06,074 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Sqrt ...
2025-07-22 08:07:06,074 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Div ...
2025-07-22 08:07:06,074 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Mul ...
2025-07-22 08:07:06,074 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Add_1 ...
2025-07-22 08:07:06,074 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/Wqkv/MatMul ...
2025-07-22 08:07:06,080 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/Wqkv/MatMul ...
2025-07-22 08:07:06,080 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape ...
2025-07-22 08:07:06,080 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather ...
2025-07-22 08:07:06,080 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_1 ...
2025-07-22 08:07:06,080 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze ...
2025-07-22 08:07:06,080 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_1 ...
2025-07-22 08:07:06,080 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Concat ...
2025-07-22 08:07:06,080 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Reshape ...
2025-07-22 08:07:06,080 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape ...
2025-07-22 08:07:06,080 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:06,080 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:06,080 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:06,080 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather ...
2025-07-22 08:07:06,080 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:06,080 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cast ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Range ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Neg ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Einsum ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cos ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Sin ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand ...
2025-07-22 08:07:06,081 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_3 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_4 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_5 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_1 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_2 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/MatMul ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/MatMul ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Div_1 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Add ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Softmax ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/MatMul_1 ...
2025-07-22 08:07:06,082 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:06,083 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/MatMul_1 ...
2025-07-22 08:07:06,083 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_3 ...
2025-07-22 08:07:06,083 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_3 ...
2025-07-22 08:07:06,083 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_6 ...
2025-07-22 08:07:06,083 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_7 ...
2025-07-22 08:07:06,083 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_3 ...
2025-07-22 08:07:06,083 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_4 ...
2025-07-22 08:07:06,083 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Concat_1 ...
2025-07-22 08:07:06,083 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Reshape_1 ...
2025-07-22 08:07:06,083 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/out_proj/MatMul ...
2025-07-22 08:07:06,086 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/out_proj/MatMul ...
2025-07-22 08:07:06,086 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Add ...
2025-07-22 08:07:06,086 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/ReduceMean ...
2025-07-22 08:07:06,086 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Sub ...
2025-07-22 08:07:06,086 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Pow ...
2025-07-22 08:07:06,086 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/ReduceMean_1 ...
2025-07-22 08:07:06,086 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Add ...
2025-07-22 08:07:06,086 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Sqrt ...
2025-07-22 08:07:06,086 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Div ...
2025-07-22 08:07:06,086 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Mul ...
2025-07-22 08:07:06,086 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Add_1 ...
2025-07-22 08:07:06,086 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc11/MatMul ...
2025-07-22 08:07:06,093 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc11/MatMul ...
2025-07-22 08:07:06,093 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc12/MatMul ...
2025-07-22 08:07:06,099 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc12/MatMul ...
2025-07-22 08:07:06,099 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Sigmoid ...
2025-07-22 08:07:06,099 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Mul ...
2025-07-22 08:07:06,099 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Mul_1 ...
2025-07-22 08:07:06,100 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc2/MatMul ...
2025-07-22 08:07:06,106 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc2/MatMul ...
2025-07-22 08:07:06,106 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Add_1 ...
2025-07-22 08:07:06,106 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/ReduceMean ...
2025-07-22 08:07:06,106 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Sub ...
2025-07-22 08:07:06,106 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Pow ...
2025-07-22 08:07:06,106 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/ReduceMean_1 ...
2025-07-22 08:07:06,106 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Add ...
2025-07-22 08:07:06,106 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Sqrt ...
2025-07-22 08:07:06,106 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Div ...
2025-07-22 08:07:06,106 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Mul ...
2025-07-22 08:07:06,106 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Add_1 ...
2025-07-22 08:07:06,106 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/Wqkv/MatMul ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/Wqkv/MatMul ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_1 ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_1 ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Concat ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Reshape ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cast ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Range ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Neg ...
2025-07-22 08:07:06,112 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Einsum ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cos ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Sin ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_3 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_4 ...
2025-07-22 08:07:06,113 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_5 ...
2025-07-22 08:07:06,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose ...
2025-07-22 08:07:06,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_1 ...
2025-07-22 08:07:06,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_2 ...
2025-07-22 08:07:06,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/MatMul ...
2025-07-22 08:07:06,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:06,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/MatMul ...
2025-07-22 08:07:06,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Div_1 ...
2025-07-22 08:07:06,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Add ...
2025-07-22 08:07:06,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Softmax ...
2025-07-22 08:07:06,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/MatMul_1 ...
2025-07-22 08:07:06,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:06,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/MatMul_1 ...
2025-07-22 08:07:06,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_3 ...
2025-07-22 08:07:06,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_3 ...
2025-07-22 08:07:06,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_6 ...
2025-07-22 08:07:06,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_7 ...
2025-07-22 08:07:06,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_3 ...
2025-07-22 08:07:06,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_4 ...
2025-07-22 08:07:06,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Concat_1 ...
2025-07-22 08:07:06,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Reshape_1 ...
2025-07-22 08:07:06,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/out_proj/MatMul ...
2025-07-22 08:07:06,118 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/out_proj/MatMul ...
2025-07-22 08:07:06,118 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Add ...
2025-07-22 08:07:06,118 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/ReduceMean ...
2025-07-22 08:07:06,118 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Sub ...
2025-07-22 08:07:06,118 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Pow ...
2025-07-22 08:07:06,118 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/ReduceMean_1 ...
2025-07-22 08:07:06,118 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Add ...
2025-07-22 08:07:06,118 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Sqrt ...
2025-07-22 08:07:06,118 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Div ...
2025-07-22 08:07:06,118 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Mul ...
2025-07-22 08:07:06,118 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Add_1 ...
2025-07-22 08:07:06,118 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc11/MatMul ...
2025-07-22 08:07:06,124 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc11/MatMul ...
2025-07-22 08:07:06,124 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc12/MatMul ...
2025-07-22 08:07:06,131 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc12/MatMul ...
2025-07-22 08:07:06,131 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Sigmoid ...
2025-07-22 08:07:06,131 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Mul ...
2025-07-22 08:07:06,131 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Mul_1 ...
2025-07-22 08:07:06,131 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc2/MatMul ...
2025-07-22 08:07:06,137 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc2/MatMul ...
2025-07-22 08:07:06,137 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Add_1 ...
2025-07-22 08:07:06,137 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/ReduceMean ...
2025-07-22 08:07:06,137 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Sub ...
2025-07-22 08:07:06,137 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Pow ...
2025-07-22 08:07:06,137 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/ReduceMean_1 ...
2025-07-22 08:07:06,137 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Add ...
2025-07-22 08:07:06,137 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Sqrt ...
2025-07-22 08:07:06,137 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Div ...
2025-07-22 08:07:06,137 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Mul ...
2025-07-22 08:07:06,137 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Add_1 ...
2025-07-22 08:07:06,137 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/Wqkv/MatMul ...
2025-07-22 08:07:06,143 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/Wqkv/MatMul ...
2025-07-22 08:07:06,143 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape ...
2025-07-22 08:07:06,143 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather ...
2025-07-22 08:07:06,143 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_1 ...
2025-07-22 08:07:06,143 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze ...
2025-07-22 08:07:06,143 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_1 ...
2025-07-22 08:07:06,143 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Concat ...
2025-07-22 08:07:06,143 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Reshape ...
2025-07-22 08:07:06,143 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape ...
2025-07-22 08:07:06,143 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cast ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Range ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Neg ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Einsum ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cos ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Sin ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:06,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_3 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_4 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_5 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_1 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_2 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/MatMul ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/MatMul ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Div_1 ...
2025-07-22 08:07:06,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Add ...
2025-07-22 08:07:06,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Softmax ...
2025-07-22 08:07:06,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/MatMul_1 ...
2025-07-22 08:07:06,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:06,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/MatMul_1 ...
2025-07-22 08:07:06,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_3 ...
2025-07-22 08:07:06,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_3 ...
2025-07-22 08:07:06,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_6 ...
2025-07-22 08:07:06,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_7 ...
2025-07-22 08:07:06,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_3 ...
2025-07-22 08:07:06,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_4 ...
2025-07-22 08:07:06,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Concat_1 ...
2025-07-22 08:07:06,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Reshape_1 ...
2025-07-22 08:07:06,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/out_proj/MatMul ...
2025-07-22 08:07:06,150 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/out_proj/MatMul ...
2025-07-22 08:07:06,150 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Add ...
2025-07-22 08:07:06,150 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/ReduceMean ...
2025-07-22 08:07:06,150 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Sub ...
2025-07-22 08:07:06,150 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Pow ...
2025-07-22 08:07:06,150 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/ReduceMean_1 ...
2025-07-22 08:07:06,150 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Add ...
2025-07-22 08:07:06,150 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Sqrt ...
2025-07-22 08:07:06,150 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Div ...
2025-07-22 08:07:06,150 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Mul ...
2025-07-22 08:07:06,150 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Add_1 ...
2025-07-22 08:07:06,150 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc11/MatMul ...
2025-07-22 08:07:06,156 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc11/MatMul ...
2025-07-22 08:07:06,156 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc12/MatMul ...
2025-07-22 08:07:06,162 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc12/MatMul ...
2025-07-22 08:07:06,163 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Sigmoid ...
2025-07-22 08:07:06,163 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Mul ...
2025-07-22 08:07:06,163 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Mul_1 ...
2025-07-22 08:07:06,163 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc2/MatMul ...
2025-07-22 08:07:06,169 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc2/MatMul ...
2025-07-22 08:07:06,169 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Add_1 ...
2025-07-22 08:07:06,169 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/ReduceMean ...
2025-07-22 08:07:06,169 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Sub ...
2025-07-22 08:07:06,169 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Pow ...
2025-07-22 08:07:06,169 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/ReduceMean_1 ...
2025-07-22 08:07:06,169 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Add ...
2025-07-22 08:07:06,169 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Sqrt ...
2025-07-22 08:07:06,169 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Div ...
2025-07-22 08:07:06,169 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Mul ...
2025-07-22 08:07:06,169 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Add_1 ...
2025-07-22 08:07:06,169 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/Wqkv/MatMul ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/Wqkv/MatMul ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_1 ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_1 ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Concat ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Reshape ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cast ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:06,175 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Range ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Neg ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Einsum ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cos ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Sin ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:06,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_3 ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_4 ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_5 ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_1 ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_2 ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/MatMul ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/MatMul ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Div_1 ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Add ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Softmax ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/MatMul_1 ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/MatMul_1 ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_3 ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_3 ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_6 ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_7 ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_3 ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_4 ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Concat_1 ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Reshape_1 ...
2025-07-22 08:07:06,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/out_proj/MatMul ...
2025-07-22 08:07:06,181 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/out_proj/MatMul ...
2025-07-22 08:07:06,181 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Add ...
2025-07-22 08:07:06,181 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/ReduceMean ...
2025-07-22 08:07:06,181 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Sub ...
2025-07-22 08:07:06,181 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Pow ...
2025-07-22 08:07:06,181 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/ReduceMean_1 ...
2025-07-22 08:07:06,181 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Add ...
2025-07-22 08:07:06,181 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Sqrt ...
2025-07-22 08:07:06,181 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Div ...
2025-07-22 08:07:06,181 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Mul ...
2025-07-22 08:07:06,181 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Add_1 ...
2025-07-22 08:07:06,181 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc11/MatMul ...
2025-07-22 08:07:06,187 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc11/MatMul ...
2025-07-22 08:07:06,188 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc12/MatMul ...
2025-07-22 08:07:06,194 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc12/MatMul ...
2025-07-22 08:07:06,194 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Sigmoid ...
2025-07-22 08:07:06,194 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Mul ...
2025-07-22 08:07:06,194 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Mul_1 ...
2025-07-22 08:07:06,194 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc2/MatMul ...
2025-07-22 08:07:06,201 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc2/MatMul ...
2025-07-22 08:07:06,201 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Add_1 ...
2025-07-22 08:07:06,201 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/ReduceMean ...
2025-07-22 08:07:06,201 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Sub ...
2025-07-22 08:07:06,201 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Pow ...
2025-07-22 08:07:06,201 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/ReduceMean_1 ...
2025-07-22 08:07:06,201 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Add ...
2025-07-22 08:07:06,201 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Sqrt ...
2025-07-22 08:07:06,201 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Div ...
2025-07-22 08:07:06,201 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Mul ...
2025-07-22 08:07:06,201 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Add_1 ...
2025-07-22 08:07:06,201 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/Wqkv/MatMul ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/Wqkv/MatMul ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_1 ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_1 ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Concat ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Reshape ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cast ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:06,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Range ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Neg ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Einsum ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cos ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Sin ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:06,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_3 ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_4 ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_5 ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_1 ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_2 ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/MatMul ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/MatMul ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Div_1 ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Add ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Softmax ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/MatMul_1 ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/MatMul_1 ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_3 ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_3 ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_6 ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_7 ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_3 ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_4 ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Concat_1 ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Reshape_1 ...
2025-07-22 08:07:06,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/out_proj/MatMul ...
2025-07-22 08:07:06,213 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/out_proj/MatMul ...
2025-07-22 08:07:06,213 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Add ...
2025-07-22 08:07:06,213 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/ReduceMean ...
2025-07-22 08:07:06,213 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Sub ...
2025-07-22 08:07:06,213 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Pow ...
2025-07-22 08:07:06,213 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/ReduceMean_1 ...
2025-07-22 08:07:06,213 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Add ...
2025-07-22 08:07:06,213 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Sqrt ...
2025-07-22 08:07:06,213 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Div ...
2025-07-22 08:07:06,213 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Mul ...
2025-07-22 08:07:06,213 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Add_1 ...
2025-07-22 08:07:06,213 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc11/MatMul ...
2025-07-22 08:07:06,220 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc11/MatMul ...
2025-07-22 08:07:06,220 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc12/MatMul ...
2025-07-22 08:07:06,226 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc12/MatMul ...
2025-07-22 08:07:06,227 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Sigmoid ...
2025-07-22 08:07:06,227 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Mul ...
2025-07-22 08:07:06,227 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Mul_1 ...
2025-07-22 08:07:06,227 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc2/MatMul ...
2025-07-22 08:07:06,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc2/MatMul ...
2025-07-22 08:07:06,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Add_1 ...
2025-07-22 08:07:06,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/ReduceMean ...
2025-07-22 08:07:06,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Sub ...
2025-07-22 08:07:06,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Pow ...
2025-07-22 08:07:06,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/ReduceMean_1 ...
2025-07-22 08:07:06,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Add ...
2025-07-22 08:07:06,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Sqrt ...
2025-07-22 08:07:06,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Div ...
2025-07-22 08:07:06,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Mul ...
2025-07-22 08:07:06,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Add_1 ...
2025-07-22 08:07:06,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/Wqkv/MatMul ...
2025-07-22 08:07:06,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/Wqkv/MatMul ...
2025-07-22 08:07:06,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape ...
2025-07-22 08:07:06,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather ...
2025-07-22 08:07:06,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_1 ...
2025-07-22 08:07:06,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze ...
2025-07-22 08:07:06,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_1 ...
2025-07-22 08:07:06,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Concat ...
2025-07-22 08:07:06,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Reshape ...
2025-07-22 08:07:06,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape ...
2025-07-22 08:07:06,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:06,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:06,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:06,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather ...
2025-07-22 08:07:06,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:06,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:06,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:06,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:06,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:06,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:06,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:06,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cast ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Range ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Neg ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Einsum ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cos ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Sin ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:06,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_3 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_4 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_5 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_1 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_2 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/MatMul ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/MatMul ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Div_1 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Add ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Softmax ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/MatMul_1 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/MatMul_1 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_3 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_3 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_6 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_7 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_3 ...
2025-07-22 08:07:06,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_4 ...
2025-07-22 08:07:06,242 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Concat_1 ...
2025-07-22 08:07:06,242 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Reshape_1 ...
2025-07-22 08:07:06,242 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/out_proj/MatMul ...
2025-07-22 08:07:06,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/out_proj/MatMul ...
2025-07-22 08:07:06,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Add ...
2025-07-22 08:07:06,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/ReduceMean ...
2025-07-22 08:07:06,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Sub ...
2025-07-22 08:07:06,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Pow ...
2025-07-22 08:07:06,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/ReduceMean_1 ...
2025-07-22 08:07:06,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Add ...
2025-07-22 08:07:06,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Sqrt ...
2025-07-22 08:07:06,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Div ...
2025-07-22 08:07:06,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Mul ...
2025-07-22 08:07:06,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Add_1 ...
2025-07-22 08:07:06,246 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc11/MatMul ...
2025-07-22 08:07:06,252 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc11/MatMul ...
2025-07-22 08:07:06,252 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc12/MatMul ...
2025-07-22 08:07:06,258 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc12/MatMul ...
2025-07-22 08:07:06,258 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Sigmoid ...
2025-07-22 08:07:06,258 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Mul ...
2025-07-22 08:07:06,258 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Mul_1 ...
2025-07-22 08:07:06,258 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc2/MatMul ...
2025-07-22 08:07:06,264 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc2/MatMul ...
2025-07-22 08:07:06,265 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Add_1 ...
2025-07-22 08:07:06,265 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/ReduceMean ...
2025-07-22 08:07:06,265 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Sub ...
2025-07-22 08:07:06,265 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Pow ...
2025-07-22 08:07:06,265 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/ReduceMean_1 ...
2025-07-22 08:07:06,265 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Add ...
2025-07-22 08:07:06,265 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Sqrt ...
2025-07-22 08:07:06,265 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Div ...
2025-07-22 08:07:06,265 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Mul ...
2025-07-22 08:07:06,265 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Add_1 ...
2025-07-22 08:07:06,265 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/Wqkv/MatMul ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/Wqkv/MatMul ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_1 ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_1 ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Concat ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Reshape ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cast ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Range ...
2025-07-22 08:07:06,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Neg ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Einsum ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cos ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Sin ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:06,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_3 ...
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_4 ...
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_5 ...
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose ...
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_1 ...
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_2 ...
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/MatMul ...
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/MatMul ...
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Div_1 ...
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Add ...
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Softmax ...
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/MatMul_1 ...
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/MatMul_1 ...
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_3 ...
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_3 ...
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_6 ...
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_7 ...
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_3 ...
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_4 ...
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Concat_1 ...
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Reshape_1 ...
2025-07-22 08:07:06,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/out_proj/MatMul ...
2025-07-22 08:07:06,277 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/out_proj/MatMul ...
2025-07-22 08:07:06,277 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Add ...
2025-07-22 08:07:06,277 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/ReduceMean ...
2025-07-22 08:07:06,277 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Sub ...
2025-07-22 08:07:06,277 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Pow ...
2025-07-22 08:07:06,277 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/ReduceMean_1 ...
2025-07-22 08:07:06,277 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Add ...
2025-07-22 08:07:06,277 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Sqrt ...
2025-07-22 08:07:06,277 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Div ...
2025-07-22 08:07:06,278 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Mul ...
2025-07-22 08:07:06,278 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Add_1 ...
2025-07-22 08:07:06,278 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc11/MatMul ...
2025-07-22 08:07:06,284 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc11/MatMul ...
2025-07-22 08:07:06,284 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc12/MatMul ...
2025-07-22 08:07:06,290 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc12/MatMul ...
2025-07-22 08:07:06,290 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Sigmoid ...
2025-07-22 08:07:06,290 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Mul ...
2025-07-22 08:07:06,290 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Mul_1 ...
2025-07-22 08:07:06,290 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc2/MatMul ...
2025-07-22 08:07:06,296 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc2/MatMul ...
2025-07-22 08:07:06,297 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Add_1 ...
2025-07-22 08:07:06,297 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/ReduceMean ...
2025-07-22 08:07:06,297 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Sub ...
2025-07-22 08:07:06,297 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Pow ...
2025-07-22 08:07:06,297 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/ReduceMean_1 ...
2025-07-22 08:07:06,297 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Add ...
2025-07-22 08:07:06,297 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Sqrt ...
2025-07-22 08:07:06,297 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Div ...
2025-07-22 08:07:06,297 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Mul ...
2025-07-22 08:07:06,297 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Add_1 ...
2025-07-22 08:07:06,297 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/Wqkv/MatMul ...
2025-07-22 08:07:06,302 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/Wqkv/MatMul ...
2025-07-22 08:07:06,302 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_1 ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_1 ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Concat ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Reshape ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cast ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Range ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Neg ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Einsum ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:06,303 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cos ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Sin ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_3 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_4 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_5 ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose ...
2025-07-22 08:07:06,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_1 ...
2025-07-22 08:07:06,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_2 ...
2025-07-22 08:07:06,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/MatMul ...
2025-07-22 08:07:06,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:06,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/MatMul ...
2025-07-22 08:07:06,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Div_1 ...
2025-07-22 08:07:06,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Add ...
2025-07-22 08:07:06,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Softmax ...
2025-07-22 08:07:06,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/MatMul_1 ...
2025-07-22 08:07:06,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:06,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/MatMul_1 ...
2025-07-22 08:07:06,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_3 ...
2025-07-22 08:07:06,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_3 ...
2025-07-22 08:07:06,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_6 ...
2025-07-22 08:07:06,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_7 ...
2025-07-22 08:07:06,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_3 ...
2025-07-22 08:07:06,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_4 ...
2025-07-22 08:07:06,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Concat_1 ...
2025-07-22 08:07:06,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Reshape_1 ...
2025-07-22 08:07:06,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/out_proj/MatMul ...
2025-07-22 08:07:06,309 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/out_proj/MatMul ...
2025-07-22 08:07:06,309 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Add ...
2025-07-22 08:07:06,309 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/ReduceMean ...
2025-07-22 08:07:06,309 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Sub ...
2025-07-22 08:07:06,309 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Pow ...
2025-07-22 08:07:06,309 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/ReduceMean_1 ...
2025-07-22 08:07:06,309 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Add ...
2025-07-22 08:07:06,309 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Sqrt ...
2025-07-22 08:07:06,309 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Div ...
2025-07-22 08:07:06,309 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Mul ...
2025-07-22 08:07:06,309 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Add_1 ...
2025-07-22 08:07:06,309 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc11/MatMul ...
2025-07-22 08:07:06,315 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc11/MatMul ...
2025-07-22 08:07:06,315 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc12/MatMul ...
2025-07-22 08:07:06,322 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc12/MatMul ...
2025-07-22 08:07:06,322 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Sigmoid ...
2025-07-22 08:07:06,322 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Mul ...
2025-07-22 08:07:06,322 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Mul_1 ...
2025-07-22 08:07:06,322 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc2/MatMul ...
2025-07-22 08:07:06,328 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc2/MatMul ...
2025-07-22 08:07:06,328 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Add_1 ...
2025-07-22 08:07:06,328 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/ReduceMean ...
2025-07-22 08:07:06,328 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Sub ...
2025-07-22 08:07:06,328 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Pow ...
2025-07-22 08:07:06,328 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/ReduceMean_1 ...
2025-07-22 08:07:06,328 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Add ...
2025-07-22 08:07:06,328 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Sqrt ...
2025-07-22 08:07:06,328 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Div ...
2025-07-22 08:07:06,328 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Mul ...
2025-07-22 08:07:06,329 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Add_1 ...
- Quantizing to q4: 60%|ββββββ | 3/5 [00:11<00:06, 3.38s/it][A
- Quantizing to q4f16: 60%|ββββββ | 3/5 [00:11<00:06, 3.38s/it][A2025-07-22 08:07:07,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/word_embeddings/Gather ...
2025-07-22 08:07:07,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/token_type_embeddings/Gather ...
2025-07-22 08:07:07,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Unsqueeze ...
2025-07-22 08:07:07,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/Add ...
2025-07-22 08:07:07,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Cast ...
2025-07-22 08:07:07,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/ReduceMean ...
2025-07-22 08:07:07,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Sub ...
2025-07-22 08:07:07,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Sub ...
2025-07-22 08:07:07,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Mul ...
2025-07-22 08:07:07,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Pow ...
2025-07-22 08:07:07,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/ReduceMean_1 ...
2025-07-22 08:07:07,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Add ...
2025-07-22 08:07:07,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Sqrt ...
2025-07-22 08:07:07,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Div ...
2025-07-22 08:07:07,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Mul ...
2025-07-22 08:07:07,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Add_1 ...
2025-07-22 08:07:07,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,534 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,534 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape ...
2025-07-22 08:07:07,534 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather ...
2025-07-22 08:07:07,534 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_1 ...
2025-07-22 08:07:07,534 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze ...
2025-07-22 08:07:07,534 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_1 ...
2025-07-22 08:07:07,534 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Concat ...
2025-07-22 08:07:07,534 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Reshape ...
2025-07-22 08:07:07,534 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape ...
2025-07-22 08:07:07,534 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:07,534 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:07,534 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:07,534 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather ...
2025-07-22 08:07:07,534 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:07,534 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:07,534 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cast ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Range ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Neg ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Einsum ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cos ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Sin ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:07,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_3 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_4 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_5 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_1 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_2 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/MatMul ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/MatMul ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Div_1 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Add ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Softmax ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/MatMul_1 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/MatMul_1 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_3 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_3 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_6 ...
2025-07-22 08:07:07,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_7 ...
2025-07-22 08:07:07,537 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_3 ...
2025-07-22 08:07:07,537 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_4 ...
2025-07-22 08:07:07,537 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Concat_1 ...
2025-07-22 08:07:07,537 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Reshape_1 ...
2025-07-22 08:07:07,537 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/out_proj/MatMul ...
2025-07-22 08:07:07,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/out_proj/MatMul ...
2025-07-22 08:07:07,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Add ...
2025-07-22 08:07:07,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/ReduceMean ...
2025-07-22 08:07:07,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Sub ...
2025-07-22 08:07:07,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Pow ...
2025-07-22 08:07:07,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/ReduceMean_1 ...
2025-07-22 08:07:07,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Add ...
2025-07-22 08:07:07,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Sqrt ...
2025-07-22 08:07:07,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Div ...
2025-07-22 08:07:07,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Mul ...
2025-07-22 08:07:07,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Add_1 ...
2025-07-22 08:07:07,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc11/MatMul ...
2025-07-22 08:07:07,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc11/MatMul ...
2025-07-22 08:07:07,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc12/MatMul ...
2025-07-22 08:07:07,553 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc12/MatMul ...
2025-07-22 08:07:07,553 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Sigmoid ...
2025-07-22 08:07:07,553 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Mul ...
2025-07-22 08:07:07,553 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Mul_1 ...
2025-07-22 08:07:07,553 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc2/MatMul ...
2025-07-22 08:07:07,559 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc2/MatMul ...
2025-07-22 08:07:07,559 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Add_1 ...
2025-07-22 08:07:07,559 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/ReduceMean ...
2025-07-22 08:07:07,559 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Sub ...
2025-07-22 08:07:07,559 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Pow ...
2025-07-22 08:07:07,559 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/ReduceMean_1 ...
2025-07-22 08:07:07,559 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Add ...
2025-07-22 08:07:07,559 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Sqrt ...
2025-07-22 08:07:07,559 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Div ...
2025-07-22 08:07:07,559 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Mul ...
2025-07-22 08:07:07,559 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Add_1 ...
2025-07-22 08:07:07,559 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape ...
2025-07-22 08:07:07,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather ...
2025-07-22 08:07:07,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_1 ...
2025-07-22 08:07:07,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze ...
2025-07-22 08:07:07,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_1 ...
2025-07-22 08:07:07,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Concat ...
2025-07-22 08:07:07,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Reshape ...
2025-07-22 08:07:07,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape ...
2025-07-22 08:07:07,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:07,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:07,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:07,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather ...
2025-07-22 08:07:07,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:07,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:07,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:07,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cast ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Range ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Neg ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Einsum ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cos ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Sin ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:07,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_3 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_4 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_5 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_1 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_2 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/MatMul ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/MatMul ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Div_1 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Add ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Softmax ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/MatMul_1 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/MatMul_1 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_3 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_3 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_6 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_7 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_3 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_4 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Concat_1 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Reshape_1 ...
2025-07-22 08:07:07,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/out_proj/MatMul ...
2025-07-22 08:07:07,571 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/out_proj/MatMul ...
2025-07-22 08:07:07,571 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Add ...
2025-07-22 08:07:07,571 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/ReduceMean ...
2025-07-22 08:07:07,571 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Sub ...
2025-07-22 08:07:07,571 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Pow ...
2025-07-22 08:07:07,571 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/ReduceMean_1 ...
2025-07-22 08:07:07,571 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Add ...
2025-07-22 08:07:07,571 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Sqrt ...
2025-07-22 08:07:07,571 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Div ...
2025-07-22 08:07:07,571 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Mul ...
2025-07-22 08:07:07,571 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Add_1 ...
2025-07-22 08:07:07,571 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc11/MatMul ...
2025-07-22 08:07:07,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc11/MatMul ...
2025-07-22 08:07:07,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc12/MatMul ...
2025-07-22 08:07:07,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc12/MatMul ...
2025-07-22 08:07:07,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Sigmoid ...
2025-07-22 08:07:07,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Mul ...
2025-07-22 08:07:07,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Mul_1 ...
2025-07-22 08:07:07,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc2/MatMul ...
2025-07-22 08:07:07,590 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc2/MatMul ...
2025-07-22 08:07:07,590 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Add_1 ...
2025-07-22 08:07:07,590 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/ReduceMean ...
2025-07-22 08:07:07,590 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Sub ...
2025-07-22 08:07:07,590 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Pow ...
2025-07-22 08:07:07,590 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/ReduceMean_1 ...
2025-07-22 08:07:07,590 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Add ...
2025-07-22 08:07:07,590 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Sqrt ...
2025-07-22 08:07:07,590 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Div ...
2025-07-22 08:07:07,590 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Mul ...
2025-07-22 08:07:07,590 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Add_1 ...
2025-07-22 08:07:07,590 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape ...
2025-07-22 08:07:07,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather ...
2025-07-22 08:07:07,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_1 ...
2025-07-22 08:07:07,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze ...
2025-07-22 08:07:07,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_1 ...
2025-07-22 08:07:07,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Concat ...
2025-07-22 08:07:07,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Reshape ...
2025-07-22 08:07:07,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape ...
2025-07-22 08:07:07,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:07,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:07,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cast ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Range ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Neg ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Einsum ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cos ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Sin ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:07,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_3 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_4 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_5 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_1 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_2 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/MatMul ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/MatMul ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Div_1 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Add ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Softmax ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/MatMul_1 ...
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/MatMul_1 ...
2025-07-22 08:07:07,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_3 ...
2025-07-22 08:07:07,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_3 ...
2025-07-22 08:07:07,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_6 ...
2025-07-22 08:07:07,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_7 ...
2025-07-22 08:07:07,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_3 ...
2025-07-22 08:07:07,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_4 ...
2025-07-22 08:07:07,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Concat_1 ...
2025-07-22 08:07:07,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Reshape_1 ...
2025-07-22 08:07:07,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/out_proj/MatMul ...
2025-07-22 08:07:07,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/out_proj/MatMul ...
2025-07-22 08:07:07,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Add ...
2025-07-22 08:07:07,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/ReduceMean ...
2025-07-22 08:07:07,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Sub ...
2025-07-22 08:07:07,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Pow ...
2025-07-22 08:07:07,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/ReduceMean_1 ...
2025-07-22 08:07:07,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Add ...
2025-07-22 08:07:07,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Sqrt ...
2025-07-22 08:07:07,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Div ...
2025-07-22 08:07:07,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Mul ...
2025-07-22 08:07:07,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Add_1 ...
2025-07-22 08:07:07,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc11/MatMul ...
2025-07-22 08:07:07,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc11/MatMul ...
2025-07-22 08:07:07,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc12/MatMul ...
2025-07-22 08:07:07,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc12/MatMul ...
2025-07-22 08:07:07,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Sigmoid ...
2025-07-22 08:07:07,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Mul ...
2025-07-22 08:07:07,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Mul_1 ...
2025-07-22 08:07:07,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc2/MatMul ...
2025-07-22 08:07:07,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc2/MatMul ...
2025-07-22 08:07:07,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Add_1 ...
2025-07-22 08:07:07,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/ReduceMean ...
2025-07-22 08:07:07,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Sub ...
2025-07-22 08:07:07,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Pow ...
2025-07-22 08:07:07,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/ReduceMean_1 ...
2025-07-22 08:07:07,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Add ...
2025-07-22 08:07:07,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Sqrt ...
2025-07-22 08:07:07,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Div ...
2025-07-22 08:07:07,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Mul ...
2025-07-22 08:07:07,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Add_1 ...
2025-07-22 08:07:07,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_1 ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_1 ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Concat ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Reshape ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cast ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:07,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Range ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Neg ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Einsum ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cos ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Sin ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:07,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_3 ...
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_4 ...
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_5 ...
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose ...
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_1 ...
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_2 ...
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/MatMul ...
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/MatMul ...
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Div_1 ...
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Add ...
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Softmax ...
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/MatMul_1 ...
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/MatMul_1 ...
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_3 ...
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_3 ...
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_6 ...
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_7 ...
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_3 ...
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_4 ...
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Concat_1 ...
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Reshape_1 ...
2025-07-22 08:07:07,630 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/out_proj/MatMul ...
2025-07-22 08:07:07,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/out_proj/MatMul ...
2025-07-22 08:07:07,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Add ...
2025-07-22 08:07:07,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/ReduceMean ...
2025-07-22 08:07:07,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Sub ...
2025-07-22 08:07:07,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Pow ...
2025-07-22 08:07:07,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/ReduceMean_1 ...
2025-07-22 08:07:07,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Add ...
2025-07-22 08:07:07,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Sqrt ...
2025-07-22 08:07:07,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Div ...
2025-07-22 08:07:07,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Mul ...
2025-07-22 08:07:07,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Add_1 ...
2025-07-22 08:07:07,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc11/MatMul ...
2025-07-22 08:07:07,642 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc11/MatMul ...
2025-07-22 08:07:07,642 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc12/MatMul ...
2025-07-22 08:07:07,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc12/MatMul ...
2025-07-22 08:07:07,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Sigmoid ...
2025-07-22 08:07:07,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Mul ...
2025-07-22 08:07:07,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Mul_1 ...
2025-07-22 08:07:07,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc2/MatMul ...
2025-07-22 08:07:07,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc2/MatMul ...
2025-07-22 08:07:07,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Add_1 ...
2025-07-22 08:07:07,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/ReduceMean ...
2025-07-22 08:07:07,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Sub ...
2025-07-22 08:07:07,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Pow ...
2025-07-22 08:07:07,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/ReduceMean_1 ...
2025-07-22 08:07:07,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Add ...
2025-07-22 08:07:07,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Sqrt ...
2025-07-22 08:07:07,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Div ...
2025-07-22 08:07:07,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Mul ...
2025-07-22 08:07:07,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Add_1 ...
2025-07-22 08:07:07,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_1 ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_1 ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Concat ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Reshape ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cast ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Range ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Neg ...
2025-07-22 08:07:07,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Einsum ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cos ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Sin ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_3 ...
2025-07-22 08:07:07,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_4 ...
2025-07-22 08:07:07,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_5 ...
2025-07-22 08:07:07,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose ...
2025-07-22 08:07:07,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_1 ...
2025-07-22 08:07:07,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_2 ...
2025-07-22 08:07:07,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/MatMul ...
2025-07-22 08:07:07,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/MatMul ...
2025-07-22 08:07:07,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Div_1 ...
2025-07-22 08:07:07,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Add ...
2025-07-22 08:07:07,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Softmax ...
2025-07-22 08:07:07,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/MatMul_1 ...
2025-07-22 08:07:07,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/MatMul_1 ...
2025-07-22 08:07:07,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_3 ...
2025-07-22 08:07:07,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_3 ...
2025-07-22 08:07:07,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_6 ...
2025-07-22 08:07:07,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_7 ...
2025-07-22 08:07:07,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_3 ...
2025-07-22 08:07:07,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_4 ...
2025-07-22 08:07:07,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Concat_1 ...
2025-07-22 08:07:07,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Reshape_1 ...
2025-07-22 08:07:07,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/out_proj/MatMul ...
2025-07-22 08:07:07,667 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/out_proj/MatMul ...
2025-07-22 08:07:07,667 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Add ...
2025-07-22 08:07:07,667 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/ReduceMean ...
2025-07-22 08:07:07,667 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Sub ...
2025-07-22 08:07:07,667 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Pow ...
2025-07-22 08:07:07,667 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/ReduceMean_1 ...
2025-07-22 08:07:07,667 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Add ...
2025-07-22 08:07:07,667 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Sqrt ...
2025-07-22 08:07:07,667 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Div ...
2025-07-22 08:07:07,667 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Mul ...
2025-07-22 08:07:07,667 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Add_1 ...
2025-07-22 08:07:07,667 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc11/MatMul ...
2025-07-22 08:07:07,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc11/MatMul ...
2025-07-22 08:07:07,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc12/MatMul ...
2025-07-22 08:07:07,680 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc12/MatMul ...
2025-07-22 08:07:07,680 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Sigmoid ...
2025-07-22 08:07:07,680 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Mul ...
2025-07-22 08:07:07,680 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Mul_1 ...
2025-07-22 08:07:07,680 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc2/MatMul ...
2025-07-22 08:07:07,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc2/MatMul ...
2025-07-22 08:07:07,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Add_1 ...
2025-07-22 08:07:07,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/ReduceMean ...
2025-07-22 08:07:07,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Sub ...
2025-07-22 08:07:07,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Pow ...
2025-07-22 08:07:07,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/ReduceMean_1 ...
2025-07-22 08:07:07,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Add ...
2025-07-22 08:07:07,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Sqrt ...
2025-07-22 08:07:07,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Div ...
2025-07-22 08:07:07,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Mul ...
2025-07-22 08:07:07,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Add_1 ...
2025-07-22 08:07:07,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_1 ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_1 ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Concat ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Reshape ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cast ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:07,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Range ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Neg ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Einsum ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cos ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Sin ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:07,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_3 ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_4 ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_5 ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_1 ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_2 ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/MatMul ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/MatMul ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Div_1 ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Add ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Softmax ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/MatMul_1 ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/MatMul_1 ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_3 ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_3 ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_6 ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_7 ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_3 ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_4 ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Concat_1 ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Reshape_1 ...
2025-07-22 08:07:07,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/out_proj/MatMul ...
2025-07-22 08:07:07,698 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/out_proj/MatMul ...
2025-07-22 08:07:07,698 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Add ...
2025-07-22 08:07:07,698 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/ReduceMean ...
2025-07-22 08:07:07,698 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Sub ...
2025-07-22 08:07:07,698 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Pow ...
2025-07-22 08:07:07,698 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/ReduceMean_1 ...
2025-07-22 08:07:07,698 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Add ...
2025-07-22 08:07:07,698 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Sqrt ...
2025-07-22 08:07:07,698 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Div ...
2025-07-22 08:07:07,698 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Mul ...
2025-07-22 08:07:07,698 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Add_1 ...
2025-07-22 08:07:07,698 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc11/MatMul ...
2025-07-22 08:07:07,704 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc11/MatMul ...
2025-07-22 08:07:07,705 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc12/MatMul ...
2025-07-22 08:07:07,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc12/MatMul ...
2025-07-22 08:07:07,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Sigmoid ...
2025-07-22 08:07:07,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Mul ...
2025-07-22 08:07:07,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Mul_1 ...
2025-07-22 08:07:07,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc2/MatMul ...
2025-07-22 08:07:07,717 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc2/MatMul ...
2025-07-22 08:07:07,717 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Add_1 ...
2025-07-22 08:07:07,717 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/ReduceMean ...
2025-07-22 08:07:07,717 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Sub ...
2025-07-22 08:07:07,717 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Pow ...
2025-07-22 08:07:07,717 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/ReduceMean_1 ...
2025-07-22 08:07:07,717 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Add ...
2025-07-22 08:07:07,717 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Sqrt ...
2025-07-22 08:07:07,717 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Div ...
2025-07-22 08:07:07,717 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Mul ...
2025-07-22 08:07:07,717 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Add_1 ...
2025-07-22 08:07:07,717 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape ...
2025-07-22 08:07:07,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather ...
2025-07-22 08:07:07,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_1 ...
2025-07-22 08:07:07,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze ...
2025-07-22 08:07:07,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_1 ...
2025-07-22 08:07:07,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Concat ...
2025-07-22 08:07:07,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Reshape ...
2025-07-22 08:07:07,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape ...
2025-07-22 08:07:07,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:07,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:07,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:07,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather ...
2025-07-22 08:07:07,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:07,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:07,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:07,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cast ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Range ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Neg ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Einsum ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cos ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Sin ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:07,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_3 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_4 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_5 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_1 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_2 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/MatMul ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/MatMul ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Div_1 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Add ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Softmax ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/MatMul_1 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/MatMul_1 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_3 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_3 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_6 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_7 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_3 ...
2025-07-22 08:07:07,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_4 ...
2025-07-22 08:07:07,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Concat_1 ...
2025-07-22 08:07:07,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Reshape_1 ...
2025-07-22 08:07:07,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/out_proj/MatMul ...
2025-07-22 08:07:07,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/out_proj/MatMul ...
2025-07-22 08:07:07,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Add ...
2025-07-22 08:07:07,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/ReduceMean ...
2025-07-22 08:07:07,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Sub ...
2025-07-22 08:07:07,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Pow ...
2025-07-22 08:07:07,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/ReduceMean_1 ...
2025-07-22 08:07:07,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Add ...
2025-07-22 08:07:07,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Sqrt ...
2025-07-22 08:07:07,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Div ...
2025-07-22 08:07:07,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Mul ...
2025-07-22 08:07:07,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Add_1 ...
2025-07-22 08:07:07,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc11/MatMul ...
2025-07-22 08:07:07,736 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc11/MatMul ...
2025-07-22 08:07:07,736 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc12/MatMul ...
2025-07-22 08:07:07,742 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc12/MatMul ...
2025-07-22 08:07:07,742 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Sigmoid ...
2025-07-22 08:07:07,742 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Mul ...
2025-07-22 08:07:07,742 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Mul_1 ...
2025-07-22 08:07:07,742 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc2/MatMul ...
2025-07-22 08:07:07,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc2/MatMul ...
2025-07-22 08:07:07,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Add_1 ...
2025-07-22 08:07:07,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/ReduceMean ...
2025-07-22 08:07:07,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Sub ...
2025-07-22 08:07:07,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Pow ...
2025-07-22 08:07:07,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/ReduceMean_1 ...
2025-07-22 08:07:07,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Add ...
2025-07-22 08:07:07,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Sqrt ...
2025-07-22 08:07:07,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Div ...
2025-07-22 08:07:07,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Mul ...
2025-07-22 08:07:07,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Add_1 ...
2025-07-22 08:07:07,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape ...
2025-07-22 08:07:07,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather ...
2025-07-22 08:07:07,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_1 ...
2025-07-22 08:07:07,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_1 ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Concat ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Reshape ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cast ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Range ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Neg ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Einsum ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cos ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Sin ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice ...
2025-07-22 08:07:07,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_3 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_4 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_5 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_1 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_2 ...
2025-07-22 08:07:07,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/MatMul ...
2025-07-22 08:07:07,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/MatMul ...
2025-07-22 08:07:07,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Div_1 ...
2025-07-22 08:07:07,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Add ...
2025-07-22 08:07:07,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Softmax ...
2025-07-22 08:07:07,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/MatMul_1 ...
2025-07-22 08:07:07,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/MatMul_1 ...
2025-07-22 08:07:07,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_3 ...
2025-07-22 08:07:07,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_3 ...
2025-07-22 08:07:07,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_6 ...
2025-07-22 08:07:07,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_7 ...
2025-07-22 08:07:07,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_3 ...
2025-07-22 08:07:07,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_4 ...
2025-07-22 08:07:07,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Concat_1 ...
2025-07-22 08:07:07,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Reshape_1 ...
2025-07-22 08:07:07,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/out_proj/MatMul ...
2025-07-22 08:07:07,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/out_proj/MatMul ...
2025-07-22 08:07:07,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Add ...
2025-07-22 08:07:07,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/ReduceMean ...
2025-07-22 08:07:07,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Sub ...
2025-07-22 08:07:07,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Pow ...
2025-07-22 08:07:07,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/ReduceMean_1 ...
2025-07-22 08:07:07,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Add ...
2025-07-22 08:07:07,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Sqrt ...
2025-07-22 08:07:07,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Div ...
2025-07-22 08:07:07,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Mul ...
2025-07-22 08:07:07,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Add_1 ...
2025-07-22 08:07:07,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc11/MatMul ...
2025-07-22 08:07:07,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc11/MatMul ...
2025-07-22 08:07:07,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc12/MatMul ...
2025-07-22 08:07:07,774 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc12/MatMul ...
2025-07-22 08:07:07,774 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Sigmoid ...
2025-07-22 08:07:07,774 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Mul ...
2025-07-22 08:07:07,774 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Mul_1 ...
2025-07-22 08:07:07,774 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc2/MatMul ...
2025-07-22 08:07:07,780 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc2/MatMul ...
2025-07-22 08:07:07,780 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Add_1 ...
2025-07-22 08:07:07,780 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/ReduceMean ...
2025-07-22 08:07:07,780 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Sub ...
2025-07-22 08:07:07,780 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Pow ...
2025-07-22 08:07:07,780 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/ReduceMean_1 ...
2025-07-22 08:07:07,780 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Add ...
2025-07-22 08:07:07,780 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Sqrt ...
2025-07-22 08:07:07,780 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Div ...
2025-07-22 08:07:07,780 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Mul ...
2025-07-22 08:07:07,780 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Add_1 ...
2025-07-22 08:07:07,781 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape ...
2025-07-22 08:07:07,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather ...
2025-07-22 08:07:07,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_1 ...
2025-07-22 08:07:07,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze ...
2025-07-22 08:07:07,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_1 ...
2025-07-22 08:07:07,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Concat ...
2025-07-22 08:07:07,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Reshape ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cast ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Range ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Neg ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Einsum ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cos ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Sin ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:07,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_3 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_4 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_5 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_1 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_2 ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/MatMul ...
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/MatMul ...
2025-07-22 08:07:07,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Div_1 ...
2025-07-22 08:07:07,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Add ...
2025-07-22 08:07:07,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Softmax ...
2025-07-22 08:07:07,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/MatMul_1 ...
2025-07-22 08:07:07,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/MatMul_1 ...
2025-07-22 08:07:07,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_3 ...
2025-07-22 08:07:07,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_3 ...
2025-07-22 08:07:07,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_6 ...
2025-07-22 08:07:07,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_7 ...
2025-07-22 08:07:07,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_3 ...
2025-07-22 08:07:07,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_4 ...
2025-07-22 08:07:07,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Concat_1 ...
2025-07-22 08:07:07,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Reshape_1 ...
2025-07-22 08:07:07,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/out_proj/MatMul ...
2025-07-22 08:07:07,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/out_proj/MatMul ...
2025-07-22 08:07:07,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Add ...
2025-07-22 08:07:07,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/ReduceMean ...
2025-07-22 08:07:07,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Sub ...
2025-07-22 08:07:07,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Pow ...
2025-07-22 08:07:07,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/ReduceMean_1 ...
2025-07-22 08:07:07,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Add ...
2025-07-22 08:07:07,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Sqrt ...
2025-07-22 08:07:07,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Div ...
2025-07-22 08:07:07,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Mul ...
2025-07-22 08:07:07,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Add_1 ...
2025-07-22 08:07:07,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc11/MatMul ...
2025-07-22 08:07:07,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc11/MatMul ...
2025-07-22 08:07:07,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc12/MatMul ...
2025-07-22 08:07:07,805 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc12/MatMul ...
2025-07-22 08:07:07,806 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Sigmoid ...
2025-07-22 08:07:07,806 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Mul ...
2025-07-22 08:07:07,806 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Mul_1 ...
2025-07-22 08:07:07,806 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc2/MatMul ...
2025-07-22 08:07:07,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc2/MatMul ...
2025-07-22 08:07:07,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Add_1 ...
2025-07-22 08:07:07,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/ReduceMean ...
2025-07-22 08:07:07,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Sub ...
2025-07-22 08:07:07,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Pow ...
2025-07-22 08:07:07,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/ReduceMean_1 ...
2025-07-22 08:07:07,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Add ...
2025-07-22 08:07:07,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Sqrt ...
2025-07-22 08:07:07,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Div ...
2025-07-22 08:07:07,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Mul ...
2025-07-22 08:07:07,813 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Add_1 ...
2025-07-22 08:07:07,813 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape ...
2025-07-22 08:07:07,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather ...
2025-07-22 08:07:07,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_1 ...
2025-07-22 08:07:07,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze ...
2025-07-22 08:07:07,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_1 ...
2025-07-22 08:07:07,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Concat ...
2025-07-22 08:07:07,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Reshape ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cast ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Range ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Neg ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Einsum ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cos ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Sin ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice ...
2025-07-22 08:07:07,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_3 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_4 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_5 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_1 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_2 ...
2025-07-22 08:07:07,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/MatMul ...
2025-07-22 08:07:07,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/MatMul ...
2025-07-22 08:07:07,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Div_1 ...
2025-07-22 08:07:07,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Add ...
2025-07-22 08:07:07,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Softmax ...
2025-07-22 08:07:07,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/MatMul_1 ...
2025-07-22 08:07:07,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/MatMul_1 ...
2025-07-22 08:07:07,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_3 ...
2025-07-22 08:07:07,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_3 ...
2025-07-22 08:07:07,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_6 ...
2025-07-22 08:07:07,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_7 ...
2025-07-22 08:07:07,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_3 ...
2025-07-22 08:07:07,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_4 ...
2025-07-22 08:07:07,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Concat_1 ...
2025-07-22 08:07:07,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Reshape_1 ...
2025-07-22 08:07:07,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/out_proj/MatMul ...
2025-07-22 08:07:07,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/out_proj/MatMul ...
2025-07-22 08:07:07,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Add ...
2025-07-22 08:07:07,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/ReduceMean ...
2025-07-22 08:07:07,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Sub ...
2025-07-22 08:07:07,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Pow ...
2025-07-22 08:07:07,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/ReduceMean_1 ...
2025-07-22 08:07:07,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Add ...
2025-07-22 08:07:07,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Sqrt ...
2025-07-22 08:07:07,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Div ...
2025-07-22 08:07:07,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Mul ...
2025-07-22 08:07:07,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Add_1 ...
2025-07-22 08:07:07,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc11/MatMul ...
2025-07-22 08:07:07,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc11/MatMul ...
2025-07-22 08:07:07,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc12/MatMul ...
2025-07-22 08:07:07,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc12/MatMul ...
2025-07-22 08:07:07,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Sigmoid ...
2025-07-22 08:07:07,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Mul ...
2025-07-22 08:07:07,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Mul_1 ...
2025-07-22 08:07:07,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc2/MatMul ...
2025-07-22 08:07:07,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc2/MatMul ...
2025-07-22 08:07:07,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Add_1 ...
2025-07-22 08:07:07,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/ReduceMean ...
2025-07-22 08:07:07,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Sub ...
2025-07-22 08:07:07,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Pow ...
2025-07-22 08:07:07,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/ReduceMean_1 ...
2025-07-22 08:07:07,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Add ...
2025-07-22 08:07:07,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Sqrt ...
2025-07-22 08:07:07,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Div ...
2025-07-22 08:07:07,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Mul ...
2025-07-22 08:07:07,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Add_1 ...
2025-07-22 08:07:07,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,850 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,850 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape ...
2025-07-22 08:07:07,850 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather ...
2025-07-22 08:07:07,850 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_1 ...
2025-07-22 08:07:07,850 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze ...
2025-07-22 08:07:07,850 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_1 ...
2025-07-22 08:07:07,850 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Concat ...
2025-07-22 08:07:07,850 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Reshape ...
2025-07-22 08:07:07,850 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape ...
2025-07-22 08:07:07,850 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:07,850 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:07,850 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:07,850 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather ...
2025-07-22 08:07:07,850 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:07,850 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:07,850 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:07,850 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:07,850 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:07,850 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cast ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Range ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Neg ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Einsum ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cos ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Sin ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:07,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_3 ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_4 ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_5 ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_1 ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_2 ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/MatMul ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/MatMul ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Div_1 ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Add ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Softmax ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/MatMul_1 ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/MatMul_1 ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_3 ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_3 ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_6 ...
2025-07-22 08:07:07,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_7 ...
2025-07-22 08:07:07,853 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_3 ...
2025-07-22 08:07:07,853 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_4 ...
2025-07-22 08:07:07,853 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Concat_1 ...
2025-07-22 08:07:07,853 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Reshape_1 ...
2025-07-22 08:07:07,853 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/out_proj/MatMul ...
2025-07-22 08:07:07,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/out_proj/MatMul ...
2025-07-22 08:07:07,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Add ...
2025-07-22 08:07:07,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/ReduceMean ...
2025-07-22 08:07:07,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Sub ...
2025-07-22 08:07:07,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Pow ...
2025-07-22 08:07:07,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/ReduceMean_1 ...
2025-07-22 08:07:07,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Add ...
2025-07-22 08:07:07,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Sqrt ...
2025-07-22 08:07:07,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Div ...
2025-07-22 08:07:07,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Mul ...
2025-07-22 08:07:07,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Add_1 ...
2025-07-22 08:07:07,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc11/MatMul ...
2025-07-22 08:07:07,863 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc11/MatMul ...
2025-07-22 08:07:07,863 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc12/MatMul ...
2025-07-22 08:07:07,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc12/MatMul ...
2025-07-22 08:07:07,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Sigmoid ...
2025-07-22 08:07:07,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Mul ...
2025-07-22 08:07:07,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Mul_1 ...
2025-07-22 08:07:07,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc2/MatMul ...
2025-07-22 08:07:07,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc2/MatMul ...
2025-07-22 08:07:07,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Add_1 ...
2025-07-22 08:07:07,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/ReduceMean ...
2025-07-22 08:07:07,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Sub ...
2025-07-22 08:07:07,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Pow ...
2025-07-22 08:07:07,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/ReduceMean_1 ...
2025-07-22 08:07:07,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Add ...
2025-07-22 08:07:07,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Sqrt ...
2025-07-22 08:07:07,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Div ...
2025-07-22 08:07:07,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Mul ...
2025-07-22 08:07:07,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Add_1 ...
2025-07-22 08:07:07,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,882 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/Wqkv/MatMul ...
2025-07-22 08:07:07,882 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape ...
2025-07-22 08:07:07,882 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather ...
2025-07-22 08:07:07,882 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_1 ...
2025-07-22 08:07:07,882 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze ...
2025-07-22 08:07:07,882 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_1 ...
2025-07-22 08:07:07,882 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Concat ...
2025-07-22 08:07:07,882 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Reshape ...
2025-07-22 08:07:07,882 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape ...
2025-07-22 08:07:07,882 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:07,882 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:07,882 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:07,882 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather ...
2025-07-22 08:07:07,882 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:07,882 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:07,882 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:07,882 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:07,882 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:07,882 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:07,882 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cast ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Range ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Neg ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Einsum ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cos ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Sin ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:07,883 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_3 ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_4 ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_5 ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_1 ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_2 ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/MatMul ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/MatMul ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Div_1 ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Add ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Softmax ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/MatMul_1 ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/MatMul_1 ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_3 ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_3 ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_6 ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_7 ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_3 ...
2025-07-22 08:07:07,884 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_4 ...
2025-07-22 08:07:07,885 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Concat_1 ...
2025-07-22 08:07:07,885 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Reshape_1 ...
2025-07-22 08:07:07,885 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/out_proj/MatMul ...
2025-07-22 08:07:07,888 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/out_proj/MatMul ...
2025-07-22 08:07:07,888 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Add ...
2025-07-22 08:07:07,888 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/ReduceMean ...
2025-07-22 08:07:07,888 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Sub ...
2025-07-22 08:07:07,888 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Pow ...
2025-07-22 08:07:07,889 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/ReduceMean_1 ...
2025-07-22 08:07:07,889 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Add ...
2025-07-22 08:07:07,889 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Sqrt ...
2025-07-22 08:07:07,889 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Div ...
2025-07-22 08:07:07,889 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Mul ...
2025-07-22 08:07:07,889 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Add_1 ...
2025-07-22 08:07:07,889 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc11/MatMul ...
2025-07-22 08:07:07,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc11/MatMul ...
2025-07-22 08:07:07,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc12/MatMul ...
2025-07-22 08:07:07,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc12/MatMul ...
2025-07-22 08:07:07,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Sigmoid ...
2025-07-22 08:07:07,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Mul ...
2025-07-22 08:07:07,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Mul_1 ...
2025-07-22 08:07:07,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc2/MatMul ...
2025-07-22 08:07:07,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc2/MatMul ...
2025-07-22 08:07:07,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Add_1 ...
2025-07-22 08:07:07,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/ReduceMean ...
2025-07-22 08:07:07,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Sub ...
2025-07-22 08:07:07,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Pow ...
2025-07-22 08:07:07,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/ReduceMean_1 ...
2025-07-22 08:07:07,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Add ...
2025-07-22 08:07:07,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Sqrt ...
2025-07-22 08:07:07,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Div ...
2025-07-22 08:07:07,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Mul ...
2025-07-22 08:07:07,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Add_1 ...
/home/ubuntu/src/tjsmigration/transformers.js/scripts/float16.py:73: UserWarning: the float32 number 1.544654404384005e-09 will be truncated to 1e-07
warnings.warn(
/home/ubuntu/src/tjsmigration/transformers.js/scripts/float16.py:92: UserWarning: the float32 number -3.8726669093769317e-10 will be truncated to -1e-07
warnings.warn(
/home/ubuntu/src/tjsmigration/transformers.js/scripts/float16.py:85: UserWarning: the float32 number -3.4028234663852886e+38 will be truncated to -10000.0
warnings.warn(
/home/ubuntu/src/tjsmigration/transformers.js/scripts/float16.py:73: UserWarning: the float32 number 9.999999960041972e-13 will be truncated to 1e-07
warnings.warn(
- Quantizing to q4f16: 60%|ββββββ | 3/5 [00:14<00:09, 4.68s/it]
Processing /tmp/tmpbl__miwg/model.onnx: 0%| | 0/1 [00:14<?, ?it/s]
Traceback (most recent call last):
File "<frozen runpy>", line 198, in _run_module_as_main
File "<frozen runpy>", line 88, in _run_code
File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/quantize.py", line 377, in <module>
main()
File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/quantize.py", line 374, in main
quantize(input_folder, output_folder, quantization_args)
File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/quantize.py", line 326, in quantize
quantize_fp16(
File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/quantize.py", line 223, in quantize_fp16
check_and_save_model(model_fp16, save_path)
File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/utils.py", line 29, in check_and_save_model
strict_check_model(model)
File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/utils.py", line 21, in strict_check_model
raise e
File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/utils.py", line 16, in strict_check_model
onnx.checker.check_model(model_or_path, full_check=True)
File "/home/ubuntu/.cache/uv/archive-v0/iAncxVR1WPOl_8LkA6LpD/lib/python3.12/site-packages/onnx/checker.py", line 179, in check_model
C.check_model(
onnx.onnx_cpp2py_export.shape_inference.InferenceError: [ShapeInferenceError] (op_type:Range, node name: /encoder/layers.0/attn/rotary_emb/Range): start typestr: T, has unsupported type: tensor(float16)
β Based on model.onnx
without slimming
0%| | 0/1 [00:00<?, ?it/s]
Processing /tmp/tmpe2xdvb4_/model.onnx: 0%| | 0/1 [00:00<?, ?it/s]
0%| | 0/5 [00:00<?, ?it/s][A
- Quantizing to int8: 0%| | 0/5 [00:00<?, ?it/s][A2025-07-22 08:07:16,107 root [INFO] - Quantization parameters for tensor:"/emb_ln/Add_1_output_0" not specified
2025-07-22 08:07:16,114 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.0/attn/MatMul]
2025-07-22 08:07:16,114 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.0/attn/MatMul_1]
2025-07-22 08:07:16,116 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:16,119 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/norm1/Add_1_output_0" not specified
2025-07-22 08:07:16,137 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:16,146 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/norm2/Add_1_output_0" not specified
2025-07-22 08:07:16,153 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.1/attn/MatMul]
2025-07-22 08:07:16,153 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.1/attn/MatMul_1]
2025-07-22 08:07:16,155 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:16,159 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/norm1/Add_1_output_0" not specified
2025-07-22 08:07:16,176 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:16,184 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/norm2/Add_1_output_0" not specified
2025-07-22 08:07:16,192 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.2/attn/MatMul]
2025-07-22 08:07:16,192 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.2/attn/MatMul_1]
2025-07-22 08:07:16,194 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:16,197 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/norm1/Add_1_output_0" not specified
2025-07-22 08:07:16,215 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:16,224 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/norm2/Add_1_output_0" not specified
2025-07-22 08:07:16,232 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.3/attn/MatMul]
2025-07-22 08:07:16,232 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.3/attn/MatMul_1]
2025-07-22 08:07:16,234 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:16,237 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/norm1/Add_1_output_0" not specified
2025-07-22 08:07:16,254 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:16,263 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/norm2/Add_1_output_0" not specified
2025-07-22 08:07:16,271 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.4/attn/MatMul]
2025-07-22 08:07:16,271 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.4/attn/MatMul_1]
2025-07-22 08:07:16,273 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:16,277 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/norm1/Add_1_output_0" not specified
2025-07-22 08:07:16,294 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:16,304 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/norm2/Add_1_output_0" not specified
2025-07-22 08:07:16,312 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.5/attn/MatMul]
2025-07-22 08:07:16,312 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.5/attn/MatMul_1]
2025-07-22 08:07:16,314 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:16,318 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/norm1/Add_1_output_0" not specified
2025-07-22 08:07:16,337 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:16,347 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/norm2/Add_1_output_0" not specified
2025-07-22 08:07:16,355 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.6/attn/MatMul]
2025-07-22 08:07:16,355 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.6/attn/MatMul_1]
2025-07-22 08:07:16,357 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:16,361 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/norm1/Add_1_output_0" not specified
2025-07-22 08:07:16,379 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:16,389 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/norm2/Add_1_output_0" not specified
2025-07-22 08:07:16,398 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.7/attn/MatMul]
2025-07-22 08:07:16,398 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.7/attn/MatMul_1]
2025-07-22 08:07:16,400 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:16,404 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/norm1/Add_1_output_0" not specified
2025-07-22 08:07:16,423 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:16,433 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/norm2/Add_1_output_0" not specified
2025-07-22 08:07:16,442 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.8/attn/MatMul]
2025-07-22 08:07:16,442 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.8/attn/MatMul_1]
2025-07-22 08:07:16,444 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:16,448 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/norm1/Add_1_output_0" not specified
2025-07-22 08:07:16,467 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:16,478 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/norm2/Add_1_output_0" not specified
2025-07-22 08:07:16,487 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.9/attn/MatMul]
2025-07-22 08:07:16,487 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.9/attn/MatMul_1]
2025-07-22 08:07:16,489 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:16,493 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/norm1/Add_1_output_0" not specified
2025-07-22 08:07:16,511 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:16,522 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/norm2/Add_1_output_0" not specified
2025-07-22 08:07:16,531 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.10/attn/MatMul]
2025-07-22 08:07:16,531 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.10/attn/MatMul_1]
2025-07-22 08:07:16,533 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:16,537 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/norm1/Add_1_output_0" not specified
2025-07-22 08:07:16,557 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:16,568 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/norm2/Add_1_output_0" not specified
2025-07-22 08:07:16,577 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.11/attn/MatMul]
2025-07-22 08:07:16,577 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.11/attn/MatMul_1]
2025-07-22 08:07:16,579 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:16,584 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/norm1/Add_1_output_0" not specified
2025-07-22 08:07:16,603 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/mlp/Mul_1_output_0" not specified
- Quantizing to int8: 20%|ββ | 1/5 [00:05<00:21, 5.30s/it][A
- Quantizing to uint8: 20%|ββ | 1/5 [00:05<00:21, 5.30s/it][A2025-07-22 08:07:20,348 root [INFO] - Quantization parameters for tensor:"/emb_ln/Add_1_output_0" not specified
2025-07-22 08:07:20,355 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.0/attn/MatMul]
2025-07-22 08:07:20,355 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.0/attn/MatMul_1]
2025-07-22 08:07:20,357 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:20,360 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/norm1/Add_1_output_0" not specified
2025-07-22 08:07:20,377 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:20,388 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/norm2/Add_1_output_0" not specified
2025-07-22 08:07:20,395 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.1/attn/MatMul]
2025-07-22 08:07:20,395 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.1/attn/MatMul_1]
2025-07-22 08:07:20,397 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:20,401 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/norm1/Add_1_output_0" not specified
2025-07-22 08:07:20,416 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:20,425 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/norm2/Add_1_output_0" not specified
2025-07-22 08:07:20,432 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.2/attn/MatMul]
2025-07-22 08:07:20,432 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.2/attn/MatMul_1]
2025-07-22 08:07:20,434 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:20,438 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/norm1/Add_1_output_0" not specified
2025-07-22 08:07:20,455 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:20,463 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/norm2/Add_1_output_0" not specified
2025-07-22 08:07:20,471 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.3/attn/MatMul]
2025-07-22 08:07:20,471 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.3/attn/MatMul_1]
2025-07-22 08:07:20,473 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:20,477 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/norm1/Add_1_output_0" not specified
2025-07-22 08:07:20,494 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:20,503 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/norm2/Add_1_output_0" not specified
2025-07-22 08:07:20,511 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.4/attn/MatMul]
2025-07-22 08:07:20,511 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.4/attn/MatMul_1]
2025-07-22 08:07:20,513 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:20,517 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/norm1/Add_1_output_0" not specified
2025-07-22 08:07:20,535 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:20,545 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/norm2/Add_1_output_0" not specified
2025-07-22 08:07:20,553 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.5/attn/MatMul]
2025-07-22 08:07:20,553 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.5/attn/MatMul_1]
2025-07-22 08:07:20,555 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:20,559 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/norm1/Add_1_output_0" not specified
2025-07-22 08:07:20,578 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:20,587 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/norm2/Add_1_output_0" not specified
2025-07-22 08:07:20,595 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.6/attn/MatMul]
2025-07-22 08:07:20,595 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.6/attn/MatMul_1]
2025-07-22 08:07:20,597 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:20,601 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/norm1/Add_1_output_0" not specified
2025-07-22 08:07:20,620 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:20,629 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/norm2/Add_1_output_0" not specified
2025-07-22 08:07:20,637 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.7/attn/MatMul]
2025-07-22 08:07:20,638 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.7/attn/MatMul_1]
2025-07-22 08:07:20,640 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:20,644 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/norm1/Add_1_output_0" not specified
2025-07-22 08:07:20,662 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:20,673 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/norm2/Add_1_output_0" not specified
2025-07-22 08:07:20,681 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.8/attn/MatMul]
2025-07-22 08:07:20,682 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.8/attn/MatMul_1]
2025-07-22 08:07:20,683 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:20,687 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/norm1/Add_1_output_0" not specified
2025-07-22 08:07:20,706 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:20,716 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/norm2/Add_1_output_0" not specified
2025-07-22 08:07:20,725 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.9/attn/MatMul]
2025-07-22 08:07:20,725 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.9/attn/MatMul_1]
2025-07-22 08:07:20,727 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:20,731 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/norm1/Add_1_output_0" not specified
2025-07-22 08:07:20,751 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:20,760 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/norm2/Add_1_output_0" not specified
2025-07-22 08:07:20,769 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.10/attn/MatMul]
2025-07-22 08:07:20,769 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.10/attn/MatMul_1]
2025-07-22 08:07:20,771 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:20,775 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/norm1/Add_1_output_0" not specified
2025-07-22 08:07:20,794 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/mlp/Mul_1_output_0" not specified
2025-07-22 08:07:20,804 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/norm2/Add_1_output_0" not specified
2025-07-22 08:07:20,814 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.11/attn/MatMul]
2025-07-22 08:07:20,814 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.11/attn/MatMul_1]
2025-07-22 08:07:20,816 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/attn/Reshape_1_output_0" not specified
2025-07-22 08:07:20,820 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/norm1/Add_1_output_0" not specified
2025-07-22 08:07:20,840 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/mlp/Mul_1_output_0" not specified
- Quantizing to uint8: 40%|ββββ | 2/5 [00:09<00:13, 4.62s/it][A
- Quantizing to q4: 40%|ββββ | 2/5 [00:09<00:13, 4.62s/it] [A2025-07-22 08:07:22,425 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/word_embeddings/Gather ...
2025-07-22 08:07:22,425 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/token_type_embeddings/Gather ...
2025-07-22 08:07:22,425 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/Constant ...
2025-07-22 08:07:22,425 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/Add ...
2025-07-22 08:07:22,425 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/ReduceMean ...
2025-07-22 08:07:22,425 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Sub ...
2025-07-22 08:07:22,425 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Constant ...
2025-07-22 08:07:22,425 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Pow ...
2025-07-22 08:07:22,425 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/ReduceMean_1 ...
2025-07-22 08:07:22,425 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Constant_1 ...
2025-07-22 08:07:22,425 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Add ...
2025-07-22 08:07:22,425 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Sqrt ...
2025-07-22 08:07:22,425 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Div ...
2025-07-22 08:07:22,425 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Mul ...
2025-07-22 08:07:22,425 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Add_1 ...
2025-07-22 08:07:22,425 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Constant ...
2025-07-22 08:07:22,425 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Unsqueeze ...
2025-07-22 08:07:22,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Constant_1 ...
2025-07-22 08:07:22,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Unsqueeze_1 ...
2025-07-22 08:07:22,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Cast ...
2025-07-22 08:07:22,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Constant_2 ...
2025-07-22 08:07:22,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Sub ...
2025-07-22 08:07:22,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Constant_3 ...
2025-07-22 08:07:22,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Mul ...
2025-07-22 08:07:22,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape ...
2025-07-22 08:07:22,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant ...
2025-07-22 08:07:22,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather ...
2025-07-22 08:07:22,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_1 ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_1 ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_1 ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_2 ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_2 ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_2 ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_3 ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Div ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Cast ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Cast_1 ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_1 ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_4 ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_2 ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_5 ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Concat ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Reshape ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cast ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Range ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Einsum ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cos ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Sin ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:22,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Equal ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Where ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape ...
2025-07-22 08:07:22,434 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Div ...
2025-07-22 08:07:22,435 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Neg ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:22,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:22,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_75 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:22,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_3 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_4 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_5 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_1 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_2 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/MatMul ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/MatMul ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_6 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Div_1 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Add ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Softmax ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/MatMul_1 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/MatMul_1 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_3 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_3 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_7 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_6 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_4 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_8 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_7 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_5 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_9 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_8 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_6 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_10 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_9 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_11 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Mul ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Mul_1 ...
2025-07-22 08:07:22,439 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_3 ...
2025-07-22 08:07:22,440 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_4 ...
2025-07-22 08:07:22,440 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_5 ...
2025-07-22 08:07:22,440 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Concat_1 ...
2025-07-22 08:07:22,440 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Reshape_1 ...
2025-07-22 08:07:22,440 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/out_proj/MatMul ...
2025-07-22 08:07:22,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/out_proj/MatMul ...
2025-07-22 08:07:22,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Add ...
2025-07-22 08:07:22,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Cast ...
2025-07-22 08:07:22,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/ReduceMean ...
2025-07-22 08:07:22,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Sub ...
2025-07-22 08:07:22,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Constant ...
2025-07-22 08:07:22,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Pow ...
2025-07-22 08:07:22,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/ReduceMean_1 ...
2025-07-22 08:07:22,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Constant_1 ...
2025-07-22 08:07:22,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Add ...
2025-07-22 08:07:22,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Sqrt ...
2025-07-22 08:07:22,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Div ...
2025-07-22 08:07:22,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Mul ...
2025-07-22 08:07:22,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Add_1 ...
2025-07-22 08:07:22,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc11/MatMul ...
2025-07-22 08:07:22,450 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc11/MatMul ...
2025-07-22 08:07:22,450 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc12/MatMul ...
2025-07-22 08:07:22,456 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc12/MatMul ...
2025-07-22 08:07:22,456 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Sigmoid ...
2025-07-22 08:07:22,456 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Mul ...
2025-07-22 08:07:22,456 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Mul_1 ...
2025-07-22 08:07:22,456 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc2/MatMul ...
2025-07-22 08:07:22,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc2/MatMul ...
2025-07-22 08:07:22,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Add_1 ...
2025-07-22 08:07:22,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Cast_1 ...
2025-07-22 08:07:22,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/ReduceMean ...
2025-07-22 08:07:22,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Sub ...
2025-07-22 08:07:22,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Constant ...
2025-07-22 08:07:22,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Pow ...
2025-07-22 08:07:22,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/ReduceMean_1 ...
2025-07-22 08:07:22,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Constant_1 ...
2025-07-22 08:07:22,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Add ...
2025-07-22 08:07:22,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Sqrt ...
2025-07-22 08:07:22,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Div ...
2025-07-22 08:07:22,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Mul ...
2025-07-22 08:07:22,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Add_1 ...
2025-07-22 08:07:22,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape ...
2025-07-22 08:07:22,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant ...
2025-07-22 08:07:22,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather ...
2025-07-22 08:07:22,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_1 ...
2025-07-22 08:07:22,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_1 ...
2025-07-22 08:07:22,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_1 ...
2025-07-22 08:07:22,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_2 ...
2025-07-22 08:07:22,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_2 ...
2025-07-22 08:07:22,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_2 ...
2025-07-22 08:07:22,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_3 ...
2025-07-22 08:07:22,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Div ...
2025-07-22 08:07:22,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Cast ...
2025-07-22 08:07:22,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Cast_1 ...
2025-07-22 08:07:22,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze ...
2025-07-22 08:07:22,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_1 ...
2025-07-22 08:07:22,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_4 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_2 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_5 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Concat ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Reshape ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cast ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Range ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Einsum ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cos ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Sin ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:22,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Equal ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Where ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:22,470 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Div ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Neg ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:22,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:22,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:22,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_3 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_4 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_5 ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose ...
2025-07-22 08:07:22,474 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_1 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_2 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/MatMul ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/MatMul ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_6 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Div_1 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Add ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Softmax ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/MatMul_1 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/MatMul_1 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_3 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_3 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_7 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_6 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_4 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_8 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_7 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_5 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_9 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_8 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_6 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_10 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_9 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_11 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Mul ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Mul_1 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_3 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_4 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_5 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Concat_1 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Reshape_1 ...
2025-07-22 08:07:22,475 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/out_proj/MatMul ...
2025-07-22 08:07:22,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/out_proj/MatMul ...
2025-07-22 08:07:22,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Add ...
2025-07-22 08:07:22,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Cast ...
2025-07-22 08:07:22,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/ReduceMean ...
2025-07-22 08:07:22,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Sub ...
2025-07-22 08:07:22,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Constant ...
2025-07-22 08:07:22,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Pow ...
2025-07-22 08:07:22,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/ReduceMean_1 ...
2025-07-22 08:07:22,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Constant_1 ...
2025-07-22 08:07:22,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Add ...
2025-07-22 08:07:22,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Sqrt ...
2025-07-22 08:07:22,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Div ...
2025-07-22 08:07:22,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Mul ...
2025-07-22 08:07:22,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Add_1 ...
2025-07-22 08:07:22,480 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc11/MatMul ...
2025-07-22 08:07:22,486 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc11/MatMul ...
2025-07-22 08:07:22,486 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc12/MatMul ...
2025-07-22 08:07:22,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc12/MatMul ...
2025-07-22 08:07:22,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Sigmoid ...
2025-07-22 08:07:22,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Mul ...
2025-07-22 08:07:22,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Mul_1 ...
2025-07-22 08:07:22,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc2/MatMul ...
2025-07-22 08:07:22,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc2/MatMul ...
2025-07-22 08:07:22,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Add_1 ...
2025-07-22 08:07:22,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Cast_1 ...
2025-07-22 08:07:22,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/ReduceMean ...
2025-07-22 08:07:22,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Sub ...
2025-07-22 08:07:22,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Constant ...
2025-07-22 08:07:22,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Pow ...
2025-07-22 08:07:22,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/ReduceMean_1 ...
2025-07-22 08:07:22,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Constant_1 ...
2025-07-22 08:07:22,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Add ...
2025-07-22 08:07:22,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Sqrt ...
2025-07-22 08:07:22,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Div ...
2025-07-22 08:07:22,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Mul ...
2025-07-22 08:07:22,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Add_1 ...
2025-07-22 08:07:22,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape ...
2025-07-22 08:07:22,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant ...
2025-07-22 08:07:22,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather ...
2025-07-22 08:07:22,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_1 ...
2025-07-22 08:07:22,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_1 ...
2025-07-22 08:07:22,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_1 ...
2025-07-22 08:07:22,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_2 ...
2025-07-22 08:07:22,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_2 ...
2025-07-22 08:07:22,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_2 ...
2025-07-22 08:07:22,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_3 ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Div ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Cast ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Cast_1 ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_1 ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_4 ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_2 ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_5 ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Concat ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Reshape ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cast ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Range ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Einsum ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cos ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Sin ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:22,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Equal ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Where ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:22,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Div ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:22,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Neg ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:22,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:22,510 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:22,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_3 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_4 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_5 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_1 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_2 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/MatMul ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/MatMul ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_6 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Div_1 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Add ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Softmax ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/MatMul_1 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/MatMul_1 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_3 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_3 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_7 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_6 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_4 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_8 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_7 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_5 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_9 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_8 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_6 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_10 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_9 ...
2025-07-22 08:07:22,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_11 ...
2025-07-22 08:07:22,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Mul ...
2025-07-22 08:07:22,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Mul_1 ...
2025-07-22 08:07:22,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_3 ...
2025-07-22 08:07:22,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_4 ...
2025-07-22 08:07:22,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_5 ...
2025-07-22 08:07:22,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Concat_1 ...
2025-07-22 08:07:22,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Reshape_1 ...
2025-07-22 08:07:22,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/out_proj/MatMul ...
2025-07-22 08:07:22,516 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/out_proj/MatMul ...
2025-07-22 08:07:22,516 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Add ...
2025-07-22 08:07:22,516 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Cast ...
2025-07-22 08:07:22,516 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/ReduceMean ...
2025-07-22 08:07:22,516 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Sub ...
2025-07-22 08:07:22,516 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Constant ...
2025-07-22 08:07:22,516 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Pow ...
2025-07-22 08:07:22,517 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/ReduceMean_1 ...
2025-07-22 08:07:22,517 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Constant_1 ...
2025-07-22 08:07:22,517 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Add ...
2025-07-22 08:07:22,517 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Sqrt ...
2025-07-22 08:07:22,517 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Div ...
2025-07-22 08:07:22,517 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Mul ...
2025-07-22 08:07:22,517 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Add_1 ...
2025-07-22 08:07:22,517 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc11/MatMul ...
2025-07-22 08:07:22,523 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc11/MatMul ...
2025-07-22 08:07:22,523 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc12/MatMul ...
2025-07-22 08:07:22,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc12/MatMul ...
2025-07-22 08:07:22,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Sigmoid ...
2025-07-22 08:07:22,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Mul ...
2025-07-22 08:07:22,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Mul_1 ...
2025-07-22 08:07:22,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc2/MatMul ...
2025-07-22 08:07:22,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc2/MatMul ...
2025-07-22 08:07:22,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Add_1 ...
2025-07-22 08:07:22,535 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Cast_1 ...
2025-07-22 08:07:22,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/ReduceMean ...
2025-07-22 08:07:22,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Sub ...
2025-07-22 08:07:22,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Constant ...
2025-07-22 08:07:22,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Pow ...
2025-07-22 08:07:22,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/ReduceMean_1 ...
2025-07-22 08:07:22,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Constant_1 ...
2025-07-22 08:07:22,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Add ...
2025-07-22 08:07:22,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Sqrt ...
2025-07-22 08:07:22,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Div ...
2025-07-22 08:07:22,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Mul ...
2025-07-22 08:07:22,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Add_1 ...
2025-07-22 08:07:22,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape ...
2025-07-22 08:07:22,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant ...
2025-07-22 08:07:22,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather ...
2025-07-22 08:07:22,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_1 ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_1 ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_1 ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_2 ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_2 ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_2 ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_3 ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Div ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Cast ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Cast_1 ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_1 ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_4 ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_2 ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_5 ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Concat ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Reshape ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cast ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Range ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Einsum ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cos ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Sin ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:22,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Equal ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Where ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat ...
2025-07-22 08:07:22,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:22,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Div ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Neg ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:22,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:22,546 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:22,547 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_3 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_4 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_5 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_1 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_2 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/MatMul ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/MatMul ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_6 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Div_1 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Add ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Softmax ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/MatMul_1 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/MatMul_1 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_3 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_3 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_7 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_6 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_4 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_8 ...
2025-07-22 08:07:22,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_7 ...
2025-07-22 08:07:22,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_5 ...
2025-07-22 08:07:22,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_9 ...
2025-07-22 08:07:22,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_8 ...
2025-07-22 08:07:22,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_6 ...
2025-07-22 08:07:22,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_10 ...
2025-07-22 08:07:22,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_9 ...
2025-07-22 08:07:22,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_11 ...
2025-07-22 08:07:22,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Mul ...
2025-07-22 08:07:22,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Mul_1 ...
2025-07-22 08:07:22,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_3 ...
2025-07-22 08:07:22,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_4 ...
2025-07-22 08:07:22,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_5 ...
2025-07-22 08:07:22,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Concat_1 ...
2025-07-22 08:07:22,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Reshape_1 ...
2025-07-22 08:07:22,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/out_proj/MatMul ...
2025-07-22 08:07:22,552 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/out_proj/MatMul ...
2025-07-22 08:07:22,552 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Add ...
2025-07-22 08:07:22,552 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Cast ...
2025-07-22 08:07:22,552 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/ReduceMean ...
2025-07-22 08:07:22,552 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Sub ...
2025-07-22 08:07:22,552 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Constant ...
2025-07-22 08:07:22,552 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Pow ...
2025-07-22 08:07:22,552 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/ReduceMean_1 ...
2025-07-22 08:07:22,552 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Constant_1 ...
2025-07-22 08:07:22,553 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Add ...
2025-07-22 08:07:22,553 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Sqrt ...
2025-07-22 08:07:22,553 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Div ...
2025-07-22 08:07:22,553 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Mul ...
2025-07-22 08:07:22,553 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Add_1 ...
2025-07-22 08:07:22,553 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc11/MatMul ...
2025-07-22 08:07:22,559 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc11/MatMul ...
2025-07-22 08:07:22,559 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc12/MatMul ...
2025-07-22 08:07:22,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc12/MatMul ...
2025-07-22 08:07:22,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Sigmoid ...
2025-07-22 08:07:22,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Mul ...
2025-07-22 08:07:22,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Mul_1 ...
2025-07-22 08:07:22,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc2/MatMul ...
2025-07-22 08:07:22,572 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc2/MatMul ...
2025-07-22 08:07:22,572 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Add_1 ...
2025-07-22 08:07:22,572 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Cast_1 ...
2025-07-22 08:07:22,572 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/ReduceMean ...
2025-07-22 08:07:22,572 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Sub ...
2025-07-22 08:07:22,572 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Constant ...
2025-07-22 08:07:22,572 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Pow ...
2025-07-22 08:07:22,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/ReduceMean_1 ...
2025-07-22 08:07:22,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Constant_1 ...
2025-07-22 08:07:22,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Add ...
2025-07-22 08:07:22,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Sqrt ...
2025-07-22 08:07:22,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Div ...
2025-07-22 08:07:22,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Mul ...
2025-07-22 08:07:22,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Add_1 ...
2025-07-22 08:07:22,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape ...
2025-07-22 08:07:22,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant ...
2025-07-22 08:07:22,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather ...
2025-07-22 08:07:22,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_1 ...
2025-07-22 08:07:22,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_1 ...
2025-07-22 08:07:22,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_1 ...
2025-07-22 08:07:22,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_2 ...
2025-07-22 08:07:22,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_2 ...
2025-07-22 08:07:22,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_2 ...
2025-07-22 08:07:22,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_3 ...
2025-07-22 08:07:22,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Div ...
2025-07-22 08:07:22,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Cast ...
2025-07-22 08:07:22,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Cast_1 ...
2025-07-22 08:07:22,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze ...
2025-07-22 08:07:22,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_1 ...
2025-07-22 08:07:22,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_4 ...
2025-07-22 08:07:22,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_2 ...
2025-07-22 08:07:22,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_5 ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Concat ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Reshape ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cast ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Range ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Einsum ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cos ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Sin ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:22,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Equal ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Where ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:22,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Div ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Neg ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:22,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:22,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:22,583 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_3 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_4 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_5 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_1 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_2 ...
2025-07-22 08:07:22,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/MatMul ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/MatMul ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_6 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Div_1 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Add ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Softmax ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/MatMul_1 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/MatMul_1 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_3 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_3 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_7 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_6 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_4 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_8 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_7 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_5 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_9 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_8 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_6 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_10 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_9 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_11 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Mul ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Mul_1 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_3 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_4 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_5 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Concat_1 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Reshape_1 ...
2025-07-22 08:07:22,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/out_proj/MatMul ...
2025-07-22 08:07:22,589 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/out_proj/MatMul ...
2025-07-22 08:07:22,589 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Add ...
2025-07-22 08:07:22,589 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Cast ...
2025-07-22 08:07:22,589 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/ReduceMean ...
2025-07-22 08:07:22,589 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Sub ...
2025-07-22 08:07:22,589 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Constant ...
2025-07-22 08:07:22,589 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Pow ...
2025-07-22 08:07:22,589 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/ReduceMean_1 ...
2025-07-22 08:07:22,589 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Constant_1 ...
2025-07-22 08:07:22,589 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Add ...
2025-07-22 08:07:22,589 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Sqrt ...
2025-07-22 08:07:22,589 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Div ...
2025-07-22 08:07:22,589 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Mul ...
2025-07-22 08:07:22,589 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Add_1 ...
2025-07-22 08:07:22,589 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc11/MatMul ...
2025-07-22 08:07:22,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc11/MatMul ...
2025-07-22 08:07:22,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc12/MatMul ...
2025-07-22 08:07:22,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc12/MatMul ...
2025-07-22 08:07:22,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Sigmoid ...
2025-07-22 08:07:22,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Mul ...
2025-07-22 08:07:22,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Mul_1 ...
2025-07-22 08:07:22,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc2/MatMul ...
2025-07-22 08:07:22,608 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc2/MatMul ...
2025-07-22 08:07:22,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Add_1 ...
2025-07-22 08:07:22,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Cast_1 ...
2025-07-22 08:07:22,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/ReduceMean ...
2025-07-22 08:07:22,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Sub ...
2025-07-22 08:07:22,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Constant ...
2025-07-22 08:07:22,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Pow ...
2025-07-22 08:07:22,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/ReduceMean_1 ...
2025-07-22 08:07:22,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Constant_1 ...
2025-07-22 08:07:22,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Add ...
2025-07-22 08:07:22,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Sqrt ...
2025-07-22 08:07:22,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Div ...
2025-07-22 08:07:22,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Mul ...
2025-07-22 08:07:22,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Add_1 ...
2025-07-22 08:07:22,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_1 ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_1 ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_1 ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_2 ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_2 ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_2 ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_3 ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Div ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Cast ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Cast_1 ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_1 ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_4 ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_2 ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_5 ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Concat ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Reshape ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant ...
2025-07-22 08:07:22,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cast ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Range ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Einsum ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cos ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Sin ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:22,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Equal ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Where ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:22,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Div ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Neg ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:22,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:22,619 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:22,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_3 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_4 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_5 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_1 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_2 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/MatMul ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/MatMul ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_6 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Div_1 ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Add ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Softmax ...
2025-07-22 08:07:22,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/MatMul_1 ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/MatMul_1 ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_3 ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_3 ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_7 ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_6 ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_4 ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_8 ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_7 ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_5 ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_9 ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_8 ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_6 ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_10 ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_9 ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_11 ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Mul ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Mul_1 ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_3 ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_4 ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_5 ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Concat_1 ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Reshape_1 ...
2025-07-22 08:07:22,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/out_proj/MatMul ...
2025-07-22 08:07:22,626 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/out_proj/MatMul ...
2025-07-22 08:07:22,626 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Add ...
2025-07-22 08:07:22,626 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Cast ...
2025-07-22 08:07:22,626 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/ReduceMean ...
2025-07-22 08:07:22,626 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Sub ...
2025-07-22 08:07:22,626 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Constant ...
2025-07-22 08:07:22,626 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Pow ...
2025-07-22 08:07:22,626 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/ReduceMean_1 ...
2025-07-22 08:07:22,626 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Constant_1 ...
2025-07-22 08:07:22,626 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Add ...
2025-07-22 08:07:22,626 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Sqrt ...
2025-07-22 08:07:22,626 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Div ...
2025-07-22 08:07:22,626 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Mul ...
2025-07-22 08:07:22,626 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Add_1 ...
2025-07-22 08:07:22,626 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc11/MatMul ...
2025-07-22 08:07:22,633 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc11/MatMul ...
2025-07-22 08:07:22,633 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc12/MatMul ...
2025-07-22 08:07:22,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc12/MatMul ...
2025-07-22 08:07:22,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Sigmoid ...
2025-07-22 08:07:22,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Mul ...
2025-07-22 08:07:22,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Mul_1 ...
2025-07-22 08:07:22,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc2/MatMul ...
2025-07-22 08:07:22,645 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc2/MatMul ...
2025-07-22 08:07:22,645 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Add_1 ...
2025-07-22 08:07:22,645 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Cast_1 ...
2025-07-22 08:07:22,645 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/ReduceMean ...
2025-07-22 08:07:22,645 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Sub ...
2025-07-22 08:07:22,645 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Constant ...
2025-07-22 08:07:22,645 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Pow ...
2025-07-22 08:07:22,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/ReduceMean_1 ...
2025-07-22 08:07:22,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Constant_1 ...
2025-07-22 08:07:22,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Add ...
2025-07-22 08:07:22,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Sqrt ...
2025-07-22 08:07:22,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Div ...
2025-07-22 08:07:22,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Mul ...
2025-07-22 08:07:22,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Add_1 ...
2025-07-22 08:07:22,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape ...
2025-07-22 08:07:22,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant ...
2025-07-22 08:07:22,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_1 ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_1 ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_1 ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_2 ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_2 ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_2 ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_3 ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Div ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Cast ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Cast_1 ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_1 ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_4 ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_2 ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_5 ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Concat ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Reshape ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cast ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Range ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Einsum ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cos ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Sin ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul ...
2025-07-22 08:07:22,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Equal ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Where ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat ...
2025-07-22 08:07:22,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add ...
2025-07-22 08:07:22,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Div ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Neg ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:22,655 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:22,656 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:22,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_3 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_4 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_5 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_1 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_2 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/MatMul ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/MatMul ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_6 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Div_1 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Add ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Softmax ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/MatMul_1 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/MatMul_1 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_3 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_3 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_7 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_6 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_4 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_8 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_7 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_5 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_9 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_8 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_6 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_10 ...
2025-07-22 08:07:22,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_9 ...
2025-07-22 08:07:22,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_11 ...
2025-07-22 08:07:22,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Mul ...
2025-07-22 08:07:22,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Mul_1 ...
2025-07-22 08:07:22,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_3 ...
2025-07-22 08:07:22,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_4 ...
2025-07-22 08:07:22,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_5 ...
2025-07-22 08:07:22,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Concat_1 ...
2025-07-22 08:07:22,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Reshape_1 ...
2025-07-22 08:07:22,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/out_proj/MatMul ...
2025-07-22 08:07:22,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/out_proj/MatMul ...
2025-07-22 08:07:22,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Add ...
2025-07-22 08:07:22,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Cast ...
2025-07-22 08:07:22,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/ReduceMean ...
2025-07-22 08:07:22,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Sub ...
2025-07-22 08:07:22,662 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Constant ...
2025-07-22 08:07:22,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Pow ...
2025-07-22 08:07:22,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/ReduceMean_1 ...
2025-07-22 08:07:22,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Constant_1 ...
2025-07-22 08:07:22,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Add ...
2025-07-22 08:07:22,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Sqrt ...
2025-07-22 08:07:22,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Div ...
2025-07-22 08:07:22,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Mul ...
2025-07-22 08:07:22,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Add_1 ...
2025-07-22 08:07:22,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc11/MatMul ...
2025-07-22 08:07:22,669 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc11/MatMul ...
2025-07-22 08:07:22,669 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc12/MatMul ...
2025-07-22 08:07:22,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc12/MatMul ...
2025-07-22 08:07:22,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Sigmoid ...
2025-07-22 08:07:22,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Mul ...
2025-07-22 08:07:22,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Mul_1 ...
2025-07-22 08:07:22,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc2/MatMul ...
2025-07-22 08:07:22,681 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc2/MatMul ...
2025-07-22 08:07:22,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Add_1 ...
2025-07-22 08:07:22,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Cast_1 ...
2025-07-22 08:07:22,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/ReduceMean ...
2025-07-22 08:07:22,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Sub ...
2025-07-22 08:07:22,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Constant ...
2025-07-22 08:07:22,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Pow ...
2025-07-22 08:07:22,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/ReduceMean_1 ...
2025-07-22 08:07:22,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Constant_1 ...
2025-07-22 08:07:22,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Add ...
2025-07-22 08:07:22,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Sqrt ...
2025-07-22 08:07:22,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Div ...
2025-07-22 08:07:22,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Mul ...
2025-07-22 08:07:22,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Add_1 ...
2025-07-22 08:07:22,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_1 ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_1 ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_1 ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_2 ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_2 ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_2 ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_3 ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Div ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Cast ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Cast_1 ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_1 ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_4 ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_2 ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_5 ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Concat ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Reshape ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cast ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:22,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Range ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Einsum ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cos ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Sin ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:22,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Equal ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Where ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:22,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Div ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Neg ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:22,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:22,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:22,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_3 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_4 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_5 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_1 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_2 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/MatMul ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/MatMul ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_6 ...
2025-07-22 08:07:22,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Div_1 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Add ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Softmax ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/MatMul_1 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/MatMul_1 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_3 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_3 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_7 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_6 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_4 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_8 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_7 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_5 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_9 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_8 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_6 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_10 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_9 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_11 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Mul ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Mul_1 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_3 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_4 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_5 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Concat_1 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Reshape_1 ...
2025-07-22 08:07:22,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/out_proj/MatMul ...
2025-07-22 08:07:22,699 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/out_proj/MatMul ...
2025-07-22 08:07:22,699 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Add ...
2025-07-22 08:07:22,699 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Cast ...
2025-07-22 08:07:22,699 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/ReduceMean ...
2025-07-22 08:07:22,699 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Sub ...
2025-07-22 08:07:22,699 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Constant ...
2025-07-22 08:07:22,699 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Pow ...
2025-07-22 08:07:22,699 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/ReduceMean_1 ...
2025-07-22 08:07:22,699 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Constant_1 ...
2025-07-22 08:07:22,699 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Add ...
2025-07-22 08:07:22,699 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Sqrt ...
2025-07-22 08:07:22,699 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Div ...
2025-07-22 08:07:22,699 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Mul ...
2025-07-22 08:07:22,699 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Add_1 ...
2025-07-22 08:07:22,699 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc11/MatMul ...
2025-07-22 08:07:22,705 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc11/MatMul ...
2025-07-22 08:07:22,705 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc12/MatMul ...
2025-07-22 08:07:22,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc12/MatMul ...
2025-07-22 08:07:22,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Sigmoid ...
2025-07-22 08:07:22,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Mul ...
2025-07-22 08:07:22,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Mul_1 ...
2025-07-22 08:07:22,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc2/MatMul ...
2025-07-22 08:07:22,718 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc2/MatMul ...
2025-07-22 08:07:22,718 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Add_1 ...
2025-07-22 08:07:22,718 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Cast_1 ...
2025-07-22 08:07:22,718 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/ReduceMean ...
2025-07-22 08:07:22,718 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Sub ...
2025-07-22 08:07:22,718 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Constant ...
2025-07-22 08:07:22,718 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Pow ...
2025-07-22 08:07:22,718 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/ReduceMean_1 ...
2025-07-22 08:07:22,718 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Constant_1 ...
2025-07-22 08:07:22,718 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Add ...
2025-07-22 08:07:22,718 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Sqrt ...
2025-07-22 08:07:22,718 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Div ...
2025-07-22 08:07:22,718 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Mul ...
2025-07-22 08:07:22,718 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Add_1 ...
2025-07-22 08:07:22,718 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape ...
2025-07-22 08:07:22,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant ...
2025-07-22 08:07:22,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather ...
2025-07-22 08:07:22,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_1 ...
2025-07-22 08:07:22,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_1 ...
2025-07-22 08:07:22,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_1 ...
2025-07-22 08:07:22,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_2 ...
2025-07-22 08:07:22,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_2 ...
2025-07-22 08:07:22,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_2 ...
2025-07-22 08:07:22,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_3 ...
2025-07-22 08:07:22,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Div ...
2025-07-22 08:07:22,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Cast ...
2025-07-22 08:07:22,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Cast_1 ...
2025-07-22 08:07:22,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze ...
2025-07-22 08:07:22,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_1 ...
2025-07-22 08:07:22,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_4 ...
2025-07-22 08:07:22,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_2 ...
2025-07-22 08:07:22,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_5 ...
2025-07-22 08:07:22,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Concat ...
2025-07-22 08:07:22,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Reshape ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cast ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Range ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Einsum ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cos ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Sin ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:22,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Equal ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Where ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:22,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Div ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Neg ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:22,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:22,728 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:22,729 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_3 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_4 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_5 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_1 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_2 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/MatMul ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/MatMul ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_6 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Div_1 ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Add ...
2025-07-22 08:07:22,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Softmax ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/MatMul_1 ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/MatMul_1 ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_3 ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_3 ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_7 ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_6 ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_4 ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_8 ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_7 ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_5 ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_9 ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_8 ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_6 ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_10 ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_9 ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_11 ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Mul ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Mul_1 ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_3 ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_4 ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_5 ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Concat_1 ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Reshape_1 ...
2025-07-22 08:07:22,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/out_proj/MatMul ...
2025-07-22 08:07:22,735 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/out_proj/MatMul ...
2025-07-22 08:07:22,735 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Add ...
2025-07-22 08:07:22,735 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Cast ...
2025-07-22 08:07:22,735 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/ReduceMean ...
2025-07-22 08:07:22,735 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Sub ...
2025-07-22 08:07:22,735 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Constant ...
2025-07-22 08:07:22,735 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Pow ...
2025-07-22 08:07:22,735 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/ReduceMean_1 ...
2025-07-22 08:07:22,735 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Constant_1 ...
2025-07-22 08:07:22,735 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Add ...
2025-07-22 08:07:22,735 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Sqrt ...
2025-07-22 08:07:22,735 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Div ...
2025-07-22 08:07:22,735 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Mul ...
2025-07-22 08:07:22,735 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Add_1 ...
2025-07-22 08:07:22,735 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc11/MatMul ...
2025-07-22 08:07:22,741 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc11/MatMul ...
2025-07-22 08:07:22,741 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc12/MatMul ...
2025-07-22 08:07:22,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc12/MatMul ...
2025-07-22 08:07:22,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Sigmoid ...
2025-07-22 08:07:22,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Mul ...
2025-07-22 08:07:22,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Mul_1 ...
2025-07-22 08:07:22,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc2/MatMul ...
2025-07-22 08:07:22,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc2/MatMul ...
2025-07-22 08:07:22,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Add_1 ...
2025-07-22 08:07:22,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Cast_1 ...
2025-07-22 08:07:22,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/ReduceMean ...
2025-07-22 08:07:22,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Sub ...
2025-07-22 08:07:22,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Constant ...
2025-07-22 08:07:22,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Pow ...
2025-07-22 08:07:22,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/ReduceMean_1 ...
2025-07-22 08:07:22,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Constant_1 ...
2025-07-22 08:07:22,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Add ...
2025-07-22 08:07:22,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Sqrt ...
2025-07-22 08:07:22,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Div ...
2025-07-22 08:07:22,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Mul ...
2025-07-22 08:07:22,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Add_1 ...
2025-07-22 08:07:22,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape ...
2025-07-22 08:07:22,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant ...
2025-07-22 08:07:22,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather ...
2025-07-22 08:07:22,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_1 ...
2025-07-22 08:07:22,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_1 ...
2025-07-22 08:07:22,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_1 ...
2025-07-22 08:07:22,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_2 ...
2025-07-22 08:07:22,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_2 ...
2025-07-22 08:07:22,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_2 ...
2025-07-22 08:07:22,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_3 ...
2025-07-22 08:07:22,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Div ...
2025-07-22 08:07:22,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Cast ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Cast_1 ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_1 ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_4 ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_2 ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_5 ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Concat ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Reshape ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cast ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Range ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Einsum ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cos ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Sin ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:22,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Equal ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Where ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:22,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Div ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:22,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Neg ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:22,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:22,765 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_3 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_4 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_5 ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose ...
2025-07-22 08:07:22,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_1 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_2 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/MatMul ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/MatMul ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_6 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Div_1 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Add ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Softmax ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/MatMul_1 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/MatMul_1 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_3 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_3 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_7 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_6 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_4 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_8 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_7 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_5 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_9 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_8 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_6 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_10 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_9 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_11 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Mul ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Mul_1 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_3 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_4 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_5 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Concat_1 ...
2025-07-22 08:07:22,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Reshape_1 ...
2025-07-22 08:07:22,768 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/out_proj/MatMul ...
2025-07-22 08:07:22,771 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/out_proj/MatMul ...
2025-07-22 08:07:22,771 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Add ...
2025-07-22 08:07:22,771 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Cast ...
2025-07-22 08:07:22,771 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/ReduceMean ...
2025-07-22 08:07:22,771 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Sub ...
2025-07-22 08:07:22,771 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Constant ...
2025-07-22 08:07:22,771 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Pow ...
2025-07-22 08:07:22,772 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/ReduceMean_1 ...
2025-07-22 08:07:22,772 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Constant_1 ...
2025-07-22 08:07:22,772 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Add ...
2025-07-22 08:07:22,772 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Sqrt ...
2025-07-22 08:07:22,772 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Div ...
2025-07-22 08:07:22,772 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Mul ...
2025-07-22 08:07:22,772 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Add_1 ...
2025-07-22 08:07:22,772 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc11/MatMul ...
2025-07-22 08:07:22,778 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc11/MatMul ...
2025-07-22 08:07:22,778 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc12/MatMul ...
2025-07-22 08:07:22,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc12/MatMul ...
2025-07-22 08:07:22,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Sigmoid ...
2025-07-22 08:07:22,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Mul ...
2025-07-22 08:07:22,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Mul_1 ...
2025-07-22 08:07:22,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc2/MatMul ...
2025-07-22 08:07:22,790 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc2/MatMul ...
2025-07-22 08:07:22,790 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Add_1 ...
2025-07-22 08:07:22,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Cast_1 ...
2025-07-22 08:07:22,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/ReduceMean ...
2025-07-22 08:07:22,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Sub ...
2025-07-22 08:07:22,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Constant ...
2025-07-22 08:07:22,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Pow ...
2025-07-22 08:07:22,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/ReduceMean_1 ...
2025-07-22 08:07:22,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Constant_1 ...
2025-07-22 08:07:22,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Add ...
2025-07-22 08:07:22,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Sqrt ...
2025-07-22 08:07:22,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Div ...
2025-07-22 08:07:22,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Mul ...
2025-07-22 08:07:22,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Add_1 ...
2025-07-22 08:07:22,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape ...
2025-07-22 08:07:22,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant ...
2025-07-22 08:07:22,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_1 ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_1 ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_1 ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_2 ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_2 ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_2 ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_3 ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Div ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Cast ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Cast_1 ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_1 ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_4 ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_2 ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_5 ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Concat ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Reshape ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cast ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Range ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Einsum ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cos ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Sin ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:22,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Equal ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Where ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape ...
2025-07-22 08:07:22,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:22,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Div ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Neg ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:22,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:22,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:22,802 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_3 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_4 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_5 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_1 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_2 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/MatMul ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/MatMul ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_6 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Div_1 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Add ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Softmax ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/MatMul_1 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/MatMul_1 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_3 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_3 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_7 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_6 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_4 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_8 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_7 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_5 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_9 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_8 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_6 ...
2025-07-22 08:07:22,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_10 ...
2025-07-22 08:07:22,804 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_9 ...
2025-07-22 08:07:22,804 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_11 ...
2025-07-22 08:07:22,804 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Mul ...
2025-07-22 08:07:22,804 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Mul_1 ...
2025-07-22 08:07:22,804 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_3 ...
2025-07-22 08:07:22,804 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_4 ...
2025-07-22 08:07:22,804 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_5 ...
2025-07-22 08:07:22,804 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Concat_1 ...
2025-07-22 08:07:22,804 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Reshape_1 ...
2025-07-22 08:07:22,804 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/out_proj/MatMul ...
2025-07-22 08:07:22,807 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/out_proj/MatMul ...
2025-07-22 08:07:22,808 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Add ...
2025-07-22 08:07:22,808 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Cast ...
2025-07-22 08:07:22,808 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/ReduceMean ...
2025-07-22 08:07:22,808 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Sub ...
2025-07-22 08:07:22,808 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Constant ...
2025-07-22 08:07:22,808 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Pow ...
2025-07-22 08:07:22,808 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/ReduceMean_1 ...
2025-07-22 08:07:22,808 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Constant_1 ...
2025-07-22 08:07:22,808 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Add ...
2025-07-22 08:07:22,808 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Sqrt ...
2025-07-22 08:07:22,808 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Div ...
2025-07-22 08:07:22,808 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Mul ...
2025-07-22 08:07:22,808 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Add_1 ...
2025-07-22 08:07:22,808 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc11/MatMul ...
2025-07-22 08:07:22,814 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc11/MatMul ...
2025-07-22 08:07:22,814 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc12/MatMul ...
2025-07-22 08:07:22,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc12/MatMul ...
2025-07-22 08:07:22,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Sigmoid ...
2025-07-22 08:07:22,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Mul ...
2025-07-22 08:07:22,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Mul_1 ...
2025-07-22 08:07:22,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc2/MatMul ...
2025-07-22 08:07:22,827 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc2/MatMul ...
2025-07-22 08:07:22,827 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Add_1 ...
2025-07-22 08:07:22,827 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Cast_1 ...
2025-07-22 08:07:22,827 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/ReduceMean ...
2025-07-22 08:07:22,827 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Sub ...
2025-07-22 08:07:22,827 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Constant ...
2025-07-22 08:07:22,827 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Pow ...
2025-07-22 08:07:22,827 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/ReduceMean_1 ...
2025-07-22 08:07:22,827 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Constant_1 ...
2025-07-22 08:07:22,827 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Add ...
2025-07-22 08:07:22,827 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Sqrt ...
2025-07-22 08:07:22,827 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Div ...
2025-07-22 08:07:22,827 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Mul ...
2025-07-22 08:07:22,827 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Add_1 ...
2025-07-22 08:07:22,827 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/Wqkv/MatMul ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_1 ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_1 ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_1 ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_2 ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_2 ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_2 ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_3 ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Div ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Cast ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Cast_1 ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_1 ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_4 ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_2 ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_5 ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Concat ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Reshape ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cast ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Range ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Einsum ...
2025-07-22 08:07:22,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cos ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Sin ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:22,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Equal ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Where ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:22,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Div ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Neg ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:22,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:22,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:22,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_3 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_4 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_5 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_1 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_2 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/MatMul ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/MatMul ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_6 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Div_1 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Add ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Softmax ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/MatMul_1 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/MatMul_1 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_3 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_3 ...
2025-07-22 08:07:22,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_7 ...
2025-07-22 08:07:22,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_6 ...
2025-07-22 08:07:22,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_4 ...
2025-07-22 08:07:22,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_8 ...
2025-07-22 08:07:22,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_7 ...
2025-07-22 08:07:22,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_5 ...
2025-07-22 08:07:22,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_9 ...
2025-07-22 08:07:22,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_8 ...
2025-07-22 08:07:22,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_6 ...
2025-07-22 08:07:22,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_10 ...
2025-07-22 08:07:22,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_9 ...
2025-07-22 08:07:22,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_11 ...
2025-07-22 08:07:22,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Mul ...
2025-07-22 08:07:22,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Mul_1 ...
2025-07-22 08:07:22,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_3 ...
2025-07-22 08:07:22,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_4 ...
2025-07-22 08:07:22,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_5 ...
2025-07-22 08:07:22,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Concat_1 ...
2025-07-22 08:07:22,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Reshape_1 ...
2025-07-22 08:07:22,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/out_proj/MatMul ...
2025-07-22 08:07:22,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/out_proj/MatMul ...
2025-07-22 08:07:22,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Add ...
2025-07-22 08:07:22,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Cast ...
2025-07-22 08:07:22,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/ReduceMean ...
2025-07-22 08:07:22,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Sub ...
2025-07-22 08:07:22,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Constant ...
2025-07-22 08:07:22,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Pow ...
2025-07-22 08:07:22,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/ReduceMean_1 ...
2025-07-22 08:07:22,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Constant_1 ...
2025-07-22 08:07:22,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Add ...
2025-07-22 08:07:22,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Sqrt ...
2025-07-22 08:07:22,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Div ...
2025-07-22 08:07:22,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Mul ...
2025-07-22 08:07:22,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Add_1 ...
2025-07-22 08:07:22,844 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc11/MatMul ...
2025-07-22 08:07:22,850 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc11/MatMul ...
2025-07-22 08:07:22,851 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc12/MatMul ...
2025-07-22 08:07:22,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc12/MatMul ...
2025-07-22 08:07:22,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Sigmoid ...
2025-07-22 08:07:22,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Mul ...
2025-07-22 08:07:22,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Mul_1 ...
2025-07-22 08:07:22,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc2/MatMul ...
2025-07-22 08:07:22,863 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc2/MatMul ...
2025-07-22 08:07:22,863 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Add_1 ...
2025-07-22 08:07:22,863 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Cast_1 ...
2025-07-22 08:07:22,863 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/ReduceMean ...
2025-07-22 08:07:22,863 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Sub ...
2025-07-22 08:07:22,863 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Constant ...
2025-07-22 08:07:22,863 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Pow ...
2025-07-22 08:07:22,863 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/ReduceMean_1 ...
2025-07-22 08:07:22,863 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Constant_1 ...
2025-07-22 08:07:22,863 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Add ...
2025-07-22 08:07:22,863 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Sqrt ...
2025-07-22 08:07:22,864 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Div ...
2025-07-22 08:07:22,864 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Mul ...
2025-07-22 08:07:22,864 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Add_1 ...
- Quantizing to q4: 60%|ββββββ | 3/5 [00:10<00:05, 2.97s/it][A
- Quantizing to q4f16: 60%|ββββββ | 3/5 [00:10<00:05, 2.97s/it][A2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/word_embeddings/Gather ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/token_type_embeddings/Gather ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/Constant ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/Add ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/ReduceMean ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Sub ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Constant ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Pow ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/ReduceMean_1 ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Constant_1 ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Add ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Sqrt ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Div ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Mul ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Add_1 ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Constant ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Unsqueeze ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Constant_1 ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Unsqueeze_1 ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Cast ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Constant_2 ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Sub ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Constant_3 ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Mul ...
2025-07-22 08:07:23,420 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_1 ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_1 ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_1 ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_2 ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_2 ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_2 ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_3 ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Div ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Cast ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Cast_1 ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_1 ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_4 ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_2 ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_5 ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Concat ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Reshape ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cast ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Range ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:23,426 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Einsum ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cos ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Sin ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:23,427 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Equal ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Where ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:23,428 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Div ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Neg ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:23,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:23,430 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:23,431 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_75 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_3 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_4 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_5 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_1 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_2 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/MatMul ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/MatMul ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_6 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Div_1 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Add ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Softmax ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/MatMul_1 ...
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,432 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/MatMul_1 ...
2025-07-22 08:07:23,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_3 ...
2025-07-22 08:07:23,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_3 ...
2025-07-22 08:07:23,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_7 ...
2025-07-22 08:07:23,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_6 ...
2025-07-22 08:07:23,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_4 ...
2025-07-22 08:07:23,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_8 ...
2025-07-22 08:07:23,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_7 ...
2025-07-22 08:07:23,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_5 ...
2025-07-22 08:07:23,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_9 ...
2025-07-22 08:07:23,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_8 ...
2025-07-22 08:07:23,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_6 ...
2025-07-22 08:07:23,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_10 ...
2025-07-22 08:07:23,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_9 ...
2025-07-22 08:07:23,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_11 ...
2025-07-22 08:07:23,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Mul ...
2025-07-22 08:07:23,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Mul_1 ...
2025-07-22 08:07:23,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_3 ...
2025-07-22 08:07:23,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_4 ...
2025-07-22 08:07:23,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_5 ...
2025-07-22 08:07:23,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Concat_1 ...
2025-07-22 08:07:23,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Reshape_1 ...
2025-07-22 08:07:23,433 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/out_proj/MatMul ...
2025-07-22 08:07:23,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/out_proj/MatMul ...
2025-07-22 08:07:23,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Add ...
2025-07-22 08:07:23,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Cast ...
2025-07-22 08:07:23,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/ReduceMean ...
2025-07-22 08:07:23,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Sub ...
2025-07-22 08:07:23,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Constant ...
2025-07-22 08:07:23,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Pow ...
2025-07-22 08:07:23,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/ReduceMean_1 ...
2025-07-22 08:07:23,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Constant_1 ...
2025-07-22 08:07:23,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Add ...
2025-07-22 08:07:23,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Sqrt ...
2025-07-22 08:07:23,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Div ...
2025-07-22 08:07:23,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Mul ...
2025-07-22 08:07:23,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Add_1 ...
2025-07-22 08:07:23,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc11/MatMul ...
2025-07-22 08:07:23,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc11/MatMul ...
2025-07-22 08:07:23,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc12/MatMul ...
2025-07-22 08:07:23,449 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc12/MatMul ...
2025-07-22 08:07:23,449 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Sigmoid ...
2025-07-22 08:07:23,449 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Mul ...
2025-07-22 08:07:23,449 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Mul_1 ...
2025-07-22 08:07:23,449 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc2/MatMul ...
2025-07-22 08:07:23,456 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc2/MatMul ...
2025-07-22 08:07:23,456 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Add_1 ...
2025-07-22 08:07:23,456 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Cast_1 ...
2025-07-22 08:07:23,456 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/ReduceMean ...
2025-07-22 08:07:23,456 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Sub ...
2025-07-22 08:07:23,456 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Constant ...
2025-07-22 08:07:23,456 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Pow ...
2025-07-22 08:07:23,456 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/ReduceMean_1 ...
2025-07-22 08:07:23,456 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Constant_1 ...
2025-07-22 08:07:23,456 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Add ...
2025-07-22 08:07:23,456 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Sqrt ...
2025-07-22 08:07:23,456 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Div ...
2025-07-22 08:07:23,456 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Mul ...
2025-07-22 08:07:23,456 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Add_1 ...
2025-07-22 08:07:23,456 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_1 ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_1 ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_1 ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_2 ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_2 ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_2 ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_3 ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Div ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Cast ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Cast_1 ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_1 ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_4 ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_2 ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_5 ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Concat ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Reshape ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cast ...
2025-07-22 08:07:23,462 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Range ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Einsum ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cos ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Sin ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:23,463 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Equal ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Where ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:23,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Div ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Neg ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:23,465 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:23,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:23,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_3 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_4 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_5 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_1 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_2 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/MatMul ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/MatMul ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_6 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Div_1 ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Add ...
2025-07-22 08:07:23,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Softmax ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/MatMul_1 ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/MatMul_1 ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_3 ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_3 ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_7 ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_6 ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_4 ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_8 ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_7 ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_5 ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_9 ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_8 ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_6 ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_10 ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_9 ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_11 ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Mul ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Mul_1 ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_3 ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_4 ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_5 ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Concat_1 ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Reshape_1 ...
2025-07-22 08:07:23,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/out_proj/MatMul ...
2025-07-22 08:07:23,476 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/out_proj/MatMul ...
2025-07-22 08:07:23,476 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Add ...
2025-07-22 08:07:23,476 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Cast ...
2025-07-22 08:07:23,476 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/ReduceMean ...
2025-07-22 08:07:23,476 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Sub ...
2025-07-22 08:07:23,476 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Constant ...
2025-07-22 08:07:23,476 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Pow ...
2025-07-22 08:07:23,476 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/ReduceMean_1 ...
2025-07-22 08:07:23,476 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Constant_1 ...
2025-07-22 08:07:23,476 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Add ...
2025-07-22 08:07:23,476 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Sqrt ...
2025-07-22 08:07:23,476 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Div ...
2025-07-22 08:07:23,476 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Mul ...
2025-07-22 08:07:23,476 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Add_1 ...
2025-07-22 08:07:23,476 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc11/MatMul ...
2025-07-22 08:07:23,482 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc11/MatMul ...
2025-07-22 08:07:23,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc12/MatMul ...
2025-07-22 08:07:23,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc12/MatMul ...
2025-07-22 08:07:23,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Sigmoid ...
2025-07-22 08:07:23,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Mul ...
2025-07-22 08:07:23,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Mul_1 ...
2025-07-22 08:07:23,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc2/MatMul ...
2025-07-22 08:07:23,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc2/MatMul ...
2025-07-22 08:07:23,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Add_1 ...
2025-07-22 08:07:23,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Cast_1 ...
2025-07-22 08:07:23,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/ReduceMean ...
2025-07-22 08:07:23,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Sub ...
2025-07-22 08:07:23,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Constant ...
2025-07-22 08:07:23,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Pow ...
2025-07-22 08:07:23,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/ReduceMean_1 ...
2025-07-22 08:07:23,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Constant_1 ...
2025-07-22 08:07:23,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Add ...
2025-07-22 08:07:23,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Sqrt ...
2025-07-22 08:07:23,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Div ...
2025-07-22 08:07:23,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Mul ...
2025-07-22 08:07:23,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Add_1 ...
2025-07-22 08:07:23,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape ...
2025-07-22 08:07:23,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant ...
2025-07-22 08:07:23,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather ...
2025-07-22 08:07:23,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_1 ...
2025-07-22 08:07:23,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_1 ...
2025-07-22 08:07:23,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_1 ...
2025-07-22 08:07:23,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_2 ...
2025-07-22 08:07:23,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_2 ...
2025-07-22 08:07:23,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_2 ...
2025-07-22 08:07:23,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_3 ...
2025-07-22 08:07:23,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Div ...
2025-07-22 08:07:23,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Cast ...
2025-07-22 08:07:23,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Cast_1 ...
2025-07-22 08:07:23,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze ...
2025-07-22 08:07:23,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_1 ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_4 ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_2 ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_5 ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Concat ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Reshape ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cast ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Range ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Einsum ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cos ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Sin ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:23,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Equal ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Where ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:23,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Div ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Neg ...
2025-07-22 08:07:23,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:23,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:23,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_3 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_4 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_5 ...
2025-07-22 08:07:23,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_1 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_2 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/MatMul ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/MatMul ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_6 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Div_1 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Add ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Softmax ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/MatMul_1 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/MatMul_1 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_3 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_3 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_7 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_6 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_4 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_8 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_7 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_5 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_9 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_8 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_6 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_10 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_9 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_11 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Mul ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Mul_1 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_3 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_4 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_5 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Concat_1 ...
2025-07-22 08:07:23,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Reshape_1 ...
2025-07-22 08:07:23,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/out_proj/MatMul ...
2025-07-22 08:07:23,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/out_proj/MatMul ...
2025-07-22 08:07:23,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Add ...
2025-07-22 08:07:23,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Cast ...
2025-07-22 08:07:23,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/ReduceMean ...
2025-07-22 08:07:23,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Sub ...
2025-07-22 08:07:23,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Constant ...
2025-07-22 08:07:23,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Pow ...
2025-07-22 08:07:23,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/ReduceMean_1 ...
2025-07-22 08:07:23,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Constant_1 ...
2025-07-22 08:07:23,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Add ...
2025-07-22 08:07:23,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Sqrt ...
2025-07-22 08:07:23,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Div ...
2025-07-22 08:07:23,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Mul ...
2025-07-22 08:07:23,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Add_1 ...
2025-07-22 08:07:23,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc11/MatMul ...
2025-07-22 08:07:23,519 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc11/MatMul ...
2025-07-22 08:07:23,519 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc12/MatMul ...
2025-07-22 08:07:23,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc12/MatMul ...
2025-07-22 08:07:23,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Sigmoid ...
2025-07-22 08:07:23,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Mul ...
2025-07-22 08:07:23,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Mul_1 ...
2025-07-22 08:07:23,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc2/MatMul ...
2025-07-22 08:07:23,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc2/MatMul ...
2025-07-22 08:07:23,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Add_1 ...
2025-07-22 08:07:23,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Cast_1 ...
2025-07-22 08:07:23,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/ReduceMean ...
2025-07-22 08:07:23,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Sub ...
2025-07-22 08:07:23,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Constant ...
2025-07-22 08:07:23,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Pow ...
2025-07-22 08:07:23,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/ReduceMean_1 ...
2025-07-22 08:07:23,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Constant_1 ...
2025-07-22 08:07:23,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Add ...
2025-07-22 08:07:23,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Sqrt ...
2025-07-22 08:07:23,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Div ...
2025-07-22 08:07:23,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Mul ...
2025-07-22 08:07:23,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Add_1 ...
2025-07-22 08:07:23,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,537 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,537 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape ...
2025-07-22 08:07:23,537 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant ...
2025-07-22 08:07:23,537 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather ...
2025-07-22 08:07:23,537 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_1 ...
2025-07-22 08:07:23,537 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_1 ...
2025-07-22 08:07:23,537 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_1 ...
2025-07-22 08:07:23,537 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_2 ...
2025-07-22 08:07:23,537 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_2 ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_2 ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_3 ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Div ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Cast ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Cast_1 ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_1 ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_4 ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_2 ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_5 ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Concat ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Reshape ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cast ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Range ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Einsum ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cos ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Sin ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:23,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Equal ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Where ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:23,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Div ...
2025-07-22 08:07:23,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Neg ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:23,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:23,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:23,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_3 ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_4 ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_5 ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_1 ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_2 ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/MatMul ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/MatMul ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_6 ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Div_1 ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Add ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Softmax ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/MatMul_1 ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/MatMul_1 ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_3 ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_3 ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_7 ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_6 ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_4 ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_8 ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_7 ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_5 ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_9 ...
2025-07-22 08:07:23,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_8 ...
2025-07-22 08:07:23,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_6 ...
2025-07-22 08:07:23,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_10 ...
2025-07-22 08:07:23,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_9 ...
2025-07-22 08:07:23,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_11 ...
2025-07-22 08:07:23,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Mul ...
2025-07-22 08:07:23,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Mul_1 ...
2025-07-22 08:07:23,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_3 ...
2025-07-22 08:07:23,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_4 ...
2025-07-22 08:07:23,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_5 ...
2025-07-22 08:07:23,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Concat_1 ...
2025-07-22 08:07:23,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Reshape_1 ...
2025-07-22 08:07:23,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/out_proj/MatMul ...
2025-07-22 08:07:23,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/out_proj/MatMul ...
2025-07-22 08:07:23,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Add ...
2025-07-22 08:07:23,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Cast ...
2025-07-22 08:07:23,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/ReduceMean ...
2025-07-22 08:07:23,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Sub ...
2025-07-22 08:07:23,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Constant ...
2025-07-22 08:07:23,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Pow ...
2025-07-22 08:07:23,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/ReduceMean_1 ...
2025-07-22 08:07:23,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Constant_1 ...
2025-07-22 08:07:23,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Add ...
2025-07-22 08:07:23,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Sqrt ...
2025-07-22 08:07:23,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Div ...
2025-07-22 08:07:23,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Mul ...
2025-07-22 08:07:23,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Add_1 ...
2025-07-22 08:07:23,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc11/MatMul ...
2025-07-22 08:07:23,555 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc11/MatMul ...
2025-07-22 08:07:23,555 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc12/MatMul ...
2025-07-22 08:07:23,561 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc12/MatMul ...
2025-07-22 08:07:23,561 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Sigmoid ...
2025-07-22 08:07:23,561 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Mul ...
2025-07-22 08:07:23,561 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Mul_1 ...
2025-07-22 08:07:23,561 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc2/MatMul ...
2025-07-22 08:07:23,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc2/MatMul ...
2025-07-22 08:07:23,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Add_1 ...
2025-07-22 08:07:23,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Cast_1 ...
2025-07-22 08:07:23,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/ReduceMean ...
2025-07-22 08:07:23,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Sub ...
2025-07-22 08:07:23,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Constant ...
2025-07-22 08:07:23,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Pow ...
2025-07-22 08:07:23,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/ReduceMean_1 ...
2025-07-22 08:07:23,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Constant_1 ...
2025-07-22 08:07:23,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Add ...
2025-07-22 08:07:23,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Sqrt ...
2025-07-22 08:07:23,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Div ...
2025-07-22 08:07:23,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Mul ...
2025-07-22 08:07:23,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Add_1 ...
2025-07-22 08:07:23,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_1 ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_1 ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_1 ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_2 ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_2 ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_2 ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_3 ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Div ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Cast ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Cast_1 ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_1 ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_4 ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_2 ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_5 ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Concat ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Reshape ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cast ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Range ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Einsum ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cos ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Sin ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:23,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Equal ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Where ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:23,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:23,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Div ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Neg ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:23,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:23,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:23,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_3 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_4 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_5 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_1 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_2 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/MatMul ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/MatMul ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_6 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Div_1 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Add ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Softmax ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/MatMul_1 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/MatMul_1 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_3 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_3 ...
2025-07-22 08:07:23,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_7 ...
2025-07-22 08:07:23,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_6 ...
2025-07-22 08:07:23,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_4 ...
2025-07-22 08:07:23,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_8 ...
2025-07-22 08:07:23,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_7 ...
2025-07-22 08:07:23,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_5 ...
2025-07-22 08:07:23,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_9 ...
2025-07-22 08:07:23,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_8 ...
2025-07-22 08:07:23,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_6 ...
2025-07-22 08:07:23,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_10 ...
2025-07-22 08:07:23,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_9 ...
2025-07-22 08:07:23,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_11 ...
2025-07-22 08:07:23,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Mul ...
2025-07-22 08:07:23,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Mul_1 ...
2025-07-22 08:07:23,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_3 ...
2025-07-22 08:07:23,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_4 ...
2025-07-22 08:07:23,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_5 ...
2025-07-22 08:07:23,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Concat_1 ...
2025-07-22 08:07:23,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Reshape_1 ...
2025-07-22 08:07:23,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/out_proj/MatMul ...
2025-07-22 08:07:23,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/out_proj/MatMul ...
2025-07-22 08:07:23,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Add ...
2025-07-22 08:07:23,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Cast ...
2025-07-22 08:07:23,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/ReduceMean ...
2025-07-22 08:07:23,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Sub ...
2025-07-22 08:07:23,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Constant ...
2025-07-22 08:07:23,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Pow ...
2025-07-22 08:07:23,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/ReduceMean_1 ...
2025-07-22 08:07:23,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Constant_1 ...
2025-07-22 08:07:23,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Add ...
2025-07-22 08:07:23,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Sqrt ...
2025-07-22 08:07:23,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Div ...
2025-07-22 08:07:23,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Mul ...
2025-07-22 08:07:23,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Add_1 ...
2025-07-22 08:07:23,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc11/MatMul ...
2025-07-22 08:07:23,591 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc11/MatMul ...
2025-07-22 08:07:23,591 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc12/MatMul ...
2025-07-22 08:07:23,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc12/MatMul ...
2025-07-22 08:07:23,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Sigmoid ...
2025-07-22 08:07:23,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Mul ...
2025-07-22 08:07:23,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Mul_1 ...
2025-07-22 08:07:23,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc2/MatMul ...
2025-07-22 08:07:23,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc2/MatMul ...
2025-07-22 08:07:23,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Add_1 ...
2025-07-22 08:07:23,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Cast_1 ...
2025-07-22 08:07:23,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/ReduceMean ...
2025-07-22 08:07:23,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Sub ...
2025-07-22 08:07:23,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Constant ...
2025-07-22 08:07:23,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Pow ...
2025-07-22 08:07:23,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/ReduceMean_1 ...
2025-07-22 08:07:23,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Constant_1 ...
2025-07-22 08:07:23,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Add ...
2025-07-22 08:07:23,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Sqrt ...
2025-07-22 08:07:23,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Div ...
2025-07-22 08:07:23,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Mul ...
2025-07-22 08:07:23,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Add_1 ...
2025-07-22 08:07:23,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape ...
2025-07-22 08:07:23,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant ...
2025-07-22 08:07:23,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather ...
2025-07-22 08:07:23,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_1 ...
2025-07-22 08:07:23,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_1 ...
2025-07-22 08:07:23,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_1 ...
2025-07-22 08:07:23,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_2 ...
2025-07-22 08:07:23,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_2 ...
2025-07-22 08:07:23,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_2 ...
2025-07-22 08:07:23,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_3 ...
2025-07-22 08:07:23,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Div ...
2025-07-22 08:07:23,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Cast ...
2025-07-22 08:07:23,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Cast_1 ...
2025-07-22 08:07:23,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze ...
2025-07-22 08:07:23,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_1 ...
2025-07-22 08:07:23,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_4 ...
2025-07-22 08:07:23,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_2 ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_5 ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Concat ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Reshape ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cast ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Range ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Einsum ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cos ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Sin ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:23,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Equal ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Where ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:23,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Div ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Neg ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:23,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:23,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:23,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_3 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_4 ...
2025-07-22 08:07:23,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_5 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_1 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_2 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/MatMul ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/MatMul ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_6 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Div_1 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Add ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Softmax ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/MatMul_1 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/MatMul_1 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_3 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_3 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_7 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_6 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_4 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_8 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_7 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_5 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_9 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_8 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_6 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_10 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_9 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_11 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Mul ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Mul_1 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_3 ...
2025-07-22 08:07:23,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_4 ...
2025-07-22 08:07:23,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_5 ...
2025-07-22 08:07:23,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Concat_1 ...
2025-07-22 08:07:23,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Reshape_1 ...
2025-07-22 08:07:23,618 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/out_proj/MatMul ...
2025-07-22 08:07:23,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/out_proj/MatMul ...
2025-07-22 08:07:23,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Add ...
2025-07-22 08:07:23,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Cast ...
2025-07-22 08:07:23,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/ReduceMean ...
2025-07-22 08:07:23,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Sub ...
2025-07-22 08:07:23,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Constant ...
2025-07-22 08:07:23,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Pow ...
2025-07-22 08:07:23,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/ReduceMean_1 ...
2025-07-22 08:07:23,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Constant_1 ...
2025-07-22 08:07:23,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Add ...
2025-07-22 08:07:23,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Sqrt ...
2025-07-22 08:07:23,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Div ...
2025-07-22 08:07:23,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Mul ...
2025-07-22 08:07:23,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Add_1 ...
2025-07-22 08:07:23,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc11/MatMul ...
2025-07-22 08:07:23,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc11/MatMul ...
2025-07-22 08:07:23,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc12/MatMul ...
2025-07-22 08:07:23,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc12/MatMul ...
2025-07-22 08:07:23,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Sigmoid ...
2025-07-22 08:07:23,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Mul ...
2025-07-22 08:07:23,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Mul_1 ...
2025-07-22 08:07:23,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc2/MatMul ...
2025-07-22 08:07:23,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc2/MatMul ...
2025-07-22 08:07:23,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Add_1 ...
2025-07-22 08:07:23,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Cast_1 ...
2025-07-22 08:07:23,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/ReduceMean ...
2025-07-22 08:07:23,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Sub ...
2025-07-22 08:07:23,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Constant ...
2025-07-22 08:07:23,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Pow ...
2025-07-22 08:07:23,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/ReduceMean_1 ...
2025-07-22 08:07:23,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Constant_1 ...
2025-07-22 08:07:23,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Add ...
2025-07-22 08:07:23,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Sqrt ...
2025-07-22 08:07:23,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Div ...
2025-07-22 08:07:23,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Mul ...
2025-07-22 08:07:23,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Add_1 ...
2025-07-22 08:07:23,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_1 ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_1 ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_1 ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_2 ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_2 ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_2 ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_3 ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Div ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Cast ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Cast_1 ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_1 ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_4 ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_2 ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_5 ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Concat ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Reshape ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cast ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Range ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Einsum ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cos ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Sin ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:23,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:23,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Equal ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Where ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:23,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Div ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Neg ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:23,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:23,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:23,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_3 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_4 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_5 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_1 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_2 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/MatMul ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/MatMul ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_6 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Div_1 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Add ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Softmax ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/MatMul_1 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/MatMul_1 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_3 ...
2025-07-22 08:07:23,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_3 ...
2025-07-22 08:07:23,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_7 ...
2025-07-22 08:07:23,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_6 ...
2025-07-22 08:07:23,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_4 ...
2025-07-22 08:07:23,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_8 ...
2025-07-22 08:07:23,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_7 ...
2025-07-22 08:07:23,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_5 ...
2025-07-22 08:07:23,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_9 ...
2025-07-22 08:07:23,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_8 ...
2025-07-22 08:07:23,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_6 ...
2025-07-22 08:07:23,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_10 ...
2025-07-22 08:07:23,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_9 ...
2025-07-22 08:07:23,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_11 ...
2025-07-22 08:07:23,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Mul ...
2025-07-22 08:07:23,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Mul_1 ...
2025-07-22 08:07:23,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_3 ...
2025-07-22 08:07:23,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_4 ...
2025-07-22 08:07:23,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_5 ...
2025-07-22 08:07:23,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Concat_1 ...
2025-07-22 08:07:23,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Reshape_1 ...
2025-07-22 08:07:23,654 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/out_proj/MatMul ...
2025-07-22 08:07:23,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/out_proj/MatMul ...
2025-07-22 08:07:23,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Add ...
2025-07-22 08:07:23,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Cast ...
2025-07-22 08:07:23,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/ReduceMean ...
2025-07-22 08:07:23,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Sub ...
2025-07-22 08:07:23,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Constant ...
2025-07-22 08:07:23,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Pow ...
2025-07-22 08:07:23,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/ReduceMean_1 ...
2025-07-22 08:07:23,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Constant_1 ...
2025-07-22 08:07:23,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Add ...
2025-07-22 08:07:23,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Sqrt ...
2025-07-22 08:07:23,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Div ...
2025-07-22 08:07:23,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Mul ...
2025-07-22 08:07:23,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Add_1 ...
2025-07-22 08:07:23,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc11/MatMul ...
2025-07-22 08:07:23,664 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc11/MatMul ...
2025-07-22 08:07:23,664 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc12/MatMul ...
2025-07-22 08:07:23,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc12/MatMul ...
2025-07-22 08:07:23,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Sigmoid ...
2025-07-22 08:07:23,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Mul ...
2025-07-22 08:07:23,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Mul_1 ...
2025-07-22 08:07:23,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc2/MatMul ...
2025-07-22 08:07:23,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc2/MatMul ...
2025-07-22 08:07:23,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Add_1 ...
2025-07-22 08:07:23,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Cast_1 ...
2025-07-22 08:07:23,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/ReduceMean ...
2025-07-22 08:07:23,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Sub ...
2025-07-22 08:07:23,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Constant ...
2025-07-22 08:07:23,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Pow ...
2025-07-22 08:07:23,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/ReduceMean_1 ...
2025-07-22 08:07:23,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Constant_1 ...
2025-07-22 08:07:23,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Add ...
2025-07-22 08:07:23,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Sqrt ...
2025-07-22 08:07:23,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Div ...
2025-07-22 08:07:23,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Mul ...
2025-07-22 08:07:23,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Add_1 ...
2025-07-22 08:07:23,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape ...
2025-07-22 08:07:23,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant ...
2025-07-22 08:07:23,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather ...
2025-07-22 08:07:23,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_1 ...
2025-07-22 08:07:23,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_1 ...
2025-07-22 08:07:23,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_1 ...
2025-07-22 08:07:23,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_2 ...
2025-07-22 08:07:23,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_2 ...
2025-07-22 08:07:23,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_2 ...
2025-07-22 08:07:23,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_3 ...
2025-07-22 08:07:23,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Div ...
2025-07-22 08:07:23,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Cast ...
2025-07-22 08:07:23,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Cast_1 ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_1 ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_4 ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_2 ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_5 ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Concat ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Reshape ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cast ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Range ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Einsum ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cos ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Sin ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:23,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Equal ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Where ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:23,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Div ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:23,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Neg ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:23,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:23,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:23,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_3 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_4 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_5 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_1 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_2 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/MatMul ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/MatMul ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_6 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Div_1 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Add ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Softmax ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/MatMul_1 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/MatMul_1 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_3 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_3 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_7 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_6 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_4 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_8 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_7 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_5 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_9 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_8 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_6 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_10 ...
2025-07-22 08:07:23,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_9 ...
2025-07-22 08:07:23,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_11 ...
2025-07-22 08:07:23,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Mul ...
2025-07-22 08:07:23,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Mul_1 ...
2025-07-22 08:07:23,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_3 ...
2025-07-22 08:07:23,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_4 ...
2025-07-22 08:07:23,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_5 ...
2025-07-22 08:07:23,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Concat_1 ...
2025-07-22 08:07:23,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Reshape_1 ...
2025-07-22 08:07:23,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/out_proj/MatMul ...
2025-07-22 08:07:23,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/out_proj/MatMul ...
2025-07-22 08:07:23,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Add ...
2025-07-22 08:07:23,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Cast ...
2025-07-22 08:07:23,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/ReduceMean ...
2025-07-22 08:07:23,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Sub ...
2025-07-22 08:07:23,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Constant ...
2025-07-22 08:07:23,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Pow ...
2025-07-22 08:07:23,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/ReduceMean_1 ...
2025-07-22 08:07:23,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Constant_1 ...
2025-07-22 08:07:23,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Add ...
2025-07-22 08:07:23,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Sqrt ...
2025-07-22 08:07:23,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Div ...
2025-07-22 08:07:23,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Mul ...
2025-07-22 08:07:23,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Add_1 ...
2025-07-22 08:07:23,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc11/MatMul ...
2025-07-22 08:07:23,701 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc11/MatMul ...
2025-07-22 08:07:23,701 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc12/MatMul ...
2025-07-22 08:07:23,707 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc12/MatMul ...
2025-07-22 08:07:23,707 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Sigmoid ...
2025-07-22 08:07:23,707 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Mul ...
2025-07-22 08:07:23,707 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Mul_1 ...
2025-07-22 08:07:23,707 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc2/MatMul ...
2025-07-22 08:07:23,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc2/MatMul ...
2025-07-22 08:07:23,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Add_1 ...
2025-07-22 08:07:23,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Cast_1 ...
2025-07-22 08:07:23,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/ReduceMean ...
2025-07-22 08:07:23,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Sub ...
2025-07-22 08:07:23,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Constant ...
2025-07-22 08:07:23,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Pow ...
2025-07-22 08:07:23,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/ReduceMean_1 ...
2025-07-22 08:07:23,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Constant_1 ...
2025-07-22 08:07:23,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Add ...
2025-07-22 08:07:23,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Sqrt ...
2025-07-22 08:07:23,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Div ...
2025-07-22 08:07:23,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Mul ...
2025-07-22 08:07:23,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Add_1 ...
2025-07-22 08:07:23,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_1 ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_1 ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_1 ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_2 ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_2 ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_2 ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_3 ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Div ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Cast ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Cast_1 ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_1 ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_4 ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_2 ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_5 ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Concat ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Reshape ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cast ...
2025-07-22 08:07:23,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Range ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Einsum ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cos ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Sin ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:23,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Equal ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Where ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:23,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Div ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Neg ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:23,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:23,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:23,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_3 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_4 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_5 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_1 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_2 ...
2025-07-22 08:07:23,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/MatMul ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/MatMul ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_6 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Div_1 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Add ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Softmax ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/MatMul_1 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/MatMul_1 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_3 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_3 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_7 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_6 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_4 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_8 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_7 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_5 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_9 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_8 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_6 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_10 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_9 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_11 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Mul ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Mul_1 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_3 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_4 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_5 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Concat_1 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Reshape_1 ...
2025-07-22 08:07:23,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/out_proj/MatMul ...
2025-07-22 08:07:23,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/out_proj/MatMul ...
2025-07-22 08:07:23,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Add ...
2025-07-22 08:07:23,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Cast ...
2025-07-22 08:07:23,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/ReduceMean ...
2025-07-22 08:07:23,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Sub ...
2025-07-22 08:07:23,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Constant ...
2025-07-22 08:07:23,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Pow ...
2025-07-22 08:07:23,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/ReduceMean_1 ...
2025-07-22 08:07:23,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Constant_1 ...
2025-07-22 08:07:23,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Add ...
2025-07-22 08:07:23,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Sqrt ...
2025-07-22 08:07:23,731 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Div ...
2025-07-22 08:07:23,732 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Mul ...
2025-07-22 08:07:23,732 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Add_1 ...
2025-07-22 08:07:23,732 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc11/MatMul ...
2025-07-22 08:07:23,738 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc11/MatMul ...
2025-07-22 08:07:23,738 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc12/MatMul ...
2025-07-22 08:07:23,744 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc12/MatMul ...
2025-07-22 08:07:23,744 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Sigmoid ...
2025-07-22 08:07:23,744 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Mul ...
2025-07-22 08:07:23,744 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Mul_1 ...
2025-07-22 08:07:23,744 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc2/MatMul ...
2025-07-22 08:07:23,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc2/MatMul ...
2025-07-22 08:07:23,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Add_1 ...
2025-07-22 08:07:23,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Cast_1 ...
2025-07-22 08:07:23,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/ReduceMean ...
2025-07-22 08:07:23,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Sub ...
2025-07-22 08:07:23,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Constant ...
2025-07-22 08:07:23,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Pow ...
2025-07-22 08:07:23,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/ReduceMean_1 ...
2025-07-22 08:07:23,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Constant_1 ...
2025-07-22 08:07:23,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Add ...
2025-07-22 08:07:23,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Sqrt ...
2025-07-22 08:07:23,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Div ...
2025-07-22 08:07:23,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Mul ...
2025-07-22 08:07:23,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Add_1 ...
2025-07-22 08:07:23,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape ...
2025-07-22 08:07:23,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant ...
2025-07-22 08:07:23,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather ...
2025-07-22 08:07:23,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_1 ...
2025-07-22 08:07:23,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_1 ...
2025-07-22 08:07:23,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_1 ...
2025-07-22 08:07:23,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_2 ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_2 ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_2 ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_3 ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Div ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Cast ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Cast_1 ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_1 ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_4 ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_2 ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_5 ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Concat ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Reshape ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cast ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Range ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Einsum ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cos ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Sin ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:23,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Equal ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Where ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:23,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:23,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Div ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Neg ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:23,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:23,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:23,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_3 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_4 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_5 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_1 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_2 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/MatMul ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/MatMul ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_6 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Div_1 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Add ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Softmax ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/MatMul_1 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/MatMul_1 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_3 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_3 ...
2025-07-22 08:07:23,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_7 ...
2025-07-22 08:07:23,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_6 ...
2025-07-22 08:07:23,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_4 ...
2025-07-22 08:07:23,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_8 ...
2025-07-22 08:07:23,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_7 ...
2025-07-22 08:07:23,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_5 ...
2025-07-22 08:07:23,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_9 ...
2025-07-22 08:07:23,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_8 ...
2025-07-22 08:07:23,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_6 ...
2025-07-22 08:07:23,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_10 ...
2025-07-22 08:07:23,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_9 ...
2025-07-22 08:07:23,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_11 ...
2025-07-22 08:07:23,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Mul ...
2025-07-22 08:07:23,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Mul_1 ...
2025-07-22 08:07:23,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_3 ...
2025-07-22 08:07:23,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_4 ...
2025-07-22 08:07:23,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_5 ...
2025-07-22 08:07:23,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Concat_1 ...
2025-07-22 08:07:23,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Reshape_1 ...
2025-07-22 08:07:23,764 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/out_proj/MatMul ...
2025-07-22 08:07:23,768 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/out_proj/MatMul ...
2025-07-22 08:07:23,768 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Add ...
2025-07-22 08:07:23,768 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Cast ...
2025-07-22 08:07:23,768 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/ReduceMean ...
2025-07-22 08:07:23,768 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Sub ...
2025-07-22 08:07:23,768 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Constant ...
2025-07-22 08:07:23,768 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Pow ...
2025-07-22 08:07:23,768 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/ReduceMean_1 ...
2025-07-22 08:07:23,768 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Constant_1 ...
2025-07-22 08:07:23,768 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Add ...
2025-07-22 08:07:23,768 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Sqrt ...
2025-07-22 08:07:23,768 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Div ...
2025-07-22 08:07:23,768 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Mul ...
2025-07-22 08:07:23,768 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Add_1 ...
2025-07-22 08:07:23,768 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc11/MatMul ...
2025-07-22 08:07:23,774 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc11/MatMul ...
2025-07-22 08:07:23,774 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc12/MatMul ...
2025-07-22 08:07:23,781 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc12/MatMul ...
2025-07-22 08:07:23,781 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Sigmoid ...
2025-07-22 08:07:23,781 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Mul ...
2025-07-22 08:07:23,781 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Mul_1 ...
2025-07-22 08:07:23,781 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc2/MatMul ...
2025-07-22 08:07:23,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc2/MatMul ...
2025-07-22 08:07:23,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Add_1 ...
2025-07-22 08:07:23,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Cast_1 ...
2025-07-22 08:07:23,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/ReduceMean ...
2025-07-22 08:07:23,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Sub ...
2025-07-22 08:07:23,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Constant ...
2025-07-22 08:07:23,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Pow ...
2025-07-22 08:07:23,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/ReduceMean_1 ...
2025-07-22 08:07:23,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Constant_1 ...
2025-07-22 08:07:23,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Add ...
2025-07-22 08:07:23,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Sqrt ...
2025-07-22 08:07:23,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Div ...
2025-07-22 08:07:23,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Mul ...
2025-07-22 08:07:23,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Add_1 ...
2025-07-22 08:07:23,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape ...
2025-07-22 08:07:23,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant ...
2025-07-22 08:07:23,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather ...
2025-07-22 08:07:23,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_1 ...
2025-07-22 08:07:23,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_1 ...
2025-07-22 08:07:23,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_1 ...
2025-07-22 08:07:23,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_2 ...
2025-07-22 08:07:23,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_2 ...
2025-07-22 08:07:23,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_2 ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_3 ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Div ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Cast ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Cast_1 ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_1 ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_4 ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_2 ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_5 ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Concat ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Reshape ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cast ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Range ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Einsum ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cos ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Sin ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:23,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Equal ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Where ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:23,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add ...
2025-07-22 08:07:23,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Div ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Neg ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:23,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:23,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:23,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_3 ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_4 ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_5 ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_1 ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_2 ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/MatMul ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/MatMul ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_6 ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Div_1 ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Add ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Softmax ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/MatMul_1 ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/MatMul_1 ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_3 ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_3 ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_7 ...
2025-07-22 08:07:23,800 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_6 ...
2025-07-22 08:07:23,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_4 ...
2025-07-22 08:07:23,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_8 ...
2025-07-22 08:07:23,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_7 ...
2025-07-22 08:07:23,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_5 ...
2025-07-22 08:07:23,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_9 ...
2025-07-22 08:07:23,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_8 ...
2025-07-22 08:07:23,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_6 ...
2025-07-22 08:07:23,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_10 ...
2025-07-22 08:07:23,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_9 ...
2025-07-22 08:07:23,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_11 ...
2025-07-22 08:07:23,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Mul ...
2025-07-22 08:07:23,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Mul_1 ...
2025-07-22 08:07:23,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_3 ...
2025-07-22 08:07:23,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_4 ...
2025-07-22 08:07:23,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_5 ...
2025-07-22 08:07:23,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Concat_1 ...
2025-07-22 08:07:23,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Reshape_1 ...
2025-07-22 08:07:23,801 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/out_proj/MatMul ...
2025-07-22 08:07:23,805 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/out_proj/MatMul ...
2025-07-22 08:07:23,805 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Add ...
2025-07-22 08:07:23,805 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Cast ...
2025-07-22 08:07:23,805 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/ReduceMean ...
2025-07-22 08:07:23,805 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Sub ...
2025-07-22 08:07:23,805 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Constant ...
2025-07-22 08:07:23,805 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Pow ...
2025-07-22 08:07:23,805 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/ReduceMean_1 ...
2025-07-22 08:07:23,805 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Constant_1 ...
2025-07-22 08:07:23,805 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Add ...
2025-07-22 08:07:23,805 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Sqrt ...
2025-07-22 08:07:23,805 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Div ...
2025-07-22 08:07:23,805 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Mul ...
2025-07-22 08:07:23,805 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Add_1 ...
2025-07-22 08:07:23,805 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc11/MatMul ...
2025-07-22 08:07:23,811 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc11/MatMul ...
2025-07-22 08:07:23,811 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc12/MatMul ...
2025-07-22 08:07:23,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc12/MatMul ...
2025-07-22 08:07:23,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Sigmoid ...
2025-07-22 08:07:23,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Mul ...
2025-07-22 08:07:23,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Mul_1 ...
2025-07-22 08:07:23,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc2/MatMul ...
2025-07-22 08:07:23,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc2/MatMul ...
2025-07-22 08:07:23,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Add_1 ...
2025-07-22 08:07:23,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Cast_1 ...
2025-07-22 08:07:23,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/ReduceMean ...
2025-07-22 08:07:23,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Sub ...
2025-07-22 08:07:23,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Constant ...
2025-07-22 08:07:23,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Pow ...
2025-07-22 08:07:23,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/ReduceMean_1 ...
2025-07-22 08:07:23,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Constant_1 ...
2025-07-22 08:07:23,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Add ...
2025-07-22 08:07:23,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Sqrt ...
2025-07-22 08:07:23,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Div ...
2025-07-22 08:07:23,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Mul ...
2025-07-22 08:07:23,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Add_1 ...
2025-07-22 08:07:23,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/Wqkv/MatMul ...
2025-07-22 08:07:23,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape ...
2025-07-22 08:07:23,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_1 ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_1 ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_1 ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_2 ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_2 ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_2 ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_3 ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Div ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Cast ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Cast_1 ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_1 ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_4 ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_2 ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_5 ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Concat ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Reshape ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cast ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_1 ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_2 ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Range ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Einsum ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cos ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cast_1 ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Sin ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cast_2 ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_1 ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_1 ...
2025-07-22 08:07:23,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_3 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_2 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_4 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_2 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_5 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_3 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_6 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_7 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_8 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_9 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_10 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_11 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_1 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_3 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_12 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_4 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_4 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_13 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_5 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_14 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_1 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_15 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_16 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_17 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_2 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_18 ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Equal ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Where ...
2025-07-22 08:07:23,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_19 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_5 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_20 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_6 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_6 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_21 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_7 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_22 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_3 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_23 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_24 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_25 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_4 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_26 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Equal_1 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Where_1 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_1 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_27 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_1 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_28 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_29 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_30 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_2 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_5 ...
2025-07-22 08:07:23,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_7 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_31 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_8 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_32 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_33 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_34 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Div ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_35 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_6 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_3 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_36 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_7 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_4 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Neg ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_2 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_8 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add_1 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_37 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_38 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_39 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_5 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_3 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_9 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_8 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_40 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_10 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_41 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_42 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_43 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_6 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_44 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_45 ...
2025-07-22 08:07:23,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_46 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_7 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_9 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_47 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_11 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_10 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_48 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_12 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_49 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_9 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_50 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_51 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_52 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_10 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_53 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Equal_2 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Where_2 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_2 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_54 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_4 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_11 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_55 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_13 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_12 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_56 ...
2025-07-22 08:07:23,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_14 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_57 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_11 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_58 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_59 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_60 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_12 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_61 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Equal_3 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Where_3 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_3 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_62 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_5 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_63 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_64 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_65 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_8 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_13 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_13 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_66 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_15 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_67 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_68 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add_2 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_69 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Div_1 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_70 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_14 ...
2025-07-22 08:07:23,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_9 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_71 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_15 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_10 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Neg_1 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_6 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_16 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add_3 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_72 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_73 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_74 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_11 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_7 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_16 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_8 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_3 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_4 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_5 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_1 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_2 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/MatMul ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/MatMul ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_6 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Div_1 ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Add ...
2025-07-22 08:07:23,837 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Softmax ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/MatMul_1 ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/MatMul_1 ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_3 ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_3 ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_7 ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_6 ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_4 ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_8 ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_7 ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_5 ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_9 ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_8 ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_6 ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_10 ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_9 ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_11 ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Mul ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Mul_1 ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_3 ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_4 ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_5 ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Concat_1 ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Reshape_1 ...
2025-07-22 08:07:23,838 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/out_proj/MatMul ...
2025-07-22 08:07:23,842 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/out_proj/MatMul ...
2025-07-22 08:07:23,842 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Add ...
2025-07-22 08:07:23,842 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Cast ...
2025-07-22 08:07:23,842 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/ReduceMean ...
2025-07-22 08:07:23,842 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Sub ...
2025-07-22 08:07:23,842 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Constant ...
2025-07-22 08:07:23,842 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Pow ...
2025-07-22 08:07:23,842 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/ReduceMean_1 ...
2025-07-22 08:07:23,842 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Constant_1 ...
2025-07-22 08:07:23,842 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Add ...
2025-07-22 08:07:23,842 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Sqrt ...
2025-07-22 08:07:23,842 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Div ...
2025-07-22 08:07:23,842 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Mul ...
2025-07-22 08:07:23,842 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Add_1 ...
2025-07-22 08:07:23,842 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc11/MatMul ...
2025-07-22 08:07:23,849 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc11/MatMul ...
2025-07-22 08:07:23,849 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc12/MatMul ...
2025-07-22 08:07:23,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc12/MatMul ...
2025-07-22 08:07:23,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Sigmoid ...
2025-07-22 08:07:23,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Mul ...
2025-07-22 08:07:23,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Mul_1 ...
2025-07-22 08:07:23,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc2/MatMul ...
2025-07-22 08:07:23,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc2/MatMul ...
2025-07-22 08:07:23,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Add_1 ...
2025-07-22 08:07:23,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Cast_1 ...
2025-07-22 08:07:23,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/ReduceMean ...
2025-07-22 08:07:23,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Sub ...
2025-07-22 08:07:23,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Constant ...
2025-07-22 08:07:23,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Pow ...
2025-07-22 08:07:23,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/ReduceMean_1 ...
2025-07-22 08:07:23,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Constant_1 ...
2025-07-22 08:07:23,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Add ...
2025-07-22 08:07:23,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Sqrt ...
2025-07-22 08:07:23,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Div ...
2025-07-22 08:07:23,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Mul ...
2025-07-22 08:07:23,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Add_1 ...
/home/ubuntu/src/tjsmigration/transformers.js/scripts/float16.py:73: UserWarning: the float32 number 9.999999960041972e-13 will be truncated to 1e-07
warnings.warn(
/home/ubuntu/src/tjsmigration/transformers.js/scripts/float16.py:85: UserWarning: the float32 number -3.4028234663852886e+38 will be truncated to -10000.0
warnings.warn(
/home/ubuntu/src/tjsmigration/transformers.js/scripts/float16.py:73: UserWarning: the float32 number 1.544654404384005e-09 will be truncated to 1e-07
warnings.warn(
/home/ubuntu/src/tjsmigration/transformers.js/scripts/float16.py:92: UserWarning: the float32 number -3.8726669093769317e-10 will be truncated to -1e-07
warnings.warn(
- Quantizing to q4f16: 60%|ββββββ | 3/5 [00:12<00:08, 4.20s/it]
Processing /tmp/tmpe2xdvb4_/model.onnx: 0%| | 0/1 [00:12<?, ?it/s]
Traceback (most recent call last):
File "<frozen runpy>", line 198, in _run_module_as_main
File "<frozen runpy>", line 88, in _run_code
File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/quantize.py", line 377, in <module>
main()
File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/quantize.py", line 374, in main
quantize(input_folder, output_folder, quantization_args)
File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/quantize.py", line 326, in quantize
quantize_fp16(
File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/quantize.py", line 223, in quantize_fp16
check_and_save_model(model_fp16, save_path)
File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/utils.py", line 29, in check_and_save_model
strict_check_model(model)
File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/utils.py", line 21, in strict_check_model
raise e
File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/utils.py", line 16, in strict_check_model
onnx.checker.check_model(model_or_path, full_check=True)
File "/home/ubuntu/.cache/uv/archive-v0/iAncxVR1WPOl_8LkA6LpD/lib/python3.12/site-packages/onnx/checker.py", line 179, in check_model
C.check_model(
onnx.onnx_cpp2py_export.shape_inference.InferenceError: [ShapeInferenceError] (op_type:Range, node name: /encoder/layers.0/attn/rotary_emb/Range): start typestr: T, has unsupported type: tensor(float16)
Xenova
changed pull request status to
merged