Add/update the quantized ONNX model files and README.md for Transformers.js v3

#1
by whitphx HF Staff - opened

Applied Quantizations

❌ Based on model.onnx with slimming

0%|          | 0/1 [00:00<?, ?it/s]
Processing /tmp/tmpfxbvrttu/model.onnx:   0%|          | 0/1 [00:00<?, ?it/s]

  0%|          | 0/5 [00:00<?, ?it/s]

 - Quantizing to int8:   0%|          | 0/5 [00:00<?, ?it/s]2025-07-22 08:08:18,400 root [INFO] - Quantization parameters for tensor:"/emb_ln/Add_1_output_0" not specified
2025-07-22 08:08:18,407 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.0/attn/MatMul]
2025-07-22 08:08:18,407 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.0/attn/MatMul_1]
2025-07-22 08:08:18,408 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:18,411 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/norm1/Add_1_output_0" not specified
2025-07-22 08:08:18,427 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:18,435 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/norm2/Add_1_output_0" not specified
2025-07-22 08:08:18,441 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.1/attn/MatMul]
2025-07-22 08:08:18,441 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.1/attn/MatMul_1]
2025-07-22 08:08:18,442 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:18,445 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/norm1/Add_1_output_0" not specified
2025-07-22 08:08:18,462 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:18,470 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/norm2/Add_1_output_0" not specified
2025-07-22 08:08:18,476 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.2/attn/MatMul]
2025-07-22 08:08:18,476 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.2/attn/MatMul_1]
2025-07-22 08:08:18,477 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:18,480 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/norm1/Add_1_output_0" not specified
2025-07-22 08:08:18,496 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:18,504 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/norm2/Add_1_output_0" not specified
2025-07-22 08:08:18,510 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.3/attn/MatMul]
2025-07-22 08:08:18,510 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.3/attn/MatMul_1]
2025-07-22 08:08:18,511 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:18,514 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/norm1/Add_1_output_0" not specified
2025-07-22 08:08:18,529 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:18,538 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/norm2/Add_1_output_0" not specified
2025-07-22 08:08:18,545 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.4/attn/MatMul]
2025-07-22 08:08:18,545 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.4/attn/MatMul_1]
2025-07-22 08:08:18,546 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:18,549 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/norm1/Add_1_output_0" not specified
2025-07-22 08:08:18,566 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:18,575 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/norm2/Add_1_output_0" not specified
2025-07-22 08:08:18,582 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.5/attn/MatMul]
2025-07-22 08:08:18,582 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.5/attn/MatMul_1]
2025-07-22 08:08:18,583 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:18,586 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/norm1/Add_1_output_0" not specified
2025-07-22 08:08:18,603 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:18,611 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/norm2/Add_1_output_0" not specified
2025-07-22 08:08:18,618 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.6/attn/MatMul]
2025-07-22 08:08:18,619 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.6/attn/MatMul_1]
2025-07-22 08:08:18,620 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:18,622 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/norm1/Add_1_output_0" not specified
2025-07-22 08:08:18,640 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:18,649 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/norm2/Add_1_output_0" not specified
2025-07-22 08:08:18,656 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.7/attn/MatMul]
2025-07-22 08:08:18,656 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.7/attn/MatMul_1]
2025-07-22 08:08:18,657 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:18,660 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/norm1/Add_1_output_0" not specified
2025-07-22 08:08:18,678 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:18,686 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/norm2/Add_1_output_0" not specified
2025-07-22 08:08:18,693 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.8/attn/MatMul]
2025-07-22 08:08:18,693 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.8/attn/MatMul_1]
2025-07-22 08:08:18,695 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:18,698 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/norm1/Add_1_output_0" not specified
2025-07-22 08:08:18,715 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:18,724 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/norm2/Add_1_output_0" not specified
2025-07-22 08:08:18,731 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.9/attn/MatMul]
2025-07-22 08:08:18,731 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.9/attn/MatMul_1]
2025-07-22 08:08:18,732 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:18,735 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/norm1/Add_1_output_0" not specified
2025-07-22 08:08:18,753 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:18,762 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/norm2/Add_1_output_0" not specified
2025-07-22 08:08:18,770 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.10/attn/MatMul]
2025-07-22 08:08:18,770 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.10/attn/MatMul_1]
2025-07-22 08:08:18,771 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:18,774 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/norm1/Add_1_output_0" not specified
2025-07-22 08:08:18,792 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:18,801 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/norm2/Add_1_output_0" not specified
2025-07-22 08:08:18,808 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.11/attn/MatMul]
2025-07-22 08:08:18,808 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.11/attn/MatMul_1]
2025-07-22 08:08:18,809 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:18,813 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/norm1/Add_1_output_0" not specified
2025-07-22 08:08:18,831 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/mlp/Mul_1_output_0" not specified


 - Quantizing to int8:  20%|β–ˆβ–ˆ        | 1/5 [00:05<00:20,  5.18s/it]

 - Quantizing to uint8:  20%|β–ˆβ–ˆ        | 1/5 [00:05<00:20,  5.18s/it]2025-07-22 08:08:23,006 root [INFO] - Quantization parameters for tensor:"/emb_ln/Add_1_output_0" not specified
2025-07-22 08:08:23,012 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.0/attn/MatMul]
2025-07-22 08:08:23,013 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.0/attn/MatMul_1]
2025-07-22 08:08:23,014 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:23,016 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/norm1/Add_1_output_0" not specified
2025-07-22 08:08:23,033 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:23,040 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/norm2/Add_1_output_0" not specified
2025-07-22 08:08:23,046 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.1/attn/MatMul]
2025-07-22 08:08:23,046 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.1/attn/MatMul_1]
2025-07-22 08:08:23,047 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:23,050 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/norm1/Add_1_output_0" not specified
2025-07-22 08:08:23,065 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:23,074 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/norm2/Add_1_output_0" not specified
2025-07-22 08:08:23,080 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.2/attn/MatMul]
2025-07-22 08:08:23,080 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.2/attn/MatMul_1]
2025-07-22 08:08:23,081 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:23,084 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/norm1/Add_1_output_0" not specified
2025-07-22 08:08:23,099 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:23,106 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/norm2/Add_1_output_0" not specified
2025-07-22 08:08:23,113 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.3/attn/MatMul]
2025-07-22 08:08:23,113 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.3/attn/MatMul_1]
2025-07-22 08:08:23,114 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:23,117 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/norm1/Add_1_output_0" not specified
2025-07-22 08:08:23,134 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:23,141 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/norm2/Add_1_output_0" not specified
2025-07-22 08:08:23,148 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.4/attn/MatMul]
2025-07-22 08:08:23,148 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.4/attn/MatMul_1]
2025-07-22 08:08:23,149 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:23,152 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/norm1/Add_1_output_0" not specified
2025-07-22 08:08:23,169 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:23,178 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/norm2/Add_1_output_0" not specified
2025-07-22 08:08:23,184 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.5/attn/MatMul]
2025-07-22 08:08:23,185 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.5/attn/MatMul_1]
2025-07-22 08:08:23,186 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:23,189 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/norm1/Add_1_output_0" not specified
2025-07-22 08:08:23,206 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:23,216 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/norm2/Add_1_output_0" not specified
2025-07-22 08:08:23,222 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.6/attn/MatMul]
2025-07-22 08:08:23,223 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.6/attn/MatMul_1]
2025-07-22 08:08:23,224 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:23,227 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/norm1/Add_1_output_0" not specified
2025-07-22 08:08:23,244 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:23,254 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/norm2/Add_1_output_0" not specified
2025-07-22 08:08:23,261 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.7/attn/MatMul]
2025-07-22 08:08:23,261 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.7/attn/MatMul_1]
2025-07-22 08:08:23,262 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:23,265 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/norm1/Add_1_output_0" not specified
2025-07-22 08:08:23,283 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:23,292 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/norm2/Add_1_output_0" not specified
2025-07-22 08:08:23,299 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.8/attn/MatMul]
2025-07-22 08:08:23,299 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.8/attn/MatMul_1]
2025-07-22 08:08:23,300 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:23,303 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/norm1/Add_1_output_0" not specified
2025-07-22 08:08:23,322 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:23,331 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/norm2/Add_1_output_0" not specified
2025-07-22 08:08:23,338 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.9/attn/MatMul]
2025-07-22 08:08:23,338 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.9/attn/MatMul_1]
2025-07-22 08:08:23,339 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:23,342 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/norm1/Add_1_output_0" not specified
2025-07-22 08:08:23,360 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:23,369 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/norm2/Add_1_output_0" not specified
2025-07-22 08:08:23,376 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.10/attn/MatMul]
2025-07-22 08:08:23,376 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.10/attn/MatMul_1]
2025-07-22 08:08:23,377 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:23,381 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/norm1/Add_1_output_0" not specified
2025-07-22 08:08:23,399 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:23,408 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/norm2/Add_1_output_0" not specified
2025-07-22 08:08:23,415 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.11/attn/MatMul]
2025-07-22 08:08:23,415 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.11/attn/MatMul_1]
2025-07-22 08:08:23,416 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:23,420 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/norm1/Add_1_output_0" not specified
2025-07-22 08:08:23,438 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/mlp/Mul_1_output_0" not specified


 - Quantizing to uint8:  40%|β–ˆβ–ˆβ–ˆβ–ˆ      | 2/5 [00:09<00:14,  4.86s/it]

 - Quantizing to q4:  40%|β–ˆβ–ˆβ–ˆβ–ˆ      | 2/5 [00:09<00:14,  4.86s/it]   2025-07-22 08:08:25,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/word_embeddings/Gather ...
2025-07-22 08:08:25,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/token_type_embeddings/Gather ...
2025-07-22 08:08:25,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Unsqueeze ...
2025-07-22 08:08:25,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/Add ...
2025-07-22 08:08:25,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Cast ...
2025-07-22 08:08:25,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/ReduceMean ...
2025-07-22 08:08:25,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Sub ...
2025-07-22 08:08:25,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Sub ...
2025-07-22 08:08:25,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Mul ...
2025-07-22 08:08:25,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Pow ...
2025-07-22 08:08:25,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/ReduceMean_1 ...
2025-07-22 08:08:25,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Add ...
2025-07-22 08:08:25,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Sqrt ...
2025-07-22 08:08:25,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Div ...
2025-07-22 08:08:25,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Mul ...
2025-07-22 08:08:25,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Add_1 ...
2025-07-22 08:08:25,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_1 ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_1 ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Concat ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Reshape ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cast ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:25,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Range ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Neg ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Einsum ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cos ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Sin ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:25,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_3 ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_4 ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_5 ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_1 ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_2 ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/MatMul ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/MatMul ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Div_1 ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Add ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Softmax ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/MatMul_1 ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/MatMul_1 ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_3 ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_3 ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_6 ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_7 ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_3 ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_4 ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Concat_1 ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Reshape_1 ...
2025-07-22 08:08:25,438 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/out_proj/MatMul ...
2025-07-22 08:08:25,442 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/out_proj/MatMul ...
2025-07-22 08:08:25,442 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Add ...
2025-07-22 08:08:25,442 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/ReduceMean ...
2025-07-22 08:08:25,442 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Sub ...
2025-07-22 08:08:25,442 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Pow ...
2025-07-22 08:08:25,442 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/ReduceMean_1 ...
2025-07-22 08:08:25,442 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Add ...
2025-07-22 08:08:25,442 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Sqrt ...
2025-07-22 08:08:25,442 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Div ...
2025-07-22 08:08:25,442 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Mul ...
2025-07-22 08:08:25,442 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Add_1 ...
2025-07-22 08:08:25,442 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc11/MatMul ...
2025-07-22 08:08:25,448 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc11/MatMul ...
2025-07-22 08:08:25,448 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc12/MatMul ...
2025-07-22 08:08:25,454 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc12/MatMul ...
2025-07-22 08:08:25,454 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Sigmoid ...
2025-07-22 08:08:25,455 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Mul ...
2025-07-22 08:08:25,455 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Mul_1 ...
2025-07-22 08:08:25,455 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc2/MatMul ...
2025-07-22 08:08:25,461 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc2/MatMul ...
2025-07-22 08:08:25,461 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Add_1 ...
2025-07-22 08:08:25,461 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/ReduceMean ...
2025-07-22 08:08:25,461 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Sub ...
2025-07-22 08:08:25,461 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Pow ...
2025-07-22 08:08:25,461 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/ReduceMean_1 ...
2025-07-22 08:08:25,461 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Add ...
2025-07-22 08:08:25,461 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Sqrt ...
2025-07-22 08:08:25,461 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Div ...
2025-07-22 08:08:25,461 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Mul ...
2025-07-22 08:08:25,461 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Add_1 ...
2025-07-22 08:08:25,461 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape ...
2025-07-22 08:08:25,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather ...
2025-07-22 08:08:25,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_1 ...
2025-07-22 08:08:25,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze ...
2025-07-22 08:08:25,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_1 ...
2025-07-22 08:08:25,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Concat ...
2025-07-22 08:08:25,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Reshape ...
2025-07-22 08:08:25,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape ...
2025-07-22 08:08:25,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:25,466 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cast ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Range ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Neg ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Einsum ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cos ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Sin ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:25,467 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_3 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_4 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_5 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_1 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_2 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/MatMul ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/MatMul ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Div_1 ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Add ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Softmax ...
2025-07-22 08:08:25,468 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/MatMul_1 ...
2025-07-22 08:08:25,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/MatMul_1 ...
2025-07-22 08:08:25,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_3 ...
2025-07-22 08:08:25,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_3 ...
2025-07-22 08:08:25,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_6 ...
2025-07-22 08:08:25,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_7 ...
2025-07-22 08:08:25,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_3 ...
2025-07-22 08:08:25,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_4 ...
2025-07-22 08:08:25,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Concat_1 ...
2025-07-22 08:08:25,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Reshape_1 ...
2025-07-22 08:08:25,469 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/out_proj/MatMul ...
2025-07-22 08:08:25,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/out_proj/MatMul ...
2025-07-22 08:08:25,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Add ...
2025-07-22 08:08:25,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/ReduceMean ...
2025-07-22 08:08:25,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Sub ...
2025-07-22 08:08:25,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Pow ...
2025-07-22 08:08:25,472 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/ReduceMean_1 ...
2025-07-22 08:08:25,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Add ...
2025-07-22 08:08:25,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Sqrt ...
2025-07-22 08:08:25,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Div ...
2025-07-22 08:08:25,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Mul ...
2025-07-22 08:08:25,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Add_1 ...
2025-07-22 08:08:25,473 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc11/MatMul ...
2025-07-22 08:08:25,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc11/MatMul ...
2025-07-22 08:08:25,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc12/MatMul ...
2025-07-22 08:08:25,487 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc12/MatMul ...
2025-07-22 08:08:25,487 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Sigmoid ...
2025-07-22 08:08:25,487 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Mul ...
2025-07-22 08:08:25,487 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Mul_1 ...
2025-07-22 08:08:25,487 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc2/MatMul ...
2025-07-22 08:08:25,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc2/MatMul ...
2025-07-22 08:08:25,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Add_1 ...
2025-07-22 08:08:25,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/ReduceMean ...
2025-07-22 08:08:25,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Sub ...
2025-07-22 08:08:25,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Pow ...
2025-07-22 08:08:25,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/ReduceMean_1 ...
2025-07-22 08:08:25,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Add ...
2025-07-22 08:08:25,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Sqrt ...
2025-07-22 08:08:25,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Div ...
2025-07-22 08:08:25,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Mul ...
2025-07-22 08:08:25,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Add_1 ...
2025-07-22 08:08:25,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape ...
2025-07-22 08:08:25,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather ...
2025-07-22 08:08:25,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_1 ...
2025-07-22 08:08:25,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze ...
2025-07-22 08:08:25,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_1 ...
2025-07-22 08:08:25,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Concat ...
2025-07-22 08:08:25,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Reshape ...
2025-07-22 08:08:25,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape ...
2025-07-22 08:08:25,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:25,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:25,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:25,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather ...
2025-07-22 08:08:25,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:25,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:25,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cast ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Range ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Neg ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Einsum ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cos ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Sin ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape ...
2025-07-22 08:08:25,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_3 ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_4 ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_5 ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_1 ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_2 ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/MatMul ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/MatMul ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Div_1 ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Add ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Softmax ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/MatMul_1 ...
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/MatMul_1 ...
2025-07-22 08:08:25,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_3 ...
2025-07-22 08:08:25,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_3 ...
2025-07-22 08:08:25,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_6 ...
2025-07-22 08:08:25,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_7 ...
2025-07-22 08:08:25,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_3 ...
2025-07-22 08:08:25,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_4 ...
2025-07-22 08:08:25,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Concat_1 ...
2025-07-22 08:08:25,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Reshape_1 ...
2025-07-22 08:08:25,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/out_proj/MatMul ...
2025-07-22 08:08:25,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/out_proj/MatMul ...
2025-07-22 08:08:25,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Add ...
2025-07-22 08:08:25,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/ReduceMean ...
2025-07-22 08:08:25,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Sub ...
2025-07-22 08:08:25,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Pow ...
2025-07-22 08:08:25,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/ReduceMean_1 ...
2025-07-22 08:08:25,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Add ...
2025-07-22 08:08:25,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Sqrt ...
2025-07-22 08:08:25,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Div ...
2025-07-22 08:08:25,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Mul ...
2025-07-22 08:08:25,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Add_1 ...
2025-07-22 08:08:25,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc11/MatMul ...
2025-07-22 08:08:25,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc11/MatMul ...
2025-07-22 08:08:25,511 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc12/MatMul ...
2025-07-22 08:08:25,518 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc12/MatMul ...
2025-07-22 08:08:25,518 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Sigmoid ...
2025-07-22 08:08:25,518 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Mul ...
2025-07-22 08:08:25,518 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Mul_1 ...
2025-07-22 08:08:25,518 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc2/MatMul ...
2025-07-22 08:08:25,524 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc2/MatMul ...
2025-07-22 08:08:25,524 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Add_1 ...
2025-07-22 08:08:25,524 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/ReduceMean ...
2025-07-22 08:08:25,524 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Sub ...
2025-07-22 08:08:25,524 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Pow ...
2025-07-22 08:08:25,524 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/ReduceMean_1 ...
2025-07-22 08:08:25,524 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Add ...
2025-07-22 08:08:25,524 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Sqrt ...
2025-07-22 08:08:25,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Div ...
2025-07-22 08:08:25,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Mul ...
2025-07-22 08:08:25,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Add_1 ...
2025-07-22 08:08:25,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape ...
2025-07-22 08:08:25,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather ...
2025-07-22 08:08:25,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_1 ...
2025-07-22 08:08:25,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze ...
2025-07-22 08:08:25,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_1 ...
2025-07-22 08:08:25,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Concat ...
2025-07-22 08:08:25,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Reshape ...
2025-07-22 08:08:25,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape ...
2025-07-22 08:08:25,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:25,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:25,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:25,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather ...
2025-07-22 08:08:25,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:25,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:25,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:25,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cast ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Range ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Neg ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Einsum ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cos ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Sin ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:25,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_3 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_4 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_5 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_1 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_2 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/MatMul ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/MatMul ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Div_1 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Add ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Softmax ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/MatMul_1 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/MatMul_1 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_3 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_3 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_6 ...
2025-07-22 08:08:25,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_7 ...
2025-07-22 08:08:25,533 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_3 ...
2025-07-22 08:08:25,533 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_4 ...
2025-07-22 08:08:25,533 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Concat_1 ...
2025-07-22 08:08:25,533 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Reshape_1 ...
2025-07-22 08:08:25,533 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/out_proj/MatMul ...
2025-07-22 08:08:25,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/out_proj/MatMul ...
2025-07-22 08:08:25,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Add ...
2025-07-22 08:08:25,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/ReduceMean ...
2025-07-22 08:08:25,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Sub ...
2025-07-22 08:08:25,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Pow ...
2025-07-22 08:08:25,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/ReduceMean_1 ...
2025-07-22 08:08:25,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Add ...
2025-07-22 08:08:25,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Sqrt ...
2025-07-22 08:08:25,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Div ...
2025-07-22 08:08:25,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Mul ...
2025-07-22 08:08:25,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Add_1 ...
2025-07-22 08:08:25,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc11/MatMul ...
2025-07-22 08:08:25,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc11/MatMul ...
2025-07-22 08:08:25,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc12/MatMul ...
2025-07-22 08:08:25,550 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc12/MatMul ...
2025-07-22 08:08:25,550 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Sigmoid ...
2025-07-22 08:08:25,550 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Mul ...
2025-07-22 08:08:25,550 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Mul_1 ...
2025-07-22 08:08:25,550 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc2/MatMul ...
2025-07-22 08:08:25,556 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc2/MatMul ...
2025-07-22 08:08:25,556 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Add_1 ...
2025-07-22 08:08:25,556 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/ReduceMean ...
2025-07-22 08:08:25,556 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Sub ...
2025-07-22 08:08:25,556 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Pow ...
2025-07-22 08:08:25,556 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/ReduceMean_1 ...
2025-07-22 08:08:25,556 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Add ...
2025-07-22 08:08:25,556 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Sqrt ...
2025-07-22 08:08:25,556 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Div ...
2025-07-22 08:08:25,556 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Mul ...
2025-07-22 08:08:25,556 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Add_1 ...
2025-07-22 08:08:25,557 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape ...
2025-07-22 08:08:25,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather ...
2025-07-22 08:08:25,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_1 ...
2025-07-22 08:08:25,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze ...
2025-07-22 08:08:25,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_1 ...
2025-07-22 08:08:25,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Concat ...
2025-07-22 08:08:25,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Reshape ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cast ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Range ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Neg ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Einsum ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cos ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Sin ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:25,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_3 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_4 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_5 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_1 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_2 ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/MatMul ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/MatMul ...
2025-07-22 08:08:25,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Div_1 ...
2025-07-22 08:08:25,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Add ...
2025-07-22 08:08:25,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Softmax ...
2025-07-22 08:08:25,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/MatMul_1 ...
2025-07-22 08:08:25,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/MatMul_1 ...
2025-07-22 08:08:25,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_3 ...
2025-07-22 08:08:25,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_3 ...
2025-07-22 08:08:25,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_6 ...
2025-07-22 08:08:25,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_7 ...
2025-07-22 08:08:25,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_3 ...
2025-07-22 08:08:25,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_4 ...
2025-07-22 08:08:25,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Concat_1 ...
2025-07-22 08:08:25,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Reshape_1 ...
2025-07-22 08:08:25,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/out_proj/MatMul ...
2025-07-22 08:08:25,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/out_proj/MatMul ...
2025-07-22 08:08:25,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Add ...
2025-07-22 08:08:25,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/ReduceMean ...
2025-07-22 08:08:25,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Sub ...
2025-07-22 08:08:25,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Pow ...
2025-07-22 08:08:25,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/ReduceMean_1 ...
2025-07-22 08:08:25,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Add ...
2025-07-22 08:08:25,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Sqrt ...
2025-07-22 08:08:25,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Div ...
2025-07-22 08:08:25,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Mul ...
2025-07-22 08:08:25,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Add_1 ...
2025-07-22 08:08:25,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc11/MatMul ...
2025-07-22 08:08:25,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc11/MatMul ...
2025-07-22 08:08:25,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc12/MatMul ...
2025-07-22 08:08:25,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc12/MatMul ...
2025-07-22 08:08:25,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Sigmoid ...
2025-07-22 08:08:25,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Mul ...
2025-07-22 08:08:25,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Mul_1 ...
2025-07-22 08:08:25,582 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc2/MatMul ...
2025-07-22 08:08:25,588 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc2/MatMul ...
2025-07-22 08:08:25,588 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Add_1 ...
2025-07-22 08:08:25,588 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/ReduceMean ...
2025-07-22 08:08:25,588 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Sub ...
2025-07-22 08:08:25,588 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Pow ...
2025-07-22 08:08:25,588 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/ReduceMean_1 ...
2025-07-22 08:08:25,588 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Add ...
2025-07-22 08:08:25,588 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Sqrt ...
2025-07-22 08:08:25,589 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Div ...
2025-07-22 08:08:25,589 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Mul ...
2025-07-22 08:08:25,589 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Add_1 ...
2025-07-22 08:08:25,589 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,594 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,594 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape ...
2025-07-22 08:08:25,594 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather ...
2025-07-22 08:08:25,594 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_1 ...
2025-07-22 08:08:25,594 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze ...
2025-07-22 08:08:25,594 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_1 ...
2025-07-22 08:08:25,594 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Concat ...
2025-07-22 08:08:25,594 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Reshape ...
2025-07-22 08:08:25,594 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape ...
2025-07-22 08:08:25,594 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:25,594 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:25,594 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:25,594 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cast ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Range ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Neg ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Einsum ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cos ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Sin ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:25,595 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_3 ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_4 ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_5 ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_1 ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_2 ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/MatMul ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/MatMul ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Div_1 ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Add ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Softmax ...
2025-07-22 08:08:25,596 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/MatMul_1 ...
2025-07-22 08:08:25,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/MatMul_1 ...
2025-07-22 08:08:25,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_3 ...
2025-07-22 08:08:25,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_3 ...
2025-07-22 08:08:25,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_6 ...
2025-07-22 08:08:25,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_7 ...
2025-07-22 08:08:25,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_3 ...
2025-07-22 08:08:25,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_4 ...
2025-07-22 08:08:25,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Concat_1 ...
2025-07-22 08:08:25,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Reshape_1 ...
2025-07-22 08:08:25,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/out_proj/MatMul ...
2025-07-22 08:08:25,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/out_proj/MatMul ...
2025-07-22 08:08:25,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Add ...
2025-07-22 08:08:25,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/ReduceMean ...
2025-07-22 08:08:25,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Sub ...
2025-07-22 08:08:25,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Pow ...
2025-07-22 08:08:25,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/ReduceMean_1 ...
2025-07-22 08:08:25,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Add ...
2025-07-22 08:08:25,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Sqrt ...
2025-07-22 08:08:25,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Div ...
2025-07-22 08:08:25,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Mul ...
2025-07-22 08:08:25,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Add_1 ...
2025-07-22 08:08:25,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc11/MatMul ...
2025-07-22 08:08:25,607 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc11/MatMul ...
2025-07-22 08:08:25,607 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc12/MatMul ...
2025-07-22 08:08:25,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc12/MatMul ...
2025-07-22 08:08:25,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Sigmoid ...
2025-07-22 08:08:25,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Mul ...
2025-07-22 08:08:25,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Mul_1 ...
2025-07-22 08:08:25,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc2/MatMul ...
2025-07-22 08:08:25,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc2/MatMul ...
2025-07-22 08:08:25,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Add_1 ...
2025-07-22 08:08:25,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/ReduceMean ...
2025-07-22 08:08:25,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Sub ...
2025-07-22 08:08:25,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Pow ...
2025-07-22 08:08:25,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/ReduceMean_1 ...
2025-07-22 08:08:25,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Add ...
2025-07-22 08:08:25,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Sqrt ...
2025-07-22 08:08:25,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Div ...
2025-07-22 08:08:25,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Mul ...
2025-07-22 08:08:25,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Add_1 ...
2025-07-22 08:08:25,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_1 ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_1 ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Concat ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Reshape ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cast ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Range ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:25,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Neg ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Einsum ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cos ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Sin ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:25,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_3 ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_4 ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_5 ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_1 ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_2 ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/MatMul ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/MatMul ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Div_1 ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Add ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Softmax ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/MatMul_1 ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/MatMul_1 ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_3 ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_3 ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_6 ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_7 ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_3 ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_4 ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Concat_1 ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Reshape_1 ...
2025-07-22 08:08:25,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/out_proj/MatMul ...
2025-07-22 08:08:25,633 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/out_proj/MatMul ...
2025-07-22 08:08:25,633 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Add ...
2025-07-22 08:08:25,633 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/ReduceMean ...
2025-07-22 08:08:25,633 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Sub ...
2025-07-22 08:08:25,633 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Pow ...
2025-07-22 08:08:25,633 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/ReduceMean_1 ...
2025-07-22 08:08:25,633 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Add ...
2025-07-22 08:08:25,633 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Sqrt ...
2025-07-22 08:08:25,633 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Div ...
2025-07-22 08:08:25,633 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Mul ...
2025-07-22 08:08:25,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Add_1 ...
2025-07-22 08:08:25,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc11/MatMul ...
2025-07-22 08:08:25,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc11/MatMul ...
2025-07-22 08:08:25,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc12/MatMul ...
2025-07-22 08:08:25,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc12/MatMul ...
2025-07-22 08:08:25,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Sigmoid ...
2025-07-22 08:08:25,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Mul ...
2025-07-22 08:08:25,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Mul_1 ...
2025-07-22 08:08:25,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc2/MatMul ...
2025-07-22 08:08:25,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc2/MatMul ...
2025-07-22 08:08:25,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Add_1 ...
2025-07-22 08:08:25,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/ReduceMean ...
2025-07-22 08:08:25,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Sub ...
2025-07-22 08:08:25,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Pow ...
2025-07-22 08:08:25,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/ReduceMean_1 ...
2025-07-22 08:08:25,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Add ...
2025-07-22 08:08:25,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Sqrt ...
2025-07-22 08:08:25,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Div ...
2025-07-22 08:08:25,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Mul ...
2025-07-22 08:08:25,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Add_1 ...
2025-07-22 08:08:25,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape ...
2025-07-22 08:08:25,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather ...
2025-07-22 08:08:25,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_1 ...
2025-07-22 08:08:25,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze ...
2025-07-22 08:08:25,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_1 ...
2025-07-22 08:08:25,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Concat ...
2025-07-22 08:08:25,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Reshape ...
2025-07-22 08:08:25,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape ...
2025-07-22 08:08:25,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:25,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:25,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cast ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Range ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Neg ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Einsum ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cos ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Sin ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:25,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_3 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_4 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_5 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_1 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_2 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/MatMul ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/MatMul ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Div_1 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Add ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Softmax ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/MatMul_1 ...
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,660 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/MatMul_1 ...
2025-07-22 08:08:25,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_3 ...
2025-07-22 08:08:25,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_3 ...
2025-07-22 08:08:25,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_6 ...
2025-07-22 08:08:25,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_7 ...
2025-07-22 08:08:25,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_3 ...
2025-07-22 08:08:25,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_4 ...
2025-07-22 08:08:25,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Concat_1 ...
2025-07-22 08:08:25,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Reshape_1 ...
2025-07-22 08:08:25,661 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/out_proj/MatMul ...
2025-07-22 08:08:25,664 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/out_proj/MatMul ...
2025-07-22 08:08:25,664 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Add ...
2025-07-22 08:08:25,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/ReduceMean ...
2025-07-22 08:08:25,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Sub ...
2025-07-22 08:08:25,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Pow ...
2025-07-22 08:08:25,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/ReduceMean_1 ...
2025-07-22 08:08:25,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Add ...
2025-07-22 08:08:25,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Sqrt ...
2025-07-22 08:08:25,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Div ...
2025-07-22 08:08:25,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Mul ...
2025-07-22 08:08:25,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Add_1 ...
2025-07-22 08:08:25,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc11/MatMul ...
2025-07-22 08:08:25,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc11/MatMul ...
2025-07-22 08:08:25,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc12/MatMul ...
2025-07-22 08:08:25,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc12/MatMul ...
2025-07-22 08:08:25,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Sigmoid ...
2025-07-22 08:08:25,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Mul ...
2025-07-22 08:08:25,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Mul_1 ...
2025-07-22 08:08:25,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc2/MatMul ...
2025-07-22 08:08:25,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc2/MatMul ...
2025-07-22 08:08:25,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Add_1 ...
2025-07-22 08:08:25,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/ReduceMean ...
2025-07-22 08:08:25,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Sub ...
2025-07-22 08:08:25,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Pow ...
2025-07-22 08:08:25,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/ReduceMean_1 ...
2025-07-22 08:08:25,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Add ...
2025-07-22 08:08:25,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Sqrt ...
2025-07-22 08:08:25,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Div ...
2025-07-22 08:08:25,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Mul ...
2025-07-22 08:08:25,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Add_1 ...
2025-07-22 08:08:25,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape ...
2025-07-22 08:08:25,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather ...
2025-07-22 08:08:25,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_1 ...
2025-07-22 08:08:25,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_1 ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Concat ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Reshape ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cast ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Range ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Neg ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Einsum ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cos ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Sin ...
2025-07-22 08:08:25,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_3 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_4 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_5 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_1 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_2 ...
2025-07-22 08:08:25,691 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/MatMul ...
2025-07-22 08:08:25,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/MatMul ...
2025-07-22 08:08:25,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Div_1 ...
2025-07-22 08:08:25,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Add ...
2025-07-22 08:08:25,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Softmax ...
2025-07-22 08:08:25,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/MatMul_1 ...
2025-07-22 08:08:25,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/MatMul_1 ...
2025-07-22 08:08:25,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_3 ...
2025-07-22 08:08:25,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_3 ...
2025-07-22 08:08:25,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_6 ...
2025-07-22 08:08:25,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_7 ...
2025-07-22 08:08:25,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_3 ...
2025-07-22 08:08:25,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_4 ...
2025-07-22 08:08:25,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Concat_1 ...
2025-07-22 08:08:25,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Reshape_1 ...
2025-07-22 08:08:25,692 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/out_proj/MatMul ...
2025-07-22 08:08:25,696 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/out_proj/MatMul ...
2025-07-22 08:08:25,696 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Add ...
2025-07-22 08:08:25,696 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/ReduceMean ...
2025-07-22 08:08:25,696 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Sub ...
2025-07-22 08:08:25,696 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Pow ...
2025-07-22 08:08:25,696 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/ReduceMean_1 ...
2025-07-22 08:08:25,696 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Add ...
2025-07-22 08:08:25,696 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Sqrt ...
2025-07-22 08:08:25,696 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Div ...
2025-07-22 08:08:25,696 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Mul ...
2025-07-22 08:08:25,696 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Add_1 ...
2025-07-22 08:08:25,696 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc11/MatMul ...
2025-07-22 08:08:25,702 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc11/MatMul ...
2025-07-22 08:08:25,702 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc12/MatMul ...
2025-07-22 08:08:25,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc12/MatMul ...
2025-07-22 08:08:25,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Sigmoid ...
2025-07-22 08:08:25,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Mul ...
2025-07-22 08:08:25,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Mul_1 ...
2025-07-22 08:08:25,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc2/MatMul ...
2025-07-22 08:08:25,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc2/MatMul ...
2025-07-22 08:08:25,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Add_1 ...
2025-07-22 08:08:25,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/ReduceMean ...
2025-07-22 08:08:25,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Sub ...
2025-07-22 08:08:25,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Pow ...
2025-07-22 08:08:25,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/ReduceMean_1 ...
2025-07-22 08:08:25,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Add ...
2025-07-22 08:08:25,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Sqrt ...
2025-07-22 08:08:25,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Div ...
2025-07-22 08:08:25,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Mul ...
2025-07-22 08:08:25,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Add_1 ...
2025-07-22 08:08:25,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_1 ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_1 ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Concat ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Reshape ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cast ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:25,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Range ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Neg ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Einsum ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cos ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Sin ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:25,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_3 ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_4 ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_5 ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_1 ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_2 ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/MatMul ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/MatMul ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Div_1 ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Add ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Softmax ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/MatMul_1 ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/MatMul_1 ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_3 ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_3 ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_6 ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_7 ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_3 ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_4 ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Concat_1 ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Reshape_1 ...
2025-07-22 08:08:25,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/out_proj/MatMul ...
2025-07-22 08:08:25,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/out_proj/MatMul ...
2025-07-22 08:08:25,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Add ...
2025-07-22 08:08:25,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/ReduceMean ...
2025-07-22 08:08:25,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Sub ...
2025-07-22 08:08:25,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Pow ...
2025-07-22 08:08:25,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/ReduceMean_1 ...
2025-07-22 08:08:25,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Add ...
2025-07-22 08:08:25,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Sqrt ...
2025-07-22 08:08:25,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Div ...
2025-07-22 08:08:25,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Mul ...
2025-07-22 08:08:25,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Add_1 ...
2025-07-22 08:08:25,727 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc11/MatMul ...
2025-07-22 08:08:25,733 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc11/MatMul ...
2025-07-22 08:08:25,734 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc12/MatMul ...
2025-07-22 08:08:25,740 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc12/MatMul ...
2025-07-22 08:08:25,740 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Sigmoid ...
2025-07-22 08:08:25,740 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Mul ...
2025-07-22 08:08:25,740 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Mul_1 ...
2025-07-22 08:08:25,740 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc2/MatMul ...
2025-07-22 08:08:25,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc2/MatMul ...
2025-07-22 08:08:25,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Add_1 ...
2025-07-22 08:08:25,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/ReduceMean ...
2025-07-22 08:08:25,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Sub ...
2025-07-22 08:08:25,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Pow ...
2025-07-22 08:08:25,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/ReduceMean_1 ...
2025-07-22 08:08:25,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Add ...
2025-07-22 08:08:25,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Sqrt ...
2025-07-22 08:08:25,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Div ...
2025-07-22 08:08:25,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Mul ...
2025-07-22 08:08:25,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Add_1 ...
2025-07-22 08:08:25,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_1 ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_1 ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Concat ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Reshape ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cast ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Range ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:25,753 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Neg ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Einsum ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cos ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Sin ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_3 ...
2025-07-22 08:08:25,754 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_4 ...
2025-07-22 08:08:25,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_5 ...
2025-07-22 08:08:25,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose ...
2025-07-22 08:08:25,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_1 ...
2025-07-22 08:08:25,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_2 ...
2025-07-22 08:08:25,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/MatMul ...
2025-07-22 08:08:25,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/MatMul ...
2025-07-22 08:08:25,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Div_1 ...
2025-07-22 08:08:25,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Add ...
2025-07-22 08:08:25,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Softmax ...
2025-07-22 08:08:25,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/MatMul_1 ...
2025-07-22 08:08:25,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/MatMul_1 ...
2025-07-22 08:08:25,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_3 ...
2025-07-22 08:08:25,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_3 ...
2025-07-22 08:08:25,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_6 ...
2025-07-22 08:08:25,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_7 ...
2025-07-22 08:08:25,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_3 ...
2025-07-22 08:08:25,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_4 ...
2025-07-22 08:08:25,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Concat_1 ...
2025-07-22 08:08:25,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Reshape_1 ...
2025-07-22 08:08:25,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/out_proj/MatMul ...
2025-07-22 08:08:25,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/out_proj/MatMul ...
2025-07-22 08:08:25,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Add ...
2025-07-22 08:08:25,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/ReduceMean ...
2025-07-22 08:08:25,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Sub ...
2025-07-22 08:08:25,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Pow ...
2025-07-22 08:08:25,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/ReduceMean_1 ...
2025-07-22 08:08:25,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Add ...
2025-07-22 08:08:25,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Sqrt ...
2025-07-22 08:08:25,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Div ...
2025-07-22 08:08:25,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Mul ...
2025-07-22 08:08:25,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Add_1 ...
2025-07-22 08:08:25,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc11/MatMul ...
2025-07-22 08:08:25,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc11/MatMul ...
2025-07-22 08:08:25,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc12/MatMul ...
2025-07-22 08:08:25,772 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc12/MatMul ...
2025-07-22 08:08:25,772 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Sigmoid ...
2025-07-22 08:08:25,772 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Mul ...
2025-07-22 08:08:25,772 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Mul_1 ...
2025-07-22 08:08:25,772 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc2/MatMul ...
2025-07-22 08:08:25,779 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc2/MatMul ...
2025-07-22 08:08:25,779 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Add_1 ...
2025-07-22 08:08:25,779 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/ReduceMean ...
2025-07-22 08:08:25,779 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Sub ...
2025-07-22 08:08:25,779 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Pow ...
2025-07-22 08:08:25,779 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/ReduceMean_1 ...
2025-07-22 08:08:25,779 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Add ...
2025-07-22 08:08:25,779 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Sqrt ...
2025-07-22 08:08:25,779 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Div ...
2025-07-22 08:08:25,779 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Mul ...
2025-07-22 08:08:25,779 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Add_1 ...
2025-07-22 08:08:25,779 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/Wqkv/MatMul ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_1 ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_1 ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Concat ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Reshape ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cast ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Range ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Neg ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Einsum ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:25,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cos ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Sin ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_3 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_4 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_5 ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose ...
2025-07-22 08:08:25,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_1 ...
2025-07-22 08:08:25,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_2 ...
2025-07-22 08:08:25,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/MatMul ...
2025-07-22 08:08:25,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/MatMul ...
2025-07-22 08:08:25,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Div_1 ...
2025-07-22 08:08:25,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Add ...
2025-07-22 08:08:25,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Softmax ...
2025-07-22 08:08:25,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/MatMul_1 ...
2025-07-22 08:08:25,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:25,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/MatMul_1 ...
2025-07-22 08:08:25,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_3 ...
2025-07-22 08:08:25,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_3 ...
2025-07-22 08:08:25,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_6 ...
2025-07-22 08:08:25,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_7 ...
2025-07-22 08:08:25,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_3 ...
2025-07-22 08:08:25,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_4 ...
2025-07-22 08:08:25,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Concat_1 ...
2025-07-22 08:08:25,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Reshape_1 ...
2025-07-22 08:08:25,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/out_proj/MatMul ...
2025-07-22 08:08:25,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/out_proj/MatMul ...
2025-07-22 08:08:25,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Add ...
2025-07-22 08:08:25,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/ReduceMean ...
2025-07-22 08:08:25,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Sub ...
2025-07-22 08:08:25,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Pow ...
2025-07-22 08:08:25,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/ReduceMean_1 ...
2025-07-22 08:08:25,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Add ...
2025-07-22 08:08:25,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Sqrt ...
2025-07-22 08:08:25,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Div ...
2025-07-22 08:08:25,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Mul ...
2025-07-22 08:08:25,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Add_1 ...
2025-07-22 08:08:25,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc11/MatMul ...
2025-07-22 08:08:25,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc11/MatMul ...
2025-07-22 08:08:25,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc12/MatMul ...
2025-07-22 08:08:25,806 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc12/MatMul ...
2025-07-22 08:08:25,806 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Sigmoid ...
2025-07-22 08:08:25,806 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Mul ...
2025-07-22 08:08:25,806 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Mul_1 ...
2025-07-22 08:08:25,806 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc2/MatMul ...
2025-07-22 08:08:25,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc2/MatMul ...
2025-07-22 08:08:25,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Add_1 ...
2025-07-22 08:08:25,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/ReduceMean ...
2025-07-22 08:08:25,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Sub ...
2025-07-22 08:08:25,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Pow ...
2025-07-22 08:08:25,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/ReduceMean_1 ...
2025-07-22 08:08:25,813 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Add ...
2025-07-22 08:08:25,813 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Sqrt ...
2025-07-22 08:08:25,813 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Div ...
2025-07-22 08:08:25,813 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Mul ...
2025-07-22 08:08:25,813 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Add_1 ...


 - Quantizing to q4:  60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ    | 3/5 [00:11<00:06,  3.41s/it]

 - Quantizing to q4f16:  60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ    | 3/5 [00:11<00:06,  3.41s/it]2025-07-22 08:08:27,108 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/word_embeddings/Gather ...
2025-07-22 08:08:27,108 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/token_type_embeddings/Gather ...
2025-07-22 08:08:27,108 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Unsqueeze ...
2025-07-22 08:08:27,108 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/Add ...
2025-07-22 08:08:27,108 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Cast ...
2025-07-22 08:08:27,108 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/ReduceMean ...
2025-07-22 08:08:27,108 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Sub ...
2025-07-22 08:08:27,108 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Sub ...
2025-07-22 08:08:27,108 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Mul ...
2025-07-22 08:08:27,108 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Pow ...
2025-07-22 08:08:27,108 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/ReduceMean_1 ...
2025-07-22 08:08:27,108 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Add ...
2025-07-22 08:08:27,108 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Sqrt ...
2025-07-22 08:08:27,108 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Div ...
2025-07-22 08:08:27,108 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Mul ...
2025-07-22 08:08:27,108 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Add_1 ...
2025-07-22 08:08:27,108 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_1 ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_1 ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Concat ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Reshape ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cast ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:27,114 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Range ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Neg ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Einsum ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cos ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Sin ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:27,115 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_3 ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_4 ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_5 ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_1 ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_2 ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/MatMul ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/MatMul ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Div_1 ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Add ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Softmax ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/MatMul_1 ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/MatMul_1 ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_3 ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_3 ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_6 ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_7 ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_3 ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_4 ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Concat_1 ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Reshape_1 ...
2025-07-22 08:08:27,116 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/out_proj/MatMul ...
2025-07-22 08:08:27,120 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/out_proj/MatMul ...
2025-07-22 08:08:27,120 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Add ...
2025-07-22 08:08:27,120 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/ReduceMean ...
2025-07-22 08:08:27,120 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Sub ...
2025-07-22 08:08:27,120 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Pow ...
2025-07-22 08:08:27,120 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/ReduceMean_1 ...
2025-07-22 08:08:27,120 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Add ...
2025-07-22 08:08:27,120 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Sqrt ...
2025-07-22 08:08:27,120 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Div ...
2025-07-22 08:08:27,120 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Mul ...
2025-07-22 08:08:27,120 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Add_1 ...
2025-07-22 08:08:27,120 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc11/MatMul ...
2025-07-22 08:08:27,126 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc11/MatMul ...
2025-07-22 08:08:27,126 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc12/MatMul ...
2025-07-22 08:08:27,132 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc12/MatMul ...
2025-07-22 08:08:27,132 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Sigmoid ...
2025-07-22 08:08:27,132 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Mul ...
2025-07-22 08:08:27,132 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Mul_1 ...
2025-07-22 08:08:27,132 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc2/MatMul ...
2025-07-22 08:08:27,138 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc2/MatMul ...
2025-07-22 08:08:27,138 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Add_1 ...
2025-07-22 08:08:27,138 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/ReduceMean ...
2025-07-22 08:08:27,138 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Sub ...
2025-07-22 08:08:27,138 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Pow ...
2025-07-22 08:08:27,138 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/ReduceMean_1 ...
2025-07-22 08:08:27,138 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Add ...
2025-07-22 08:08:27,139 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Sqrt ...
2025-07-22 08:08:27,139 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Div ...
2025-07-22 08:08:27,139 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Mul ...
2025-07-22 08:08:27,139 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Add_1 ...
2025-07-22 08:08:27,139 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,144 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_1 ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_1 ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Concat ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Reshape ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cast ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Range ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Neg ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Einsum ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cos ...
2025-07-22 08:08:27,145 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Sin ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_3 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_4 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_5 ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose ...
2025-07-22 08:08:27,146 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_1 ...
2025-07-22 08:08:27,147 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_2 ...
2025-07-22 08:08:27,147 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/MatMul ...
2025-07-22 08:08:27,147 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,147 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/MatMul ...
2025-07-22 08:08:27,147 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Div_1 ...
2025-07-22 08:08:27,147 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Add ...
2025-07-22 08:08:27,147 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Softmax ...
2025-07-22 08:08:27,147 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/MatMul_1 ...
2025-07-22 08:08:27,147 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,147 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/MatMul_1 ...
2025-07-22 08:08:27,147 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_3 ...
2025-07-22 08:08:27,147 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_3 ...
2025-07-22 08:08:27,147 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_6 ...
2025-07-22 08:08:27,147 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_7 ...
2025-07-22 08:08:27,147 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_3 ...
2025-07-22 08:08:27,147 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_4 ...
2025-07-22 08:08:27,147 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Concat_1 ...
2025-07-22 08:08:27,147 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Reshape_1 ...
2025-07-22 08:08:27,147 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/out_proj/MatMul ...
2025-07-22 08:08:27,150 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/out_proj/MatMul ...
2025-07-22 08:08:27,150 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Add ...
2025-07-22 08:08:27,150 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/ReduceMean ...
2025-07-22 08:08:27,151 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Sub ...
2025-07-22 08:08:27,151 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Pow ...
2025-07-22 08:08:27,151 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/ReduceMean_1 ...
2025-07-22 08:08:27,151 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Add ...
2025-07-22 08:08:27,151 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Sqrt ...
2025-07-22 08:08:27,151 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Div ...
2025-07-22 08:08:27,151 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Mul ...
2025-07-22 08:08:27,151 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Add_1 ...
2025-07-22 08:08:27,151 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc11/MatMul ...
2025-07-22 08:08:27,157 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc11/MatMul ...
2025-07-22 08:08:27,157 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc12/MatMul ...
2025-07-22 08:08:27,164 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc12/MatMul ...
2025-07-22 08:08:27,164 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Sigmoid ...
2025-07-22 08:08:27,164 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Mul ...
2025-07-22 08:08:27,164 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Mul_1 ...
2025-07-22 08:08:27,164 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc2/MatMul ...
2025-07-22 08:08:27,170 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc2/MatMul ...
2025-07-22 08:08:27,170 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Add_1 ...
2025-07-22 08:08:27,170 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/ReduceMean ...
2025-07-22 08:08:27,170 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Sub ...
2025-07-22 08:08:27,170 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Pow ...
2025-07-22 08:08:27,170 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/ReduceMean_1 ...
2025-07-22 08:08:27,170 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Add ...
2025-07-22 08:08:27,170 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Sqrt ...
2025-07-22 08:08:27,170 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Div ...
2025-07-22 08:08:27,170 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Mul ...
2025-07-22 08:08:27,170 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Add_1 ...
2025-07-22 08:08:27,170 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_1 ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_1 ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Concat ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Reshape ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cast ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:27,176 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Range ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Neg ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Einsum ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cos ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Sin ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:27,177 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_3 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_4 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_5 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_1 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_2 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/MatMul ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/MatMul ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Div_1 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Add ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Softmax ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/MatMul_1 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/MatMul_1 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_3 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_3 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_6 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_7 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_3 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_4 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Concat_1 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Reshape_1 ...
2025-07-22 08:08:27,178 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/out_proj/MatMul ...
2025-07-22 08:08:27,182 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/out_proj/MatMul ...
2025-07-22 08:08:27,182 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Add ...
2025-07-22 08:08:27,182 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/ReduceMean ...
2025-07-22 08:08:27,182 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Sub ...
2025-07-22 08:08:27,182 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Pow ...
2025-07-22 08:08:27,182 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/ReduceMean_1 ...
2025-07-22 08:08:27,182 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Add ...
2025-07-22 08:08:27,182 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Sqrt ...
2025-07-22 08:08:27,182 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Div ...
2025-07-22 08:08:27,182 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Mul ...
2025-07-22 08:08:27,182 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Add_1 ...
2025-07-22 08:08:27,182 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc11/MatMul ...
2025-07-22 08:08:27,189 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc11/MatMul ...
2025-07-22 08:08:27,189 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc12/MatMul ...
2025-07-22 08:08:27,195 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc12/MatMul ...
2025-07-22 08:08:27,195 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Sigmoid ...
2025-07-22 08:08:27,195 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Mul ...
2025-07-22 08:08:27,195 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Mul_1 ...
2025-07-22 08:08:27,195 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc2/MatMul ...
2025-07-22 08:08:27,201 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc2/MatMul ...
2025-07-22 08:08:27,201 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Add_1 ...
2025-07-22 08:08:27,201 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/ReduceMean ...
2025-07-22 08:08:27,201 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Sub ...
2025-07-22 08:08:27,201 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Pow ...
2025-07-22 08:08:27,201 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/ReduceMean_1 ...
2025-07-22 08:08:27,201 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Add ...
2025-07-22 08:08:27,202 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Sqrt ...
2025-07-22 08:08:27,202 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Div ...
2025-07-22 08:08:27,202 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Mul ...
2025-07-22 08:08:27,202 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Add_1 ...
2025-07-22 08:08:27,202 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape ...
2025-07-22 08:08:27,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather ...
2025-07-22 08:08:27,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_1 ...
2025-07-22 08:08:27,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze ...
2025-07-22 08:08:27,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_1 ...
2025-07-22 08:08:27,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Concat ...
2025-07-22 08:08:27,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Reshape ...
2025-07-22 08:08:27,207 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cast ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Range ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Neg ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Einsum ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cos ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Sin ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:27,208 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_3 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_4 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_5 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_1 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_2 ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/MatMul ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/MatMul ...
2025-07-22 08:08:27,209 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Div_1 ...
2025-07-22 08:08:27,210 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Add ...
2025-07-22 08:08:27,210 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Softmax ...
2025-07-22 08:08:27,210 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/MatMul_1 ...
2025-07-22 08:08:27,210 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,210 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/MatMul_1 ...
2025-07-22 08:08:27,210 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_3 ...
2025-07-22 08:08:27,210 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_3 ...
2025-07-22 08:08:27,210 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_6 ...
2025-07-22 08:08:27,210 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_7 ...
2025-07-22 08:08:27,210 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_3 ...
2025-07-22 08:08:27,210 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_4 ...
2025-07-22 08:08:27,210 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Concat_1 ...
2025-07-22 08:08:27,210 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Reshape_1 ...
2025-07-22 08:08:27,210 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/out_proj/MatMul ...
2025-07-22 08:08:27,213 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/out_proj/MatMul ...
2025-07-22 08:08:27,213 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Add ...
2025-07-22 08:08:27,214 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/ReduceMean ...
2025-07-22 08:08:27,214 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Sub ...
2025-07-22 08:08:27,214 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Pow ...
2025-07-22 08:08:27,214 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/ReduceMean_1 ...
2025-07-22 08:08:27,214 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Add ...
2025-07-22 08:08:27,214 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Sqrt ...
2025-07-22 08:08:27,214 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Div ...
2025-07-22 08:08:27,214 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Mul ...
2025-07-22 08:08:27,214 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Add_1 ...
2025-07-22 08:08:27,214 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc11/MatMul ...
2025-07-22 08:08:27,220 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc11/MatMul ...
2025-07-22 08:08:27,220 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc12/MatMul ...
2025-07-22 08:08:27,226 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc12/MatMul ...
2025-07-22 08:08:27,226 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Sigmoid ...
2025-07-22 08:08:27,226 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Mul ...
2025-07-22 08:08:27,226 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Mul_1 ...
2025-07-22 08:08:27,226 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc2/MatMul ...
2025-07-22 08:08:27,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc2/MatMul ...
2025-07-22 08:08:27,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Add_1 ...
2025-07-22 08:08:27,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/ReduceMean ...
2025-07-22 08:08:27,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Sub ...
2025-07-22 08:08:27,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Pow ...
2025-07-22 08:08:27,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/ReduceMean_1 ...
2025-07-22 08:08:27,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Add ...
2025-07-22 08:08:27,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Sqrt ...
2025-07-22 08:08:27,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Div ...
2025-07-22 08:08:27,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Mul ...
2025-07-22 08:08:27,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Add_1 ...
2025-07-22 08:08:27,233 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_1 ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_1 ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Concat ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Reshape ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cast ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Range ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Neg ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:27,239 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Einsum ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cos ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Sin ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_3 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_4 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_5 ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose ...
2025-07-22 08:08:27,240 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_1 ...
2025-07-22 08:08:27,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_2 ...
2025-07-22 08:08:27,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/MatMul ...
2025-07-22 08:08:27,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/MatMul ...
2025-07-22 08:08:27,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Div_1 ...
2025-07-22 08:08:27,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Add ...
2025-07-22 08:08:27,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Softmax ...
2025-07-22 08:08:27,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/MatMul_1 ...
2025-07-22 08:08:27,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/MatMul_1 ...
2025-07-22 08:08:27,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_3 ...
2025-07-22 08:08:27,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_3 ...
2025-07-22 08:08:27,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_6 ...
2025-07-22 08:08:27,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_7 ...
2025-07-22 08:08:27,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_3 ...
2025-07-22 08:08:27,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_4 ...
2025-07-22 08:08:27,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Concat_1 ...
2025-07-22 08:08:27,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Reshape_1 ...
2025-07-22 08:08:27,241 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/out_proj/MatMul ...
2025-07-22 08:08:27,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/out_proj/MatMul ...
2025-07-22 08:08:27,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Add ...
2025-07-22 08:08:27,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/ReduceMean ...
2025-07-22 08:08:27,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Sub ...
2025-07-22 08:08:27,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Pow ...
2025-07-22 08:08:27,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/ReduceMean_1 ...
2025-07-22 08:08:27,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Add ...
2025-07-22 08:08:27,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Sqrt ...
2025-07-22 08:08:27,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Div ...
2025-07-22 08:08:27,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Mul ...
2025-07-22 08:08:27,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Add_1 ...
2025-07-22 08:08:27,245 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc11/MatMul ...
2025-07-22 08:08:27,251 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc11/MatMul ...
2025-07-22 08:08:27,251 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc12/MatMul ...
2025-07-22 08:08:27,258 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc12/MatMul ...
2025-07-22 08:08:27,258 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Sigmoid ...
2025-07-22 08:08:27,258 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Mul ...
2025-07-22 08:08:27,258 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Mul_1 ...
2025-07-22 08:08:27,258 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc2/MatMul ...
2025-07-22 08:08:27,264 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc2/MatMul ...
2025-07-22 08:08:27,264 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Add_1 ...
2025-07-22 08:08:27,264 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/ReduceMean ...
2025-07-22 08:08:27,264 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Sub ...
2025-07-22 08:08:27,264 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Pow ...
2025-07-22 08:08:27,264 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/ReduceMean_1 ...
2025-07-22 08:08:27,264 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Add ...
2025-07-22 08:08:27,264 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Sqrt ...
2025-07-22 08:08:27,264 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Div ...
2025-07-22 08:08:27,264 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Mul ...
2025-07-22 08:08:27,264 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Add_1 ...
2025-07-22 08:08:27,264 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,270 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,270 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape ...
2025-07-22 08:08:27,270 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather ...
2025-07-22 08:08:27,270 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_1 ...
2025-07-22 08:08:27,270 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze ...
2025-07-22 08:08:27,270 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_1 ...
2025-07-22 08:08:27,270 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Concat ...
2025-07-22 08:08:27,270 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Reshape ...
2025-07-22 08:08:27,270 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape ...
2025-07-22 08:08:27,270 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:27,270 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:27,270 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:27,270 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather ...
2025-07-22 08:08:27,270 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:27,270 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:27,270 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:27,270 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:27,270 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:27,270 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:27,270 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:27,270 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cast ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Range ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Neg ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Einsum ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cos ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Sin ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:27,271 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_3 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_4 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_5 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_1 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_2 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/MatMul ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/MatMul ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Div_1 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Add ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Softmax ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/MatMul_1 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/MatMul_1 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_3 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_3 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_6 ...
2025-07-22 08:08:27,272 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_7 ...
2025-07-22 08:08:27,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_3 ...
2025-07-22 08:08:27,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_4 ...
2025-07-22 08:08:27,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Concat_1 ...
2025-07-22 08:08:27,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Reshape_1 ...
2025-07-22 08:08:27,273 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/out_proj/MatMul ...
2025-07-22 08:08:27,276 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/out_proj/MatMul ...
2025-07-22 08:08:27,276 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Add ...
2025-07-22 08:08:27,276 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/ReduceMean ...
2025-07-22 08:08:27,276 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Sub ...
2025-07-22 08:08:27,276 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Pow ...
2025-07-22 08:08:27,276 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/ReduceMean_1 ...
2025-07-22 08:08:27,276 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Add ...
2025-07-22 08:08:27,276 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Sqrt ...
2025-07-22 08:08:27,276 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Div ...
2025-07-22 08:08:27,276 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Mul ...
2025-07-22 08:08:27,276 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Add_1 ...
2025-07-22 08:08:27,276 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc11/MatMul ...
2025-07-22 08:08:27,284 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc11/MatMul ...
2025-07-22 08:08:27,284 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc12/MatMul ...
2025-07-22 08:08:27,292 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc12/MatMul ...
2025-07-22 08:08:27,292 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Sigmoid ...
2025-07-22 08:08:27,292 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Mul ...
2025-07-22 08:08:27,292 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Mul_1 ...
2025-07-22 08:08:27,292 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc2/MatMul ...
2025-07-22 08:08:27,298 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc2/MatMul ...
2025-07-22 08:08:27,299 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Add_1 ...
2025-07-22 08:08:27,299 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/ReduceMean ...
2025-07-22 08:08:27,299 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Sub ...
2025-07-22 08:08:27,299 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Pow ...
2025-07-22 08:08:27,299 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/ReduceMean_1 ...
2025-07-22 08:08:27,299 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Add ...
2025-07-22 08:08:27,299 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Sqrt ...
2025-07-22 08:08:27,299 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Div ...
2025-07-22 08:08:27,299 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Mul ...
2025-07-22 08:08:27,299 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Add_1 ...
2025-07-22 08:08:27,299 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,304 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_1 ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_1 ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Concat ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Reshape ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cast ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Range ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Neg ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:27,305 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Einsum ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cos ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Sin ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_3 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_4 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_5 ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose ...
2025-07-22 08:08:27,306 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_1 ...
2025-07-22 08:08:27,307 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_2 ...
2025-07-22 08:08:27,307 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/MatMul ...
2025-07-22 08:08:27,307 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,307 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/MatMul ...
2025-07-22 08:08:27,307 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Div_1 ...
2025-07-22 08:08:27,307 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Add ...
2025-07-22 08:08:27,307 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Softmax ...
2025-07-22 08:08:27,307 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/MatMul_1 ...
2025-07-22 08:08:27,307 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,307 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/MatMul_1 ...
2025-07-22 08:08:27,307 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_3 ...
2025-07-22 08:08:27,307 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_3 ...
2025-07-22 08:08:27,307 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_6 ...
2025-07-22 08:08:27,307 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_7 ...
2025-07-22 08:08:27,307 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_3 ...
2025-07-22 08:08:27,307 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_4 ...
2025-07-22 08:08:27,307 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Concat_1 ...
2025-07-22 08:08:27,307 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Reshape_1 ...
2025-07-22 08:08:27,307 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/out_proj/MatMul ...
2025-07-22 08:08:27,311 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/out_proj/MatMul ...
2025-07-22 08:08:27,311 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Add ...
2025-07-22 08:08:27,311 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/ReduceMean ...
2025-07-22 08:08:27,311 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Sub ...
2025-07-22 08:08:27,311 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Pow ...
2025-07-22 08:08:27,311 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/ReduceMean_1 ...
2025-07-22 08:08:27,311 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Add ...
2025-07-22 08:08:27,311 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Sqrt ...
2025-07-22 08:08:27,311 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Div ...
2025-07-22 08:08:27,311 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Mul ...
2025-07-22 08:08:27,311 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Add_1 ...
2025-07-22 08:08:27,311 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc11/MatMul ...
2025-07-22 08:08:27,318 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc11/MatMul ...
2025-07-22 08:08:27,318 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc12/MatMul ...
2025-07-22 08:08:27,326 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc12/MatMul ...
2025-07-22 08:08:27,326 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Sigmoid ...
2025-07-22 08:08:27,326 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Mul ...
2025-07-22 08:08:27,326 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Mul_1 ...
2025-07-22 08:08:27,326 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc2/MatMul ...
2025-07-22 08:08:27,333 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc2/MatMul ...
2025-07-22 08:08:27,333 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Add_1 ...
2025-07-22 08:08:27,333 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/ReduceMean ...
2025-07-22 08:08:27,333 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Sub ...
2025-07-22 08:08:27,333 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Pow ...
2025-07-22 08:08:27,333 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/ReduceMean_1 ...
2025-07-22 08:08:27,333 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Add ...
2025-07-22 08:08:27,333 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Sqrt ...
2025-07-22 08:08:27,333 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Div ...
2025-07-22 08:08:27,333 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Mul ...
2025-07-22 08:08:27,333 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Add_1 ...
2025-07-22 08:08:27,333 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,339 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,339 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape ...
2025-07-22 08:08:27,339 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather ...
2025-07-22 08:08:27,339 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_1 ...
2025-07-22 08:08:27,339 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze ...
2025-07-22 08:08:27,339 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_1 ...
2025-07-22 08:08:27,339 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Concat ...
2025-07-22 08:08:27,339 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Reshape ...
2025-07-22 08:08:27,339 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape ...
2025-07-22 08:08:27,339 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:27,339 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:27,339 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:27,339 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather ...
2025-07-22 08:08:27,339 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:27,339 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:27,339 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:27,339 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:27,339 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:27,339 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:27,339 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:27,339 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cast ...
2025-07-22 08:08:27,339 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Range ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Neg ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Einsum ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cos ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Sin ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:27,340 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_3 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_4 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_5 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_1 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_2 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/MatMul ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/MatMul ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Div_1 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Add ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Softmax ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/MatMul_1 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/MatMul_1 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_3 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_3 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_6 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_7 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_3 ...
2025-07-22 08:08:27,341 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_4 ...
2025-07-22 08:08:27,342 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Concat_1 ...
2025-07-22 08:08:27,342 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Reshape_1 ...
2025-07-22 08:08:27,342 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/out_proj/MatMul ...
2025-07-22 08:08:27,345 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/out_proj/MatMul ...
2025-07-22 08:08:27,345 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Add ...
2025-07-22 08:08:27,345 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/ReduceMean ...
2025-07-22 08:08:27,345 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Sub ...
2025-07-22 08:08:27,345 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Pow ...
2025-07-22 08:08:27,345 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/ReduceMean_1 ...
2025-07-22 08:08:27,345 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Add ...
2025-07-22 08:08:27,345 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Sqrt ...
2025-07-22 08:08:27,346 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Div ...
2025-07-22 08:08:27,346 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Mul ...
2025-07-22 08:08:27,346 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Add_1 ...
2025-07-22 08:08:27,346 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc11/MatMul ...
2025-07-22 08:08:27,353 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc11/MatMul ...
2025-07-22 08:08:27,353 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc12/MatMul ...
2025-07-22 08:08:27,360 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc12/MatMul ...
2025-07-22 08:08:27,360 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Sigmoid ...
2025-07-22 08:08:27,360 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Mul ...
2025-07-22 08:08:27,360 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Mul_1 ...
2025-07-22 08:08:27,360 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc2/MatMul ...
2025-07-22 08:08:27,367 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc2/MatMul ...
2025-07-22 08:08:27,367 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Add_1 ...
2025-07-22 08:08:27,367 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/ReduceMean ...
2025-07-22 08:08:27,367 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Sub ...
2025-07-22 08:08:27,367 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Pow ...
2025-07-22 08:08:27,367 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/ReduceMean_1 ...
2025-07-22 08:08:27,368 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Add ...
2025-07-22 08:08:27,368 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Sqrt ...
2025-07-22 08:08:27,368 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Div ...
2025-07-22 08:08:27,368 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Mul ...
2025-07-22 08:08:27,368 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Add_1 ...
2025-07-22 08:08:27,368 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,373 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,373 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape ...
2025-07-22 08:08:27,373 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather ...
2025-07-22 08:08:27,373 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_1 ...
2025-07-22 08:08:27,373 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_1 ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Concat ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Reshape ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cast ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Range ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Neg ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Einsum ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cos ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Sin ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice ...
2025-07-22 08:08:27,374 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_3 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_4 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_5 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_1 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_2 ...
2025-07-22 08:08:27,375 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/MatMul ...
2025-07-22 08:08:27,376 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,376 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/MatMul ...
2025-07-22 08:08:27,376 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Div_1 ...
2025-07-22 08:08:27,376 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Add ...
2025-07-22 08:08:27,376 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Softmax ...
2025-07-22 08:08:27,376 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/MatMul_1 ...
2025-07-22 08:08:27,376 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,376 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/MatMul_1 ...
2025-07-22 08:08:27,376 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_3 ...
2025-07-22 08:08:27,376 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_3 ...
2025-07-22 08:08:27,376 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_6 ...
2025-07-22 08:08:27,376 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_7 ...
2025-07-22 08:08:27,376 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_3 ...
2025-07-22 08:08:27,376 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_4 ...
2025-07-22 08:08:27,376 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Concat_1 ...
2025-07-22 08:08:27,376 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Reshape_1 ...
2025-07-22 08:08:27,376 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/out_proj/MatMul ...
2025-07-22 08:08:27,380 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/out_proj/MatMul ...
2025-07-22 08:08:27,380 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Add ...
2025-07-22 08:08:27,380 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/ReduceMean ...
2025-07-22 08:08:27,380 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Sub ...
2025-07-22 08:08:27,380 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Pow ...
2025-07-22 08:08:27,380 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/ReduceMean_1 ...
2025-07-22 08:08:27,380 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Add ...
2025-07-22 08:08:27,380 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Sqrt ...
2025-07-22 08:08:27,380 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Div ...
2025-07-22 08:08:27,380 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Mul ...
2025-07-22 08:08:27,380 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Add_1 ...
2025-07-22 08:08:27,380 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc11/MatMul ...
2025-07-22 08:08:27,388 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc11/MatMul ...
2025-07-22 08:08:27,388 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc12/MatMul ...
2025-07-22 08:08:27,395 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc12/MatMul ...
2025-07-22 08:08:27,395 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Sigmoid ...
2025-07-22 08:08:27,395 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Mul ...
2025-07-22 08:08:27,395 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Mul_1 ...
2025-07-22 08:08:27,395 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc2/MatMul ...
2025-07-22 08:08:27,402 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc2/MatMul ...
2025-07-22 08:08:27,402 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Add_1 ...
2025-07-22 08:08:27,402 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/ReduceMean ...
2025-07-22 08:08:27,402 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Sub ...
2025-07-22 08:08:27,402 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Pow ...
2025-07-22 08:08:27,402 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/ReduceMean_1 ...
2025-07-22 08:08:27,402 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Add ...
2025-07-22 08:08:27,402 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Sqrt ...
2025-07-22 08:08:27,402 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Div ...
2025-07-22 08:08:27,402 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Mul ...
2025-07-22 08:08:27,402 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Add_1 ...
2025-07-22 08:08:27,402 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,408 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,408 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape ...
2025-07-22 08:08:27,408 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather ...
2025-07-22 08:08:27,408 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_1 ...
2025-07-22 08:08:27,408 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze ...
2025-07-22 08:08:27,408 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_1 ...
2025-07-22 08:08:27,408 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Concat ...
2025-07-22 08:08:27,408 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Reshape ...
2025-07-22 08:08:27,408 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape ...
2025-07-22 08:08:27,408 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:27,408 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:27,408 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:27,408 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather ...
2025-07-22 08:08:27,408 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:27,408 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:27,408 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:27,408 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:27,408 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:27,408 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:27,408 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cast ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Range ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Neg ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Einsum ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cos ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Sin ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:27,409 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_3 ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_4 ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_5 ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_1 ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_2 ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/MatMul ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/MatMul ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Div_1 ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Add ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Softmax ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/MatMul_1 ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/MatMul_1 ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_3 ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_3 ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_6 ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_7 ...
2025-07-22 08:08:27,410 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_3 ...
2025-07-22 08:08:27,411 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_4 ...
2025-07-22 08:08:27,411 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Concat_1 ...
2025-07-22 08:08:27,411 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Reshape_1 ...
2025-07-22 08:08:27,411 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/out_proj/MatMul ...
2025-07-22 08:08:27,414 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/out_proj/MatMul ...
2025-07-22 08:08:27,414 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Add ...
2025-07-22 08:08:27,414 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/ReduceMean ...
2025-07-22 08:08:27,414 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Sub ...
2025-07-22 08:08:27,414 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Pow ...
2025-07-22 08:08:27,414 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/ReduceMean_1 ...
2025-07-22 08:08:27,414 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Add ...
2025-07-22 08:08:27,414 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Sqrt ...
2025-07-22 08:08:27,414 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Div ...
2025-07-22 08:08:27,414 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Mul ...
2025-07-22 08:08:27,415 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Add_1 ...
2025-07-22 08:08:27,415 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc11/MatMul ...
2025-07-22 08:08:27,422 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc11/MatMul ...
2025-07-22 08:08:27,422 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc12/MatMul ...
2025-07-22 08:08:27,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc12/MatMul ...
2025-07-22 08:08:27,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Sigmoid ...
2025-07-22 08:08:27,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Mul ...
2025-07-22 08:08:27,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Mul_1 ...
2025-07-22 08:08:27,429 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc2/MatMul ...
2025-07-22 08:08:27,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc2/MatMul ...
2025-07-22 08:08:27,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Add_1 ...
2025-07-22 08:08:27,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/ReduceMean ...
2025-07-22 08:08:27,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Sub ...
2025-07-22 08:08:27,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Pow ...
2025-07-22 08:08:27,436 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/ReduceMean_1 ...
2025-07-22 08:08:27,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Add ...
2025-07-22 08:08:27,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Sqrt ...
2025-07-22 08:08:27,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Div ...
2025-07-22 08:08:27,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Mul ...
2025-07-22 08:08:27,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Add_1 ...
2025-07-22 08:08:27,437 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,442 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,442 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape ...
2025-07-22 08:08:27,442 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather ...
2025-07-22 08:08:27,442 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_1 ...
2025-07-22 08:08:27,442 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze ...
2025-07-22 08:08:27,442 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_1 ...
2025-07-22 08:08:27,442 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Concat ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Reshape ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cast ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Range ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Neg ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Einsum ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cos ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Sin ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:27,443 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_3 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_4 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_5 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_1 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_2 ...
2025-07-22 08:08:27,444 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/MatMul ...
2025-07-22 08:08:27,445 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,445 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/MatMul ...
2025-07-22 08:08:27,445 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Div_1 ...
2025-07-22 08:08:27,445 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Add ...
2025-07-22 08:08:27,445 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Softmax ...
2025-07-22 08:08:27,445 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/MatMul_1 ...
2025-07-22 08:08:27,445 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,445 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/MatMul_1 ...
2025-07-22 08:08:27,445 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_3 ...
2025-07-22 08:08:27,445 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_3 ...
2025-07-22 08:08:27,445 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_6 ...
2025-07-22 08:08:27,445 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_7 ...
2025-07-22 08:08:27,445 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_3 ...
2025-07-22 08:08:27,445 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_4 ...
2025-07-22 08:08:27,445 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Concat_1 ...
2025-07-22 08:08:27,445 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Reshape_1 ...
2025-07-22 08:08:27,445 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/out_proj/MatMul ...
2025-07-22 08:08:27,449 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/out_proj/MatMul ...
2025-07-22 08:08:27,449 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Add ...
2025-07-22 08:08:27,449 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/ReduceMean ...
2025-07-22 08:08:27,449 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Sub ...
2025-07-22 08:08:27,449 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Pow ...
2025-07-22 08:08:27,449 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/ReduceMean_1 ...
2025-07-22 08:08:27,449 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Add ...
2025-07-22 08:08:27,449 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Sqrt ...
2025-07-22 08:08:27,449 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Div ...
2025-07-22 08:08:27,449 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Mul ...
2025-07-22 08:08:27,449 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Add_1 ...
2025-07-22 08:08:27,449 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc11/MatMul ...
2025-07-22 08:08:27,456 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc11/MatMul ...
2025-07-22 08:08:27,456 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc12/MatMul ...
2025-07-22 08:08:27,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc12/MatMul ...
2025-07-22 08:08:27,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Sigmoid ...
2025-07-22 08:08:27,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Mul ...
2025-07-22 08:08:27,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Mul_1 ...
2025-07-22 08:08:27,464 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc2/MatMul ...
2025-07-22 08:08:27,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc2/MatMul ...
2025-07-22 08:08:27,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Add_1 ...
2025-07-22 08:08:27,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/ReduceMean ...
2025-07-22 08:08:27,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Sub ...
2025-07-22 08:08:27,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Pow ...
2025-07-22 08:08:27,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/ReduceMean_1 ...
2025-07-22 08:08:27,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Add ...
2025-07-22 08:08:27,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Sqrt ...
2025-07-22 08:08:27,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Div ...
2025-07-22 08:08:27,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Mul ...
2025-07-22 08:08:27,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Add_1 ...
2025-07-22 08:08:27,471 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/Wqkv/MatMul ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_1 ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_1 ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Concat ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Reshape ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cast ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:27,477 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Range ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Neg ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Einsum ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cos ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Sin ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:27,478 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_3 ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_4 ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_5 ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_1 ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_2 ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/MatMul ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/MatMul ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Div_1 ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Add ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Softmax ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/MatMul_1 ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/MatMul_1 ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_3 ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_3 ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_6 ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_7 ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_3 ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_4 ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Concat_1 ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Reshape_1 ...
2025-07-22 08:08:27,479 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/out_proj/MatMul ...
2025-07-22 08:08:27,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/out_proj/MatMul ...
2025-07-22 08:08:27,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Add ...
2025-07-22 08:08:27,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/ReduceMean ...
2025-07-22 08:08:27,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Sub ...
2025-07-22 08:08:27,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Pow ...
2025-07-22 08:08:27,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/ReduceMean_1 ...
2025-07-22 08:08:27,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Add ...
2025-07-22 08:08:27,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Sqrt ...
2025-07-22 08:08:27,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Div ...
2025-07-22 08:08:27,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Mul ...
2025-07-22 08:08:27,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Add_1 ...
2025-07-22 08:08:27,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc11/MatMul ...
2025-07-22 08:08:27,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc11/MatMul ...
2025-07-22 08:08:27,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc12/MatMul ...
2025-07-22 08:08:27,498 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc12/MatMul ...
2025-07-22 08:08:27,498 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Sigmoid ...
2025-07-22 08:08:27,498 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Mul ...
2025-07-22 08:08:27,498 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Mul_1 ...
2025-07-22 08:08:27,498 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc2/MatMul ...
2025-07-22 08:08:27,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc2/MatMul ...
2025-07-22 08:08:27,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Add_1 ...
2025-07-22 08:08:27,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/ReduceMean ...
2025-07-22 08:08:27,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Sub ...
2025-07-22 08:08:27,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Pow ...
2025-07-22 08:08:27,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/ReduceMean_1 ...
2025-07-22 08:08:27,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Add ...
2025-07-22 08:08:27,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Sqrt ...
2025-07-22 08:08:27,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Div ...
2025-07-22 08:08:27,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Mul ...
2025-07-22 08:08:27,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Add_1 ...
/home/ubuntu/src/tjsmigration/transformers.js/scripts/float16.py:73: UserWarning: the float32 number 1.7979866484552076e-08 will be truncated to 1e-07
  warnings.warn(
/home/ubuntu/src/tjsmigration/transformers.js/scripts/float16.py:92: UserWarning: the float32 number -4.1298839903447515e-09 will be truncated to -1e-07
  warnings.warn(
/home/ubuntu/src/tjsmigration/transformers.js/scripts/float16.py:85: UserWarning: the float32 number -3.4028234663852886e+38 will be truncated to -10000.0
  warnings.warn(
/home/ubuntu/src/tjsmigration/transformers.js/scripts/float16.py:73: UserWarning: the float32 number 9.999999960041972e-13 will be truncated to 1e-07
  warnings.warn(

 - Quantizing to q4f16:  60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ    | 3/5 [00:14<00:09,  4.81s/it]

Processing /tmp/tmpfxbvrttu/model.onnx:   0%|          | 0/1 [00:14<?, ?it/s]
Traceback (most recent call last):
  File "<frozen runpy>", line 198, in _run_module_as_main
  File "<frozen runpy>", line 88, in _run_code
  File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/quantize.py", line 377, in <module>
    main()
  File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/quantize.py", line 374, in main
    quantize(input_folder, output_folder, quantization_args)
  File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/quantize.py", line 326, in quantize
    quantize_fp16(
  File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/quantize.py", line 223, in quantize_fp16
    check_and_save_model(model_fp16, save_path)
  File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/utils.py", line 29, in check_and_save_model
    strict_check_model(model)
  File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/utils.py", line 21, in strict_check_model
    raise e
  File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/utils.py", line 16, in strict_check_model
    onnx.checker.check_model(model_or_path, full_check=True)
  File "/home/ubuntu/.cache/uv/archive-v0/iAncxVR1WPOl_8LkA6LpD/lib/python3.12/site-packages/onnx/checker.py", line 179, in check_model
    C.check_model(
onnx.onnx_cpp2py_export.shape_inference.InferenceError: [ShapeInferenceError] (op_type:Range, node name: /encoder/layers.0/attn/rotary_emb/Range): start typestr: T, has unsupported type: tensor(float16)

❌ Based on model.onnx without slimming

0%|          | 0/1 [00:00<?, ?it/s]
Processing /tmp/tmp6syzxlml/model.onnx:   0%|          | 0/1 [00:00<?, ?it/s]

  0%|          | 0/5 [00:00<?, ?it/s]

 - Quantizing to int8:   0%|          | 0/5 [00:00<?, ?it/s]2025-07-22 08:08:36,089 root [INFO] - Quantization parameters for tensor:"/emb_ln/Add_1_output_0" not specified
2025-07-22 08:08:36,096 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.0/attn/MatMul]
2025-07-22 08:08:36,096 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.0/attn/MatMul_1]
2025-07-22 08:08:36,098 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:36,102 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/norm1/Add_1_output_0" not specified
2025-07-22 08:08:36,118 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:36,127 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/norm2/Add_1_output_0" not specified
2025-07-22 08:08:36,134 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.1/attn/MatMul]
2025-07-22 08:08:36,134 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.1/attn/MatMul_1]
2025-07-22 08:08:36,136 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:36,140 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/norm1/Add_1_output_0" not specified
2025-07-22 08:08:36,157 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:36,166 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/norm2/Add_1_output_0" not specified
2025-07-22 08:08:36,174 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.2/attn/MatMul]
2025-07-22 08:08:36,174 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.2/attn/MatMul_1]
2025-07-22 08:08:36,176 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:36,179 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/norm1/Add_1_output_0" not specified
2025-07-22 08:08:36,196 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:36,205 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/norm2/Add_1_output_0" not specified
2025-07-22 08:08:36,213 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.3/attn/MatMul]
2025-07-22 08:08:36,213 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.3/attn/MatMul_1]
2025-07-22 08:08:36,215 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:36,218 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/norm1/Add_1_output_0" not specified
2025-07-22 08:08:36,236 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:36,245 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/norm2/Add_1_output_0" not specified
2025-07-22 08:08:36,253 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.4/attn/MatMul]
2025-07-22 08:08:36,253 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.4/attn/MatMul_1]
2025-07-22 08:08:36,255 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:36,258 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/norm1/Add_1_output_0" not specified
2025-07-22 08:08:36,276 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:36,286 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/norm2/Add_1_output_0" not specified
2025-07-22 08:08:36,294 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.5/attn/MatMul]
2025-07-22 08:08:36,294 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.5/attn/MatMul_1]
2025-07-22 08:08:36,296 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:36,300 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/norm1/Add_1_output_0" not specified
2025-07-22 08:08:36,317 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:36,328 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/norm2/Add_1_output_0" not specified
2025-07-22 08:08:36,336 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.6/attn/MatMul]
2025-07-22 08:08:36,336 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.6/attn/MatMul_1]
2025-07-22 08:08:36,338 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:36,342 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/norm1/Add_1_output_0" not specified
2025-07-22 08:08:36,360 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:36,371 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/norm2/Add_1_output_0" not specified
2025-07-22 08:08:36,379 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.7/attn/MatMul]
2025-07-22 08:08:36,379 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.7/attn/MatMul_1]
2025-07-22 08:08:36,381 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:36,385 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/norm1/Add_1_output_0" not specified
2025-07-22 08:08:36,404 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:36,414 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/norm2/Add_1_output_0" not specified
2025-07-22 08:08:36,423 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.8/attn/MatMul]
2025-07-22 08:08:36,423 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.8/attn/MatMul_1]
2025-07-22 08:08:36,425 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:36,430 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/norm1/Add_1_output_0" not specified
2025-07-22 08:08:36,448 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:36,459 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/norm2/Add_1_output_0" not specified
2025-07-22 08:08:36,468 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.9/attn/MatMul]
2025-07-22 08:08:36,468 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.9/attn/MatMul_1]
2025-07-22 08:08:36,470 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:36,474 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/norm1/Add_1_output_0" not specified
2025-07-22 08:08:36,492 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:36,503 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/norm2/Add_1_output_0" not specified
2025-07-22 08:08:36,512 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.10/attn/MatMul]
2025-07-22 08:08:36,512 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.10/attn/MatMul_1]
2025-07-22 08:08:36,514 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:36,518 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/norm1/Add_1_output_0" not specified
2025-07-22 08:08:36,537 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:36,548 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/norm2/Add_1_output_0" not specified
2025-07-22 08:08:36,558 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.11/attn/MatMul]
2025-07-22 08:08:36,558 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.11/attn/MatMul_1]
2025-07-22 08:08:36,560 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:36,564 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/norm1/Add_1_output_0" not specified
2025-07-22 08:08:36,583 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/mlp/Mul_1_output_0" not specified


 - Quantizing to int8:  20%|β–ˆβ–ˆ        | 1/5 [00:05<00:21,  5.31s/it]

 - Quantizing to uint8:  20%|β–ˆβ–ˆ        | 1/5 [00:05<00:21,  5.31s/it]2025-07-22 08:08:40,338 root [INFO] - Quantization parameters for tensor:"/emb_ln/Add_1_output_0" not specified
2025-07-22 08:08:40,345 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.0/attn/MatMul]
2025-07-22 08:08:40,346 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.0/attn/MatMul_1]
2025-07-22 08:08:40,347 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:40,351 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/norm1/Add_1_output_0" not specified
2025-07-22 08:08:40,368 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:40,376 root [INFO] - Quantization parameters for tensor:"/encoder/layers.0/norm2/Add_1_output_0" not specified
2025-07-22 08:08:40,384 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.1/attn/MatMul]
2025-07-22 08:08:40,384 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.1/attn/MatMul_1]
2025-07-22 08:08:40,386 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:40,389 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/norm1/Add_1_output_0" not specified
2025-07-22 08:08:40,406 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:40,415 root [INFO] - Quantization parameters for tensor:"/encoder/layers.1/norm2/Add_1_output_0" not specified
2025-07-22 08:08:40,422 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.2/attn/MatMul]
2025-07-22 08:08:40,422 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.2/attn/MatMul_1]
2025-07-22 08:08:40,424 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:40,428 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/norm1/Add_1_output_0" not specified
2025-07-22 08:08:40,443 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:40,454 root [INFO] - Quantization parameters for tensor:"/encoder/layers.2/norm2/Add_1_output_0" not specified
2025-07-22 08:08:40,461 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.3/attn/MatMul]
2025-07-22 08:08:40,462 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.3/attn/MatMul_1]
2025-07-22 08:08:40,463 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:40,467 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/norm1/Add_1_output_0" not specified
2025-07-22 08:08:40,483 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:40,492 root [INFO] - Quantization parameters for tensor:"/encoder/layers.3/norm2/Add_1_output_0" not specified
2025-07-22 08:08:40,500 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.4/attn/MatMul]
2025-07-22 08:08:40,500 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.4/attn/MatMul_1]
2025-07-22 08:08:40,502 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:40,505 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/norm1/Add_1_output_0" not specified
2025-07-22 08:08:40,524 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:40,533 root [INFO] - Quantization parameters for tensor:"/encoder/layers.4/norm2/Add_1_output_0" not specified
2025-07-22 08:08:40,541 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.5/attn/MatMul]
2025-07-22 08:08:40,541 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.5/attn/MatMul_1]
2025-07-22 08:08:40,543 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:40,547 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/norm1/Add_1_output_0" not specified
2025-07-22 08:08:40,565 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:40,575 root [INFO] - Quantization parameters for tensor:"/encoder/layers.5/norm2/Add_1_output_0" not specified
2025-07-22 08:08:40,583 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.6/attn/MatMul]
2025-07-22 08:08:40,584 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.6/attn/MatMul_1]
2025-07-22 08:08:40,586 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:40,590 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/norm1/Add_1_output_0" not specified
2025-07-22 08:08:40,608 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:40,618 root [INFO] - Quantization parameters for tensor:"/encoder/layers.6/norm2/Add_1_output_0" not specified
2025-07-22 08:08:40,626 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.7/attn/MatMul]
2025-07-22 08:08:40,627 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.7/attn/MatMul_1]
2025-07-22 08:08:40,629 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:40,633 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/norm1/Add_1_output_0" not specified
2025-07-22 08:08:40,651 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:40,662 root [INFO] - Quantization parameters for tensor:"/encoder/layers.7/norm2/Add_1_output_0" not specified
2025-07-22 08:08:40,671 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.8/attn/MatMul]
2025-07-22 08:08:40,671 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.8/attn/MatMul_1]
2025-07-22 08:08:40,673 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:40,677 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/norm1/Add_1_output_0" not specified
2025-07-22 08:08:40,696 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:40,706 root [INFO] - Quantization parameters for tensor:"/encoder/layers.8/norm2/Add_1_output_0" not specified
2025-07-22 08:08:40,715 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.9/attn/MatMul]
2025-07-22 08:08:40,715 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.9/attn/MatMul_1]
2025-07-22 08:08:40,717 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:40,721 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/norm1/Add_1_output_0" not specified
2025-07-22 08:08:40,740 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:40,751 root [INFO] - Quantization parameters for tensor:"/encoder/layers.9/norm2/Add_1_output_0" not specified
2025-07-22 08:08:40,760 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.10/attn/MatMul]
2025-07-22 08:08:40,760 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.10/attn/MatMul_1]
2025-07-22 08:08:40,762 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:40,767 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/norm1/Add_1_output_0" not specified
2025-07-22 08:08:40,785 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/mlp/Mul_1_output_0" not specified
2025-07-22 08:08:40,795 root [INFO] - Quantization parameters for tensor:"/encoder/layers.10/norm2/Add_1_output_0" not specified
2025-07-22 08:08:40,805 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.11/attn/MatMul]
2025-07-22 08:08:40,805 root [INFO] - Ignore MatMul due to non constant B: /[/encoder/layers.11/attn/MatMul_1]
2025-07-22 08:08:40,807 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/attn/Reshape_1_output_0" not specified
2025-07-22 08:08:40,811 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/norm1/Add_1_output_0" not specified
2025-07-22 08:08:40,831 root [INFO] - Quantization parameters for tensor:"/encoder/layers.11/mlp/Mul_1_output_0" not specified


 - Quantizing to uint8:  40%|β–ˆβ–ˆβ–ˆβ–ˆ      | 2/5 [00:09<00:14,  4.68s/it]

 - Quantizing to q4:  40%|β–ˆβ–ˆβ–ˆβ–ˆ      | 2/5 [00:09<00:14,  4.68s/it]   2025-07-22 08:08:42,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/word_embeddings/Gather ...
2025-07-22 08:08:42,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/token_type_embeddings/Gather ...
2025-07-22 08:08:42,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/Constant ...
2025-07-22 08:08:42,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/Add ...
2025-07-22 08:08:42,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/ReduceMean ...
2025-07-22 08:08:42,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Sub ...
2025-07-22 08:08:42,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Constant ...
2025-07-22 08:08:42,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Pow ...
2025-07-22 08:08:42,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/ReduceMean_1 ...
2025-07-22 08:08:42,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Constant_1 ...
2025-07-22 08:08:42,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Add ...
2025-07-22 08:08:42,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Sqrt ...
2025-07-22 08:08:42,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Div ...
2025-07-22 08:08:42,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Mul ...
2025-07-22 08:08:42,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Add_1 ...
2025-07-22 08:08:42,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Constant ...
2025-07-22 08:08:42,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Unsqueeze ...
2025-07-22 08:08:42,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Constant_1 ...
2025-07-22 08:08:42,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Unsqueeze_1 ...
2025-07-22 08:08:42,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Cast ...
2025-07-22 08:08:42,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Constant_2 ...
2025-07-22 08:08:42,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Sub ...
2025-07-22 08:08:42,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Constant_3 ...
2025-07-22 08:08:42,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Mul ...
2025-07-22 08:08:42,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,498 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,498 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape ...
2025-07-22 08:08:42,498 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant ...
2025-07-22 08:08:42,498 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather ...
2025-07-22 08:08:42,498 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_1 ...
2025-07-22 08:08:42,498 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_1 ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_1 ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_2 ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_2 ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_2 ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_3 ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Div ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Cast ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Cast_1 ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_1 ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_4 ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_2 ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_5 ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Concat ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Reshape ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cast ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Range ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:42,499 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Einsum ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cos ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Sin ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:42,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Equal ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Where ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:42,501 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:42,502 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Div ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Neg ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:42,503 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:42,504 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:42,505 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:42,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_75 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:42,507 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:42,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_3 ...
2025-07-22 08:08:42,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_4 ...
2025-07-22 08:08:42,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_5 ...
2025-07-22 08:08:42,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose ...
2025-07-22 08:08:42,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_1 ...
2025-07-22 08:08:42,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_2 ...
2025-07-22 08:08:42,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/MatMul ...
2025-07-22 08:08:42,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/MatMul ...
2025-07-22 08:08:42,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_6 ...
2025-07-22 08:08:42,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Div_1 ...
2025-07-22 08:08:42,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Add ...
2025-07-22 08:08:42,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Softmax ...
2025-07-22 08:08:42,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/MatMul_1 ...
2025-07-22 08:08:42,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/MatMul_1 ...
2025-07-22 08:08:42,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_3 ...
2025-07-22 08:08:42,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_3 ...
2025-07-22 08:08:42,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_7 ...
2025-07-22 08:08:42,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_6 ...
2025-07-22 08:08:42,508 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_4 ...
2025-07-22 08:08:42,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_8 ...
2025-07-22 08:08:42,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_7 ...
2025-07-22 08:08:42,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_5 ...
2025-07-22 08:08:42,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_9 ...
2025-07-22 08:08:42,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_8 ...
2025-07-22 08:08:42,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_6 ...
2025-07-22 08:08:42,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_10 ...
2025-07-22 08:08:42,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_9 ...
2025-07-22 08:08:42,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_11 ...
2025-07-22 08:08:42,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Mul ...
2025-07-22 08:08:42,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Mul_1 ...
2025-07-22 08:08:42,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_3 ...
2025-07-22 08:08:42,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_4 ...
2025-07-22 08:08:42,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_5 ...
2025-07-22 08:08:42,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Concat_1 ...
2025-07-22 08:08:42,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Reshape_1 ...
2025-07-22 08:08:42,509 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/out_proj/MatMul ...
2025-07-22 08:08:42,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/out_proj/MatMul ...
2025-07-22 08:08:42,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Add ...
2025-07-22 08:08:42,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Cast ...
2025-07-22 08:08:42,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/ReduceMean ...
2025-07-22 08:08:42,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Sub ...
2025-07-22 08:08:42,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Constant ...
2025-07-22 08:08:42,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Pow ...
2025-07-22 08:08:42,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/ReduceMean_1 ...
2025-07-22 08:08:42,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Constant_1 ...
2025-07-22 08:08:42,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Add ...
2025-07-22 08:08:42,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Sqrt ...
2025-07-22 08:08:42,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Div ...
2025-07-22 08:08:42,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Mul ...
2025-07-22 08:08:42,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Add_1 ...
2025-07-22 08:08:42,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc11/MatMul ...
2025-07-22 08:08:42,519 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc11/MatMul ...
2025-07-22 08:08:42,519 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc12/MatMul ...
2025-07-22 08:08:42,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc12/MatMul ...
2025-07-22 08:08:42,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Sigmoid ...
2025-07-22 08:08:42,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Mul ...
2025-07-22 08:08:42,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Mul_1 ...
2025-07-22 08:08:42,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc2/MatMul ...
2025-07-22 08:08:42,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc2/MatMul ...
2025-07-22 08:08:42,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Add_1 ...
2025-07-22 08:08:42,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Cast_1 ...
2025-07-22 08:08:42,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/ReduceMean ...
2025-07-22 08:08:42,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Sub ...
2025-07-22 08:08:42,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Constant ...
2025-07-22 08:08:42,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Pow ...
2025-07-22 08:08:42,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/ReduceMean_1 ...
2025-07-22 08:08:42,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Constant_1 ...
2025-07-22 08:08:42,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Add ...
2025-07-22 08:08:42,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Sqrt ...
2025-07-22 08:08:42,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Div ...
2025-07-22 08:08:42,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Mul ...
2025-07-22 08:08:42,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Add_1 ...
2025-07-22 08:08:42,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,537 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,537 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape ...
2025-07-22 08:08:42,537 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant ...
2025-07-22 08:08:42,537 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather ...
2025-07-22 08:08:42,537 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_1 ...
2025-07-22 08:08:42,537 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_1 ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_1 ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_2 ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_2 ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_2 ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_3 ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Div ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Cast ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Cast_1 ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_1 ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_4 ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_2 ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_5 ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Concat ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Reshape ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cast ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Range ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Einsum ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cos ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Sin ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:42,538 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Equal ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Where ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape ...
2025-07-22 08:08:42,539 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:42,540 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Div ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Neg ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:42,541 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:42,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:42,543 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_3 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_4 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_5 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_1 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_2 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/MatMul ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/MatMul ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_6 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Div_1 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Add ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Softmax ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/MatMul_1 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/MatMul_1 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_3 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_3 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_7 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_6 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_4 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_8 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_7 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_5 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_9 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_8 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_6 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_10 ...
2025-07-22 08:08:42,544 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_9 ...
2025-07-22 08:08:42,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_11 ...
2025-07-22 08:08:42,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Mul ...
2025-07-22 08:08:42,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Mul_1 ...
2025-07-22 08:08:42,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_3 ...
2025-07-22 08:08:42,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_4 ...
2025-07-22 08:08:42,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_5 ...
2025-07-22 08:08:42,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Concat_1 ...
2025-07-22 08:08:42,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Reshape_1 ...
2025-07-22 08:08:42,545 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/out_proj/MatMul ...
2025-07-22 08:08:42,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/out_proj/MatMul ...
2025-07-22 08:08:42,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Add ...
2025-07-22 08:08:42,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Cast ...
2025-07-22 08:08:42,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/ReduceMean ...
2025-07-22 08:08:42,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Sub ...
2025-07-22 08:08:42,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Constant ...
2025-07-22 08:08:42,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Pow ...
2025-07-22 08:08:42,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/ReduceMean_1 ...
2025-07-22 08:08:42,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Constant_1 ...
2025-07-22 08:08:42,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Add ...
2025-07-22 08:08:42,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Sqrt ...
2025-07-22 08:08:42,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Div ...
2025-07-22 08:08:42,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Mul ...
2025-07-22 08:08:42,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Add_1 ...
2025-07-22 08:08:42,548 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc11/MatMul ...
2025-07-22 08:08:42,555 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc11/MatMul ...
2025-07-22 08:08:42,555 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc12/MatMul ...
2025-07-22 08:08:42,561 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc12/MatMul ...
2025-07-22 08:08:42,561 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Sigmoid ...
2025-07-22 08:08:42,561 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Mul ...
2025-07-22 08:08:42,561 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Mul_1 ...
2025-07-22 08:08:42,561 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc2/MatMul ...
2025-07-22 08:08:42,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc2/MatMul ...
2025-07-22 08:08:42,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Add_1 ...
2025-07-22 08:08:42,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Cast_1 ...
2025-07-22 08:08:42,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/ReduceMean ...
2025-07-22 08:08:42,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Sub ...
2025-07-22 08:08:42,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Constant ...
2025-07-22 08:08:42,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Pow ...
2025-07-22 08:08:42,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/ReduceMean_1 ...
2025-07-22 08:08:42,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Constant_1 ...
2025-07-22 08:08:42,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Add ...
2025-07-22 08:08:42,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Sqrt ...
2025-07-22 08:08:42,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Div ...
2025-07-22 08:08:42,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Mul ...
2025-07-22 08:08:42,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Add_1 ...
2025-07-22 08:08:42,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape ...
2025-07-22 08:08:42,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant ...
2025-07-22 08:08:42,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather ...
2025-07-22 08:08:42,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_1 ...
2025-07-22 08:08:42,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_1 ...
2025-07-22 08:08:42,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_1 ...
2025-07-22 08:08:42,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_2 ...
2025-07-22 08:08:42,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_2 ...
2025-07-22 08:08:42,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_2 ...
2025-07-22 08:08:42,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_3 ...
2025-07-22 08:08:42,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Div ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Cast ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Cast_1 ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_1 ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_4 ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_2 ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_5 ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Concat ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Reshape ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cast ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Range ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Einsum ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cos ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Sin ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:42,574 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Equal ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Where ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:42,575 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Div ...
2025-07-22 08:08:42,576 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Neg ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:42,577 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:42,578 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:42,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_3 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_4 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_5 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_1 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_2 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/MatMul ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/MatMul ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_6 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Div_1 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Add ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Softmax ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/MatMul_1 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/MatMul_1 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_3 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_3 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_7 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_6 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_4 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_8 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_7 ...
2025-07-22 08:08:42,580 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_5 ...
2025-07-22 08:08:42,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_9 ...
2025-07-22 08:08:42,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_8 ...
2025-07-22 08:08:42,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_6 ...
2025-07-22 08:08:42,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_10 ...
2025-07-22 08:08:42,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_9 ...
2025-07-22 08:08:42,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_11 ...
2025-07-22 08:08:42,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Mul ...
2025-07-22 08:08:42,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Mul_1 ...
2025-07-22 08:08:42,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_3 ...
2025-07-22 08:08:42,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_4 ...
2025-07-22 08:08:42,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_5 ...
2025-07-22 08:08:42,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Concat_1 ...
2025-07-22 08:08:42,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Reshape_1 ...
2025-07-22 08:08:42,581 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/out_proj/MatMul ...
2025-07-22 08:08:42,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/out_proj/MatMul ...
2025-07-22 08:08:42,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Add ...
2025-07-22 08:08:42,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Cast ...
2025-07-22 08:08:42,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/ReduceMean ...
2025-07-22 08:08:42,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Sub ...
2025-07-22 08:08:42,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Constant ...
2025-07-22 08:08:42,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Pow ...
2025-07-22 08:08:42,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/ReduceMean_1 ...
2025-07-22 08:08:42,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Constant_1 ...
2025-07-22 08:08:42,584 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Add ...
2025-07-22 08:08:42,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Sqrt ...
2025-07-22 08:08:42,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Div ...
2025-07-22 08:08:42,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Mul ...
2025-07-22 08:08:42,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Add_1 ...
2025-07-22 08:08:42,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc11/MatMul ...
2025-07-22 08:08:42,591 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc11/MatMul ...
2025-07-22 08:08:42,591 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc12/MatMul ...
2025-07-22 08:08:42,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc12/MatMul ...
2025-07-22 08:08:42,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Sigmoid ...
2025-07-22 08:08:42,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Mul ...
2025-07-22 08:08:42,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Mul_1 ...
2025-07-22 08:08:42,597 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc2/MatMul ...
2025-07-22 08:08:42,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc2/MatMul ...
2025-07-22 08:08:42,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Add_1 ...
2025-07-22 08:08:42,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Cast_1 ...
2025-07-22 08:08:42,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/ReduceMean ...
2025-07-22 08:08:42,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Sub ...
2025-07-22 08:08:42,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Constant ...
2025-07-22 08:08:42,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Pow ...
2025-07-22 08:08:42,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/ReduceMean_1 ...
2025-07-22 08:08:42,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Constant_1 ...
2025-07-22 08:08:42,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Add ...
2025-07-22 08:08:42,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Sqrt ...
2025-07-22 08:08:42,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Div ...
2025-07-22 08:08:42,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Mul ...
2025-07-22 08:08:42,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Add_1 ...
2025-07-22 08:08:42,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape ...
2025-07-22 08:08:42,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant ...
2025-07-22 08:08:42,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather ...
2025-07-22 08:08:42,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_1 ...
2025-07-22 08:08:42,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_1 ...
2025-07-22 08:08:42,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_1 ...
2025-07-22 08:08:42,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_2 ...
2025-07-22 08:08:42,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_2 ...
2025-07-22 08:08:42,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_2 ...
2025-07-22 08:08:42,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_3 ...
2025-07-22 08:08:42,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Div ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Cast ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Cast_1 ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_1 ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_4 ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_2 ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_5 ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Concat ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Reshape ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cast ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Range ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Einsum ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cos ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Sin ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:42,610 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Equal ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Where ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:42,611 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Div ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:42,612 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Neg ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:42,613 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:42,614 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:42,615 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_3 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_4 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_5 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_1 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_2 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/MatMul ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/MatMul ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_6 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Div_1 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Add ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Softmax ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/MatMul_1 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/MatMul_1 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_3 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_3 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_7 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_6 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_4 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_8 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_7 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_5 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_9 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_8 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_6 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_10 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_9 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_11 ...
2025-07-22 08:08:42,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Mul ...
2025-07-22 08:08:42,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Mul_1 ...
2025-07-22 08:08:42,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_3 ...
2025-07-22 08:08:42,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_4 ...
2025-07-22 08:08:42,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_5 ...
2025-07-22 08:08:42,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Concat_1 ...
2025-07-22 08:08:42,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Reshape_1 ...
2025-07-22 08:08:42,617 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/out_proj/MatMul ...
2025-07-22 08:08:42,620 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/out_proj/MatMul ...
2025-07-22 08:08:42,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Add ...
2025-07-22 08:08:42,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Cast ...
2025-07-22 08:08:42,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/ReduceMean ...
2025-07-22 08:08:42,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Sub ...
2025-07-22 08:08:42,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Constant ...
2025-07-22 08:08:42,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Pow ...
2025-07-22 08:08:42,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/ReduceMean_1 ...
2025-07-22 08:08:42,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Constant_1 ...
2025-07-22 08:08:42,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Add ...
2025-07-22 08:08:42,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Sqrt ...
2025-07-22 08:08:42,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Div ...
2025-07-22 08:08:42,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Mul ...
2025-07-22 08:08:42,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Add_1 ...
2025-07-22 08:08:42,621 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc11/MatMul ...
2025-07-22 08:08:42,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc11/MatMul ...
2025-07-22 08:08:42,627 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc12/MatMul ...
2025-07-22 08:08:42,633 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc12/MatMul ...
2025-07-22 08:08:42,633 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Sigmoid ...
2025-07-22 08:08:42,633 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Mul ...
2025-07-22 08:08:42,633 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Mul_1 ...
2025-07-22 08:08:42,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc2/MatMul ...
2025-07-22 08:08:42,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc2/MatMul ...
2025-07-22 08:08:42,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Add_1 ...
2025-07-22 08:08:42,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Cast_1 ...
2025-07-22 08:08:42,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/ReduceMean ...
2025-07-22 08:08:42,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Sub ...
2025-07-22 08:08:42,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Constant ...
2025-07-22 08:08:42,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Pow ...
2025-07-22 08:08:42,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/ReduceMean_1 ...
2025-07-22 08:08:42,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Constant_1 ...
2025-07-22 08:08:42,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Add ...
2025-07-22 08:08:42,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Sqrt ...
2025-07-22 08:08:42,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Div ...
2025-07-22 08:08:42,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Mul ...
2025-07-22 08:08:42,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Add_1 ...
2025-07-22 08:08:42,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_1 ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_1 ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_1 ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_2 ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_2 ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_2 ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_3 ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Div ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Cast ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Cast_1 ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_1 ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_4 ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_2 ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_5 ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Concat ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Reshape ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant ...
2025-07-22 08:08:42,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cast ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Range ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Einsum ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cos ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Sin ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:42,647 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Equal ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Where ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:42,648 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Div ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Neg ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:42,649 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:42,650 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:42,651 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_3 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_4 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_5 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_1 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_2 ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/MatMul ...
2025-07-22 08:08:42,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/MatMul ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_6 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Div_1 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Add ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Softmax ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/MatMul_1 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/MatMul_1 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_3 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_3 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_7 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_6 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_4 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_8 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_7 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_5 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_9 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_8 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_6 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_10 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_9 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_11 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Mul ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Mul_1 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_3 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_4 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_5 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Concat_1 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Reshape_1 ...
2025-07-22 08:08:42,653 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/out_proj/MatMul ...
2025-07-22 08:08:42,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/out_proj/MatMul ...
2025-07-22 08:08:42,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Add ...
2025-07-22 08:08:42,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Cast ...
2025-07-22 08:08:42,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/ReduceMean ...
2025-07-22 08:08:42,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Sub ...
2025-07-22 08:08:42,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Constant ...
2025-07-22 08:08:42,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Pow ...
2025-07-22 08:08:42,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/ReduceMean_1 ...
2025-07-22 08:08:42,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Constant_1 ...
2025-07-22 08:08:42,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Add ...
2025-07-22 08:08:42,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Sqrt ...
2025-07-22 08:08:42,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Div ...
2025-07-22 08:08:42,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Mul ...
2025-07-22 08:08:42,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Add_1 ...
2025-07-22 08:08:42,657 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc11/MatMul ...
2025-07-22 08:08:42,663 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc11/MatMul ...
2025-07-22 08:08:42,664 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc12/MatMul ...
2025-07-22 08:08:42,670 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc12/MatMul ...
2025-07-22 08:08:42,670 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Sigmoid ...
2025-07-22 08:08:42,670 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Mul ...
2025-07-22 08:08:42,670 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Mul_1 ...
2025-07-22 08:08:42,670 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc2/MatMul ...
2025-07-22 08:08:42,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc2/MatMul ...
2025-07-22 08:08:42,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Add_1 ...
2025-07-22 08:08:42,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Cast_1 ...
2025-07-22 08:08:42,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/ReduceMean ...
2025-07-22 08:08:42,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Sub ...
2025-07-22 08:08:42,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Constant ...
2025-07-22 08:08:42,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Pow ...
2025-07-22 08:08:42,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/ReduceMean_1 ...
2025-07-22 08:08:42,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Constant_1 ...
2025-07-22 08:08:42,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Add ...
2025-07-22 08:08:42,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Sqrt ...
2025-07-22 08:08:42,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Div ...
2025-07-22 08:08:42,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Mul ...
2025-07-22 08:08:42,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Add_1 ...
2025-07-22 08:08:42,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape ...
2025-07-22 08:08:42,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant ...
2025-07-22 08:08:42,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather ...
2025-07-22 08:08:42,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_1 ...
2025-07-22 08:08:42,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_1 ...
2025-07-22 08:08:42,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_1 ...
2025-07-22 08:08:42,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_2 ...
2025-07-22 08:08:42,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_2 ...
2025-07-22 08:08:42,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_2 ...
2025-07-22 08:08:42,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_3 ...
2025-07-22 08:08:42,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Div ...
2025-07-22 08:08:42,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Cast ...
2025-07-22 08:08:42,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Cast_1 ...
2025-07-22 08:08:42,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze ...
2025-07-22 08:08:42,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_1 ...
2025-07-22 08:08:42,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_4 ...
2025-07-22 08:08:42,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_2 ...
2025-07-22 08:08:42,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_5 ...
2025-07-22 08:08:42,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Concat ...
2025-07-22 08:08:42,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Reshape ...
2025-07-22 08:08:42,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cast ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Range ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Einsum ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cos ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Sin ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:42,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Equal ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Where ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:42,684 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Div ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Neg ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:42,685 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:42,686 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:42,687 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:42,688 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_3 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_4 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_5 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_1 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_2 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/MatMul ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/MatMul ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_6 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Div_1 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Add ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Softmax ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/MatMul_1 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/MatMul_1 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_3 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_3 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_7 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_6 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_4 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_8 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_7 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_5 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_9 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_8 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_6 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_10 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_9 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_11 ...
2025-07-22 08:08:42,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Mul ...
2025-07-22 08:08:42,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Mul_1 ...
2025-07-22 08:08:42,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_3 ...
2025-07-22 08:08:42,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_4 ...
2025-07-22 08:08:42,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_5 ...
2025-07-22 08:08:42,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Concat_1 ...
2025-07-22 08:08:42,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Reshape_1 ...
2025-07-22 08:08:42,690 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/out_proj/MatMul ...
2025-07-22 08:08:42,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/out_proj/MatMul ...
2025-07-22 08:08:42,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Add ...
2025-07-22 08:08:42,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Cast ...
2025-07-22 08:08:42,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/ReduceMean ...
2025-07-22 08:08:42,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Sub ...
2025-07-22 08:08:42,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Constant ...
2025-07-22 08:08:42,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Pow ...
2025-07-22 08:08:42,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/ReduceMean_1 ...
2025-07-22 08:08:42,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Constant_1 ...
2025-07-22 08:08:42,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Add ...
2025-07-22 08:08:42,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Sqrt ...
2025-07-22 08:08:42,693 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Div ...
2025-07-22 08:08:42,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Mul ...
2025-07-22 08:08:42,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Add_1 ...
2025-07-22 08:08:42,694 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc11/MatMul ...
2025-07-22 08:08:42,700 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc11/MatMul ...
2025-07-22 08:08:42,700 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc12/MatMul ...
2025-07-22 08:08:42,706 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc12/MatMul ...
2025-07-22 08:08:42,706 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Sigmoid ...
2025-07-22 08:08:42,706 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Mul ...
2025-07-22 08:08:42,706 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Mul_1 ...
2025-07-22 08:08:42,706 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc2/MatMul ...
2025-07-22 08:08:42,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc2/MatMul ...
2025-07-22 08:08:42,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Add_1 ...
2025-07-22 08:08:42,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Cast_1 ...
2025-07-22 08:08:42,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/ReduceMean ...
2025-07-22 08:08:42,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Sub ...
2025-07-22 08:08:42,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Constant ...
2025-07-22 08:08:42,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Pow ...
2025-07-22 08:08:42,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/ReduceMean_1 ...
2025-07-22 08:08:42,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Constant_1 ...
2025-07-22 08:08:42,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Add ...
2025-07-22 08:08:42,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Sqrt ...
2025-07-22 08:08:42,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Div ...
2025-07-22 08:08:42,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Mul ...
2025-07-22 08:08:42,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Add_1 ...
2025-07-22 08:08:42,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,718 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,718 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_1 ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_1 ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_1 ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_2 ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_2 ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_2 ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_3 ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Div ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Cast ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Cast_1 ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_1 ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_4 ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_2 ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_5 ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Concat ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Reshape ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cast ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Range ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Einsum ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cos ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Sin ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:42,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Equal ...
2025-07-22 08:08:42,720 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Where ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:42,721 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Div ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Neg ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:42,722 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:42,723 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:42,724 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_3 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_4 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_5 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_1 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_2 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/MatMul ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/MatMul ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_6 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Div_1 ...
2025-07-22 08:08:42,725 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Add ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Softmax ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/MatMul_1 ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/MatMul_1 ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_3 ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_3 ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_7 ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_6 ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_4 ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_8 ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_7 ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_5 ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_9 ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_8 ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_6 ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_10 ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_9 ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_11 ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Mul ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Mul_1 ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_3 ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_4 ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_5 ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Concat_1 ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Reshape_1 ...
2025-07-22 08:08:42,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/out_proj/MatMul ...
2025-07-22 08:08:42,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/out_proj/MatMul ...
2025-07-22 08:08:42,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Add ...
2025-07-22 08:08:42,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Cast ...
2025-07-22 08:08:42,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/ReduceMean ...
2025-07-22 08:08:42,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Sub ...
2025-07-22 08:08:42,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Constant ...
2025-07-22 08:08:42,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Pow ...
2025-07-22 08:08:42,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/ReduceMean_1 ...
2025-07-22 08:08:42,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Constant_1 ...
2025-07-22 08:08:42,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Add ...
2025-07-22 08:08:42,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Sqrt ...
2025-07-22 08:08:42,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Div ...
2025-07-22 08:08:42,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Mul ...
2025-07-22 08:08:42,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Add_1 ...
2025-07-22 08:08:42,730 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc11/MatMul ...
2025-07-22 08:08:42,736 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc11/MatMul ...
2025-07-22 08:08:42,736 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc12/MatMul ...
2025-07-22 08:08:42,742 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc12/MatMul ...
2025-07-22 08:08:42,743 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Sigmoid ...
2025-07-22 08:08:42,743 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Mul ...
2025-07-22 08:08:42,743 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Mul_1 ...
2025-07-22 08:08:42,743 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc2/MatMul ...
2025-07-22 08:08:42,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc2/MatMul ...
2025-07-22 08:08:42,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Add_1 ...
2025-07-22 08:08:42,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Cast_1 ...
2025-07-22 08:08:42,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/ReduceMean ...
2025-07-22 08:08:42,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Sub ...
2025-07-22 08:08:42,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Constant ...
2025-07-22 08:08:42,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Pow ...
2025-07-22 08:08:42,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/ReduceMean_1 ...
2025-07-22 08:08:42,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Constant_1 ...
2025-07-22 08:08:42,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Add ...
2025-07-22 08:08:42,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Sqrt ...
2025-07-22 08:08:42,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Div ...
2025-07-22 08:08:42,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Mul ...
2025-07-22 08:08:42,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Add_1 ...
2025-07-22 08:08:42,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape ...
2025-07-22 08:08:42,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant ...
2025-07-22 08:08:42,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather ...
2025-07-22 08:08:42,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_1 ...
2025-07-22 08:08:42,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_1 ...
2025-07-22 08:08:42,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_1 ...
2025-07-22 08:08:42,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_2 ...
2025-07-22 08:08:42,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_2 ...
2025-07-22 08:08:42,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_2 ...
2025-07-22 08:08:42,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_3 ...
2025-07-22 08:08:42,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Div ...
2025-07-22 08:08:42,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Cast ...
2025-07-22 08:08:42,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Cast_1 ...
2025-07-22 08:08:42,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze ...
2025-07-22 08:08:42,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_1 ...
2025-07-22 08:08:42,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_4 ...
2025-07-22 08:08:42,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_2 ...
2025-07-22 08:08:42,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_5 ...
2025-07-22 08:08:42,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Concat ...
2025-07-22 08:08:42,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Reshape ...
2025-07-22 08:08:42,755 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cast ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Range ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Einsum ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cos ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Sin ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:42,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Equal ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Where ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:42,757 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Div ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Neg ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:42,758 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:42,759 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:42,760 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_3 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_4 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_5 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_1 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_2 ...
2025-07-22 08:08:42,761 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/MatMul ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/MatMul ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_6 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Div_1 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Add ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Softmax ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/MatMul_1 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/MatMul_1 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_3 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_3 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_7 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_6 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_4 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_8 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_7 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_5 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_9 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_8 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_6 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_10 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_9 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_11 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Mul ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Mul_1 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_3 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_4 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_5 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Concat_1 ...
2025-07-22 08:08:42,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Reshape_1 ...
2025-07-22 08:08:42,763 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/out_proj/MatMul ...
2025-07-22 08:08:42,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/out_proj/MatMul ...
2025-07-22 08:08:42,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Add ...
2025-07-22 08:08:42,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Cast ...
2025-07-22 08:08:42,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/ReduceMean ...
2025-07-22 08:08:42,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Sub ...
2025-07-22 08:08:42,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Constant ...
2025-07-22 08:08:42,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Pow ...
2025-07-22 08:08:42,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/ReduceMean_1 ...
2025-07-22 08:08:42,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Constant_1 ...
2025-07-22 08:08:42,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Add ...
2025-07-22 08:08:42,766 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Sqrt ...
2025-07-22 08:08:42,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Div ...
2025-07-22 08:08:42,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Mul ...
2025-07-22 08:08:42,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Add_1 ...
2025-07-22 08:08:42,767 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc11/MatMul ...
2025-07-22 08:08:42,773 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc11/MatMul ...
2025-07-22 08:08:42,773 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc12/MatMul ...
2025-07-22 08:08:42,779 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc12/MatMul ...
2025-07-22 08:08:42,779 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Sigmoid ...
2025-07-22 08:08:42,779 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Mul ...
2025-07-22 08:08:42,779 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Mul_1 ...
2025-07-22 08:08:42,779 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc2/MatMul ...
2025-07-22 08:08:42,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc2/MatMul ...
2025-07-22 08:08:42,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Add_1 ...
2025-07-22 08:08:42,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Cast_1 ...
2025-07-22 08:08:42,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/ReduceMean ...
2025-07-22 08:08:42,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Sub ...
2025-07-22 08:08:42,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Constant ...
2025-07-22 08:08:42,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Pow ...
2025-07-22 08:08:42,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/ReduceMean_1 ...
2025-07-22 08:08:42,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Constant_1 ...
2025-07-22 08:08:42,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Add ...
2025-07-22 08:08:42,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Sqrt ...
2025-07-22 08:08:42,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Div ...
2025-07-22 08:08:42,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Mul ...
2025-07-22 08:08:42,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Add_1 ...
2025-07-22 08:08:42,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape ...
2025-07-22 08:08:42,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant ...
2025-07-22 08:08:42,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather ...
2025-07-22 08:08:42,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_1 ...
2025-07-22 08:08:42,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_1 ...
2025-07-22 08:08:42,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_1 ...
2025-07-22 08:08:42,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_2 ...
2025-07-22 08:08:42,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_2 ...
2025-07-22 08:08:42,791 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_2 ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_3 ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Div ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Cast ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Cast_1 ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_1 ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_4 ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_2 ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_5 ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Concat ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Reshape ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cast ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Range ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Einsum ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cos ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Sin ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:42,792 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Equal ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Where ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape ...
2025-07-22 08:08:42,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:42,794 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Div ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Neg ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:42,795 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:42,796 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:42,797 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_3 ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_4 ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_5 ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_1 ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_2 ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/MatMul ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/MatMul ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_6 ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Div_1 ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Add ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Softmax ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/MatMul_1 ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/MatMul_1 ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_3 ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_3 ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_7 ...
2025-07-22 08:08:42,798 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_6 ...
2025-07-22 08:08:42,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_4 ...
2025-07-22 08:08:42,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_8 ...
2025-07-22 08:08:42,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_7 ...
2025-07-22 08:08:42,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_5 ...
2025-07-22 08:08:42,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_9 ...
2025-07-22 08:08:42,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_8 ...
2025-07-22 08:08:42,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_6 ...
2025-07-22 08:08:42,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_10 ...
2025-07-22 08:08:42,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_9 ...
2025-07-22 08:08:42,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_11 ...
2025-07-22 08:08:42,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Mul ...
2025-07-22 08:08:42,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Mul_1 ...
2025-07-22 08:08:42,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_3 ...
2025-07-22 08:08:42,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_4 ...
2025-07-22 08:08:42,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_5 ...
2025-07-22 08:08:42,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Concat_1 ...
2025-07-22 08:08:42,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Reshape_1 ...
2025-07-22 08:08:42,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/out_proj/MatMul ...
2025-07-22 08:08:42,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/out_proj/MatMul ...
2025-07-22 08:08:42,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Add ...
2025-07-22 08:08:42,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Cast ...
2025-07-22 08:08:42,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/ReduceMean ...
2025-07-22 08:08:42,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Sub ...
2025-07-22 08:08:42,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Constant ...
2025-07-22 08:08:42,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Pow ...
2025-07-22 08:08:42,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/ReduceMean_1 ...
2025-07-22 08:08:42,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Constant_1 ...
2025-07-22 08:08:42,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Add ...
2025-07-22 08:08:42,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Sqrt ...
2025-07-22 08:08:42,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Div ...
2025-07-22 08:08:42,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Mul ...
2025-07-22 08:08:42,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Add_1 ...
2025-07-22 08:08:42,803 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc11/MatMul ...
2025-07-22 08:08:42,809 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc11/MatMul ...
2025-07-22 08:08:42,809 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc12/MatMul ...
2025-07-22 08:08:42,815 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc12/MatMul ...
2025-07-22 08:08:42,816 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Sigmoid ...
2025-07-22 08:08:42,816 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Mul ...
2025-07-22 08:08:42,816 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Mul_1 ...
2025-07-22 08:08:42,816 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc2/MatMul ...
2025-07-22 08:08:42,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc2/MatMul ...
2025-07-22 08:08:42,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Add_1 ...
2025-07-22 08:08:42,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Cast_1 ...
2025-07-22 08:08:42,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/ReduceMean ...
2025-07-22 08:08:42,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Sub ...
2025-07-22 08:08:42,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Constant ...
2025-07-22 08:08:42,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Pow ...
2025-07-22 08:08:42,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/ReduceMean_1 ...
2025-07-22 08:08:42,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Constant_1 ...
2025-07-22 08:08:42,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Add ...
2025-07-22 08:08:42,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Sqrt ...
2025-07-22 08:08:42,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Div ...
2025-07-22 08:08:42,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Mul ...
2025-07-22 08:08:42,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Add_1 ...
2025-07-22 08:08:42,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,828 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,828 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape ...
2025-07-22 08:08:42,828 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant ...
2025-07-22 08:08:42,828 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather ...
2025-07-22 08:08:42,828 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_1 ...
2025-07-22 08:08:42,828 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_1 ...
2025-07-22 08:08:42,828 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_1 ...
2025-07-22 08:08:42,828 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_2 ...
2025-07-22 08:08:42,828 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_2 ...
2025-07-22 08:08:42,828 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_2 ...
2025-07-22 08:08:42,828 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_3 ...
2025-07-22 08:08:42,828 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Div ...
2025-07-22 08:08:42,828 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Cast ...
2025-07-22 08:08:42,828 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Cast_1 ...
2025-07-22 08:08:42,828 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze ...
2025-07-22 08:08:42,828 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_1 ...
2025-07-22 08:08:42,828 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_4 ...
2025-07-22 08:08:42,828 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_2 ...
2025-07-22 08:08:42,828 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_5 ...
2025-07-22 08:08:42,828 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Concat ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Reshape ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cast ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Range ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Einsum ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cos ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Sin ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:42,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Equal ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Where ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:42,830 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Div ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Neg ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:42,831 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:42,832 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:42,833 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_3 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_4 ...
2025-07-22 08:08:42,834 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_5 ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_1 ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_2 ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/MatMul ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/MatMul ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_6 ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Div_1 ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Add ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Softmax ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/MatMul_1 ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/MatMul_1 ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_3 ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_3 ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_7 ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_6 ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_4 ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_8 ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_7 ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_5 ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_9 ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_8 ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_6 ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_10 ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_9 ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_11 ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Mul ...
2025-07-22 08:08:42,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Mul_1 ...
2025-07-22 08:08:42,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_3 ...
2025-07-22 08:08:42,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_4 ...
2025-07-22 08:08:42,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_5 ...
2025-07-22 08:08:42,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Concat_1 ...
2025-07-22 08:08:42,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Reshape_1 ...
2025-07-22 08:08:42,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/out_proj/MatMul ...
2025-07-22 08:08:42,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/out_proj/MatMul ...
2025-07-22 08:08:42,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Add ...
2025-07-22 08:08:42,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Cast ...
2025-07-22 08:08:42,839 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/ReduceMean ...
2025-07-22 08:08:42,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Sub ...
2025-07-22 08:08:42,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Constant ...
2025-07-22 08:08:42,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Pow ...
2025-07-22 08:08:42,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/ReduceMean_1 ...
2025-07-22 08:08:42,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Constant_1 ...
2025-07-22 08:08:42,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Add ...
2025-07-22 08:08:42,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Sqrt ...
2025-07-22 08:08:42,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Div ...
2025-07-22 08:08:42,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Mul ...
2025-07-22 08:08:42,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Add_1 ...
2025-07-22 08:08:42,840 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc11/MatMul ...
2025-07-22 08:08:42,846 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc11/MatMul ...
2025-07-22 08:08:42,846 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc12/MatMul ...
2025-07-22 08:08:42,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc12/MatMul ...
2025-07-22 08:08:42,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Sigmoid ...
2025-07-22 08:08:42,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Mul ...
2025-07-22 08:08:42,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Mul_1 ...
2025-07-22 08:08:42,852 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc2/MatMul ...
2025-07-22 08:08:42,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc2/MatMul ...
2025-07-22 08:08:42,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Add_1 ...
2025-07-22 08:08:42,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Cast_1 ...
2025-07-22 08:08:42,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/ReduceMean ...
2025-07-22 08:08:42,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Sub ...
2025-07-22 08:08:42,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Constant ...
2025-07-22 08:08:42,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Pow ...
2025-07-22 08:08:42,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/ReduceMean_1 ...
2025-07-22 08:08:42,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Constant_1 ...
2025-07-22 08:08:42,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Add ...
2025-07-22 08:08:42,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Sqrt ...
2025-07-22 08:08:42,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Div ...
2025-07-22 08:08:42,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Mul ...
2025-07-22 08:08:42,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Add_1 ...
2025-07-22 08:08:42,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,864 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,864 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape ...
2025-07-22 08:08:42,864 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_1 ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_1 ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_1 ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_2 ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_2 ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_2 ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_3 ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Div ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Cast ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Cast_1 ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_1 ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_4 ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_2 ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_5 ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Concat ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Reshape ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cast ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Range ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Einsum ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cos ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Sin ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:42,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Equal ...
2025-07-22 08:08:42,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Where ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:42,867 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Div ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Neg ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:42,868 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:42,869 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:42,870 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_3 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_4 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_5 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_1 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_2 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/MatMul ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/MatMul ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_6 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Div_1 ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Add ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Softmax ...
2025-07-22 08:08:42,871 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/MatMul_1 ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/MatMul_1 ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_3 ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_3 ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_7 ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_6 ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_4 ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_8 ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_7 ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_5 ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_9 ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_8 ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_6 ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_10 ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_9 ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_11 ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Mul ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Mul_1 ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_3 ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_4 ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_5 ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Concat_1 ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Reshape_1 ...
2025-07-22 08:08:42,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/out_proj/MatMul ...
2025-07-22 08:08:42,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/out_proj/MatMul ...
2025-07-22 08:08:42,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Add ...
2025-07-22 08:08:42,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Cast ...
2025-07-22 08:08:42,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/ReduceMean ...
2025-07-22 08:08:42,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Sub ...
2025-07-22 08:08:42,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Constant ...
2025-07-22 08:08:42,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Pow ...
2025-07-22 08:08:42,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/ReduceMean_1 ...
2025-07-22 08:08:42,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Constant_1 ...
2025-07-22 08:08:42,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Add ...
2025-07-22 08:08:42,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Sqrt ...
2025-07-22 08:08:42,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Div ...
2025-07-22 08:08:42,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Mul ...
2025-07-22 08:08:42,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Add_1 ...
2025-07-22 08:08:42,876 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc11/MatMul ...
2025-07-22 08:08:42,882 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc11/MatMul ...
2025-07-22 08:08:42,882 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc12/MatMul ...
2025-07-22 08:08:42,889 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc12/MatMul ...
2025-07-22 08:08:42,889 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Sigmoid ...
2025-07-22 08:08:42,889 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Mul ...
2025-07-22 08:08:42,889 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Mul_1 ...
2025-07-22 08:08:42,889 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc2/MatMul ...
2025-07-22 08:08:42,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc2/MatMul ...
2025-07-22 08:08:42,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Add_1 ...
2025-07-22 08:08:42,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Cast_1 ...
2025-07-22 08:08:42,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/ReduceMean ...
2025-07-22 08:08:42,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Sub ...
2025-07-22 08:08:42,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Constant ...
2025-07-22 08:08:42,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Pow ...
2025-07-22 08:08:42,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/ReduceMean_1 ...
2025-07-22 08:08:42,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Constant_1 ...
2025-07-22 08:08:42,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Add ...
2025-07-22 08:08:42,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Sqrt ...
2025-07-22 08:08:42,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Div ...
2025-07-22 08:08:42,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Mul ...
2025-07-22 08:08:42,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Add_1 ...
2025-07-22 08:08:42,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/Wqkv/MatMul ...
2025-07-22 08:08:42,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape ...
2025-07-22 08:08:42,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant ...
2025-07-22 08:08:42,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather ...
2025-07-22 08:08:42,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_1 ...
2025-07-22 08:08:42,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_1 ...
2025-07-22 08:08:42,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_1 ...
2025-07-22 08:08:42,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_2 ...
2025-07-22 08:08:42,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_2 ...
2025-07-22 08:08:42,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_2 ...
2025-07-22 08:08:42,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_3 ...
2025-07-22 08:08:42,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Div ...
2025-07-22 08:08:42,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Cast ...
2025-07-22 08:08:42,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Cast_1 ...
2025-07-22 08:08:42,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze ...
2025-07-22 08:08:42,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_1 ...
2025-07-22 08:08:42,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_4 ...
2025-07-22 08:08:42,901 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_2 ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_5 ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Concat ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Reshape ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cast ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Range ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Einsum ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cos ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Sin ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:42,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Equal ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Where ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:42,903 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Div ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Neg ...
2025-07-22 08:08:42,904 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:42,905 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:42,906 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_3 ...
2025-07-22 08:08:42,907 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_4 ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_5 ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_1 ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_2 ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/MatMul ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/MatMul ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_6 ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Div_1 ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Add ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Softmax ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/MatMul_1 ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/MatMul_1 ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_3 ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_3 ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_7 ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_6 ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_4 ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_8 ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_7 ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_5 ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_9 ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_8 ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_6 ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_10 ...
2025-07-22 08:08:42,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_9 ...
2025-07-22 08:08:42,909 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_11 ...
2025-07-22 08:08:42,909 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Mul ...
2025-07-22 08:08:42,909 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Mul_1 ...
2025-07-22 08:08:42,909 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_3 ...
2025-07-22 08:08:42,909 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_4 ...
2025-07-22 08:08:42,909 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_5 ...
2025-07-22 08:08:42,909 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Concat_1 ...
2025-07-22 08:08:42,909 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Reshape_1 ...
2025-07-22 08:08:42,909 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/out_proj/MatMul ...
2025-07-22 08:08:42,912 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/out_proj/MatMul ...
2025-07-22 08:08:42,912 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Add ...
2025-07-22 08:08:42,912 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Cast ...
2025-07-22 08:08:42,912 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/ReduceMean ...
2025-07-22 08:08:42,912 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Sub ...
2025-07-22 08:08:42,913 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Constant ...
2025-07-22 08:08:42,913 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Pow ...
2025-07-22 08:08:42,913 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/ReduceMean_1 ...
2025-07-22 08:08:42,913 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Constant_1 ...
2025-07-22 08:08:42,913 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Add ...
2025-07-22 08:08:42,913 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Sqrt ...
2025-07-22 08:08:42,913 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Div ...
2025-07-22 08:08:42,913 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Mul ...
2025-07-22 08:08:42,913 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Add_1 ...
2025-07-22 08:08:42,913 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc11/MatMul ...
2025-07-22 08:08:42,919 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc11/MatMul ...
2025-07-22 08:08:42,919 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc12/MatMul ...
2025-07-22 08:08:42,925 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc12/MatMul ...
2025-07-22 08:08:42,925 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Sigmoid ...
2025-07-22 08:08:42,925 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Mul ...
2025-07-22 08:08:42,925 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Mul_1 ...
2025-07-22 08:08:42,925 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc2/MatMul ...
2025-07-22 08:08:42,931 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc2/MatMul ...
2025-07-22 08:08:42,932 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Add_1 ...
2025-07-22 08:08:42,932 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Cast_1 ...
2025-07-22 08:08:42,932 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/ReduceMean ...
2025-07-22 08:08:42,932 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Sub ...
2025-07-22 08:08:42,932 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Constant ...
2025-07-22 08:08:42,932 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Pow ...
2025-07-22 08:08:42,932 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/ReduceMean_1 ...
2025-07-22 08:08:42,932 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Constant_1 ...
2025-07-22 08:08:42,932 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Add ...
2025-07-22 08:08:42,932 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Sqrt ...
2025-07-22 08:08:42,932 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Div ...
2025-07-22 08:08:42,932 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Mul ...
2025-07-22 08:08:42,932 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Add_1 ...


 - Quantizing to q4:  60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ    | 3/5 [00:10<00:05,  2.99s/it]

 - Quantizing to q4f16:  60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ    | 3/5 [00:10<00:05,  2.99s/it]2025-07-22 08:08:43,482 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/word_embeddings/Gather ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/token_type_embeddings/Gather ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/Constant ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /embeddings/Add ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/ReduceMean ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Sub ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Constant ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Pow ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/ReduceMean_1 ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Constant_1 ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Add ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Sqrt ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Div ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Mul ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /emb_ln/Add_1 ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Constant ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Unsqueeze ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Constant_1 ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Unsqueeze_1 ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Cast ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Constant_2 ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Sub ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Constant_3 ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /Mul ...
2025-07-22 08:08:43,483 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_1 ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_1 ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_1 ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_2 ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_2 ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_2 ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_3 ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Div ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Cast ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Cast_1 ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_1 ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_4 ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_2 ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_5 ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Concat ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Reshape ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cast ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Range ...
2025-07-22 08:08:43,489 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Einsum ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cos ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Sin ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:43,490 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Equal ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Where ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:43,491 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Div ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Neg ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:43,492 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:43,493 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:43,494 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Constant_75 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_3 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_4 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_5 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_1 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_2 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/MatMul ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/MatMul ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_6 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Div_1 ...
2025-07-22 08:08:43,495 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Add ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Softmax ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/MatMul_1 ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/MatMul_1 ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Transpose_3 ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_3 ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_7 ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_6 ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_4 ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_8 ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_7 ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_5 ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_9 ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_8 ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Shape_6 ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_10 ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Gather_9 ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Constant_11 ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Mul ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Mul_1 ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_3 ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_4 ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Unsqueeze_5 ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Concat_1 ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/attn/Reshape_1 ...
2025-07-22 08:08:43,496 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/attn/out_proj/MatMul ...
2025-07-22 08:08:43,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/attn/out_proj/MatMul ...
2025-07-22 08:08:43,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Add ...
2025-07-22 08:08:43,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Cast ...
2025-07-22 08:08:43,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/ReduceMean ...
2025-07-22 08:08:43,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Sub ...
2025-07-22 08:08:43,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Constant ...
2025-07-22 08:08:43,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Pow ...
2025-07-22 08:08:43,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/ReduceMean_1 ...
2025-07-22 08:08:43,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Constant_1 ...
2025-07-22 08:08:43,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Add ...
2025-07-22 08:08:43,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Sqrt ...
2025-07-22 08:08:43,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Div ...
2025-07-22 08:08:43,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Mul ...
2025-07-22 08:08:43,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm1/Add_1 ...
2025-07-22 08:08:43,500 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc11/MatMul ...
2025-07-22 08:08:43,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc11/MatMul ...
2025-07-22 08:08:43,506 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc12/MatMul ...
2025-07-22 08:08:43,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc12/MatMul ...
2025-07-22 08:08:43,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Sigmoid ...
2025-07-22 08:08:43,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Mul ...
2025-07-22 08:08:43,512 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/mlp/Mul_1 ...
2025-07-22 08:08:43,513 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.0/mlp/fc2/MatMul ...
2025-07-22 08:08:43,519 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.0/mlp/fc2/MatMul ...
2025-07-22 08:08:43,519 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Add_1 ...
2025-07-22 08:08:43,519 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/Cast_1 ...
2025-07-22 08:08:43,519 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/ReduceMean ...
2025-07-22 08:08:43,519 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Sub ...
2025-07-22 08:08:43,519 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Constant ...
2025-07-22 08:08:43,519 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Pow ...
2025-07-22 08:08:43,519 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/ReduceMean_1 ...
2025-07-22 08:08:43,519 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Constant_1 ...
2025-07-22 08:08:43,519 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Add ...
2025-07-22 08:08:43,519 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Sqrt ...
2025-07-22 08:08:43,519 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Div ...
2025-07-22 08:08:43,519 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Mul ...
2025-07-22 08:08:43,519 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.0/norm2/Add_1 ...
2025-07-22 08:08:43,519 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_1 ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_1 ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_1 ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_2 ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_2 ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_2 ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_3 ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Div ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Cast ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Cast_1 ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_1 ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_4 ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_2 ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_5 ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Concat ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Reshape ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather ...
2025-07-22 08:08:43,525 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cast ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Range ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Einsum ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cos ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Sin ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:43,526 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Equal ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Where ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:43,527 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Div ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Neg ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:43,528 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:43,529 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:43,530 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_3 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_4 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_5 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_1 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_2 ...
2025-07-22 08:08:43,531 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/MatMul ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/MatMul ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_6 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Div_1 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Add ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Softmax ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/MatMul_1 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/MatMul_1 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Transpose_3 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_3 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_7 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_6 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_4 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_8 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_7 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_5 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_9 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_8 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Shape_6 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_10 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Gather_9 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Constant_11 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Mul ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Mul_1 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_3 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_4 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Unsqueeze_5 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Concat_1 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/attn/Reshape_1 ...
2025-07-22 08:08:43,532 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/attn/out_proj/MatMul ...
2025-07-22 08:08:43,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/attn/out_proj/MatMul ...
2025-07-22 08:08:43,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Add ...
2025-07-22 08:08:43,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Cast ...
2025-07-22 08:08:43,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/ReduceMean ...
2025-07-22 08:08:43,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Sub ...
2025-07-22 08:08:43,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Constant ...
2025-07-22 08:08:43,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Pow ...
2025-07-22 08:08:43,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/ReduceMean_1 ...
2025-07-22 08:08:43,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Constant_1 ...
2025-07-22 08:08:43,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Add ...
2025-07-22 08:08:43,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Sqrt ...
2025-07-22 08:08:43,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Div ...
2025-07-22 08:08:43,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Mul ...
2025-07-22 08:08:43,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm1/Add_1 ...
2025-07-22 08:08:43,536 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc11/MatMul ...
2025-07-22 08:08:43,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc11/MatMul ...
2025-07-22 08:08:43,542 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc12/MatMul ...
2025-07-22 08:08:43,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc12/MatMul ...
2025-07-22 08:08:43,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Sigmoid ...
2025-07-22 08:08:43,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Mul ...
2025-07-22 08:08:43,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/mlp/Mul_1 ...
2025-07-22 08:08:43,549 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.1/mlp/fc2/MatMul ...
2025-07-22 08:08:43,555 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.1/mlp/fc2/MatMul ...
2025-07-22 08:08:43,555 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Add_1 ...
2025-07-22 08:08:43,555 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/Cast_1 ...
2025-07-22 08:08:43,555 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/ReduceMean ...
2025-07-22 08:08:43,555 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Sub ...
2025-07-22 08:08:43,555 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Constant ...
2025-07-22 08:08:43,555 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Pow ...
2025-07-22 08:08:43,555 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/ReduceMean_1 ...
2025-07-22 08:08:43,555 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Constant_1 ...
2025-07-22 08:08:43,555 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Add ...
2025-07-22 08:08:43,555 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Sqrt ...
2025-07-22 08:08:43,555 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Div ...
2025-07-22 08:08:43,555 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Mul ...
2025-07-22 08:08:43,555 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.1/norm2/Add_1 ...
2025-07-22 08:08:43,556 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,561 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,561 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape ...
2025-07-22 08:08:43,561 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant ...
2025-07-22 08:08:43,561 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather ...
2025-07-22 08:08:43,561 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_1 ...
2025-07-22 08:08:43,561 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_1 ...
2025-07-22 08:08:43,561 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_1 ...
2025-07-22 08:08:43,561 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_2 ...
2025-07-22 08:08:43,561 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_2 ...
2025-07-22 08:08:43,561 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_2 ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_3 ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Div ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Cast ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Cast_1 ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_1 ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_4 ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_2 ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_5 ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Concat ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Reshape ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cast ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Range ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Einsum ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cos ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Sin ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:43,562 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Equal ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Where ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:43,563 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:43,564 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Div ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Neg ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:43,565 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:43,566 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:43,567 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_3 ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_4 ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_5 ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_1 ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_2 ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/MatMul ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/MatMul ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_6 ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Div_1 ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Add ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Softmax ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/MatMul_1 ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/MatMul_1 ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Transpose_3 ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_3 ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_7 ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_6 ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_4 ...
2025-07-22 08:08:43,568 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_8 ...
2025-07-22 08:08:43,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_7 ...
2025-07-22 08:08:43,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_5 ...
2025-07-22 08:08:43,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_9 ...
2025-07-22 08:08:43,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_8 ...
2025-07-22 08:08:43,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Shape_6 ...
2025-07-22 08:08:43,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_10 ...
2025-07-22 08:08:43,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Gather_9 ...
2025-07-22 08:08:43,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Constant_11 ...
2025-07-22 08:08:43,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Mul ...
2025-07-22 08:08:43,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Mul_1 ...
2025-07-22 08:08:43,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_3 ...
2025-07-22 08:08:43,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_4 ...
2025-07-22 08:08:43,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Unsqueeze_5 ...
2025-07-22 08:08:43,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Concat_1 ...
2025-07-22 08:08:43,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/attn/Reshape_1 ...
2025-07-22 08:08:43,569 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/attn/out_proj/MatMul ...
2025-07-22 08:08:43,572 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/attn/out_proj/MatMul ...
2025-07-22 08:08:43,572 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Add ...
2025-07-22 08:08:43,572 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Cast ...
2025-07-22 08:08:43,572 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/ReduceMean ...
2025-07-22 08:08:43,572 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Sub ...
2025-07-22 08:08:43,572 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Constant ...
2025-07-22 08:08:43,572 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Pow ...
2025-07-22 08:08:43,572 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/ReduceMean_1 ...
2025-07-22 08:08:43,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Constant_1 ...
2025-07-22 08:08:43,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Add ...
2025-07-22 08:08:43,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Sqrt ...
2025-07-22 08:08:43,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Div ...
2025-07-22 08:08:43,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Mul ...
2025-07-22 08:08:43,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm1/Add_1 ...
2025-07-22 08:08:43,573 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc11/MatMul ...
2025-07-22 08:08:43,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc11/MatMul ...
2025-07-22 08:08:43,579 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc12/MatMul ...
2025-07-22 08:08:43,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc12/MatMul ...
2025-07-22 08:08:43,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Sigmoid ...
2025-07-22 08:08:43,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Mul ...
2025-07-22 08:08:43,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/mlp/Mul_1 ...
2025-07-22 08:08:43,585 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.2/mlp/fc2/MatMul ...
2025-07-22 08:08:43,592 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.2/mlp/fc2/MatMul ...
2025-07-22 08:08:43,592 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Add_1 ...
2025-07-22 08:08:43,592 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/Cast_1 ...
2025-07-22 08:08:43,592 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/ReduceMean ...
2025-07-22 08:08:43,592 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Sub ...
2025-07-22 08:08:43,592 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Constant ...
2025-07-22 08:08:43,592 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Pow ...
2025-07-22 08:08:43,592 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/ReduceMean_1 ...
2025-07-22 08:08:43,592 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Constant_1 ...
2025-07-22 08:08:43,592 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Add ...
2025-07-22 08:08:43,592 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Sqrt ...
2025-07-22 08:08:43,592 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Div ...
2025-07-22 08:08:43,592 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Mul ...
2025-07-22 08:08:43,592 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.2/norm2/Add_1 ...
2025-07-22 08:08:43,592 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_1 ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_1 ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_1 ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_2 ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_2 ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_2 ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_3 ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Div ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Cast ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Cast_1 ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_1 ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_4 ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_2 ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_5 ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Concat ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Reshape ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cast ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Range ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Einsum ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cos ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:43,598 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Sin ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:43,599 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Equal ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Where ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:43,600 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Div ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Neg ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:43,601 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:43,602 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:43,603 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_3 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_4 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_5 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_1 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_2 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/MatMul ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/MatMul ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_6 ...
2025-07-22 08:08:43,604 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Div_1 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Add ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Softmax ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/MatMul_1 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/MatMul_1 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Transpose_3 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_3 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_7 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_6 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_4 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_8 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_7 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_5 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_9 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_8 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Shape_6 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_10 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Gather_9 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Constant_11 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Mul ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Mul_1 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_3 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_4 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Unsqueeze_5 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Concat_1 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/attn/Reshape_1 ...
2025-07-22 08:08:43,605 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/attn/out_proj/MatMul ...
2025-07-22 08:08:43,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/attn/out_proj/MatMul ...
2025-07-22 08:08:43,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Add ...
2025-07-22 08:08:43,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Cast ...
2025-07-22 08:08:43,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/ReduceMean ...
2025-07-22 08:08:43,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Sub ...
2025-07-22 08:08:43,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Constant ...
2025-07-22 08:08:43,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Pow ...
2025-07-22 08:08:43,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/ReduceMean_1 ...
2025-07-22 08:08:43,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Constant_1 ...
2025-07-22 08:08:43,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Add ...
2025-07-22 08:08:43,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Sqrt ...
2025-07-22 08:08:43,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Div ...
2025-07-22 08:08:43,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Mul ...
2025-07-22 08:08:43,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm1/Add_1 ...
2025-07-22 08:08:43,609 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc11/MatMul ...
2025-07-22 08:08:43,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc11/MatMul ...
2025-07-22 08:08:43,616 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc12/MatMul ...
2025-07-22 08:08:43,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc12/MatMul ...
2025-07-22 08:08:43,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Sigmoid ...
2025-07-22 08:08:43,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Mul ...
2025-07-22 08:08:43,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/mlp/Mul_1 ...
2025-07-22 08:08:43,622 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.3/mlp/fc2/MatMul ...
2025-07-22 08:08:43,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.3/mlp/fc2/MatMul ...
2025-07-22 08:08:43,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Add_1 ...
2025-07-22 08:08:43,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/Cast_1 ...
2025-07-22 08:08:43,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/ReduceMean ...
2025-07-22 08:08:43,628 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Sub ...
2025-07-22 08:08:43,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Constant ...
2025-07-22 08:08:43,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Pow ...
2025-07-22 08:08:43,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/ReduceMean_1 ...
2025-07-22 08:08:43,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Constant_1 ...
2025-07-22 08:08:43,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Add ...
2025-07-22 08:08:43,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Sqrt ...
2025-07-22 08:08:43,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Div ...
2025-07-22 08:08:43,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Mul ...
2025-07-22 08:08:43,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.3/norm2/Add_1 ...
2025-07-22 08:08:43,629 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape ...
2025-07-22 08:08:43,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant ...
2025-07-22 08:08:43,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather ...
2025-07-22 08:08:43,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_1 ...
2025-07-22 08:08:43,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_1 ...
2025-07-22 08:08:43,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_1 ...
2025-07-22 08:08:43,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_2 ...
2025-07-22 08:08:43,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_2 ...
2025-07-22 08:08:43,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_2 ...
2025-07-22 08:08:43,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_3 ...
2025-07-22 08:08:43,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Div ...
2025-07-22 08:08:43,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Cast ...
2025-07-22 08:08:43,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Cast_1 ...
2025-07-22 08:08:43,634 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_1 ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_4 ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_2 ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_5 ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Concat ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Reshape ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cast ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Range ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Einsum ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cos ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Sin ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:43,635 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Equal ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Where ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:43,636 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Div ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:43,637 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Neg ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:43,638 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:43,639 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:43,640 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_3 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_4 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_5 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_1 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_2 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/MatMul ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/MatMul ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_6 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Div_1 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Add ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Softmax ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/MatMul_1 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/MatMul_1 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Transpose_3 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_3 ...
2025-07-22 08:08:43,641 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_7 ...
2025-07-22 08:08:43,642 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_6 ...
2025-07-22 08:08:43,642 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_4 ...
2025-07-22 08:08:43,642 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_8 ...
2025-07-22 08:08:43,642 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_7 ...
2025-07-22 08:08:43,642 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_5 ...
2025-07-22 08:08:43,642 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_9 ...
2025-07-22 08:08:43,642 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_8 ...
2025-07-22 08:08:43,642 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Shape_6 ...
2025-07-22 08:08:43,642 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_10 ...
2025-07-22 08:08:43,642 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Gather_9 ...
2025-07-22 08:08:43,642 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Constant_11 ...
2025-07-22 08:08:43,642 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Mul ...
2025-07-22 08:08:43,642 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Mul_1 ...
2025-07-22 08:08:43,642 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_3 ...
2025-07-22 08:08:43,642 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_4 ...
2025-07-22 08:08:43,642 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Unsqueeze_5 ...
2025-07-22 08:08:43,642 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Concat_1 ...
2025-07-22 08:08:43,642 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/attn/Reshape_1 ...
2025-07-22 08:08:43,642 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/attn/out_proj/MatMul ...
2025-07-22 08:08:43,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/attn/out_proj/MatMul ...
2025-07-22 08:08:43,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Add ...
2025-07-22 08:08:43,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Cast ...
2025-07-22 08:08:43,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/ReduceMean ...
2025-07-22 08:08:43,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Sub ...
2025-07-22 08:08:43,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Constant ...
2025-07-22 08:08:43,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Pow ...
2025-07-22 08:08:43,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/ReduceMean_1 ...
2025-07-22 08:08:43,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Constant_1 ...
2025-07-22 08:08:43,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Add ...
2025-07-22 08:08:43,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Sqrt ...
2025-07-22 08:08:43,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Div ...
2025-07-22 08:08:43,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Mul ...
2025-07-22 08:08:43,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm1/Add_1 ...
2025-07-22 08:08:43,646 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc11/MatMul ...
2025-07-22 08:08:43,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc11/MatMul ...
2025-07-22 08:08:43,652 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc12/MatMul ...
2025-07-22 08:08:43,658 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc12/MatMul ...
2025-07-22 08:08:43,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Sigmoid ...
2025-07-22 08:08:43,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Mul ...
2025-07-22 08:08:43,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/mlp/Mul_1 ...
2025-07-22 08:08:43,659 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.4/mlp/fc2/MatMul ...
2025-07-22 08:08:43,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.4/mlp/fc2/MatMul ...
2025-07-22 08:08:43,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Add_1 ...
2025-07-22 08:08:43,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/Cast_1 ...
2025-07-22 08:08:43,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/ReduceMean ...
2025-07-22 08:08:43,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Sub ...
2025-07-22 08:08:43,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Constant ...
2025-07-22 08:08:43,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Pow ...
2025-07-22 08:08:43,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/ReduceMean_1 ...
2025-07-22 08:08:43,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Constant_1 ...
2025-07-22 08:08:43,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Add ...
2025-07-22 08:08:43,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Sqrt ...
2025-07-22 08:08:43,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Div ...
2025-07-22 08:08:43,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Mul ...
2025-07-22 08:08:43,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.4/norm2/Add_1 ...
2025-07-22 08:08:43,665 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_1 ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_1 ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_1 ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_2 ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_2 ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_2 ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_3 ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Div ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Cast ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Cast_1 ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_1 ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_4 ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_2 ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_5 ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Concat ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Reshape ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant ...
2025-07-22 08:08:43,671 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cast ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Range ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Einsum ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cos ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Sin ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:43,672 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Equal ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Where ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:43,673 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Div ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Neg ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:43,674 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:43,675 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:43,676 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:43,677 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_3 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_4 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_5 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_1 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_2 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/MatMul ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/MatMul ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_6 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Div_1 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Add ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Softmax ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/MatMul_1 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/MatMul_1 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Transpose_3 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_3 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_7 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_6 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_4 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_8 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_7 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_5 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_9 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_8 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Shape_6 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_10 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Gather_9 ...
2025-07-22 08:08:43,678 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Constant_11 ...
2025-07-22 08:08:43,679 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Mul ...
2025-07-22 08:08:43,679 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Mul_1 ...
2025-07-22 08:08:43,679 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_3 ...
2025-07-22 08:08:43,679 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_4 ...
2025-07-22 08:08:43,679 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Unsqueeze_5 ...
2025-07-22 08:08:43,679 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Concat_1 ...
2025-07-22 08:08:43,679 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/attn/Reshape_1 ...
2025-07-22 08:08:43,679 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/attn/out_proj/MatMul ...
2025-07-22 08:08:43,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/attn/out_proj/MatMul ...
2025-07-22 08:08:43,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Add ...
2025-07-22 08:08:43,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Cast ...
2025-07-22 08:08:43,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/ReduceMean ...
2025-07-22 08:08:43,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Sub ...
2025-07-22 08:08:43,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Constant ...
2025-07-22 08:08:43,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Pow ...
2025-07-22 08:08:43,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/ReduceMean_1 ...
2025-07-22 08:08:43,682 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Constant_1 ...
2025-07-22 08:08:43,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Add ...
2025-07-22 08:08:43,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Sqrt ...
2025-07-22 08:08:43,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Div ...
2025-07-22 08:08:43,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Mul ...
2025-07-22 08:08:43,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm1/Add_1 ...
2025-07-22 08:08:43,683 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc11/MatMul ...
2025-07-22 08:08:43,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc11/MatMul ...
2025-07-22 08:08:43,689 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc12/MatMul ...
2025-07-22 08:08:43,695 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc12/MatMul ...
2025-07-22 08:08:43,696 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Sigmoid ...
2025-07-22 08:08:43,696 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Mul ...
2025-07-22 08:08:43,696 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/mlp/Mul_1 ...
2025-07-22 08:08:43,696 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.5/mlp/fc2/MatMul ...
2025-07-22 08:08:43,702 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.5/mlp/fc2/MatMul ...
2025-07-22 08:08:43,702 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Add_1 ...
2025-07-22 08:08:43,702 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/Cast_1 ...
2025-07-22 08:08:43,702 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/ReduceMean ...
2025-07-22 08:08:43,702 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Sub ...
2025-07-22 08:08:43,702 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Constant ...
2025-07-22 08:08:43,702 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Pow ...
2025-07-22 08:08:43,702 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/ReduceMean_1 ...
2025-07-22 08:08:43,702 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Constant_1 ...
2025-07-22 08:08:43,702 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Add ...
2025-07-22 08:08:43,702 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Sqrt ...
2025-07-22 08:08:43,702 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Div ...
2025-07-22 08:08:43,702 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Mul ...
2025-07-22 08:08:43,702 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.5/norm2/Add_1 ...
2025-07-22 08:08:43,702 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_1 ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_1 ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_1 ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_2 ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_2 ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_2 ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_3 ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Div ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Cast ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Cast_1 ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_1 ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_4 ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_2 ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_5 ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Concat ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Reshape ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cast ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Range ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Einsum ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cos ...
2025-07-22 08:08:43,708 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Sin ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:43,709 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Equal ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Where ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:43,710 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Div ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Neg ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:43,711 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:43,712 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:43,713 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_3 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_4 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_5 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_1 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_2 ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/MatMul ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/MatMul ...
2025-07-22 08:08:43,714 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_6 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Div_1 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Add ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Softmax ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/MatMul_1 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/MatMul_1 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Transpose_3 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_3 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_7 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_6 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_4 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_8 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_7 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_5 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_9 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_8 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Shape_6 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_10 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Gather_9 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Constant_11 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Mul ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Mul_1 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_3 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_4 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Unsqueeze_5 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Concat_1 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/attn/Reshape_1 ...
2025-07-22 08:08:43,715 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/attn/out_proj/MatMul ...
2025-07-22 08:08:43,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/attn/out_proj/MatMul ...
2025-07-22 08:08:43,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Add ...
2025-07-22 08:08:43,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Cast ...
2025-07-22 08:08:43,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/ReduceMean ...
2025-07-22 08:08:43,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Sub ...
2025-07-22 08:08:43,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Constant ...
2025-07-22 08:08:43,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Pow ...
2025-07-22 08:08:43,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/ReduceMean_1 ...
2025-07-22 08:08:43,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Constant_1 ...
2025-07-22 08:08:43,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Add ...
2025-07-22 08:08:43,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Sqrt ...
2025-07-22 08:08:43,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Div ...
2025-07-22 08:08:43,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Mul ...
2025-07-22 08:08:43,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm1/Add_1 ...
2025-07-22 08:08:43,719 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc11/MatMul ...
2025-07-22 08:08:43,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc11/MatMul ...
2025-07-22 08:08:43,726 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc12/MatMul ...
2025-07-22 08:08:43,732 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc12/MatMul ...
2025-07-22 08:08:43,732 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Sigmoid ...
2025-07-22 08:08:43,732 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Mul ...
2025-07-22 08:08:43,732 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/mlp/Mul_1 ...
2025-07-22 08:08:43,732 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.6/mlp/fc2/MatMul ...
2025-07-22 08:08:43,738 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.6/mlp/fc2/MatMul ...
2025-07-22 08:08:43,738 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Add_1 ...
2025-07-22 08:08:43,738 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/Cast_1 ...
2025-07-22 08:08:43,738 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/ReduceMean ...
2025-07-22 08:08:43,738 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Sub ...
2025-07-22 08:08:43,738 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Constant ...
2025-07-22 08:08:43,738 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Pow ...
2025-07-22 08:08:43,739 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/ReduceMean_1 ...
2025-07-22 08:08:43,739 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Constant_1 ...
2025-07-22 08:08:43,739 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Add ...
2025-07-22 08:08:43,739 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Sqrt ...
2025-07-22 08:08:43,739 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Div ...
2025-07-22 08:08:43,739 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Mul ...
2025-07-22 08:08:43,739 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.6/norm2/Add_1 ...
2025-07-22 08:08:43,739 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,744 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,744 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape ...
2025-07-22 08:08:43,744 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant ...
2025-07-22 08:08:43,744 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_1 ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_1 ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_1 ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_2 ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_2 ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_2 ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_3 ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Div ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Cast ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Cast_1 ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_1 ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_4 ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_2 ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_5 ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Concat ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Reshape ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cast ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Range ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Einsum ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cos ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Sin ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:43,745 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Equal ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Where ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand ...
2025-07-22 08:08:43,746 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:43,747 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Div ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Neg ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:43,748 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:43,749 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:43,750 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_3 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_4 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_5 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_1 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_2 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/MatMul ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/MatMul ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_6 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Div_1 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Add ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Softmax ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/MatMul_1 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/MatMul_1 ...
2025-07-22 08:08:43,751 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Transpose_3 ...
2025-07-22 08:08:43,752 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_3 ...
2025-07-22 08:08:43,752 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_7 ...
2025-07-22 08:08:43,752 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_6 ...
2025-07-22 08:08:43,752 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_4 ...
2025-07-22 08:08:43,752 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_8 ...
2025-07-22 08:08:43,752 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_7 ...
2025-07-22 08:08:43,752 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_5 ...
2025-07-22 08:08:43,752 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_9 ...
2025-07-22 08:08:43,752 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_8 ...
2025-07-22 08:08:43,752 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Shape_6 ...
2025-07-22 08:08:43,752 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_10 ...
2025-07-22 08:08:43,752 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Gather_9 ...
2025-07-22 08:08:43,752 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Constant_11 ...
2025-07-22 08:08:43,752 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Mul ...
2025-07-22 08:08:43,752 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Mul_1 ...
2025-07-22 08:08:43,752 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_3 ...
2025-07-22 08:08:43,752 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_4 ...
2025-07-22 08:08:43,752 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Unsqueeze_5 ...
2025-07-22 08:08:43,752 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Concat_1 ...
2025-07-22 08:08:43,752 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/attn/Reshape_1 ...
2025-07-22 08:08:43,752 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/attn/out_proj/MatMul ...
2025-07-22 08:08:43,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/attn/out_proj/MatMul ...
2025-07-22 08:08:43,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Add ...
2025-07-22 08:08:43,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Cast ...
2025-07-22 08:08:43,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/ReduceMean ...
2025-07-22 08:08:43,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Sub ...
2025-07-22 08:08:43,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Constant ...
2025-07-22 08:08:43,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Pow ...
2025-07-22 08:08:43,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/ReduceMean_1 ...
2025-07-22 08:08:43,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Constant_1 ...
2025-07-22 08:08:43,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Add ...
2025-07-22 08:08:43,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Sqrt ...
2025-07-22 08:08:43,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Div ...
2025-07-22 08:08:43,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Mul ...
2025-07-22 08:08:43,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm1/Add_1 ...
2025-07-22 08:08:43,756 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc11/MatMul ...
2025-07-22 08:08:43,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc11/MatMul ...
2025-07-22 08:08:43,762 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc12/MatMul ...
2025-07-22 08:08:43,769 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc12/MatMul ...
2025-07-22 08:08:43,769 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Sigmoid ...
2025-07-22 08:08:43,769 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Mul ...
2025-07-22 08:08:43,769 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/mlp/Mul_1 ...
2025-07-22 08:08:43,769 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.7/mlp/fc2/MatMul ...
2025-07-22 08:08:43,775 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.7/mlp/fc2/MatMul ...
2025-07-22 08:08:43,775 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Add_1 ...
2025-07-22 08:08:43,775 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/Cast_1 ...
2025-07-22 08:08:43,775 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/ReduceMean ...
2025-07-22 08:08:43,775 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Sub ...
2025-07-22 08:08:43,775 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Constant ...
2025-07-22 08:08:43,775 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Pow ...
2025-07-22 08:08:43,775 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/ReduceMean_1 ...
2025-07-22 08:08:43,775 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Constant_1 ...
2025-07-22 08:08:43,775 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Add ...
2025-07-22 08:08:43,775 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Sqrt ...
2025-07-22 08:08:43,775 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Div ...
2025-07-22 08:08:43,775 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Mul ...
2025-07-22 08:08:43,776 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.7/norm2/Add_1 ...
2025-07-22 08:08:43,776 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,781 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,781 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape ...
2025-07-22 08:08:43,781 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant ...
2025-07-22 08:08:43,781 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather ...
2025-07-22 08:08:43,781 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_1 ...
2025-07-22 08:08:43,781 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_1 ...
2025-07-22 08:08:43,781 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_1 ...
2025-07-22 08:08:43,781 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_2 ...
2025-07-22 08:08:43,781 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_2 ...
2025-07-22 08:08:43,781 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_2 ...
2025-07-22 08:08:43,781 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_3 ...
2025-07-22 08:08:43,781 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Div ...
2025-07-22 08:08:43,781 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Cast ...
2025-07-22 08:08:43,781 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Cast_1 ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_1 ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_4 ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_2 ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_5 ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Concat ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Reshape ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cast ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Range ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Einsum ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cos ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Sin ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice ...
2025-07-22 08:08:43,782 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Equal ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Where ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:43,783 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Div ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:43,784 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Neg ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:43,785 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:43,786 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:43,787 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_3 ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_4 ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_5 ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_1 ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_2 ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/MatMul ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/MatMul ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_6 ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Div_1 ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Add ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Softmax ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/MatMul_1 ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/MatMul_1 ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Transpose_3 ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_3 ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_7 ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_6 ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_4 ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_8 ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_7 ...
2025-07-22 08:08:43,788 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_5 ...
2025-07-22 08:08:43,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_9 ...
2025-07-22 08:08:43,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_8 ...
2025-07-22 08:08:43,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Shape_6 ...
2025-07-22 08:08:43,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_10 ...
2025-07-22 08:08:43,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Gather_9 ...
2025-07-22 08:08:43,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Constant_11 ...
2025-07-22 08:08:43,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Mul ...
2025-07-22 08:08:43,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Mul_1 ...
2025-07-22 08:08:43,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_3 ...
2025-07-22 08:08:43,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_4 ...
2025-07-22 08:08:43,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Unsqueeze_5 ...
2025-07-22 08:08:43,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Concat_1 ...
2025-07-22 08:08:43,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/attn/Reshape_1 ...
2025-07-22 08:08:43,789 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/attn/out_proj/MatMul ...
2025-07-22 08:08:43,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/attn/out_proj/MatMul ...
2025-07-22 08:08:43,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Add ...
2025-07-22 08:08:43,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Cast ...
2025-07-22 08:08:43,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/ReduceMean ...
2025-07-22 08:08:43,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Sub ...
2025-07-22 08:08:43,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Constant ...
2025-07-22 08:08:43,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Pow ...
2025-07-22 08:08:43,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/ReduceMean_1 ...
2025-07-22 08:08:43,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Constant_1 ...
2025-07-22 08:08:43,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Add ...
2025-07-22 08:08:43,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Sqrt ...
2025-07-22 08:08:43,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Div ...
2025-07-22 08:08:43,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Mul ...
2025-07-22 08:08:43,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm1/Add_1 ...
2025-07-22 08:08:43,793 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc11/MatMul ...
2025-07-22 08:08:43,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc11/MatMul ...
2025-07-22 08:08:43,799 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc12/MatMul ...
2025-07-22 08:08:43,805 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc12/MatMul ...
2025-07-22 08:08:43,805 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Sigmoid ...
2025-07-22 08:08:43,805 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Mul ...
2025-07-22 08:08:43,805 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/mlp/Mul_1 ...
2025-07-22 08:08:43,806 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.8/mlp/fc2/MatMul ...
2025-07-22 08:08:43,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.8/mlp/fc2/MatMul ...
2025-07-22 08:08:43,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Add_1 ...
2025-07-22 08:08:43,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/Cast_1 ...
2025-07-22 08:08:43,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/ReduceMean ...
2025-07-22 08:08:43,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Sub ...
2025-07-22 08:08:43,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Constant ...
2025-07-22 08:08:43,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Pow ...
2025-07-22 08:08:43,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/ReduceMean_1 ...
2025-07-22 08:08:43,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Constant_1 ...
2025-07-22 08:08:43,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Add ...
2025-07-22 08:08:43,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Sqrt ...
2025-07-22 08:08:43,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Div ...
2025-07-22 08:08:43,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Mul ...
2025-07-22 08:08:43,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.8/norm2/Add_1 ...
2025-07-22 08:08:43,812 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_1 ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_1 ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_1 ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_2 ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_2 ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_2 ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_3 ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Div ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Cast ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Cast_1 ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_1 ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_4 ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_2 ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_5 ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Concat ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Reshape ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cast ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:43,818 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Range ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Einsum ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cos ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Sin ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:43,819 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Equal ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Where ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:43,820 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Div ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Neg ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:43,821 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:43,822 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:43,823 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_3 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_4 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_5 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_1 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_2 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/MatMul ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/MatMul ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_6 ...
2025-07-22 08:08:43,824 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Div_1 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Add ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Softmax ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/MatMul_1 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/MatMul_1 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Transpose_3 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_3 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_7 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_6 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_4 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_8 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_7 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_5 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_9 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_8 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Shape_6 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_10 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Gather_9 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Constant_11 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Mul ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Mul_1 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_3 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_4 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Unsqueeze_5 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Concat_1 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/attn/Reshape_1 ...
2025-07-22 08:08:43,825 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/attn/out_proj/MatMul ...
2025-07-22 08:08:43,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/attn/out_proj/MatMul ...
2025-07-22 08:08:43,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Add ...
2025-07-22 08:08:43,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Cast ...
2025-07-22 08:08:43,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/ReduceMean ...
2025-07-22 08:08:43,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Sub ...
2025-07-22 08:08:43,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Constant ...
2025-07-22 08:08:43,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Pow ...
2025-07-22 08:08:43,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/ReduceMean_1 ...
2025-07-22 08:08:43,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Constant_1 ...
2025-07-22 08:08:43,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Add ...
2025-07-22 08:08:43,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Sqrt ...
2025-07-22 08:08:43,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Div ...
2025-07-22 08:08:43,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Mul ...
2025-07-22 08:08:43,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm1/Add_1 ...
2025-07-22 08:08:43,829 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc11/MatMul ...
2025-07-22 08:08:43,835 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc11/MatMul ...
2025-07-22 08:08:43,836 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc12/MatMul ...
2025-07-22 08:08:43,842 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc12/MatMul ...
2025-07-22 08:08:43,842 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Sigmoid ...
2025-07-22 08:08:43,842 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Mul ...
2025-07-22 08:08:43,842 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/mlp/Mul_1 ...
2025-07-22 08:08:43,842 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.9/mlp/fc2/MatMul ...
2025-07-22 08:08:43,848 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.9/mlp/fc2/MatMul ...
2025-07-22 08:08:43,848 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Add_1 ...
2025-07-22 08:08:43,848 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/Cast_1 ...
2025-07-22 08:08:43,848 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/ReduceMean ...
2025-07-22 08:08:43,848 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Sub ...
2025-07-22 08:08:43,848 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Constant ...
2025-07-22 08:08:43,848 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Pow ...
2025-07-22 08:08:43,848 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/ReduceMean_1 ...
2025-07-22 08:08:43,848 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Constant_1 ...
2025-07-22 08:08:43,848 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Add ...
2025-07-22 08:08:43,848 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Sqrt ...
2025-07-22 08:08:43,848 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Div ...
2025-07-22 08:08:43,849 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Mul ...
2025-07-22 08:08:43,849 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.9/norm2/Add_1 ...
2025-07-22 08:08:43,849 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,854 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,854 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape ...
2025-07-22 08:08:43,854 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant ...
2025-07-22 08:08:43,854 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather ...
2025-07-22 08:08:43,854 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_1 ...
2025-07-22 08:08:43,854 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_1 ...
2025-07-22 08:08:43,854 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_1 ...
2025-07-22 08:08:43,854 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_2 ...
2025-07-22 08:08:43,854 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_2 ...
2025-07-22 08:08:43,854 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_2 ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_3 ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Div ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Cast ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Cast_1 ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_1 ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_4 ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_2 ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_5 ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Concat ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Reshape ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cast ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Range ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Einsum ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cos ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Sin ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:43,855 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Equal ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Where ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:43,856 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Div ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:43,857 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Neg ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:43,858 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:43,859 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:43,860 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_3 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_4 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_5 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_1 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_2 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/MatMul ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/MatMul ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_6 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Div_1 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Add ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Softmax ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/MatMul_1 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/MatMul_1 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Transpose_3 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_3 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_7 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_6 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_4 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_8 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_7 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_5 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_9 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_8 ...
2025-07-22 08:08:43,861 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Shape_6 ...
2025-07-22 08:08:43,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_10 ...
2025-07-22 08:08:43,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Gather_9 ...
2025-07-22 08:08:43,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Constant_11 ...
2025-07-22 08:08:43,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Mul ...
2025-07-22 08:08:43,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Mul_1 ...
2025-07-22 08:08:43,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_3 ...
2025-07-22 08:08:43,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_4 ...
2025-07-22 08:08:43,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Unsqueeze_5 ...
2025-07-22 08:08:43,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Concat_1 ...
2025-07-22 08:08:43,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/attn/Reshape_1 ...
2025-07-22 08:08:43,862 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/attn/out_proj/MatMul ...
2025-07-22 08:08:43,865 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/attn/out_proj/MatMul ...
2025-07-22 08:08:43,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Add ...
2025-07-22 08:08:43,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Cast ...
2025-07-22 08:08:43,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/ReduceMean ...
2025-07-22 08:08:43,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Sub ...
2025-07-22 08:08:43,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Constant ...
2025-07-22 08:08:43,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Pow ...
2025-07-22 08:08:43,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/ReduceMean_1 ...
2025-07-22 08:08:43,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Constant_1 ...
2025-07-22 08:08:43,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Add ...
2025-07-22 08:08:43,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Sqrt ...
2025-07-22 08:08:43,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Div ...
2025-07-22 08:08:43,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Mul ...
2025-07-22 08:08:43,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm1/Add_1 ...
2025-07-22 08:08:43,866 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc11/MatMul ...
2025-07-22 08:08:43,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc11/MatMul ...
2025-07-22 08:08:43,872 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc12/MatMul ...
2025-07-22 08:08:43,878 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc12/MatMul ...
2025-07-22 08:08:43,878 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Sigmoid ...
2025-07-22 08:08:43,878 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Mul ...
2025-07-22 08:08:43,878 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/mlp/Mul_1 ...
2025-07-22 08:08:43,878 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.10/mlp/fc2/MatMul ...
2025-07-22 08:08:43,885 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.10/mlp/fc2/MatMul ...
2025-07-22 08:08:43,885 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Add_1 ...
2025-07-22 08:08:43,885 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/Cast_1 ...
2025-07-22 08:08:43,885 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/ReduceMean ...
2025-07-22 08:08:43,885 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Sub ...
2025-07-22 08:08:43,885 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Constant ...
2025-07-22 08:08:43,885 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Pow ...
2025-07-22 08:08:43,885 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/ReduceMean_1 ...
2025-07-22 08:08:43,885 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Constant_1 ...
2025-07-22 08:08:43,885 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Add ...
2025-07-22 08:08:43,885 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Sqrt ...
2025-07-22 08:08:43,885 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Div ...
2025-07-22 08:08:43,885 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Mul ...
2025-07-22 08:08:43,885 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.10/norm2/Add_1 ...
2025-07-22 08:08:43,885 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/Wqkv/MatMul ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_1 ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_1 ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_1 ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_2 ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_2 ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_2 ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_3 ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Div ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Cast ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Cast_1 ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_1 ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_4 ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_2 ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_5 ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Concat ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Reshape ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cast ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_1 ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_2 ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Range ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Einsum ...
2025-07-22 08:08:43,891 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cos ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cast_1 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Sin ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Cast_2 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_1 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_1 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_3 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_2 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_4 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_2 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_5 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_3 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_6 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_7 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_8 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_9 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_10 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_1 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_11 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_1 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_3 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_12 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_4 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_4 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_13 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_5 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_14 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_1 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_2 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_15 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_16 ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/ConstantOfShape ...
2025-07-22 08:08:43,892 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_17 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_2 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_18 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Equal ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Where ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_3 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_19 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_4 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_5 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_20 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_6 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_6 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_21 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_7 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_22 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_3 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_5 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_23 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_24 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/ConstantOfShape_1 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_25 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_4 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_26 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Equal_1 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Where_1 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_1 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_6 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_27 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_7 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_1 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_1 ...
2025-07-22 08:08:43,893 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_28 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_29 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_8 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_30 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_2 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_5 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_7 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_31 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_8 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_32 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_33 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_34 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Div ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_35 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_6 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_3 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_36 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_7 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_4 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Neg ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_2 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_8 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add_1 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_37 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_9 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_38 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_39 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_5 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_3 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_9 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_8 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_40 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_10 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_41 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_42 ...
2025-07-22 08:08:43,894 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_10 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_43 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_6 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_44 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_45 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_11 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_46 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_7 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_9 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_47 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_11 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_10 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_48 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_12 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_49 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_9 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_12 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_50 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_51 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/ConstantOfShape_2 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_52 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_10 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_53 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Equal_2 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Where_2 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_2 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_13 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_54 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_14 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_4 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_2 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_11 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_55 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_13 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_12 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_56 ...
2025-07-22 08:08:43,895 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_14 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_57 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_11 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_15 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_58 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_59 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/ConstantOfShape_3 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_60 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_12 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_61 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Equal_3 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Where_3 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Expand_3 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_16 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_62 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_17 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_5 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Reshape_3 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_63 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_64 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_18 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_65 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_8 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_13 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Shape_13 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_66 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_15 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_67 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_68 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add_2 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_69 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Div_1 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_70 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_14 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_9 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_71 ...
2025-07-22 08:08:43,896 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_15 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_10 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Neg_1 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_6 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Mul_16 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Add_3 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_72 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_19 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_73 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Constant_74 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Slice_11 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_7 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Gather_16 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_20 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_21 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Unsqueeze_22 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/rotary_emb/Concat_8 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_3 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_4 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_5 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_1 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_2 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/MatMul ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/MatMul ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_6 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Div_1 ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Add ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Softmax ...
2025-07-22 08:08:43,897 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/MatMul_1 ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - MatMul doesn't have const weight. Skip to quantize
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/MatMul_1 ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Transpose_3 ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_3 ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_7 ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_6 ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_4 ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_8 ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_7 ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_5 ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_9 ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_8 ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Shape_6 ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_10 ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Gather_9 ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Constant_11 ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Mul ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Mul_1 ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_3 ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_4 ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Unsqueeze_5 ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Concat_1 ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/attn/Reshape_1 ...
2025-07-22 08:08:43,898 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/attn/out_proj/MatMul ...
2025-07-22 08:08:43,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/attn/out_proj/MatMul ...
2025-07-22 08:08:43,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Add ...
2025-07-22 08:08:43,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Cast ...
2025-07-22 08:08:43,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/ReduceMean ...
2025-07-22 08:08:43,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Sub ...
2025-07-22 08:08:43,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Constant ...
2025-07-22 08:08:43,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Pow ...
2025-07-22 08:08:43,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/ReduceMean_1 ...
2025-07-22 08:08:43,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Constant_1 ...
2025-07-22 08:08:43,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Add ...
2025-07-22 08:08:43,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Sqrt ...
2025-07-22 08:08:43,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Div ...
2025-07-22 08:08:43,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Mul ...
2025-07-22 08:08:43,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm1/Add_1 ...
2025-07-22 08:08:43,902 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc11/MatMul ...
2025-07-22 08:08:43,908 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc11/MatMul ...
2025-07-22 08:08:43,909 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc12/MatMul ...
2025-07-22 08:08:43,915 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc12/MatMul ...
2025-07-22 08:08:43,915 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Sigmoid ...
2025-07-22 08:08:43,915 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Mul ...
2025-07-22 08:08:43,915 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/mlp/Mul_1 ...
2025-07-22 08:08:43,915 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - start to quantize /encoder/layers.11/mlp/fc2/MatMul ...
2025-07-22 08:08:43,921 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - complete quantization of /encoder/layers.11/mlp/fc2/MatMul ...
2025-07-22 08:08:43,921 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Add_1 ...
2025-07-22 08:08:43,921 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/Cast_1 ...
2025-07-22 08:08:43,921 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/ReduceMean ...
2025-07-22 08:08:43,921 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Sub ...
2025-07-22 08:08:43,921 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Constant ...
2025-07-22 08:08:43,921 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Pow ...
2025-07-22 08:08:43,921 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/ReduceMean_1 ...
2025-07-22 08:08:43,921 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Constant_1 ...
2025-07-22 08:08:43,921 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Add ...
2025-07-22 08:08:43,921 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Sqrt ...
2025-07-22 08:08:43,921 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Div ...
2025-07-22 08:08:43,921 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Mul ...
2025-07-22 08:08:43,921 onnxruntime.quantization.matmul_4bits_quantizer [INFO] - skip to quantize /encoder/layers.11/norm2/Add_1 ...
/home/ubuntu/src/tjsmigration/transformers.js/scripts/float16.py:73: UserWarning: the float32 number 9.999999960041972e-13 will be truncated to 1e-07
  warnings.warn(
/home/ubuntu/src/tjsmigration/transformers.js/scripts/float16.py:85: UserWarning: the float32 number -3.4028234663852886e+38 will be truncated to -10000.0
  warnings.warn(
/home/ubuntu/src/tjsmigration/transformers.js/scripts/float16.py:73: UserWarning: the float32 number 1.7979866484552076e-08 will be truncated to 1e-07
  warnings.warn(
/home/ubuntu/src/tjsmigration/transformers.js/scripts/float16.py:92: UserWarning: the float32 number -4.1298839903447515e-09 will be truncated to -1e-07
  warnings.warn(

 - Quantizing to q4f16:  60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ    | 3/5 [00:12<00:08,  4.24s/it]

Processing /tmp/tmp6syzxlml/model.onnx:   0%|          | 0/1 [00:12<?, ?it/s]
Traceback (most recent call last):
  File "<frozen runpy>", line 198, in _run_module_as_main
  File "<frozen runpy>", line 88, in _run_code
  File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/quantize.py", line 377, in <module>
    main()
  File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/quantize.py", line 374, in main
    quantize(input_folder, output_folder, quantization_args)
  File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/quantize.py", line 326, in quantize
    quantize_fp16(
  File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/quantize.py", line 223, in quantize_fp16
    check_and_save_model(model_fp16, save_path)
  File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/utils.py", line 29, in check_and_save_model
    strict_check_model(model)
  File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/utils.py", line 21, in strict_check_model
    raise e
  File "/home/ubuntu/src/tjsmigration/transformers.js/scripts/utils.py", line 16, in strict_check_model
    onnx.checker.check_model(model_or_path, full_check=True)
  File "/home/ubuntu/.cache/uv/archive-v0/iAncxVR1WPOl_8LkA6LpD/lib/python3.12/site-packages/onnx/checker.py", line 179, in check_model
    C.check_model(
onnx.onnx_cpp2py_export.shape_inference.InferenceError: [ShapeInferenceError] (op_type:Range, node name: /encoder/layers.0/attn/rotary_emb/Range): start typestr: T, has unsupported type: tensor(float16)
Xenova changed pull request status to merged

Sign up or log in to comment