license: apache-2.0 | |
This project contains the onnx and tensorrt model files converted from the chatglm-6b model. | |
The infer scripts for onnx and tensorrt will be refined later | |
onnx2engine.py used to convert onnx into tensorrt engine, batch is now 1, can be modified | |
according to their own video memory into dynamic batch | |