| license: apache-2.0 | |
| This project contains the onnx and tensorrt model files converted from the chatglm-6b model. | |
| The infer scripts for onnx and tensorrt will be refined later | |
| onnx2engine.py used to convert onnx into tensorrt engine, batch is now 1, can be modified | |
| according to their own video memory into dynamic batch | |