| license: mit | |
| language: | |
| - en | |
| tags: | |
| - text-generation-inference | |
| - text | |
| ## TinyLLama TensorRT LLM Edition. | |
| This repo contains the TensorRT LLM version of TinyLlama Model. The conversion is done to support Float16 precision on Nvidia TensorRT. |
| license: mit | |
| language: | |
| - en | |
| tags: | |
| - text-generation-inference | |
| - text | |
| ## TinyLLama TensorRT LLM Edition. | |
| This repo contains the TensorRT LLM version of TinyLlama Model. The conversion is done to support Float16 precision on Nvidia TensorRT. |