LH-Tech-AI
/

Apex-1.5-Instruct-350M

Text Generation

Model card Files Files and versions

LH-Tech-AI commited on 22 days ago

Commit

f8d4fb3

·

verified ·

1 Parent(s): 44f99f0

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -25,7 +25,7 @@ Then, use the prepare-script and the finetuning script in the files list of this
 # How to use it
 You can directly download the final model as ONNX format - so it runs without the need to install a huge Python environment with PyTorch, CUDA, etc... - as INT8 and in full precision.
-Use `inference.py` for local inference on CUDA or CPU!
 Have fun! :D

 # How to use it
 You can directly download the final model as ONNX format - so it runs without the need to install a huge Python environment with PyTorch, CUDA, etc... - as INT8 and in full precision.
+Use `inference.py` for local inference on CUDA or CPU! First, install `pip install onnxruntime-gpu tiktoken numpy nvidia-cudnn-cu12 nvidia-cublas-cu12` on your system (in a Python VENV for Linux users).
 Have fun! :D