MTSAIR
/

Kodify-Nano

Text Generation

text-generation-inference

Model card Files Files and versions

Polushinm commited on May 30, 2025

Commit

ac63253

·

verified ·

1 Parent(s): 9b738bb

Update README.md

Files changed (1) hide show

README.md +8 -1

README.md CHANGED Viewed

@@ -4,7 +4,7 @@ language:
 - ru
 - en
 pipeline_tag: text-generation
-license: other
 license_name: apache-2.0
 license_link: https://huggingface.co/MTSAIR/Kodify-Nano/blob/main/Apache%20License%20MTS%20AI.docx
 ---
@@ -22,6 +22,13 @@ Kodify-Nano is a lightweight LLM designed for code development tasks with minima
 ```bash
 python3 -m vllm.entrypoints.openai.api_server --model MTSAIR/Kodify-Nano --port 8985
 ```
 ---
 ## Using the Ollama Image

 - ru
 - en
 pipeline_tag: text-generation
+license: apache-2.0
 license_name: apache-2.0
 license_link: https://huggingface.co/MTSAIR/Kodify-Nano/blob/main/Apache%20License%20MTS%20AI.docx
 ---
 ```bash
 python3 -m vllm.entrypoints.openai.api_server --model MTSAIR/Kodify-Nano --port 8985
 ```
+> **Important!** If you encounter the **"CUDA out of memory. Tried to allocate..."** error despite having sufficient GPU memory, try one of these solutions:
+> 1. Add the --enforce-eager argument
+> 2. Reduce GPU memory utilization (for example --gpu-memory-utilization 0.8)
+>
+> Note: This may decrease model performance.
 ---
 ## Using the Ollama Image