Upload README.md with huggingface_hub

Files changed (1) hide show

README.md CHANGED Viewed

@@ -4,6 +4,7 @@ pipeline_tag: text-generation
 tags:
 - qwen
 - qwen2
 - lora
 - vllm
 - open-webui
@@ -11,21 +12,27 @@ tags:
 - coding
 ---
-# 7bcustom-model
-This is a public deployment package for a local DGX AI Factory coding assistant runtime.
 ## Model
-- Public name: `7bcustom-model`
 - Runtime served name: `dgx-stable-current`
-- Base family: Qwen2 7B Instruct class
 - Runtime: vLLM OpenAI-compatible API
 - Open-WebUI compatible: yes
-## Deployment status
-This public release is based on the locally validated stable deployment.
 ```text
 average_score: 97.75
@@ -35,21 +42,12 @@ critical_fail_count: 0
 decision: DEPLOY_CANDIDATE
 ```
-## Runtime policy
-The local production runtime uses router/template safeguards for deterministic operational answers:
-- Linux guarded prompt
-- vLLM medium prompt
-- CUDA check template
-- LoRA/stable/rejected policy template
 ## vLLM example
 ```bash
 python -m vllm.entrypoints.openai.api_server \
   --model ./ \
-  --served-model-name 7bcustom-model \
   --dtype float16 \
   --host 0.0.0.0 \
   --port 8000 \
@@ -62,10 +60,19 @@ python -m vllm.entrypoints.openai.api_server \
 ```text
 Base URL: http://<host>:8000/v1
-Model   : 7bcustom-model
 API Key : dummy
 ```
 ## Notes
-This repository is intended as a public model/runtime release record. Local absolute paths, private operational logs, and preservation tarballs are not required for public usage.

 tags:
 - qwen
 - qwen2
+- 8b
 - lora
 - vllm
 - open-webui
 - coding
 ---
+# 8bcustom-model
+This is the public model/runtime release for the DGX AI Factory stable coding assistant.
+## Correction notice
+The previous public repository name used `7bcustom-model`, but the checked model size/name should be treated as **8B-class** for this public title. This repository uses the corrected title:
+```text
+8bcustom-model
+```
 ## Model
+- Public name: `8bcustom-model`
 - Runtime served name: `dgx-stable-current`
+- Model class: 8B-class local custom coding assistant
 - Runtime: vLLM OpenAI-compatible API
 - Open-WebUI compatible: yes
+## Deployment benchmark
 ```text
 average_score: 97.75
 decision: DEPLOY_CANDIDATE
 ```
 ## vLLM example
 ```bash
 python -m vllm.entrypoints.openai.api_server \
   --model ./ \
+  --served-model-name 8bcustom-model \
   --dtype float16 \
   --host 0.0.0.0 \
   --port 8000 \
 ```text
 Base URL: http://<host>:8000/v1
+Model   : 8bcustom-model
 API Key : dummy
 ```
+## Runtime policy
+The local production runtime used router/template safeguards for deterministic operational answers:
+- Linux guarded prompt
+- vLLM medium prompt
+- CUDA check template
+- LoRA/stable/rejected policy template
 ## Notes
+This public repository is a model/runtime release record. Private preservation archives and local absolute-path operational records are not required for public usage.