minpeter
/

HyperCLOVAX-SEED-Text-Think-32B-hf

Text Generation

text-generation-inference

Model card Files Files and versions

minpeter commited on Dec 31, 2025

Commit

885d2d4

·

verified ·

1 Parent(s): d9f34c1

Upload README.md with huggingface_hub

Files changed (1) hide show

README.md +71 -0

README.md CHANGED Viewed

@@ -126,6 +126,77 @@ If you use this model, please cite the original:
 }
 ```
 ## Acknowledgments
 - Original model by [NAVER Cloud HyperCLOVA X](https://huggingface.co/naver-hyperclovax)

 }
 ```
+## Reproduce This Extraction
+Want to extract the LLM yourself? Use the included [`extract_llm.py`](extract_llm.py) script.
+### Prerequisites
+```bash
+pip install safetensors torch tqdm huggingface_hub
+```
+### Step 1: Download Original VLM (~66GB)
+```bash
+huggingface-cli download naver-hyperclovax/HyperCLOVAX-SEED-Think-32B \
+    --local-dir ./HyperCLOVAX-SEED-Think-32B
+```
+### Step 2: Run Extraction Script
+```bash
+# Download the extraction script
+wget https://huggingface.co/minpeter/HyperCLOVAX-SEED-Text-Think-32B-hf/resolve/main/extract_llm.py
+# Run extraction
+python extract_llm.py \
+    --input ./HyperCLOVAX-SEED-Think-32B \
+    --output ./HyperCLOVAX-SEED-Text-Think-32B
+```
+### What the Script Does
+1. **Extracts LLM weights**: Filters `model.language_model.*` tensors from the VLM
+2. **Remaps keys**: Converts to standard LLaMA format
+   - `model.language_model.model.*` → `model.*`
+   - `model.language_model.lm_head.*` → `lm_head.*`
+3. **Creates config**: Generates LLaMA-compatible `config.json` from VLM's `text_config`
+4. **Copies tokenizer**: Preserves all tokenizer files unchanged
+### Output Structure
+```
+HyperCLOVAX-SEED-Text-Think-32B/
+├── config.json                      # LLaMA config
+├── generation_config.json
+├── model-00001-of-00013.safetensors # ~5GB shards
+├── ...
+├── model-00013-of-00013.safetensors
+├── model.safetensors.index.json
+├── tokenizer.json
+├── tokenizer_config.json
+├── special_tokens_map.json
+├── added_tokens.json
+├── vocab.json
+├── merges.txt
+└── chat_template.jinja
+```
+### Verify Extraction
+```bash
+# Quick test with vLLM
+vllm serve ./HyperCLOVAX-SEED-Text-Think-32B \
+    --dtype bfloat16 \
+    --tensor-parallel-size 2
+# In another terminal
+curl http://localhost:8000/v1/chat/completions \
+    -H "Content-Type: application/json" \
+    -d '{"model": "./HyperCLOVAX-SEED-Text-Think-32B", "messages": [{"role": "user", "content": "Hello!"}]}'
+```
 ## Acknowledgments
 - Original model by [NAVER Cloud HyperCLOVA X](https://huggingface.co/naver-hyperclovax)