File size: 1,329 Bytes
642275e 8ed2f67 642275e | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 | ---
pipeline_tag: text-generation
library_name: transformers
tags:
- mining
- fp8
license: cc-by-nc-sa-4.0
language:
- ru
base_model: nn-tech/MetalGPT-1
---
## Description
**MetalGPT-1** is a model built upon the Qwen/Qwen3-32B and incorporates both continual pre-training and supervised fine-tuning on domain-specific data from the mining and metallurgy industry.
---
### Quantization
For convenience and improved performance, we also provide this FP8 checkpoint of the nn-tech/MetalGPT-1 model. Using FP8 precision enables faster inference and lower memory usage, while preserving model quality and numerical stability.
---
### VLLM usage
```bash
vllm serve nn-tech/MetalGPT-1-FP8 --reasoning-parser qwen3
```
```python
from openai import OpenAI
client = OpenAI(
base_url="http://localhost:8000/v1",
api_key="dummy"
)
response = client.chat.completions.create(
model="nn-tech/MetalGPT-1-FP8",
messages=[
{"role": "system", "content": "Ты специалист в области металлургии."},
{"role": "user", "content": "Назови плюсы и минусы хлоридной и сульфатной технологии производства никеля."}
],
temperature=0.7,
max_tokens=1024
)
print(response.choices[0].message.content)
```
|