File size: 1,329 Bytes
642275e
 
 
 
 
 
8ed2f67
642275e
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
---
pipeline_tag: text-generation
library_name: transformers
tags:
- mining
- fp8
license: cc-by-nc-sa-4.0
language:
- ru
base_model: nn-tech/MetalGPT-1
---

## Description

**MetalGPT-1** is a model built upon the Qwen/Qwen3-32B and incorporates both continual pre-training and supervised fine-tuning on domain-specific data from the mining and metallurgy industry.

---

### Quantization

For convenience and improved performance, we also provide this FP8 checkpoint of the nn-tech/MetalGPT-1 model. Using FP8 precision enables faster inference and lower memory usage, while preserving model quality and numerical stability.

---

### VLLM usage

```bash
vllm serve nn-tech/MetalGPT-1-FP8 --reasoning-parser qwen3

```

```python

from openai import OpenAI

client = OpenAI(
    base_url="http://localhost:8000/v1",
    api_key="dummy"  
)

response = client.chat.completions.create(
    model="nn-tech/MetalGPT-1-FP8",
    messages=[
        {"role": "system", "content": "Ты специалист в области металлургии."},
        {"role": "user", "content": "Назови плюсы и минусы хлоридной и сульфатной технологии производства никеля."}
    ],
    temperature=0.7,
    max_tokens=1024
)

print(response.choices[0].message.content)

```