--- pipeline_tag: text-generation library_name: transformers tags: - mining - fp8 license: cc-by-nc-sa-4.0 language: - ru base_model: nn-tech/MetalGPT-1 --- ## Description **MetalGPT-1** is a model built upon the Qwen/Qwen3-32B and incorporates both continual pre-training and supervised fine-tuning on domain-specific data from the mining and metallurgy industry. --- ### Quantization For convenience and improved performance, we also provide this FP8 checkpoint of the nn-tech/MetalGPT-1 model. Using FP8 precision enables faster inference and lower memory usage, while preserving model quality and numerical stability. --- ### VLLM usage ```bash vllm serve nn-tech/MetalGPT-1-FP8 --reasoning-parser qwen3 ``` ```python from openai import OpenAI client = OpenAI( base_url="http://localhost:8000/v1", api_key="dummy" ) response = client.chat.completions.create( model="nn-tech/MetalGPT-1-FP8", messages=[ {"role": "system", "content": "Ты специалист в области металлургии."}, {"role": "user", "content": "Назови плюсы и минусы хлоридной и сульфатной технологии производства никеля."} ], temperature=0.7, max_tokens=1024 ) print(response.choices[0].message.content) ```