How to use from the
Use from the
Transformers library
# Gated model: Login with a HF token with gated access permission
hf auth login
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="bmwlab-ntust/log_copilot_32b_with_rag")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("bmwlab-ntust/log_copilot_32b_with_rag")
model = AutoModelForCausalLM.from_pretrained("bmwlab-ntust/log_copilot_32b_with_rag")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))
Quick Links

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

log_copilot_32b_with_rag

A log-analysis copilot built on Qwen/QwQ-32B via SFT on RAG-style data, intended for log triage, troubleshooting, and root-cause analysis.

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("bmwlab-ntust/log_copilot_32b_with_rag")
tokenizer = AutoTokenizer.from_pretrained("bmwlab-ntust/log_copilot_32b_with_rag")
Downloads last month
42
Safetensors
Model size
33B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for bmwlab-ntust/log_copilot_32b_with_rag

Base model

Qwen/Qwen2.5-32B
Finetuned
Qwen/QwQ-32B
Finetuned
(97)
this model