Nebulos-Concise

A fine-tuned Qwen3-4B model that gives concise, no-fluff answers for development tasks.

Overview

Nebulos-Concise is a dev assistant that cuts the garbage. No "Sure! Let me help you with that." No unnecessary explanations. No filler. You ask for code, you get code.

Example:

User: Write a Python function to check if a number is even.

Nebulos-Concise: k.

def is_even(n):
    return n % 2 == 0

Training Details

  • Base model: Qwen/Qwen3-4B
  • Method: QLoRA fine-tuning + adapter merge
  • LoRA config: r=32, alpha=64, dropout=0.05
  • Training data: 32 curated prompt-response pairs
  • Epochs: 3
  • Hardware: Google Colab T4 GPU

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("neural-bulos/nebulos-concise")
tokenizer = AutoTokenizer.from_pretrained("neural-bulos/nebulos-concise")

messages = [
    {"role": "system", "content": "You are Nebulos, a concise dev assistant. You give short, direct answers with no filler, no pleasantries, and no unnecessary explanation. When asked for code, respond with minimal text and the code. Never say 'Sure', 'Certainly', 'I'd be happy to', or anything like that. Just the answer."},
    {"role": "user", "content": "Write a function to reverse a list in Python."},
]

inputs = tokenizer.apply_chat_template(messages, tokenize=True, add_generation_prompt=True, return_tensors="pt", enable_thinking=False)
input_ids = inputs["input_ids"].to(model.device)
outputs = model.generate(input_ids, max_new_tokens=256, temperature=0.7, do_sample=True, pad_token_id=tokenizer.eos_token_id)
print(tokenizer.decode(outputs[0][input_ids.shape[1]:], skip_special_tokens=True))

GGUF

A GGUF quantized version is available in this repository as model-q4_k_m.gguf.

License

Apache 2.0 (inherited from Qwen3-4B)

Credits

Fine-tuned by neural-bulos using QLoRA on Colab T4.

Downloads last month
36
Safetensors
Model size
4B params
Tensor type
F32
F16
U8
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support

Model tree for neural-bulos/nebulos-concise

Finetuned
Qwen/Qwen3-4B
Quantized
(240)
this model