Model Card for Qwen3-4B-Instruct-2507-LoRA-Intent-Classifier

Identify user intention in multi-turn conversations for AI-driven client engagement.

Model Details

Model Description

This repository contains a LoRA (Low-Rank Adaptation) adapter fine-tuned on top of the base model Qwen3-4B-Instruct-2507 for user intent classification.

The model is designed to analyze the latest user utterance within a conversation and classify it into a predefined set of intention categories. These categories are customizable and defined by business users (e.g., sales or customer service managers).

The adapter is lightweight and intended to be merged with the base model at inference time.

  • Developed by: Li Tuo
  • Model type: Causal Language Model (LLM) with LoRA adapter for classification
  • Language(s) (NLP): Chinese (primary), English (partial support)
  • License: Apache-2.0
  • Finetuned from model: Qwen3-4B-Instruct-2507

Uses

This model is intended to be used as a self-hosted intent classification module within AI-powered customer engagement systems, such as:

  • AI marketing agents
  • Customer service chatbots
  • Phone call automation systems

It is a core component of the project: 👉 AI CS agent generator

Direct Use

Given a prompt that includes customized intention classes and a multi-turn conversation, the model:

  • Focuses on the latest user input
  • Extracts a structured intent label
  • Outputs a predefined intent ID

Bias, Risks, and Limitations

  • Performance depends on quality of synthetic data
  • May degrade on unseen domains (e.g. conversation with non commercial event topics)

Recommendations

The fine-tuned model is intened to serve the AI customer service agent focused on commericial event engagement calls.

How to Get Started with the Model

Download the project repository, and set up the environment (Python 3.11 recommended):

git clone https://github.com/lituokobe/Qwen3-Fine-Tuning.git

cd Qwen3-Fine-Tuning

pip install -r requirements

Download the base model and adapter files to their respective locations:

huggingface-cli download Qwen/Qwen3-4B-Instruct-2507 \
  --local-dir models/Qwen/Qwen3-4B-Instruct-2507

huggingface-cli download lituokobe/Qwen3-4B-Instruct-2507-LoRA-Intent-Classifier \
  --local-dir Qwen3_reg/lora_adapter_lr1e-4_1epoch

Deploy in the same environment:

vllm serve models/Qwen/Qwen3-4B-Instruct-2507 \
    --enable-lora \
    --lora-modules qwen-finetune= Qwen3_reg/lora_adapter_lr1e-4_1epoch \
    --max-lora-rank 64 \
    --max-model-len 4096 \
    --trust-remote-code \
    --dtype float16 \
    --port 8000 \
    --served-model-name qwen3-4b

After the deployment is successful, you can test the model using a simple curl request:

curl http://localhost:8000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "qwen3-4b",
    "messages": [
      {
        "role": "user",
        "content": "Please take any sample from the training data or adapt it with the correct format"
      }
    ],
    "temperature": 0.0,
    "max_tokens": 50
  }'

The model is expected to return a structured dictionary (JSON) with the following schema:

{
  "input_summary": "A concise summary of the user's latest message.",
  "intention_id": "short_intent_label"
}

Training Details

Training Data

LoRA-Samples-Intention-Classifier

Training Procedure

Preprocessing [optional]

[More Information Needed]

Training Hyperparameters

  • Training regime: [More Information Needed]

Speeds, Sizes, Times [optional]

[More Information Needed]

Evaluation

Testing Data, Factors & Metrics

Testing Data

[More Information Needed]

Factors

[More Information Needed]

Metrics

[More Information Needed]

Results

[More Information Needed]

Summary

Model Examination [optional]

[More Information Needed]

Environmental Impact

Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019).

  • Hardware Type: [More Information Needed]
  • Hours used: [More Information Needed]
  • Cloud Provider: [More Information Needed]
  • Compute Region: [More Information Needed]
  • Carbon Emitted: [More Information Needed]

Technical Specifications [optional]

Model Architecture and Objective

[More Information Needed]

Compute Infrastructure

[More Information Needed]

Hardware

[More Information Needed]

Software

[More Information Needed]

Citation [optional]

BibTeX:

[More Information Needed]

APA:

[More Information Needed]

Glossary [optional]

[More Information Needed]

More Information [optional]

[More Information Needed]

Model Card Authors [optional]

[More Information Needed]

Model Card Contact

[More Information Needed]

Framework versions

  • PEFT 0.18.1
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for lituokobe/Qwen3-4B-Instruct-2507-LoRA-Intent-Classifier

Finetuned
(1448)
this model

Dataset used to train lituokobe/Qwen3-4B-Instruct-2507-LoRA-Intent-Classifier

Paper for lituokobe/Qwen3-4B-Instruct-2507-LoRA-Intent-Classifier