Instructions to use ntphuc149/ViLegalQwen2.5-1.5B-Base with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use ntphuc149/ViLegalQwen2.5-1.5B-Base with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="ntphuc149/ViLegalQwen2.5-1.5B-Base")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("ntphuc149/ViLegalQwen2.5-1.5B-Base")
model = AutoModelForCausalLM.from_pretrained("ntphuc149/ViLegalQwen2.5-1.5B-Base", device_map="auto")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use ntphuc149/ViLegalQwen2.5-1.5B-Base with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "ntphuc149/ViLegalQwen2.5-1.5B-Base"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "ntphuc149/ViLegalQwen2.5-1.5B-Base",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/ntphuc149/ViLegalQwen2.5-1.5B-Base

SGLang

How to use ntphuc149/ViLegalQwen2.5-1.5B-Base with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "ntphuc149/ViLegalQwen2.5-1.5B-Base" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "ntphuc149/ViLegalQwen2.5-1.5B-Base",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "ntphuc149/ViLegalQwen2.5-1.5B-Base" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "ntphuc149/ViLegalQwen2.5-1.5B-Base",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use ntphuc149/ViLegalQwen2.5-1.5B-Base with Docker Model Runner:
```
docker model run hf.co/ntphuc149/ViLegalQwen2.5-1.5B-Base
```

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

ViLegalQwen2.5-1.5B-Base

ViLegalQwen2.5-1.5B-Base is a decoder-only language model for Vietnamese legal text understanding, part of the ViLegalLM suite. It is continually pretrained from Qwen2.5-1.5B on a newly curated 16GB Vietnamese legal corpus. ViLegalQwen2.5-1.5B-Base achieves state-of-the-art results among 1.5B-scale models across Vietnamese legal downstream tasks including Question Answering, Natural Language Inference, and Syllogism Reasoning.

Paper: ViLegalLM: Language Models for Vietnamese Legal Text — Read paper

Resources: GitHub | ViLegalBERT | ViLegalQwen3-1.7B-Base

How to Use

from transformers import AutoModelForCausalLM, AutoTokenizer

tokenizer = AutoTokenizer.from_pretrained("ntphuc149/ViLegalQwen2.5-1.5B-Base")
model = AutoModelForCausalLM.from_pretrained("ntphuc149/ViLegalQwen2.5-1.5B-Base")

Note: This is a base (pretrained) model, not an instruction-tuned model. We do not recommend using base language models for conversations. Instead, you can apply post-training, e.g., SFT, RLHF, continued pretraining, etc., on this model.

Model Summary

Summary for ViLegalQwen2.5-1.5B-Base checkpoint (click to expand)

Attribute	Value
Architecture	Qwen2 (decoder-only, causal LM)
Parameters	1.54B
Base model	Qwen2.5-1.5B
Max sequence length	2048 tokens
Tokenizer	Qwen2 tokenizer
Training objective	Causal Language Modeling (CLM)
Training domain	Vietnamese legal text
Precision	BF16

Evaluation Results

ViLegalQwen2.5-1.5B-Base achieves state-of-the-art results among 1.5B-scale models across all evaluated Vietnamese legal benchmarks. Bold = best in the 1.5B parameter group. Italic = closed-source model scores.

Question Answering (click to expand)

True/False — ALQAC-TF

Model	Pre	Rec	F1
Qwen2-1.5B	85.05	86.84	85.94
Qwen2.5-1.5B	74.47	92.11	83.34
ViLegalQwen2.5-1.5B-Base	87.31	90.53	88.89
gpt-4o-mini (0-shot)	89.86	97.89	93.70

Multiple Choice — ALQAC-MCQ & VLSP-MCQ-LK

Model	Pre_mac	Rec_mac	F1_mac
ALQAC-MCQ
Qwen2-1.5B	82.19	81.05	81.42
Qwen2.5-1.5B	84.80	84.05	84.37
ViLegalQwen2.5-1.5B-Base	85.66	84.53	84.96
gpt-4o-mini (0-shot)	90.83	91.58	91.15
VLSP-MCQ-LK
Qwen2-1.5B	68.02	53.62	58.15
Qwen2.5-1.5B	65.05	52.34	56.54
ViLegalQwen2.5-1.5B-Base	65.24	54.54	58.39
gpt-4o-mini (0-shot)	69.05	51.16	58.17

Abstractive QA — ViBidLQA-AQA

Model	ROUGE-L	BLEU-4	BS-F1
Qwen2-1.5B	72.44	49.00	89.68
Qwen2.5-1.5B	73.02	49.12	89.74
ViLegalQwen2.5-1.5B-Base	73.45	49.90	89.91
gpt-4o-mini (0-shot)	67.06	40.46	86.85

Natural Language Inference (click to expand)

NLI — VLSP-NLI

Model	Precision	Recall	F1
Qwen2-1.5B	92.86	86.67	89.66
Qwen2.5-1.5B	100.00	80.00	88.89
ViLegalQwen2.5-1.5B-Base	84.90	100.00	91.84
gpt-4o-mini (0-shot)	100.00	86.67	92.86

Syllogism Reasoning (click to expand)

Syllogism Reasoning — VLSP-Syllogism

Model	BS-F1	LLM-Judge
Qwen2-1.5B	76.19	0.2639
Qwen2.5-1.5B	76.89	0.2656
ViLegalQwen2.5-1.5B-Base	76.63	0.2674
gpt-4o-mini (0-shot)	78.63	0.5069

Also in ViLegalLM

Model	Architecture	Params	Context
ViLegalBERT	Encoder-only	135M	256
ViLegalQwen2.5-1.5B-Base (this model)	Decoder-only	1.54B	2,048
ViLegalQwen3-1.7B-Base	Decoder-only	1.72B	4,096

Limitations and Biases

Domain scope: Trained exclusively on Vietnamese legal texts; may not generalize to other legal systems or jurisdictions.
Base model only: Not instruction-tuned; outputs may be incomplete or incoherent without task-specific fine-tuning.
Temporal bias: Legal corpus reflects Vietnamese law as of the collection date; model outputs may not reflect recent legislative changes.
Inherited biases: May reflect biases present in the source legal corpora, including regional variations in legal practice and domain coverage imbalances.
Not a legal authority: Model outputs should never be used as definitive legal interpretations without expert validation.

Intended Use

Intended for:

Further fine-tuning on Vietnamese legal downstream tasks (QA, NLI, reasoning)
Vietnamese legal text generation and completion
Research on Vietnamese legal NLP and continual pretraining

Not intended for:

Direct conversational or instruction-following use without fine-tuning
Replacing professional legal counsel or human judgment in legal decision-making
Providing legal advice without expert validation
Legal systems outside Vietnam without careful domain adaptation

Citation

If you use ViLegalQwen2.5-1.5B-Base, please cite our paper:

@inproceedings{nguyen-etal-2026-vilegallm,
    title = "{V}i{L}egal{LM}: Language Models for {V}ietnamese Legal Text",
    author = "Nguyen, Truong-Phuc  and
      Nguyen, Quy-Nhan  and
      Nguyen, Minh-Tien",
    editor = "Liakata, Maria  and
      Moreira, Viviane P.  and
      Zhang, Jiajun  and
      Jurgens, David",
    booktitle = "Findings of the {A}ssociation for {C}omputational {L}inguistics: {ACL} 2026",
    month = jul,
    year = "2026",
    address = "San Diego, California, United States",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2026.findings-acl.1801/",
    pages = "36136--36150",
    ISBN = "979-8-89176-395-1",
    abstract = "We present **ViLegalLM**, comprising **ViLegalBERT** and **ViLegalQwen**, the first suite of Vietnamese pretrained language models for legal text understanding and generation. It includes one encoder-only model (ViLegalBERT, 135M parameters) and two decoder-only models (ViLegalQwen2.5-1.5B-Base and ViLegalQwen3-1.7B-Base), all continually pretrained on a newly curated 16GB Vietnamese legal corpus, significantly larger than previous work. To mitigate data scarcity, we construct three synthetic datasets using LLM-based generation and hard negative mining for True/False QA, Multiple Choice QA, and Natural Language Inference. We establish state-of-the-art results among open-source models on four main Vietnamese legal downstream tasks spanning ten benchmarks, demonstrating that continual pretraining from base models consistently outperforms instruction-tuned adaptation. Source codes, corpus, datasets, and model checkpoints are publicly available at https://github.com/ntphuc149/ViLegalLM."
}