Model Card for Model ID

This modelcard documents FM-FCI/DateArith-VLSP2025, a Vietnamese LLM fine-tuned for date arithmetic task. It achieved #1 in the VLSP 2025 benchmark for date-arith task.

Model Details

Model Description

This work investigates two subtasks in temporal reasoning: 1. Date Arithmetic (datearith) and 2. Duration Question Answering (durationQA). For date-arith, we focus on finetuning large language models (LLMs) to directly extract and compute answers. For durationQA, the challenge lies in identifying both explicit and implicit duration expressions in text and reasoning with world knowledge to assess correctness. We explore multiple approaches, from naive supervised fine-tuning (SFT) to SFT augmented with reasoning-based synthetic data and GRPO. Our findings highlight the critical role of carefully constructed data and appropriate training strategies in enabling effective temporal reasoning.

Developed by: FPT Smart Cloud, FPT Corporation
Model type: MoE
Language(s) (NLP): Vietnamese (primary)
License: ?

Model Sources [optional]

Repository: https://github.com/duccd4/vlsp2025-temporal-qa
Paper: Enabling Temporal Commonsense in Vietnamese LLMs – Date-Arith and DurationQA

Training Details

Training Data

40,000 synthetic samples

Training Procedure

Training Hyperparameters

Precision: BF16
Learning rate: 5.0e-5
Batch size per device: 16
Epoch: 5
Cutoff length: 2048

Evaluation

Testing Data

Đánh giá dựa vào tập valid, public test, private test mà BTC cung cấp

Metrics

Accuracy

Results

Độ chính xác trên public test là 98% và trên private test là 99%

BibTeX:

Enabling Temporal Commonsense in Vietnamese LLMs – Date-Arith and DurationQA

Duc Dinh Chu*, Thanh-Bac Nguyen Ba*, Duy Dinh Le, Khanh Van Tran

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support