Update README.md
Browse files
README.md
CHANGED
|
@@ -8,7 +8,7 @@
|
|
| 8 |
|
| 9 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 10 |
|
| 11 |
-
This modelcard
|
| 12 |
|
| 13 |
## Model Details
|
| 14 |
|
|
@@ -16,23 +16,19 @@ This modelcard aims to be a base template for new models. It has been generated
|
|
| 16 |
|
| 17 |
<!-- Provide a longer summary of what this model is. -->
|
| 18 |
|
|
|
|
| 19 |
|
| 20 |
-
|
| 21 |
-
- **
|
| 22 |
-
- **
|
| 23 |
-
- **
|
| 24 |
-
- **Model type:** [More Information Needed]
|
| 25 |
-
- **Language(s) (NLP):** [More Information Needed]
|
| 26 |
-
- **License:** [More Information Needed]
|
| 27 |
-
- **Finetuned from model [optional]:** [More Information Needed]
|
| 28 |
|
| 29 |
### Model Sources [optional]
|
| 30 |
|
| 31 |
<!-- Provide the basic links for the model. -->
|
| 32 |
|
| 33 |
-
- **Repository:** [More Information Needed]
|
| 34 |
-
- **Paper [optional]:**
|
| 35 |
-
- **Demo [optional]:** [More Information Needed]
|
| 36 |
|
| 37 |
## Uses
|
| 38 |
|
|
|
|
| 8 |
|
| 9 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 10 |
|
| 11 |
+
This modelcard documents FM-FCI/DateArith-VLSP2025, a Vietnamese LLM fine-tuned for date arithmetic task. It achieved #1 in the VLSP 2025 benchmark for date-arith task.
|
| 12 |
|
| 13 |
## Model Details
|
| 14 |
|
|
|
|
| 16 |
|
| 17 |
<!-- Provide a longer summary of what this model is. -->
|
| 18 |
|
| 19 |
+
This work investigates two subtasks in temporal reasoning: 1. Date Arithmetic (datearith) and 2. Duration Question Answering (durationQA). For date-arith, we focus on finetuning large language models (LLMs) to directly extract and compute answers. For durationQA, the challenge lies in identifying both explicit and implicit duration expressions in text and reasoning with world knowledge to assess correctness. We explore multiple approaches, from naive supervised fine-tuning (SFT) to SFT augmented with reasoning-based synthetic data and GRPO. Our findings highlight the critical role of carefully constructed data and appropriate training strategies in enabling effective temporal reasoning.
|
| 20 |
|
| 21 |
+
- **Developed by:** FPT Smart Cloud, FPT Corporation
|
| 22 |
+
- **Model type:** MoE
|
| 23 |
+
- **Language(s) (NLP):** Vietnamese (primary)
|
| 24 |
+
- **License:** ?
|
|
|
|
|
|
|
|
|
|
|
|
|
| 25 |
|
| 26 |
### Model Sources [optional]
|
| 27 |
|
| 28 |
<!-- Provide the basic links for the model. -->
|
| 29 |
|
| 30 |
+
- **Repository:** [[More Information Needed]](https://github.com/duccd4/vlsp2025-temporal-qa)
|
| 31 |
+
- **Paper [optional]:** Enabling Temporal Commonsense in Vietnamese LLMs – Date-Arith and DurationQA
|
|
|
|
| 32 |
|
| 33 |
## Uses
|
| 34 |
|