---
license: mit
language:
- en
- vi
base_model:
- BlossomsAI/BloomVN-8B-chat
pipeline_tag: text-generation
library_name: transformers
tags:
- conversational
---
# 🌟 BloomVN-8B-Chat-Reasoning
### A fine-tuned multilingual model for Vietnamese reasoning
## NOTE
- This model is a test version for our new training pipeline. THIS IS NOT SUITABLE FOR PRODUCTION
- The full model will be updated soon.
## 📋 Overview
- A language model with reasoning capability that provides step-by-step reasoning in Vietnamese before delivering answers.
- The model follows a structured XML format with explicit reasoning tags.
- It's designed for educational applications and complex problem-solving tasks in Vietnamese.
## 🔧 Method
- Fine-tuned using Group Relative Policy Optimization (GRPO) with [Unsloth](https://github.com/unslothai/unsloth) for hardware efficiency.
- Employs rule-based reward functions to encourage adherence to Vietnamese XML reasoning format.
- Uses LoRA adaptation on a Vietnamese dataset spanning various task types.
## 💫 Quantization
Coming Soon!
## 🤝 Contributors
Developed with ❤️ by [BlossomAI](https://github.com/BlossomAI)
---
Star ⭐️ this repo if you find it valuable!