File size: 4,255 Bytes

---
license: apache-2.0
language:
- ta
---

# **Debug Divas: Colloquial Tamil Translation Model**
🚀 **Fine-tuned Mistral-7B for English-to-Colloquial Tamil Translation**  

![Hugging Face](https://img.shields.io/badge/HuggingFace-Model-yellow?style=flat)  
[![License: MIT](https://img.shields.io/badge/License-MIT-blue.svg)](https://opensource.org/licenses/MIT)  

## 🌟 **Overview**
This model is a **fine-tuned version of Mistral-7B** using **Unsloth's FastLanguageModel**, designed specifically to translate **English text into colloquial Tamil** (spoken Tamil). It is optimized for **real-world Tamil conversations**, making it useful for chatbots, assistants, and translation tools.

---

## 📖 **Model Details**
- **Base Model**: [Mistral-7B-Instruct](https://huggingface.co/mistral-7b-instruct)
- **Fine-Tuned Dataset**: Custom dataset (`debug_divas_dataset.json`) with **English → Colloquial Tamil** translation pairs.
- **Training Library**: [Unsloth](https://github.com/unslothai/unsloth) (optimized training for large models)
- **Max Sequence Length**: 128 tokens  
- **Batch Size**: 8  
- **Epochs**: 3  
- **Optimizer**: AdamW  

---

## 🔧 **Installation & Setup**
To use this model, install the necessary dependencies:

```bash
pip install torch transformers datasets unsloth accelerate
```

---

## 🚀 **Usage**
### **Load Model & Tokenizer**
```python
import torch
from transformers import AutoModelForCausalLM, AutoTokenizer

# Load fine-tuned model from Hugging Face
model_name = "your-huggingface-username/debug-divas-tamil-translation"
device = "cuda" if torch.cuda.is_available() else "cpu"

model = AutoModelForCausalLM.from_pretrained(model_name).to(device)
tokenizer = AutoTokenizer.from_pretrained(model_name)

# Translation function
def translate_english_to_tamil(input_text):
    instruction = "Translate the following English sentence to colloquial Tamil"
    inputs = tokenizer(f"{instruction}: {input_text}", return_tensors="pt").to(device)
    
    translated_tokens = model.generate(**inputs, max_length=128)
    translated_text = tokenizer.decode(translated_tokens[0], skip_special_tokens=True)
    
    return translated_text

# Example usage
input_text = "The pharmacy is near the bus stop."
translated_text = translate_english_to_tamil(input_text)
print("Colloquial Tamil:", translated_text)
```

---

## 📝 **Example Outputs**
| **English** | **Colloquial Tamil Translation** |
|------------|---------------------------------|
| "How are you?" | "நீங்க எப்படி இருக்கீங்க?" |
| "I am going to the market." | "நான் மார்க்கெட்டுக்கு பொறேன்." |
| "The pharmacy is near the bus stop." | "மருந்துக் கடை பஸ்ஸ்டாப் அருகே இருக்க." |

---

## 📚 **Dataset**
The dataset contains **pairs of English sentences** with their **colloquial Tamil translations**.  
Example format:

```json
[
  {
    "input": "How are you?",
    "output": "நீங்க எப்படி இருக்கீங்க?"
  },
  {
    "input": "I am going to the market.",
    "output": "நான் மார்க்கெட்டுக்கு பொறேன்."
  }
]
```

---

## 🏗 **Training Details**
The model was fine-tuned using **UnslothTrainer** with the following hyperparameters:

- **Batch Size**: 8  
- **Epochs**: 3  
- **Learning Rate**: 2e-5  
- **FP16 Training**: Disabled  
- **Optimizer**: AdamW  
- **Dataset Split**: 80% Train, 20% Test  



---

## ⚖ **License & Citation**
This model is released under the **MIT License**. If you use it in your work, please cite:

```bibtex
@misc{debugdivas2025,
  author = {Debug Divas},
  title = {Fine-tuned Mistral-7B for Colloquial Tamil Translation},
  year = {2025},
  publisher = {Hugging Face},
  url = {https://huggingface.co/your-huggingface-username/debug-divas-tamil-translation}
}
```

---

Dataset Linke : https://huggingface.co/datasets/anitha2520/debug_divas45/tree/main
anitha2520/debug_divas45

## ❤️ **Contributions & Feedback**
We welcome feedback and contributions! Feel free to open an issue or contribute to our dataset.  

📧 **Contact:** [xidanitha@gmail.com]  

---