Debug Divas: Colloquial Tamil Translation Model

🚀 Fine-tuned Mistral-7B for English-to-Colloquial Tamil Translation

Hugging Face
License: MIT

🌟 Overview

This model is a fine-tuned version of Mistral-7B using Unsloth's FastLanguageModel, designed specifically to translate English text into colloquial Tamil (spoken Tamil). It is optimized for real-world Tamil conversations, making it useful for chatbots, assistants, and translation tools.


📖 Model Details

  • Base Model: Mistral-7B-Instruct
  • Fine-Tuned Dataset: Custom dataset (debug_divas_dataset.json) with English → Colloquial Tamil translation pairs.
  • Training Library: Unsloth (optimized training for large models)
  • Max Sequence Length: 128 tokens
  • Batch Size: 8
  • Epochs: 3
  • Optimizer: AdamW

🔧 Installation & Setup

To use this model, install the necessary dependencies:

pip install torch transformers datasets unsloth accelerate

🚀 Usage

Load Model & Tokenizer

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer

# Load fine-tuned model from Hugging Face
model_name = "your-huggingface-username/debug-divas-tamil-translation"
device = "cuda" if torch.cuda.is_available() else "cpu"

model = AutoModelForCausalLM.from_pretrained(model_name).to(device)
tokenizer = AutoTokenizer.from_pretrained(model_name)

# Translation function
def translate_english_to_tamil(input_text):
    instruction = "Translate the following English sentence to colloquial Tamil"
    inputs = tokenizer(f"{instruction}: {input_text}", return_tensors="pt").to(device)
    
    translated_tokens = model.generate(**inputs, max_length=128)
    translated_text = tokenizer.decode(translated_tokens[0], skip_special_tokens=True)
    
    return translated_text

# Example usage
input_text = "The pharmacy is near the bus stop."
translated_text = translate_english_to_tamil(input_text)
print("Colloquial Tamil:", translated_text)

📝 Example Outputs

English Colloquial Tamil Translation
"How are you?" "நீங்க எப்படி இருக்கீங்க?"
"I am going to the market." "நான் மார்க்கெட்டுக்கு பொறேன்."
"The pharmacy is near the bus stop." "மருந்துக் கடை பஸ்ஸ்டாப் அருகே இருக்க."

📚 Dataset

The dataset contains pairs of English sentences with their colloquial Tamil translations.
Example format:

[
  {
    "input": "How are you?",
    "output": "நீங்க எப்படி இருக்கீங்க?"
  },
  {
    "input": "I am going to the market.",
    "output": "நான் மார்க்கெட்டுக்கு பொறேன்."
  }
]

🏗 Training Details

The model was fine-tuned using UnslothTrainer with the following hyperparameters:

  • Batch Size: 8
  • Epochs: 3
  • Learning Rate: 2e-5
  • FP16 Training: Disabled
  • Optimizer: AdamW
  • Dataset Split: 80% Train, 20% Test

License & Citation

This model is released under the MIT License. If you use it in your work, please cite:

@misc{debugdivas2025,
  author = {Debug Divas},
  title = {Fine-tuned Mistral-7B for Colloquial Tamil Translation},
  year = {2025},
  publisher = {Hugging Face},
  url = {https://huggingface.co/your-huggingface-username/debug-divas-tamil-translation}
}

Dataset Linke : https://huggingface.co/datasets/anitha2520/debug_divas45/tree/main anitha2520/debug_divas45

❤️ Contributions & Feedback

We welcome feedback and contributions! Feel free to open an issue or contribute to our dataset.

📧 Contact: [xidanitha@gmail.com]


Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support