Debug Divas: Colloquial Tamil Translation Model

🚀 Fine-tuned Mistral-7B for English-to-Colloquial Tamil Translation

🌟 Overview

This model is a fine-tuned version of Mistral-7B using Unsloth's FastLanguageModel, designed specifically to translate English text into colloquial Tamil (spoken Tamil). It is optimized for real-world Tamil conversations, making it useful for chatbots, assistants, and translation tools.

📖 Model Details

Base Model: Mistral-7B-Instruct
Fine-Tuned Dataset: Custom dataset (debug_divas_dataset.json) with English → Colloquial Tamil translation pairs.
Training Library: Unsloth (optimized training for large models)
Max Sequence Length: 128 tokens
Batch Size: 8
Epochs: 3
Optimizer: AdamW

🔧 Installation & Setup

To use this model, install the necessary dependencies:

pip install torch transformers datasets unsloth accelerate

🚀 Usage

Load Model & Tokenizer

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer

# Load fine-tuned model from Hugging Face
model_name = "your-huggingface-username/debug-divas-tamil-translation"
device = "cuda" if torch.cuda.is_available() else "cpu"

model = AutoModelForCausalLM.from_pretrained(model_name).to(device)
tokenizer = AutoTokenizer.from_pretrained(model_name)

# Translation function
def translate_english_to_tamil(input_text):
    instruction = "Translate the following English sentence to colloquial Tamil"
    inputs = tokenizer(f"{instruction}: {input_text}", return_tensors="pt").to(device)
    
    translated_tokens = model.generate(**inputs, max_length=128)
    translated_text = tokenizer.decode(translated_tokens[0], skip_special_tokens=True)
    
    return translated_text

# Example usage
input_text = "The pharmacy is near the bus stop."
translated_text = translate_english_to_tamil(input_text)
print("Colloquial Tamil:", translated_text)

📝 Example Outputs

English	Colloquial Tamil Translation
"How are you?"	"நீங்க எப்படி இருக்கீங்க?"
"I am going to the market."	"நான் மார்க்கெட்டுக்கு பொறேன்."
"The pharmacy is near the bus stop."	"மருந்துக் கடை பஸ்ஸ்டாப் அருகே இருக்க."

📚 Dataset

The dataset contains pairs of English sentences with their colloquial Tamil translations.
Example format:

[
  {
    "input": "How are you?",
    "output": "நீங்க எப்படி இருக்கீங்க?"
  },
  {
    "input": "I am going to the market.",
    "output": "நான் மார்க்கெட்டுக்கு பொறேன்."
  }
]

🏗 Training Details

The model was fine-tuned using UnslothTrainer with the following hyperparameters:

Batch Size: 8
Epochs: 3
Learning Rate: 2e-5
FP16 Training: Disabled
Optimizer: AdamW
Dataset Split: 80% Train, 20% Test

⚖ License & Citation

This model is released under the MIT License. If you use it in your work, please cite:

@misc{debugdivas2025,
  author = {Debug Divas},
  title = {Fine-tuned Mistral-7B for Colloquial Tamil Translation},
  year = {2025},
  publisher = {Hugging Face},
  url = {https://huggingface.co/your-huggingface-username/debug-divas-tamil-translation}
}

Dataset Linke : https://huggingface.co/datasets/anitha2520/debug_divas45/tree/main anitha2520/debug_divas45

❤️ Contributions & Feedback

We welcome feedback and contributions! Feel free to open an issue or contribute to our dataset.

📧 Contact: [xidanitha@gmail.com]

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support