Instructions to use Talip7/email-domain-adapt-en-tr with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Talip7/email-domain-adapt-en-tr with Transformers:
# Use a pipeline as a high-level helper # Warning: Pipeline type "translation" is no longer supported in transformers v5. # You must load the model directly (see below) or downgrade to v4.x with: # 'pip install "transformers<5.0.0' from transformers import pipeline pipe = pipeline("translation", model="Talip7/email-domain-adapt-en-tr")# Load model directly from transformers import AutoTokenizer, AutoModelForSeq2SeqLM tokenizer = AutoTokenizer.from_pretrained("Talip7/email-domain-adapt-en-tr") model = AutoModelForSeq2SeqLM.from_pretrained("Talip7/email-domain-adapt-en-tr") - Notebooks
- Google Colab
- Kaggle
Email Domain Adaptation (EN β TR)
This model is NOT intended as a general-purpose EnglishβTurkish translation model.
It was trained exclusively for demonstrating domain adaptation and terminology biasing, following the methodology described in:
π€ Hugging Face LLM Course β Chapter 7: Domain Adaptation
π― Purpose
The sole objective of this model is to demonstrate that:
Domain-specific terminology preferences can be learned via fine-tuning,
even with a very small, controlled parallel dataset.
Concretely, the model is trained to consistently translate:
email β e-posta regardless of context.
This behavior is intentionally biased and should not be interpreted as an improvement in general translation quality.
π§ͺ Training Setup
- Base model:
Helsinki-NLP/opus-mt-en-trk - Fine-tuning method: Seq2Seq fine-tuning
- Dataset: Synthetic, controlled parallel corpus
- Training size: ~150 sentence pairs
- Evaluation: sacreBLEU (for demonstration only)
The dataset was intentionally constructed so that:
emailappears only on the English sidee-postaappears only on the Turkish side- alternative forms (
mail,e-mail) are excluded
π Evaluation Notes
BLEU scores are intentionally high due to the controlled nature of the dataset. They should not be interpreted as real-world translation benchmarks.
The success criterion is qualitative:
- Whether the model exhibits the desired terminology shift
- Including on out-of-distribution examples
β οΈ Intended Use
β
Educational
β
Research on domain adaptation
β Production translation
β General-purpose ENβTR translation
π Use this model
This model can be loaded using the π€ Transformers pipeline API.
Translation (English β Turkish)
from transformers import pipeline
translator = pipeline(
"translation_en_to_tr",
model="Talip7/email-domain-adapt-en-tr"
)
translator("I did not receive your email.")
π Reference
- Downloads last month
- 3