| --- |
| base_model: |
| - facebook/bart-large |
| pipeline_tag: translation |
| library_name: transformers |
| tags: |
| - code |
| --- |
| Hindi to Bengali Translation using BART |
|
|
| Overview |
|
|
| This project fine-tunes the BART model for Hindi-to-Bengali translation using the Hind-Beng-5k dataset. |
| The model is trained using the Hugging Face transformers library with PyTorch. |
|
|
| Dataset |
|
|
| We use the Hind-Beng-5k dataset from Hugging Face, which contains parallel Hindi and Bengali text samples. |
| Dataset: sudeshna84/Hind-Beng-5k |
|
|
| Model |
|
|
| The model used for translation is facebook/bart-large. |
| It is fine-tuned for sequence-to-sequence translation from Hindi to Bengali using the BART architecture. |
|
|
| Installation |
| To run the project, install the required dependencies: |
| pip install transformers datasets torch |
|
|
| Preprocessing |
| The dataset is preprocessed by tokenizing the Hindi input text and Bengali target text using the BART tokenizer. |
|
|
| Training |
| The model is trained using the Trainer API from Hugging Face with the following parameters: |
| Batch size: 8 |
| Learning rate: 2e-5 |
| Epochs: 3 |
| Weight decay: 0.01 |
|
|
|
|
| Credits Tag |
| Sudeshna Sani- https://huggingface.co/sudeshna84 |