ai4bharat
/

MultiIndicParaphraseGenerationSS

text2text-generation

paraphrase-generation

Model card Files Files and versions

himani commited on Mar 29, 2022

Commit

2227687

·

1 Parent(s): 5d058d8

Update README.md

Files changed (1) hide show

README.md +7 -0

README.md CHANGED Viewed

@@ -28,6 +28,13 @@ licenses:
 This repository contains the [IndicBARTSS](https://huggingface.co/ai4bharat/IndicBARTSS) checkpoint finetuned on the 11 languages of [IndicParaphrase](https://huggingface.co/datasets/ai4bharat/IndicParaphrase) dataset. For finetuning details,
 see the [paper](https://arxiv.org/abs/2203.05437).
 ## Using this model in `transformers`

 This repository contains the [IndicBARTSS](https://huggingface.co/ai4bharat/IndicBARTSS) checkpoint finetuned on the 11 languages of [IndicParaphrase](https://huggingface.co/datasets/ai4bharat/IndicParaphrase) dataset. For finetuning details,
 see the [paper](https://arxiv.org/abs/2203.05437).
+<ul>
+<li >Supported languages: Assamese, Bengali, Gujarati, Hindi, Marathi, Odiya, Punjabi, Kannada, Malayalam, Tamil, and Telugu. Not all of these languages are supported by mBART50 and mT5. </li>
+<li >The model is much smaller than the mBART and mT5(-base) models, so less computationally expensive for decoding. </li>
+<li> Trained on large Indic language corpora (5.53 million sentences). </li>
+<li> Unlike [MultiIndicParaphraseGeneration](https://huggingface.co/ai4bharat/MultiIndicParaphraseGeneration) each language is written in its own script, so you do not need to perform any script mapping to/from Devanagari. </li>
+</ul>
 ## Using this model in `transformers`