webbigdata
/

ALMA-7B-Ja

Text Generation

text-generation-inference

Model card Files Files and versions

dahara1 commited on Oct 8, 2023

Commit

06810cd

·

1 Parent(s): 150c1de

Update README.md

Files changed (1) hide show

README.md +15 -13

README.md CHANGED Viewed

@@ -10,19 +10,6 @@ language:
 ---
 # webbigdata/ALMA-7B-Ja
-**ALMA** (**A**dvanced **L**anguage **M**odel-based tr**A**nslator) is an LLM-based translation model, which adopts a new translation model paradigm: it begins with fine-tuning on monolingual data and is further optimized using high-quality parallel data. This two-step fine-tuning process ensures strong translation performance.
-Please find more details in our [paper](https://arxiv.org/abs/2309.11674).
-```
-@misc{xu2023paradigm,
-      title={A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models},
-      author={Haoran Xu and Young Jin Kim and Amr Sharaf and Hany Hassan Awadalla},
-      year={2023},
-      eprint={2309.11674},
-      archivePrefix={arXiv},
-      primaryClass={cs.CL}
-}
-```
 Original ALMA Model [ALMA-7B](https://huggingface.co/haoranxu/ALMA-7B). (26.95GB)
 https://huggingface.co/haoranxu/ALMA-7B
@@ -43,5 +30,20 @@ And translation ability for languages other than Japanese and English has deteri
 [webbigdata/ALMA-7B-Ja-GPTQ-Ja-En](https://huggingface.co/webbigdata/ALMA-7B-Ja-GPTQ-Ja-En)
 ## about this work
 - **This work was done by :** [webbigdata](https://webbigdata.jp/).

 ---
 # webbigdata/ALMA-7B-Ja
 Original ALMA Model [ALMA-7B](https://huggingface.co/haoranxu/ALMA-7B). (26.95GB)
 https://huggingface.co/haoranxu/ALMA-7B
 [webbigdata/ALMA-7B-Ja-GPTQ-Ja-En](https://huggingface.co/webbigdata/ALMA-7B-Ja-GPTQ-Ja-En)
+**ALMA** (**A**dvanced **L**anguage **M**odel-based tr**A**nslator) is an LLM-based translation model, which adopts a new translation model paradigm: it begins with fine-tuning on monolingual data and is further optimized using high-quality parallel data. This two-step fine-tuning process ensures strong translation performance.
+Please find more details in their [paper](https://arxiv.org/abs/2309.11674).
+```
+@misc{xu2023paradigm,
+      title={A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models},
+      author={Haoran Xu and Young Jin Kim and Amr Sharaf and Hany Hassan Awadalla},
+      year={2023},
+      eprint={2309.11674},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL}
+}
+```
 ## about this work
 - **This work was done by :** [webbigdata](https://webbigdata.jp/).