nvidia
/

Riva-Translate-4B-Instruct

Text Generation

text-generation-inference

Model card Files Files and versions

Mujojojo commited on Dec 10, 2025

Commit

9e2aad5

·

verified ·

1 Parent(s): 7e0a807

Update README.md

Files changed (1) hide show

README.md +7 -3

README.md CHANGED Viewed

@@ -3,17 +3,21 @@ license: other
 license_name: nvidia-open-model-license-agreement
 license_link: >-
   https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/
 library_name: transformers
 base_model:
-- nvidia/Mistral-NeMo-Minitron-4B-8K-Base
 ---
 # Riva-Translate-4B-Instruct
 ## Model Overview
 The Riva-Translate-4B-Instruct Neural Machine Translation model translates text in 12 languages. The supported languages are: English(en), German(de), European Spanish(es-ES), LATAM Spanish(es-US), France(fr), Brazillian Portugese(pt-BR), Russian(ru), Simplified Chinese(zh-CN), Traditional Chinese(zh-TW), Japanese(ja),Korean(ko), Arabic(ar).
-This model was developed based on the decoder-only Transformer architecture. It is a fine-tuned version of a 4B Base model that was pruned and distilled from [nvidia/Mistral-NeMo-Minitron-8B-Base](https://huggingface.co/nvidia/Mistral-NeMo-Minitron-8B-Base) using our LLM compression technique. The model was trained using a multi-stage CPT and SFT. It uses tiktoken as the tokenizer. The model supports a context length of 8K tokens.
 **Model Developer:** NVIDIA

 license_name: nvidia-open-model-license-agreement
 license_link: >-
   https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/
 library_name: transformers
 base_model:
+- nvidia/Mistral-NeMo-12B-Base
 ---
 # Riva-Translate-4B-Instruct
+## 🚀 **Announcement**
+We’re excited to introduce the latest update to our Riva-Translate-4B-Instruct model!
+Explore **[nvidia/Riva-Translate-4B-Instruct-v1.1](https://huggingface.co/nvidia/Riva-Translate-4B-Instruct-v1.1)** to experience improved translation quality and enhanced performance.
 ## Model Overview
 The Riva-Translate-4B-Instruct Neural Machine Translation model translates text in 12 languages. The supported languages are: English(en), German(de), European Spanish(es-ES), LATAM Spanish(es-US), France(fr), Brazillian Portugese(pt-BR), Russian(ru), Simplified Chinese(zh-CN), Traditional Chinese(zh-TW), Japanese(ja),Korean(ko), Arabic(ar).
+This model was developed based on the decoder-only Transformer architecture. It is a fine-tuned version of a 4B Base model that was pruned and distilled from [nvidia/Mistral-NeMo-12B-Base](https://huggingface.co/nvidia/Mistral-NeMo-12B-Base) using our LLM compression technique. The model was trained using a multi-stage CPT and SFT. It uses tiktoken as the tokenizer. The model supports a context length of 8K tokens.
 **Model Developer:** NVIDIA